Abstract
Context
The Comprehensive Osteopathic Medical Licensing Examination of the United States (COMLEX-USA) is a three-level national standardized licensure examination designed for the practice of osteopathic medicine. Following several years of analysis and considering input from across the continuum of osteopathic medical education, training, and licensure, the National Board of Osteopathic Medical Examiners (NBOME) transitioned COMLEX-USA Level 1 (Level 1) score reporting from a numeric score with a pass/fail result to a pass/fail result only beginning with the 2022–2023 testing cycle in May 2022.
Objectives
The purpose of this study is to investigate the differences in student Level 1 self-reported stress levels, test preparation, and performance following the transition to pass/fail score reporting.
Methods
The study utilized data from Level 1, including end-of-examination survey responses from first-time test takers in the 2023–2024 administration and examination performance data from first-time test takers across the 2021–2022, 2022–2023, and 2023–2024 administrations. The analysis examined changes in three key outcomes: self-reported stress during examination preparation, perceived examination preparation, and performance on Level 1 examination.
Results
After Level 1 transitioned to pass/fail score reporting, 60 % of students responding to the 2023–2024 end-of-examination survey reported reduced stress in Level 1 examination preparation, and 75 % of respondents reported no change in preparation time. In the first testing cycle after the transition, declines were observed in several performance indicators, including pass rate and the mean, standard deviation, and maximum of z-scores (converted from Level 1 numeric scores). In the subsequent cycle, the pass rate rebounded and the mean z-score stabilized, while both the standard deviation and maximum z-score continued to decline.
Conclusions
The COMLEX-USA series is designed as a licensure examination to assess competencies essential for the practice of osteopathic medicine. Passage of Level 1 indicates that a student has demonstrated competence in the foundational biomedical sciences, osteopathic principles, and related physician competency domains to enter supervised patient care settings. The transition in score reporting for Level 1 preserves that purpose. This study provides preliminary insights into the perceived differences in students’ stress level, examination preparation, and examination performance patterns following the change of score reporting. Further study will be conducted as these students transition through their osteopathic education.
The Comprehensive Osteopathic Medical Licensing Examination of the United States (COMLEX-USA) is a three-level national standardized licensure examination designed for the practice of osteopathic medicine. COMLEX-USA Level 1 (Level 1) is typically taken toward the end or after the second year of osteopathic undergraduate medical education (UME). Passing indicates that the student has demonstrated competency in the foundational biomedical sciences, osteopathic principles, and related physician competency domains for osteopathic medical care of patients as required to enter supervised patient care settings [1]. Following several years of analysis and considering input from across the continuum of osteopathic medical education, training, and licensure, the National Board of Osteopathic Medical Examiners (NBOME) transitioned Level 1 score reporting from a three-digit numeric score with a pass/fail outcome to a pass/fail only result in the 2022–2023 testing cycle starting in May 2022 [2].
The transition to pass/fail score reporting in high-stakes medical licensing examinations has been studied for its impact on students’ stress levels, study time, and examination performance. Twardowski et al. [3] analyzed survey data from students at Rocky Vista University College of Osteopathic Medicine (RVU-COM), comparing cohorts before (Class of 2023) and after (Class of 2024) the transition. Although they found no reduction in self-reported stress levels, students reported fewer weeks of study and fewer practice examinations after the United States Medical Licensing Examination (USMLE) Step 1 (Step 1) and Level 1 adopted pass/fail scoring. Baniadam et al. [4] observed that students at Georgetown University School of Medicine experienced significantly lower stress related to Step 1 during the second year of medical school in the pass/fail cohort (Class of 2024) compared to the numeric score cohort (Class of 2023), although this difference disappeared during the dedicated study period. Girard et al. [5] conducted a study across multiple medical schools prior to the Step 1 transition and found that students reported that they would dedicate less time to studying for Step 1 if it was scored on a pass/fail basis. Yadav et al. [6] examined national performance data for Step 1 and noted a decline in first-time pass rates for students from US/Canadian Doctor of Medicine (MD) and Doctor of Osteopathic Medicine (DO) schools and non-US/Canadian schools in the first year of transition to pass/fail scoring.
In the preclinical years of medical education, pass/fail reporting has been associated with improved student well-being [7], [8], [9] without negatively affecting coursework or Step 1 performance [7]. Despite the importance of the Level 1 score reporting change to students and educators in osteopathic medical education and graduate medical education (GME), limited research has explored its effects. To fill this gap, this study utilized nationwide data from students at all colleges of osteopathic medicine (COMs) to investigate three key questions: Did the transition to pass/fail scoring affect students’ perceived stress levels during Level 1 examination preparation? Did students feel that they prepare differently than they would have if the Level 1 examination still had a three-digit score? Has there been a shift in Level 1 performance following the score reporting change?
Methods
Data
The research project was reviewed by the Institutional Review Board (IRB) of the NBOME and was determined to be exempt.
Two sets of data were utilized in the study. The first dataset consisted of the survey responses from first-time Level 1 test takers during the 2023–2024 administration who completed two newly added questions in the optional end-of-examination survey. These questions were designed to gather students’ perspectives on how pass/fail score reporting influenced their stress level and preparation strategy during examination preparation. The second dataset included Level 1 three-digit numeric scores and pass/fail outcomes for first-time test takers across three consecutive testing cycles: the final cycle with numeric score reporting (2021–2022) and the first two cycles with pass/fail reporting (2022–2023 and 2023–2024). For analysis, numeric scores from all three cycles were converted to z-scores (mean=0, standard deviation [SD]=1), acknowledging that numeric scores were no longer reported after the transition.
Analysis
Changes in stress level during Level 1 examination preparation
In 2023–2024 administration, a new end-of-examination survey question asked for a response to the statement, “The COMLEX-USA Level 1 reporting change from a numeric score to a ‘pass’ or ‘fail’ score reduced stress in preparation for the examination”. Response options were Strongly disagree, Disagree, Neither agree nor disagree, Agree, and Strongly agree. For analytic purposes, responses were collapsed and recoded into a three-point Likert scale: 1=Disagree (Disagree/Strongly disagree), 2=Neutral (Neither agree nor disagree), 3=Agree (Agree/Strongly agree). Frequencies, pass rates, and z-scores were summarized by response option.
We hypothesized that perceived stress reduction would vary by competence. Candidates with low competence may not have perceived stress relief, given that achieving a passing threshold remained difficult, whereas candidates with high competence may have reported stress reduction due to the relative ease of attaining a passing score. To evaluate this, two hypothesis tests were conducted: (1) whether the Spearman rank correlation between scaled responses and z-scores differed from zero; and (2) whether mean z-scores differed across response categories.
Changes in Level 1 examination preparation
Another new end-of-examination survey question asked, “Compared to numeric score reporting, how does ‘pass’ or ‘fail’ score reporting for COMLEX-USA Level 1 affect how you prepare?”. The response options were Significantly more time preparing, More time preparing, No change, Less time preparing, and Significantly less time preparing. These were collapsed and converted into a three-point Likert scale: 1=More time (More time/Significantly more time preparing), 2=No change, 3=Less time (Less time/Significantly less time preparing). Frequencies, pass rates, and z-scores were again summarized by response category.
We hypothesized that the changes in Level 1 preparation would be associated with competence. Candidates with low competence might invest the same or even more preparation time to ensure a passing outcome, while candidates with high competence might reduce preparation, because maximizing numeric score was no longer necessary. This hypothesis was examined utilizing two tests: (1) whether the Spearman rank correlation between scaled responses and z-scores differed from zero; and (2) whether mean z-scores differed across response categories.
Changes in Level 1 examination performance
Level 1 examination data from 2021–2022 to 2023–2024 were analyzed to evaluate changes in performance following the reporting transition. Pass rates and z-scores were summarized by testing cycle. We hypothesized that the elimination of numeric score reporting would reduce motivation to pursue higher-level performance, thereby resulting in a decline in mean z-scores. This hypothesis was tested by evaluating whether mean z-scores differed across three testing cycles.
Results
Changes in stress level during Level 1 examination preparation
Among 9,222 first-time test takers in 2023–2024 cycle, 7,309 (79 %) responded to both new questions in the end-of-examination survey. Table 1 shows the survey findings on perceived stress reduction. Among the respondents, 60 % agreed that pass/fail reporting reduced their stress in Level 1 preparation, 25 % were neutral, and 15 % disagreed.
Level 1 performance summary by survey response on stress reduction.
| Response | n | Prop | Pass rate | Z-score | |||
|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max | ||||
| Disagree | 1,121 | 0.15 | 91.5 % | −0.19 | 0.86 | −2.89 | 2.67 |
| Neutral | 1,824 | 0.25 | 92.2 % | −0.17 | 0.84 | −3.02 | 2.79 |
| Agree | 4,364 | 0.60 | 94.4 % | −0.04 | 0.85 | −3.99 | 3.83 |
| Total | 7,309 | 1.00 | 93.4 % | −0.09 | 0.85 | −3.99 | 3.83 |
-
SD, standard deviation.
Candidates who agreed with stress reduction demonstrated the highest pass rate and mean z-score, followed by those who held neutral and then those who disagreed. The Spearman correlation between perceived stress changes and z-scores was statistically significant (p<0.0001) but negligible in magnitude (r=0.08). Pairwise comparisons showed that the agree group had a significantly higher mean z-score than the neutral and disagree groups (p<0.0001), with small effect sizes (Cohen’s d=0.16 and 0.18, respectively). No significant difference in mean z-scores was observed between the neutral and disagree groups (p=0.50).
As illustrated in Figure 1, each response category included students across a wide range of z-scores, spanning low to high competency levels. However, the agree group tended to have a slightly higher mean z-score compared to the neutral and disagree groups. This helps explain why the overall correlation between perceived stress reduction and z-score was minimal, yet meaningful differences in mean z-scores were observed between the agree group and the other two groups.

A box plot of z-scores by survey response on stress reduction.
Changes in Level 1 examination preparation
Table 2 presents a summary of the survey findings regarding changes in Level 1 preparation. A majority of respondents (75 %) reported no change following the transition to pass/fail score reporting, 16 % reported more preparation time, and 10 % reported less preparation time.
Level 1 performance summary by survey response on examination preparation.
| Response | n | Prop | Pass rate | Z-score | |||
|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max | ||||
| More time | 1,138 | 0.16 | 88.8 % | −0.38 | 0.79 | −3.99 | 2.50 |
| No change | 5,460 | 0.75 | 94.0 % | −0.07 | 0.85 | −2.95 | 3.83 |
| Less time | 711 | 0.10 | 96.6 % | 0.22 | 0.86 | −3.02 | 3.38 |
| Total | 7,309 | 1.00 | 93.4 % | −0.09 | 0.85 | −3.99 | 3.83 |
-
SD, standard deviation. The sum of proportions exceeds 1.00 as a result of rounding of individual proportions.
Candidates reporting less preparation time had the highest pass rate and mean z-score, followed by those reporting no change and then those reporting more preparation time. Perceived changes in examination preparation were significantly but weakly correlated with z-scores (r=0.18, p<0.0001). Pairwise comparisons revealed clear differences in mean z-scores between the groups. The largest difference was observed between the less-preparation and more-preparation groups (Cohen’s d=0.72), whereas differences between the less-preparation and no-change groups and between the no-change and more-preparation groups were of small to medium magnitude (d=0.34 and 0.37, respectively).
Figure 2 shows the z-score distributions across the three groups. While each group included students from a wide range of competency levels, noticeable shifts in mean z-scores were observed. This may explain why the individual-level correlation between preparation change and z-score was weak, while aggregated group-level comparisons revealed significant differences in mean z-scores.

A box plot of z-scores by survey response on examination preparation.
Changes in Level 1 examination performance
Table 3 summarizes Level 1 examination performance from 2021–2022 to 2023–2024, with 2022–2023 being the first cycle of pass/fail score reporting. The pass rate declined in 2022–2023 but rose slightly above its pretransition level in 2023–2024. Mean z-score also declined in 2022–2023 and then remained stable. The mean z-score decrease in 2022–2023 was statistically significant (p<0.0001) with a small-to-medium effect size (d=0.31), whereas no significant difference (p=0.17) was observed between 2022–2023 and 2023–2024. Additionally, both the SD and maximum z-score decreased in 2022–2023 and continued to decline in 2023–2024.
Level 1 performance summary by testing cycles.
| Exam cycle | Score reporting | n | Pass rate | Z-score | |||
|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max | ||||
| 2021–2022 | Numeric score and pass/fail | 8,089 | 92.2 % | 0.23 | 1.13 | −3.77 | 4.42 |
| 2022–2023 | Pass/fail | 8,798 | 90.6 % | −0.10 | 0.98 | −3.52 | 4.13 |
| 2023–2024 | Pass/fail | 9,222 | 93.0 % | −0.11 | 0.85 | −3.99 | 3.83 |
-
SD, standard deviation.
Discussion
There is limited research exploring the impact that transitioning from numeric scoring to a pass/fail score report has had on Level 1. This study addresses this gap by evaluating the effects of the transition on student stress, examination preparation, and performance outcomes. Our findings indicate that the transition alleviated stress for a substantial proportion of candidates, with 60 % of the survey respondents reporting reduced stress. However, the transition’s influence on Level 1 preparation was small, as 75 % reported no change in preparation time. This pattern suggests that while removing numeric scores reduced psychological pressure, students continue to invest similar effort, likely reflecting the high-stakes nature of the examination and the desire to ensure a passing outcome. Aggregated analyses further indicate that perceptions of stress reduction and changes in examination preparation may vary across competency levels, although the correlations between these perceptions and competency were negligible to weak at the individual level. The transition was also associated with declines in national performance indicators during the first pass/fail cycle, including the pass rate, mean score, score variability, and maximum score. In the subsequent cycle, the pass rate rebounded and the mean score stabilized, while score variability and maximum score continued to decline. This pattern is consistent with expectations: with only a “pass” required, the incentive for candidates to pursue higher scores have diminished, leading to a lower mean, fewer extreme high scores, and a more compressed overall score distribution.
Consistent with the previous research [4], [7], [8], [9], this study observed an improvement in students’ well-being following the transition to pass/fail score reporting. However, our findings regarding student stress level and examination preparation differ from those of Twardowski et al., [3] which reported no reduction in student stress and noted fewer studying weeks and practice examinations after the transition to pass/fail scoring for Level 1. This may be related to the differences in sample composition. This study analyzed data from 7,309 first-time Level 1 takers in the 2023–2024 cycle (primarily from the Class of 2025) from all COMs, with some students not taking the Step 1 examination. In contrast, Twardowski et al. [3] focused on Classes of 2023 and 2024 from a single COM, where all students were required to take both Level 1 and Step 1.
Furthermore, trends in first-time pass rates for Level 1 differ from those observed for Step 1 following the transition. For Level 1, the first-time pass rate declined slightly in the first testing cycle after the change but rebounded in the second cycle. In contrast, the Step 1 first-time pass rates showed a slight decline in the first cycle, followed by a continued slight decrease in the second cycle [10]. These differences can be attributed to the unique characteristics of the two examinations and the differences in their examinee populations.
This study has several limitations. First, participation in the end-of-examination survey was voluntary, and 1,913 (21 %) first-time test takers in the 2023–2024 cycle chose not to respond. As a result, the findings may be influenced by nonresponse bias. However, the response rate was consistent with historical patterns observed in prior Level 1 end-of-examination surveys. Compared to all respondents, nonrespondents demonstrated lower performance, with a pass rate of 91.7 % and a mean z-score of −0.20. Their z-scores also covered a wide range of competency levels, spanning from −3.40 to 3.32. Second, the study did not include a pretransition cohort of students who took the Level 1 examination under numeric score reporting. Instead, the posttransition cohort was asked to compare their stress and preparation time under pass/fail reporting with how they believed they would have experienced the examination had numeric score reporting been retained. As numeric score reporting was discontinued starting from the 2022–2023 cycle and the posttransition cohort consisted of the first-time test takers in 2023–2024 cycle, such comparisons were necessarily hypothetical, and responses therefore reflect perceived rather than actual experiences, introducing potential bias into the findings. Third, the study lacked objective measures of stress and preparation time. Stress was self-reported rather than assessed with standardized instruments, and preparation time was estimated rather than directly recorded. Practical considerations limited additional data collection: the end-of-exam survey already included 18 items on test administration, format, and content, and it was administered immediately after a demanding 8-h examination when students were likely fatigued. To minimize respondent burden, only two brief questions with simple response options (e.g., agree, strongly agree, less time, significantly less time) were included to capture students’ general perceptions of the transition.
Conclusions
The COMLEX-USA series is designed as a licensure examination to assess competencies essential for the practice of osteopathic medicine. The transition in score reporting for Level 1 preserves that purpose. This study provides preliminary insights into the differences in student stress levels, examination preparation, and examination results following the change of Level 1 score reporting. Further evaluation of these areas is ongoing to ensure that the scoring transition continues to provide valid, reliable information regarding student progress through osteopathic medical education and preparation for licensure to practice osteopathic medicine.
Acknowledgments
The authors would like to thank National Board of Osteopathic Medical Examiners staff members Yusong Chen for assistance with figure formatting, and Melinda Gray for proofreading.
-
Research ethics: The research project was reviewed by the Institutional Review Board of the NBOME and was determined to be exempt.
-
Informed consent: Informed consent was obtained from all individuals in this study for the use of their data.
-
Author contributions: The authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Use of Large Language Models, AI and Machine Learning Tools: None declared.
-
Conflict of interest: COMLEX-USA is owned by the NBOME, and the authors are employees of the NBOME or consultants to the NBOME.
-
Research funding: None declared.
-
Data availability: The research data are not shared.
References
1. COMLEX-USA Level 1. https://www.nbome.org/assessments/comlex-usa/comlex-usa-level-1/ [Accessed 10 February 2025].Search in Google Scholar
2. National Board of Osteopathic Medical Examiners (NBOME). COMLEX-USA Level 1 to eliminate numeric scores. NBOME Website;2022. https://www.nbome.org/news/comlex-usa-level-1-to-eliminate-numeric-scores/. Accessed February 10, 2025.Search in Google Scholar
3. Twardowski, DA, Montemayor, J, Payton, M, Waller, J. Impact of the USMLE Step 1 and COMLEX Level 1 transition to pass/fail on osteopathic medical student stress levels and board preparation. J Osteopath Med 2023;123:563–9. https://doi.org/10.1515/jom-2023-0045.Search in Google Scholar PubMed
4. Baniadam, K, Elkadi, S, Towfighi, P, Aminpour, N, Sutariya, R, Chen, HC. The impact on medical student stress in relation to a change in USMLE Step 1 examination score reporting to pass/fail. Med Sci Educ 2023;33:401–7. https://doi.org/10.1007/s40670-023-01749-4.Search in Google Scholar PubMed PubMed Central
5. Girard, AO, Qiu, C, Lake, IV, Chen, J, Lopez, CD, Yang, R. US medical student perspectives on the impact of a pass/fail USMLE Step 1. J Surg Educ 2022;79:397–408. https://doi.org/10.1016/j.jsurg.2021.09.010.Search in Google Scholar PubMed
6. Yadav, S, Dekhne, A, Harikrishnan, S, Saini, B, Shukla, J, Tango, T, et al.. The pass/fail effect: a longitudinal study of United States Medical Licensing Examination (USMLE) Step 1 performance over a decade. Cureus 2023;15:e41702. https://doi.org/10.7759/cureus.41702.Search in Google Scholar PubMed PubMed Central
7. Bloodgood, RA, Short, JG, Jackson, JM, Martindale, JR. A change to pass/fail grading in the first two years at one medical school results in improved psychological well-being. Acad Med 2009;84:655–62. https://doi.org/10.1097/ACM.0b013e31819f6d78.Search in Google Scholar PubMed
8. Reed, DA, Shanafelt, TD, Satele, DW, Power, DV, Eacker, A, Harper, W, et al.. Relationship of pass/fail grading and curriculum structure with well-being among preclinical medical students: a multi-institutional study. Acad Med 2011;86:1367–73. https://doi.org/10.1097/ACM.0b013e3182305d81.Search in Google Scholar PubMed
9. Spring, L, Robillard, D, Gehlbach, L, Simas, TA. Impact of pass/fail grading on medical students’ well-being and academic outcomes. Med Educ 2011;45:867–77. https://doi.org/10.1111/j.1365-2923.2011.03989.x.Search in Google Scholar PubMed
10. National Board of Medical Examiners (NBME). USMLE performance data. NBME Website https://www.usmle.org/performance-data [Accessed 10 February 2025].Search in Google Scholar
© 2025 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.