Research
When Should You Delay Step 1? A Data-Driven Decision Framework
The question comes up every dedicated period. You're two weeks from your scheduled date, your NBME scores are borderline, and you're trying to figure out whether to push through or push back. Friends and forums give contradictory advice. Your school might have an opinion. Your gut has another.
None of those are reliable decision inputs. The published data, however, gives you something concrete to work with. This post lays out a framework for the delay decision grounded in assessment scores, pass rate statistics, and the documented consequences of sitting for Step 1 before you are ready.
The Cost of Failure Is Not Symmetric
Before talking about when to delay, you need to understand what happens if you don't delay and fail. The consequences are not “you study more and try again.” They compound across your entire career.
McDougle et al. (2012) tracked medical students longitudinally and found that students who fail Step 1 are 2.2 times more likely to never achieve board certification (PMC8048154). That is not a typo. Failing Step 1 does not just delay your career. It statistically doubles your risk of never completing it. Their data also showed that students who failed Step 1 scored an average of 180.4 on Step 2 CK, compared to 219.4 for first-time passers. The gap follows you.
According to USMLE 2025 Performance Data (usmle.org), retaker pass rates sit at 71% for US MD students and 54% for IMGs. So roughly half of IMG retakers and 3 in 10 US MD retakers will fail again.
The asymmetry in plain terms:
Delaying Step 1 by 2 to 4 weeks costs you some scheduling inconvenience and possibly a rotation shift. Failing Step 1 costs you a 2.2x increased risk of never getting board certified, a retake pass rate under 71%, and a permanent mark on your USMLE transcript that every residency program will see. These two outcomes are not comparable.
Pass Rates Are Declining, Not Improving
If you are banking on the pass/fail transition making Step 1 “easier,” the data says the opposite. English (2024), published in the Avicenna Journal of Medicine, documented that pass rates dropped across every group after the 2022 scoring change (PMC11896725). US MD first-time rates fell from 95% to 91%. IMG rates dropped from 82% to 74%.
The hypothesis: without a three-digit score to maximize, students complete fewer practice questions during dedicated study. That behavioral shift removes the single strongest predictor of passing. The exam itself did not get harder. Student preparation got lighter.
This matters for the delay decision because the margin of safety is thinner than it was three years ago. Borderline scores that might have been “probably fine” under the old system are genuinely risky now.
The Assessment Score Framework
Here is the decision framework based on published prediction data and the NBME score interpretation research we have covered previously. These thresholds apply to scores from untaken, timed, full-length practice exams only. Anything else is unreliable.
Decision zones (based on most recent NBME or UWSA):
- ▸Below 55% correct (high fail risk): Delay. This is not a close call. Students in this range have a statistically high probability of failing, and the research is clear that simply adding more study weeks at the same approach does not fix it (Coumarbatch et al., PubMed 20135567). A structural change in study method is needed before rescheduling.
- ▸55 to 64% correct (borderline): Delay unless your score trend is clearly upward AND you have at least one more untaken NBME to confirm. A single borderline score is not enough data. Two consecutive scores above 60% with an upward trajectory give more confidence. One isolated 62% does not.
- ▸65 to 72% correct (likely passing): Proceed with caution. Most students in this range pass, but confirm with a second assessment. If your scores are stable or rising and your test-taking mechanics are sound, you are likely ready.
- ▸Above 72% correct (comfortable margin): Sit for the exam. Extended study beyond this point shows diminishing returns and risks burnout-related score drops.
These are not arbitrary cutoffs. They track with the published NBME prediction intervals and the pass rate data described above. The key principle: you need at least two data points from different practice exams, taken under timed conditions, before making the call.
Why “I Feel Ready” Is Not a Valid Signal
Rahmani (2020), in a review published in the Journal of Graduate Medical Education, found that low-performing trainees overestimate their own abilities by 30 to 40 percentile ranks (PMC7594774). This is not about overconfidence as a personality trait. It is a cognitive bias: the same knowledge gaps that cause poor performance also prevent accurate self-evaluation.
For the delay decision, this means subjective readiness is inversely correlated with accuracy. The students most likely to feel ready when they are not are exactly the students most at risk of failing. Relying on gut feeling to decide whether to sit for Step 1 is like asking the person who got lost for directions.
This is why the framework above relies exclusively on objective assessment data. Not how many hours you studied. Not how confident you feel after a UWorld block. Not what your study partner thinks. Practice exam scores, taken under real conditions, scored by the testing platform. That is the signal. Everything else is noise.
More Time Alone Does Not Help
Research published in Family Medicine (Coumarbatch et al., PubMed 20135567) found that simply extending the dedicated study period is not associated with improved outcomes. Students who studied for 12 weeks did not perform meaningfully better than those who studied for 8 weeks when the study methods stayed the same.
This is the critical nuance in the delay decision. Delaying your exam date only helps if you change what you are doing with the extra time. If you are scoring 52% on NBMEs and your plan is “keep doing what I'm doing for three more weeks,” the research predicts that will not work.
A productive delay includes specific structural changes:
- ▸Shifting from passive review (re-reading, re-watching) to active retrieval practice and question-based learning
- ▸Implementing a Wrong Answer Journal that categorizes errors by type (content gap, misread question, reasoning error, changed correct answer)
- ▸Adding external accountability through a tutor, study group, or structured program
- ▸Reserving at least one untaken NBME form for a readiness check before the new date
Delay without a plan change is just procrastination with extra anxiety. Delay with a structural overhaul is a strategic decision backed by evidence.
What About External Pressure to Stay on Schedule?
Schools, clerkship directors, and financial aid timelines all create pressure to take Step 1 on the original date. That pressure is real. But it needs to be weighed against the data.
A two-week delay affects your rotation schedule. A Step 1 failure affects your entire residency application. It appears on your USMLE transcript permanently. It statistically predicts lower Step 2 CK scores (McDougle 2012, PMC8048154). And for IMGs, where the retake pass rate is 54%, a failure can mean the difference between matching and not matching.
If your school's policy penalizes delays, have a conversation with your dean of students. Bring your practice exam scores. Frame it in terms of the published failure consequences. Most schools, when presented with a student who has borderline scores and a concrete plan for improvement, will accommodate a short delay. The alternative costs them a failed student, which is worse for everyone.
The Decision Checklist
Run through these questions before making your call. Answer honestly, using data, not feelings.
- ▸Have you taken at least two full-length, timed, untaken practice exams in the last 10 days?
- ▸Are both scores above 60% correct?
- ▸Is the trend flat or upward (not declining)?
- ▸Can you identify your top 3 weakest subjects by data, not by feel?
- ▸Have you completed at least 70% of your primary QBank?
If you answered “no” to two or more of these, the data favors delaying. If you answered “no” to the first question, you do not have enough information to decide at all. Take the practice exams first.
The Bottom Line
Delaying Step 1 is not a sign of weakness or poor preparation. When the practice exam data says you are not ready, delaying is the most rational decision you can make. The research is consistent on this: failure carries lasting, measurable consequences that a short delay does not.
Make the decision with numbers, not nerves. If your scores say go, go. If they say wait, wait and change something while you do.
Not sure whether your scores say go or wait? Bring your NBMEs to a free diagnostic session and get an objective read.
Book a Free Diagnostic Session