Score 40% on a full IB Math AA SL paper and the number tells you almost nothing useful. Did you pick the wrong approach on a mixed-strand question—one that looked like trigonometry until the second part demanded calculus? Did you run out of time? Fail to recognize what you were even being asked? The paper doesn’t separate those causes. That interpretive gap is the actual cost of entering full simulations before the underlying recognition skill is ready: you generate scores, but you can’t diagnose them.
Paper 1 sharpens the decision further. Non-calculator, 90 minutes, 80 marks—with algebraic fluency doing as much work as syllabus coverage. Enter full papers too early and you collect noisy marks that can’t direct your next revision session; too late and you never properly train synthesis and pacing across both sessions. Getting the switch right is a readiness question, not a week-count one, and that distinction shapes everything about how to prepare.
Three-Phase Preparation Arc
When several syllabus areas are still generating significant errors under mild time pressure, a full paper mostly confirms there are large gaps without identifying where any of them are. Topic consolidation—Phase 1—is the right mode here. With that many variables unsettled at once, a weak mock score cannot separate missing content from unreliable algebra, and focused, lightly timed topic drills return more marks per revision hour than full papers can at this stage.
The gap most preparation plans leave unfilled sits between isolated topic work and full simulation—and it’s where recognition failures quietly accumulate. Phase 2, the bridge, trains the specific skill AA SL papers punish most reliably: identifying what kind of mathematics a question is asking for before you start any working. Begin with label-suppressed mixed sets where you still choose the topic pool but strip the headings; then widen the mix until the set is genuinely unpredictable. The core habit is a deliberate 10–15 second pause at each new question to name the operative strands before committing to a method. Time pressure comes last—first loosely timed, then strict blocks—once recognition accuracy is mostly solid. Preparation platforms that sequence topic quizzes, intermediate difficulty sets, and full mock papers as distinct practice modes, as Revision Village’s AA SL practice page does, reflect this same logic.
Once mixed-topic recognition is consistent and gaps have narrowed, full papers shift function. That transition marks Phase 3: simulation. IB Math AA SL practice exams used at this stage are no longer gap-detection instruments—they’re pacing tools, training the composure and output quality needed to hold performance across two back-to-back 90-minute sessions. It’s a different cognitive job than anything in Phase 1 or 2, and it’s only accessible once the foundations are stable enough to stop generating interference on every other question.

Grade Boundary Criterion
Calendar proximity isn’t a readiness check. The common practitioner guidance—begin full timed papers roughly 4–6 weeks before the exam, as MathZem and others suggest—answers a scheduling question, not a preparation one. It doesn’t tell you whether your mixed-topic performance has cleared any meaningful threshold, and treating time-to-exam as a proxy for readiness is precisely what produces flat or declining mock scores in the final stretch.
A more reliable anchor is grade boundary data. Photon Academy’s analysis of AA SL boundaries across 2021–2025 reports Grade 5 sitting in the mid-40% range on average and Grade 7 in the mid-70s. These are historical averages—not a prediction for any upcoming session—and should be read as a reference range rather than a fixed target. But they give you a workable floor: a conservative Grade-4-ish threshold per paper that, combined with an error-pattern profile, turns readiness into something you can test rather than estimate. The two-paper structure matters here too—Paper 1 non-calculator, Paper 2 calculator, both 80 marks over 90 minutes—so the mini-benchmark below runs each style under matching conditions.
- Build two mini-benchmarks: one Paper 1-style and one Paper 2-style, about 45 minutes each, mixed-topic and unlabeled, drawn from sources you are happy to reuse (such as pre‑2021 question parts or third‑party sets) and long enough to feel like short sessions rather than quizzes.
- Sit each benchmark under matching conditions: strict timing, no notes; for the Paper 1-style set use no calculator, and for the Paper 2-style set allow a calculator.
- Mark immediately and record, for each set, the raw marks you earned out of the total available, then convert that to a percentage.
- For every place you lost marks, assign one primary tag based on the first thing that went wrong: Topic-gap (did not know the method or facts), Recognition (picked the wrong strand or approach first), Integration (knew methods but could not connect them or got stuck mid-solution), Execution (algebra or arithmetic slip), or Timing (ran out of time or rushed).
- Use a simple decision rule: begin full-paper simulation only if each percentage is at least in the Grade‑4‑ish floor reference range you have inferred from the historical boundaries and your dominant error tags are not Topic-gap. If Topic-gap is still the main source of lost marks, stay out of full papers, run another targeted drilling cycle on your top two or three weak areas, and then repeat this mini-benchmark.
Tiering Practice Materials by Simulation Value
Not all practice papers serve the same purpose. Authentic post-2021 IB Math AA SL papers match the current specification most closely and deserve to be treated as the highest-fidelity simulations—reserved mainly for Phase 3. Pre-2021 papers fall under a slightly different specification and form a much larger archive; their main value is as raw material for bridge-phase mixed sets rather than as benchmarks. Third-party predicted papers can offer useful exposure to unfamiliar question styles, but their calibration varies enough to make them unreliable as precise readiness indicators.
The core allocation decision follows directly: do fewer full papers at the right stage, not more at the wrong one. A small number of well-timed simulations yields sharper diagnostic returns than a long sequence run prematurely. Getting the tiering right is a necessary condition—but even correct tiering doesn’t protect against a separate and harder problem, the one where Phase 3 scores stop moving despite continued effort.
Plateau Circuit-Breaker
Stalled Phase 3 scores are rarely a stamina problem or an exposure problem. They’re almost always an unfinished bridge problem. Full simulations measure how well you use skills you already own; they aren’t particularly effective at building recognition from scratch. To identify the actual fault line, review the last two or three completed mocks and apply a single diagnostic question to each significant mark loss: was the first wrong move selecting the wrong method or strand, or did you choose a reasonable approach and then lose marks through integration, execution, or timing? That split matters. If wrong-method selection is the dominant pattern, more full papers won’t move the score—they’ll just reproduce the same errors at full length.
A deliberate two-week pause from full papers breaks the cycle. Use the completed mocks as maps: identify which topic intersections generated the most recognition failures, then build targeted mixed-topic, unlabeled drills specifically around those intersections. Run them with the same 10–15 second strand-identification step from Phase 2—loosely timed first, then with tighter constraints. Re-entry into full simulations is the right call when, on a fresh mixed set, most lost marks arrive after you’ve correctly identified the approach rather than because you didn’t know what type of problem you were facing.
Full Papers as a High-Value Resource
All three phases serve the same underlying logic: full papers are a finite resource, and their diagnostic return depends entirely on the preparation that precedes them. Use them before the foundations are solid and they generate noise. Use them at the right moment and they compress whatever’s left into something the exam can actually reveal.
The readiness check, the error-pattern tags, the circuit-breaker—these aren’t procedural extras. They’re the mechanism by which IB Math AA SL practice exams stay useful rather than becoming expensive confirmation of preparation that isn’t finished yet. Every paper you protect from premature use is one you can still sit at peak readiness, when what the score says is finally worth hearing.
