Where should AI be used in learning, and where not?

Keep AI out of the first hard attempt (Probe) and the final unaided check (Test); allow a guarded, answer-withholding AI for hints, examples, and practice in between.

What is the effortless trap?

When AI makes a learning task feel smooth and effortless it often signals the tool is doing the very work that builds the skill, producing an illusion of learning that collapses on the unaided task.

← Writing

The Effortless Trap: Productive Struggle, AI, and the Illusion of Learning

By Mario Brcic and Stjepan Frljic, PhD·June 21, 2026

Stjepan Frljic, PhD, guest co-author. Faculty of Electrical Engineering and Computing, University of Zagreb.

In short. With AI advancing fast, educators face a dilemma: ban it or allow it. Evidence that it both helps and hurts learning only deepens the confusion. Placement and pacing make all the difference: used well, AI becomes a personalized tutor that never runs out of patience, putting real power in educators' hands and stronger tools in students' minds. Used wrong, it creates only an illusion of learning. The article gives a simple map for that choice: six moves of learning, the two places AI must stay out, and the guarded uses where it can multiply feedback, examples, and practice.

Read the preprint

Every teacher now faces the educator’s dilemma about AI: allow the tool or ban it. That is a false choice. Ban it and you give up the biggest boost to learning in a generation. Allow it freely and students hand the hard part to a machine and call it studying. There is a better, middle way. With the right pacing, choosing where you hold AI back and where you let it in, you get both: the struggle that builds the skill and the boost that scales it. So where does AI belong in your lessons? We must differentiate where it helps a student build the skill, and where it quietly does the work for them instead.

Start with the ceiling, because it tells you what is possible. For most of history, the best education on earth was one devoted teacher beside one learner. Almost no one ever got it. It went to princes and prodigies, and everyone else made do.

Aristotle tutored Alexander of Macedon in person, from about the age of thirteen. A decade later, Alexander had conquered the known world. The greatest mind of the age, working as one boy’s private tutor: that was a thing exactly one boy could have.

Helen Keller lost her sight and hearing before she was two, and with them every door into language. Then Anne Sullivan sat with her, one to one, hand to hand, year after year, until a single word finally meant something. Keller went on to a university degree, wrote a dozen books, and built a life of public work. A child with no way in at all, given one devoted teacher, reached the top. That is the whole promise of teaching, written in a single life.

And the rarest teaching does not lift one student. It lifts a generation. One small school in Budapest, the Fasori Lutheran Gymnasium, sent into the twentieth century John von Neumann, who designed the logic that still sits inside almost every computer, and two future Nobel laureates, the physicist Eugene Wigner and the economist John Harsanyi. One school, one city, one generation. It came from a culture of teaching, hard problems set before the method and expectations set high, carried by teachers like László Rátz, who drilled von Neumann and Wigner in mathematics. The same thing happened again in our own time. Ryszard Szubartowski, one informatics teacher in Gdynia, coached his students to sixty medals at the international olympiads, thirty-three of them gold, and several of them went on to build OpenAI, its chief scientist among them. Netflix has announced a film about him.¹ Places that teach like this have always been vanishingly rare.

Figure 1. Who got the close attention. For most of history it reached a single student, at most a single school. The promise of AI is to put that patient, one-to-one attention within reach of a whole room, with the teacher still the one teaching.

And what these teachers gave is not a mystery, and not a gift you have to be born with. We can measure the effect, and we can name the method. Give an ordinary student a good tutor, one to one, and they climb by roughly 0.8 standard deviations, from the middle of the class up toward the top of it.²³ Large, though short of the famous “two sigma” claim. The method is teachable. What never scaled was the cost: one skilled adult, hour after hour, for a single learner at a time. That is the supply and methodology problem, and no amount of dedication has ever made it cheap.

That cost is the real ceiling. The teachers in those stories gave more hours than any job could fairly ask, carried by sheer enthusiasm, and that is exactly why what they did is so hard to repeat. You cannot ask every teacher to work to exhaustion, or to spread the same attention across thirty students that those tutors gave to one. This is the line AI moves, and it moves in the teacher’s hands, not behind their back. The parts of great tutoring that ate those hours are the parts that finally scale: the patient drilling, the instant feedback, the worked example on demand, for every student at once and at almost no cost. Hand those to the tool and a teacher’s scarcest hours come back, to spend where only a person can: judgement, mentoring, and the design of the lesson.⁴ Used this way, AI does not run the classroom. It lets an ordinary teacher give a whole room something closer to the attention only a few students ever got.

The same tool cuts the other way too. In one careful trial with high-school mathematics students, an AI helper that handed over answers during practice raised their scores while they had it, then left them about 17 percent worse on the exam they sat alone, worse than students who never used it at all. Rebuild that same AI to withhold answers and ask questions instead, and the harm vanished.⁵ Same machine, opposite result. The only thing that changed was where it sat in the learning.

The reason is simple, once you see it. Learning happens when the student does the hard part themselves: the effortful wrestling that researchers call productive struggle. An AI that does the hard part for them feels like help and is actually the opposite. Worse, it feels like learning while it happens. The work goes smoothly, the answers come out right, and the student walks away sure they have understood. The exam they sit alone says otherwise. The feeling of learning is a poor gauge. In one study, students in harder classes felt they learned less, even as they learned more.²⁸ Smooth, easy practice can feel most productive exactly where it teaches the least. That is the effortless trap, and what it leaves behind is an illusion of learning: a confident sense of mastery that collapses the moment the support is pulled away. An AI built the other way, one that keeps students doing the hard part themselves but gives them more attempts at it and quicker, sharper feedback on each, is the tutor we have been missing.

A controlled study makes the split concrete. Software engineers given an AI assistant breezed through a coding task, then scored far below the unaided group on a quiz about the very code they had just produced, worst of all on finding its errors.³⁰ Strong performance, little learning.

If the question is where, we have to look inside the learning itself. Picking up a new idea runs through six moves: Prime, Probe, Point, Attach, Strengthen, and Test. AI earns its place in some of them and wrecks others. What follows walks each move and shows where the tool helps and where it has to step back. One test carries the whole thing: if letting AI in makes the task feel effortless, it is probably in the wrong place.

1. The six moves

Picture what a student knows as a map: dots for ideas, lines for the links between them. Learning means growing a new dot and tying it firmly into the map. That tying-in is what makes knowledge deep: an idea woven into what you already know, one you can reach for and use. Skip it and you get the other kind, a fact sitting alone, linked to nothing, already fading out of memory. The six moves are how the first kind happens, in order. The figure below marks, for each move, what the teacher does, what AI may or may not do, and how the student’s knowledge map changes. Step through them.

Prime

The teacher sparks interest and asks the class to recall a few things they already know. The right corner of the map warms up.

Teacher: Hook their attention; ask a warm-up question.
AI: Can write the hook, an analogy, or a quick quiz.
Why it works: Wanting to learn, and waking up old knowledge, makes the next step land.

Probe

A hard problem is set with no help. The student hunts through the map for ideas that might fit, some right, some wrong. The hunt is the struggle.

Teacher: Pose a real problem before teaching the method.
AI: Best kept out here. Giving the answer quietly cancels the learning.
Why it works: Wrestling first, even unsuccessfully, makes the eventual explanation stick far better.

Point

Through questions, not answers, the teacher drops the wrong guesses and sharpens the right ones. Those concepts grow links that reach out, ready but empty.

Teacher: Guide with questions; never hand over the answer.
AI: A guarded helper: hints and nudges only, no solutions.
Why it works: The student stays the thinker; their map gets primed for what's coming.

Attach

Now the worked example arrives, a small new piece that snaps onto the waiting links. One older idea shifts a little to make room for it.

Teacher: Show one clean example, right when they're ready.
AI: A strong, safe use: generate and explain examples.
Why it works: After the struggle, the explanation has somewhere to land, and the old map quietly reshapes to fit it.

Strengthen

A second, slightly different problem puts the new piece to work. Its links thicken and weave into the nearby map. Fragile becomes firm.

Teacher: Set a fresh problem so the new idea gets used.
AI: Can generate endless practice and instant feedback.
Why it works: Using knowledge, not just seeing it, is what makes it last and connect.

Test

Finally the student faces the new idea alone, with no help. It holds up on its own. That standing-alone is the proof that learning really happened.

Teacher: Check the new ability unaided, at a point that matters.
AI: Removed. This moment must show the student, not the tool.
Why it works: Real learning is what survives when the scaffolding is taken away.

a known concept, restinga concept lit up by the searcha link reaching out, nothing on the end yetthe new idea, just attachedan old idea shifting to make room

Figure 2. The six moves of learning, as one growing knowledge map. Scope: one idea, on first encounter, in a single pass; review and spacing sit a layer above.Download the one-page guide.

Step through it interactively, one move at a time

Prime

Intervention (teacher)

AI on this move

Evidence

dormant conceptactivated / searchedprimed edge (ready, empty)new knowledge attachedshifts to accommodate

This is exactly what László Rátz was doing when he set a hard problem before he taught the method. The struggle at Probe is not the teacher being slow to help. It is the learning, starting.

2. The one rule

The rule is simple. AI belongs wherever it adds feedback, practice, or real-world realism without hiding the evidence that the student can think and perform on their own. That gives a clean placement. Keep the first hard attempt (Probe) and the final check (Test) AI-free. Allow a guarded AI for hints, examples, and drill in the moves between (Point, Attach, Strengthen). Open it up fully for authentic work once the student has shown they can do it unaided; support that helps a novice becomes redundant, even harmful, as competence grows, so the scaffold must fade.¹³ Tight at the ends, loose in the middle: attempt, then support, then prove, with Prime as the warm-up before the loop begins.

If letting AI in makes the task feel effortless, it is in the wrong place.

That one line does more work than any policy document, but read it precisely. The point is not effort for its own sake. Busywork is effortful too, and it teaches nothing. What matters is effort on the skill you are trying to build; effort spent anywhere else, however hard, is wasted. AI belongs wherever it clears away the effort that is not the skill, the looking-up, the formatting, the dead ends, and it has to stay out wherever it would do the part that is. So the test is always the same: does the hard part the student is doing build the skill, or not? That, task by task, tells you where the tool goes, without needing a separate rule for every situation.

The contrast that proves it

Look closer at the high-school mathematics trial, because the detail is the whole lesson.⁵ Both groups used versions of the same underlying AI; the gap between the versions is what mattered. The unguarded one, the version that gave answers, is what sank the solo exam, and it did so invisibly, because the practice it produced felt like progress the whole time. The guarded one, hints and questions only, erased the harm. This distinction matters: the practice-time gains and the unaided-exam loss came from different versions of the same underlying tool. So the lesson is not that AI hurts learning. It is narrower, and more useful: answer-giving where the student should struggle hurts learning. A guarded design in the same spot does not.

That guarded version is the AI doing what good teachers have always done at Point: question, do not tell. It is the Socratic move Aristotle used with Alexander, now available to every student at once, instead of to one prince.

3. The two menus

The picture says what has to happen at each move. The menus say how. Think of them as a shelf to pick from, not a sequence to march through. Menu A is classical teaching, everything good teachers have always done, no technology required. Menu B is the AI tools that can stretch or speed up the same moves. Read them side by side. For most moves, an AI tool does not replace a classical one. It scales it: more practice and faster feedback for every student at once. Or it buys back your scarce human hours for the things only you can do: judgement, encouragement, and guarding the struggle.

Effect strength is a rough guide from research, not a promise: strong = repeatedly shown to work well in teaching; strong learning move = the teaching move is strong, but the AI version still needs checking; solid = good support; context = depends heavily on how it is used; required = a rule for a fair check, not an effect size. The research uses different kinds of evidence, so the tables keep the rating simple and put cautions where the AI evidence is still young.

Menu A · classical interventions (no AI needed)

**Table 1.** Menu A: what good teachers have always done
Technique & what it is	Effect	Why it works on the student's map	How it fits with other tools
1. Prime, spark interest, wake up prior knowledge
Hook / curiosity gapOpen with a puzzle, story, surprising fact, or real-world stake that makes the question feel worth answering.	Context	Attention and motivation decide how much effort the student spends, the fuel for every later step.	Sets up Probe. Pairs with belonging; pick the hook that actually resonates with your class.
Activate prior knowledgeA quick recall question or discussion of what they already know about the topic.	Strong	Lights up the region of the map the new idea must attach to; you can't anchor to dots that are asleep.	Doubles as retrieval practice. The cheapest high-value move in the lesson.
Belonging & high expectationsSignal that this is hard, that it matters, and that you believe students can do it.	Solid	Reduces the fear that makes students disengage; a student who feels they belong will risk the struggle.	Underlies every step. Especially protects the Probe, where struggle can feel like failure.
2. Probe, protected struggle before any telling
Productive failureSet a hard problem before teaching the method; let students attempt and get stuck.	Strong	The search for a solution primes the map so the later explanation lands deeply, not superficially.	The core of this phase. Give a real attempt before any answer or explanation. If the gap is too wide, redesign the task rather than rescue too soon.
Open-ended challengeA non-routine problem with no obvious single path, slightly beyond current ability.	Solid	Working just beyond current comfort is where growth happens, as long as the task is still reachable.	Calibrate difficulty: too easy bores, too hard defeats. Peer work can share the load.
Think-pair-shareStudents think alone, then compare with a partner, then share with the class.	Solid	Forces every student to generate an attempt before hearing anyone else's, preserves the struggle for all.	A cheap way to make struggle universal, not just for the keen few. Leads into Point.
3. Point, guide with questions, withhold the answer
Questioning, not tellingGuide with probing questions instead of explanations; surface edge cases; ask for reasons.	Strong	Keeps the student as the thinker; their own reasoning sharpens which ideas to keep, and primes them.	The heart of this phase. Never an answer-giver.
Hints & scaffolding (then fade)Give the smallest hint that unblocks, then progressively remove support as the student copes.	Strong	Just enough support to keep struggle productive rather than hopeless; removing it builds independence.	Fading is essential: support that helps a novice harms an expert.
Formative feedbackGive timely, specific information about what is wrong, what is improving, and what to try next.	Strong	Feedback keeps the search from becoming blind and helps the student repair the exact weak link.	Best when it points, not solves. In Strengthen it becomes the correction loop for practice.
Peer instructionPose a concept question; students vote, discuss in pairs, then vote again.	Strong	Explaining to a peer forces students to articulate reasoning, which exposes and repairs gaps.	Combines Point + early Strengthen; works well in large classes.
4. Attach, one clean example, exactly when ready
Worked exampleA single, clearly stepped model of how an expert solves the problem.	Strong	Once the struggle has opened the question, a clear example snaps in as the missing piece, with little wasted effort.	Works best after a real attempt, not instead of one. For very step-heavy skills it can come a little earlier.
Explicit instruction / consolidationAfter students have tried, state the clean rule, compare solution paths, and connect the messy attempts to the formal idea.	Strong	The struggle only pays off when it is consolidated; this is where the teacher turns scattered attempts into usable knowledge.	The teaching half of productive failure. Keeps Probe from becoming unguided discovery.
Think aloud as an expertMake expert thinking visible; narrate your reasoning, including false starts and choices.	Solid	Shows not just the answer but the moves of thought, which is the part students cannot see otherwise.	Model, coach, then fade support. The human version of a worked example, richer but slower.
Name what shiftsExplicitly point out which idea the student already held now has to change or stretch.	Context	Learning often means re-shaping an old idea, not just adding a new one, saying so prevents confusion.	The old-idea adjustment step. Cheap, and prevents the new idea sitting unconnected and inert.
5. Strengthen, use it, vary it, weave it in
Deliberate practiceFocused, repeated practice on the specific new skill, with immediate correction.	Strong	Repeated use thickens the new links until the idea comes automatically, freeing mental room for harder work.	The teacher's job is to diagnose the right weakness and keep the practice focused.
Mixed practiceMix the new problem type with others rather than repeating the same kind again and again.	Solid	Forces the student to choose which idea applies, building the judgement, not just the procedure.	Feels harder, teaches more. Pairs with recall and spacing in the layer above this guide.
Spaced / distributed practiceReturn to the idea after time has passed, with recall required each time.	Strong	Spacing makes retrieval harder in the moment, but more durable later; the link has to be rebuilt, not merely reread.	Lives above a single lesson, but Strengthen is its natural home.
Self-explanationAsk students to explain, in their own words, why a step works.	Solid	Generating the explanation forces links between the new piece and the rest of the map, deeper integration.	Cheap and powerful.
Peer tutoringHave students teach or coach a peer through the new idea, explaining it and fielding questions.	Strong	Teaching forces the tutor to organise and justify the idea, which deepens their own map and exposes the gaps in it.	Consolidation by use, like self-explanation but social. Szubartowski's signature lever.
6. Test, unaided, fair, real
Recall / testingHave students recall and apply the idea from memory, with no notes or help.	Strong	Pulling knowledge out, not just re-reading it, is itself one of the strongest ways to cement it.	Doubles as learning and evidence; the final check itself stays unaided.
Oral / defence / explain-backStudent explains or defends their solution live, answering follow-ups.	Solid	Real-time questioning is hard to fake, it reveals whether the idea is truly the student's own.	A hard-to-fake check. Use at meaningful points, not everywhere, it's time-costly.
Process evidenceLook at drafts, dead ends, version history, the trail, not just the polished result.	Context	Shows the thinking happened, not just that a clean final product exists.	Complements, doesn't replace, the unaided check.

Sources behind these: productive failure (Kapur 2008; Sinha & Kapur 2021)^{6, 7}; worked examples and cognitive load (Sweller & Cooper 1985; Kirschner, Sweller & Clark 2006)^{10, 11}; retrieval and spacing (Roediger & Karpicke 2006; Dunlosky et al. 2013)^{12, 9}; active learning and peer instruction (Freeman et al. 2014; Crouch & Mazur 2001)^{14, 15}; belonging (Walton & Cohen 2011)⁸; the tutoring benchmark (VanLehn 2011)²; self-explanation and learning by teaching (Chi 2009; Cohen, Kulik & Kulik 1982; Roscoe & Chi 2007)^{19, 18, 17}; cognitive apprenticeship (Collins, Brown & Newman 1989)¹⁶.

Menu B · AI interventions, mapped to the same moves

**Table 2.** Menu B: AI tools, mapped to the same six moves
AI technique & what it is	Effect	Why it works, and the catch	What it replaces / supports
1. Prime, generate the spark (low risk)
AI-generated hook & analogyAsk AI for a vivid opening, a real-world example, or an analogy tuned to the class's interests.	Context	Lifts motivation cheaply. Catch: pick what actually resonates, AI doesn't know your room.	Complements the human hook; frees teacher time. Safe, no struggle is touched yet.
AI-generated recall quizAuto-create a quick warm-up quiz on prerequisite ideas.	Strong learning moveAI version must be checked	Wakes up prior knowledge for every student, quickly and at different levels. Same learning move as classical recall, but the items need checking.	Scales a high-value warm-up. Substitutes the teacher's prep time, not the recall itself.
2. Probe, mostly keep AI out
Access-timing gateA rule or tool that unlocks AI only after the student has made an independent attempt.	Solid	Protects the generative struggle, then allows help. Mirrors productive failure exactly.	Turns AI from a struggle-killer into a struggle-respecter. Complements every later AI use.
3. Point, the sweet spot: a guarded question-asking tutor
Answer-withholding AI tutorAn AI prompted to give hints, ask questions, and explain errors, but never the finished solution.	Strong	Hints-not-answers helped practice without the later damage caused by answer-giving.	Scales good questioning to every student at once. The single most valuable AI placement.
Error-spotting & misconception feedbackAI reads the student's own attempt and points to where the reasoning went wrong, without fixing it.	Solid	Targets the individual's actual gap, feedback density no single teacher can match. Catch: must not solve it.	Complements the teacher's feedback; covers the many students one teacher can't reach in the moment.
Related prior problemAI points back to an earlier problem with a similar deep structure and asks what is similar and different.	Context	Related cases can help students transfer an old idea to a new setting. Catch: it must not show the current solution.	A guarded hint, not a struggle-killer. Complements teacher questioning; the target problem stays the student's own.
AI helping the human tutorAI suggests good questions to a (human) tutor or teacher in real time, rather than talking to the student.	Solid	Raises the floor of human feedback quality, especially for less-experienced tutors (Tutor CoPilot).	Keeps the human in charge; AI coaches the coach.
4. Attach, a safe, strong generator
On-demand worked examplesAI generates a clean, stepped example exactly when the student is ready for one.	Strong learning moveAI version must be checked	Tailored examples and alternative explanations can help when the example is correct and well-timed.	Scales worked examples. Substitutes prep; the teacher still times the reveal.
Alternative explanations / re-framingsAsk AI to explain the same idea a different way, or at a different level.	Solid	Finds the framing that clicks for a given student, personalised in a way one lecture cannot be.	Complements the teacher's single explanation with many. Useful when an old idea has to bend.
5. Strengthen, scale practice and feedback
Targeted practice setsAI generates practice items tuned to the student's weak spots, with instant marking.	Strong learning moveAI version must be checked	More practice, at the right level, can help students use the idea repeatedly. Catch: keep the student doing the work, and check item quality.	Scales deliberate practice. Frees the teacher to diagnose, not generate.
Instant formative feedbackImmediate, specific feedback on each attempt, around the clock.	Solid	Tightens the practice loop, faster correction means faster strengthening of the right links.	Complements human feedback by covering volume and timing; teacher handles the nuanced cases.
Spaced-review schedulingAI builds and delivers a personalised review schedule for the new material.	Solid	Automates useful review over time. Catch: review must require recall, not re-reading.	Substitutes manual scheduling; bridges this lesson to long-term retention.
Teach-the-AIThe student explains the idea to, or corrects the mistakes of, an AI playing a confused learner.	Context	Can recreate the value of teaching someone else when no peer is free. Catch: evidence is still early, and the student must stay the teacher.	Scales peer tutoring to every student. Complements human peer work; keep the student in the explaining role.
6. Test, AI steps back; keep the check fair
AI-generated assessment itemsUse AI to draft varied test or recall questions, then have the teacher check them.	Context	Saves prep and widens question variety. Catch: the test delivery itself must be AI-free to be valid.	Complements design; the actual unaided check stays human-run.
AI-assisted process reviewAI helps the teacher scan drafts, version history, or logs for signs of genuine engagement.	Context	Makes process evidence practical to review at scale. Catch: a support, never sole proof of misconduct.	Complements oral or unaided checks. Detectors alone are unreliable; judgement comes from design.
Removed at the unaided checkThe deliberate absence of AI at the moment that certifies unaided ability.	Required	The whole model rests on one AI-free point that shows the student can think without the tool.	Non-negotiable. Everything else in Menu B exists to get the student ready for this.

Sources behind these: the guardrail result (Bastani et al. 2025; the answer-giving arm left students about 17 percent worse on the unaided exam, the hints-only arm erased the harm)⁵; engineered tutoring gains (Kestin et al. 2025; roughly double the learning of an active-learning class, in less time)⁴; AI coaching human tutors (Tutor CoPilot, Wang et al. 2024, preprint; about +4 to +9 points at roughly $20 per tutor per year)²²; access-timing (Rotter et al. 2026, preprint; unlocking AI only after an independent attempt beat both free access and no access)²³; assessment security and evaluative judgement (Dawson 2021; Bearman et al. 2024)^{20, 21}. Cautions: metacognitive laziness (Fan & Gasevic 2025)²⁴; reduced critical effort (Lee et al. 2025; Kosmyna et al. 2025, preprint, read with its published critique)^{25, 26, 27}. The honest summary: structured, answer-withholding AI use tends to help; unstructured answer-giving can harm.

Szubartowski’s signature move was peer tutoring, students coaching each other through a hard idea. It sits in Menu A under Strengthen: teaching forces the tutor to organise the idea and justify it, which deepens their own map. Menu B’s “teach-the-AI” row is the cheap, universal version of the same thing, the student explaining the idea to an AI that plays a confused learner. The evidence there is still early, so keep the student in the teaching seat.

One more thread is worth pulling. The masters in the opening had something most teachers never could: a steady supply of hard, well-chosen problems, and the time to feed them out one at a time. Rátz kept a problem journal. The olympiad pipelines had a curated bank. That problem-rich environment used to need a rare curator. AI’s clearest, best-evidenced win is making it cheap and universal: endless calibrated drills and on-demand examples, for every student. That is the access promise, made concrete.

A worked example

Survivorship bias, run through the six moves

Enough abstraction. Here is one real idea, survivorship bias, built inside a single student’s head, then the same lesson from the teacher’s side. The first card is the student view: how the idea lands in the map, move by move, with the teacher’s and AI’s part at each step. The second is the lesson view: how you would actually build it, and the one decision that carries the whole frame, where you let AI in.

Student view, the learner's mapStep 1 / 6

dormant / prunedactive nowknown, in playan old idea shifting to make rooma link reaching out, nothing on the end yetnew knowledge

Prime

Teacher

AI here

Figure 3. The student view: how one idea, survivorship bias, lands in a single learner’s knowledge map, move by move. Step through it.

Table 3. The lesson view: the same six moves as a lesson plan, with what the teacher does and where AI goes at each.

Lesson view, how you build it

The same lesson, from the educator's side

No student map here, that is the other view. This is the design surface: at each move, what you do, and the one decision that carries the whole frame, where AI goes.

1. Primewake priors

Set the puzzle, surface what they already know.

Pose 1943: bombers come home riddled with holes. Where do you add armor?

AI in

No struggle has started. Let it draft the hook, the image, the framing.

2. Probeforce the attempt

Make them commit before any help. The natural wrong answer is the point.

"Where would you put the armor?" Almost everyone says: on the holes. Hold back. Do not rescue.

AI out

This search is the part that builds the skill. Ask the bot now and it is skipped.

3. Pointre-aim, don't answer

One question that turns the search, without handing over the conclusion.

Ask: what about the planes that did not come back?

AI gated

May ask the same question or surface a related solved case. Never the answer. A nudge, not a rescue.

4. Attachreveal, name, flip

Give the data, name the idea, replace the wrong rule.

The missing planes were hit in the engines. Name it: survivorship bias. Flip the rule, armor the gaps, not the holes.

AI in

The struggle is done. Fine to scaffold: a clean statement, a tidy diagram.

5. Strengthentransfer, thicken

Three fresh cases, each hiding the same trap. Name where the missing data sits.

The studied startups, the ones that failed doing the same things are gone, so we copy habits that never caused the success.
The five-star reviews, unhappy buyers returned the thing and never reviewed, so only the satisfied get counted.
The smoker who reached 90, the smokers who died young are not around to be counted against him.

AI in

Great at volume and calibration. Let it scale varied practice and instant feedback.

6. Testprove it is theirs

A new case, unaided. Standing alone is the proof.

A fund advertises the ten best performers in its family over twenty years. The funds that did badly were quietly closed and dropped from the record, so the track record shows only the survivors and overstates what a real investor would have earned. The student should spot that, alone.

AI out

If the skill is the student's, it shows here. Help now proves nothing.

Read the right column top to bottom: AI is out only at Probe and Test, the one struggle and the one proof that have to be the student's own. Everywhere else it earns its place.

4. Conclusion: limits, and the promise

The scope here is deliberately narrow. This is a frame, the ground floor that course redesign builds on, not the redesign itself. It covers how a student learns one idea, on first encounter, in a single pass. Long-term memory needs review and spacing on top of it, a second layer that reinforces the six moves without replacing them. What is worth teaching in the first place, and how to build a whole curriculum around it, are larger questions that sit above this one. The sequence assumes what it cannot itself supply: a motivated, engaged student. Securing that, and designing the whole experience around it, is the educator’s craft, the part AI amplifies but doesn’t replace, the teacher pacing a lesson the way a game designer paces a player, a chef sequences a tasting menu, or a guide leads you through a country worth seeing.

The AI evidence is young. The studies run from roughly 2023 to 2026, they are mixed, and several of the most relevant are still preprints, flagged as such in the menus and references. The weight in this frame is carried by settled cognitive and social science, productive failure, worked examples, retrieval, belonging, the tutoring benchmark, not by any single AI result.

Step back, though, and the shape of the chance is clear. For most of history, the teaching that made a von Neumann, or gave Helen Keller her first word, was the luckiest accident in a child’s life: the right teacher, met at the right moment, by the very few. What is new is not a machine that teaches. It is that an ordinary teacher can now hand off the parts that used to eat their week and keep the parts only they can do, and in doing so offer a whole room something close to what once reached a single prince. The tool will not do it on its own. Placed well, by someone who knows exactly where the struggle has to stay, it could let far more learners feel, at least once, what it is to be truly taught. That is the work worth getting right.

And the stakes run past the classroom. The competence a student builds the hard way, durable, transferable, theirs, is also the quiet ground of what one of us has called cognitive sovereignty: the capacity for autonomous thought in an age of AI systems that are glad to do the thinking for you.²⁹ Placed wrong, the tool trades that ground away one effortless task at a time, and across a generation the loss stops being personal. Placed right, it does the opposite: it hands more people than ever the means to think for themselves. The same tool, and the same choice, all the way up.

The full argument, with the evidence treatment and the academic positioning, is in the preprint.

Read the preprint

References

Kawecki, M. (2025). Mistrz (documentary on Ryszard Szubartowski, III LO Gdynia). YouTube, 1 July 2025. A separate dramatized feature film was announced by Netflix in 2026. Illustration, not evidence.
VanLehn, K. (2011). The relative effectiveness of human tutoring, intelligent tutoring systems, and other tutoring systems. Educational Psychologist, 46(4), 197–221.
Bloom, B. S. (1984). The 2 sigma problem: The search for methods of group instruction as effective as one-to-one tutoring. Educational Researcher, 13(6), 4–16.
Kestin, G., et al. (2025). AI tutoring outperforms in-class active learning: an RCT introducing a novel research-based design in an authentic educational setting. Scientific Reports.
Bastani, H., et al. (2025). Generative AI without guardrails can harm learning: Evidence from high school mathematics. Proceedings of the National Academy of Sciences (PNAS). doi:10.1073/pnas.2422633122.
Kapur, M. (2008). Productive failure. Cognition and Instruction, 26(3), 379–424.
Sinha, T., & Kapur, M. (2021). When problem solving followed by instruction works: Evidence for productive failure. Review of Educational Research, 91(5), 761–798.
Walton, G. M., & Cohen, G. L. (2011). A brief social-belonging intervention improves academic and health outcomes of minority students. Science, 331(6023), 1447–1451.
Dunlosky, J., et al. (2013). Improving students’ learning with effective learning techniques. Psychological Science in the Public Interest, 14(1), 4–58.
Sweller, J., & Cooper, G. A. (1985). The use of worked examples as a substitute for problem solving in learning algebra. Cognition and Instruction, 2(1), 59–89.
Kirschner, P. A., et al. (2006). Why minimal guidance during instruction does not work. Educational Psychologist, 41(2), 75–86.
Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249–255.
Kalyuga, S. (2007). Expertise reversal effect and its implications for learner-tailored instruction. Educational Psychology Review, 19, 509–539.
Freeman, S., et al. (2014). Active learning increases student performance in science, engineering, and mathematics. PNAS, 111(23), 8410–8415.
Crouch, C. H., & Mazur, E. (2001). Peer instruction: Ten years of experience and results. American Journal of Physics, 69(9), 970–977.
Collins, A., et al. (1989). Cognitive apprenticeship: Teaching the crafts of reading, writing, and mathematics. In Knowing, Learning, and Instruction. Lawrence Erlbaum.
Roscoe, R. D., & Chi, M. T. H. (2007). Understanding tutor learning. Review of Educational Research, 77(4), 534–574.
Cohen, P. A., et al. (1982). Educational outcomes of tutoring: A meta-analysis. American Educational Research Journal, 19(2), 237–248.
Chi, M. T. H. (2009). Active-constructive-interactive: A conceptual framework. Topics in Cognitive Science, 1(1), 73–105.
Dawson, P. (2021). Defending Assessment Security in a Digital World. Routledge.
Bearman, M., et al. (2024). Developing evaluative judgement for a time of generative artificial intelligence. Assessment & Evaluation in Higher Education.
Wang, R. E., et al. (2024). Tutor CoPilot: A human-AI approach for scaling real-time expertise. arXiv:2410.03017. Preprint.
Rotter, J., et al. (2026). Access timing as scaffolding: A reinforcement learning approach to GenAI in education. arXiv:2605.15850. Preprint.
Fan, Y., et al. (2025). Beware of metacognitive laziness: Effects of generative artificial intelligence on learning motivation, processes, and performance. British Journal of Educational Technology.
Lee, H.-P., et al. (2025). The impact of generative AI on critical thinking: Self-reported reductions in cognitive effort and confidence effects from a survey of knowledge workers. Proceedings of CHI 2025.
Kosmyna, N., et al. (2025). Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task. arXiv:2506.08872. Preprint; contested methodology, read with its published critique (ref 27).
Stankovic, M., et al. (2025). Comment on: Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing tasks. arXiv:2601.00856. Preprint.
Deslauriers, L., et al. (2019). Measuring actual learning versus feeling of learning in response to being actively engaged in the classroom. PNAS, 116(39).
Brcic, M. (2025). The Memory Wars: AI memory, network effects, and the geopolitics of cognitive sovereignty. arXiv:2508.05867. Preprint.
Shen, J. H. & Tamkin, A. (2026). How AI impacts skill formation. arXiv:2601.20245. Preprint.

1. The six moves

Prime

Probe

Point

Attach

Strengthen

Test

Interactive six-move model

2. The one rule

The contrast that proves it

3. The two menus

Survivorship bias, run through the six moves

4. Conclusion: limits, and the promise

Read more

References