Why ChatGPT Can't Tutor You for MRCGP (and What Actually Does)

ChatGPT explains. It does not tutor. For MRCGP AKT preparation, that distinction costs you retention — and you will not notice the cost until the exam, when the fluent explanations you read are nowhere to be found in your memory.

A question-first tutor that diagnoses your specific misconception, makes you retrieve before it explains, and stays anchored to the AKT blueprint and NICE/CKS guidance is what actually moves the needle. An explanation machine that does the cognitive work for you feels helpful in the moment and is measurably harmful over the weeks of revision that precede the exam.

The Scene Every GP Trainee Knows

You are working through an AKT practice bank. You get a question wrong — something about diabetes management thresholds in CKD, or first-line antihypertensive choice in a specific comorbidity combination, or the monitoring interval for a newly prescribed medication. You copy the question into ChatGPT. Within seconds, you receive a 500-word explanation: clear, well-structured, clinically fluent. It explains the correct answer, discusses the relevant NICE guideline, mentions the key pharmacological considerations, covers the common mistakes, and even adds context about why the distractors are wrong.

You read it. You nod. You feel smarter. You move on to the next question.

Next week, a similar question appears — same underlying concept, different clinical scenario. You stare at the options. The explanation was so clear — you remember reading it. You can almost see the paragraph. But you cannot recall the actual threshold. You cannot reconstruct the reasoning. You cannot produce the answer from memory. You recognise the topic but cannot retrieve the knowledge. You get it wrong again.

This is not a failure of your memory. It is a predictable consequence of your study method. You read an explanation instead of retrieving an answer — and reading is not retrieval.

Explanation Is Not Learning

The evidence is now clear: when students use AI as an answer machine — receiving explanations without first attempting retrieval — they perform worse on subsequent assessments than students who never had AI access at all. In the Bastani et al. PNAS study, the deficit was 17%. In the 45-day retention study, ChatGPT study-aid users scored significantly lower than traditional-method peers on a surprise test.

The mechanism: the fluent explanation bypasses the effortful retrieval that creates durable memory. Reading an explanation feels like learning — the comprehension is genuine, the logic is clear, the knowledge feels accessible. But comprehension in the moment is not the same as retrieval under exam conditions. The felt-vs-actual gap means you cannot detect the failure until the exam reveals it.

The Dartmouth/Geisel npj Digital Medicine study (2025) confirmed the alternative: Socratic tutoring — where the AI asks questions rather than providing answers — transforms a passive answer service into an active learning partner that promotes long-term retention. The difference is not sophistication. It is direction: does the AI do the cognitive work, or does the trainee?

What a Real Tutor Does Differently

A tutor does not start by explaining. It starts by diagnosing.

It identifies your specific misconception. Not "you got diabetes management wrong" — which is a topic, not a diagnosis, and does not tell you what to fix. Rather: "you applied the eGFR threshold for metformin continuation incorrectly — you used the threshold for initiation when the patient was already established on the drug." That is a specific, correctable error. Correcting it changes how you approach not just this question but every similar question involving renal-adjusted prescribing.

It makes you retrieve before explaining. Before telling you the correct threshold, it asks: "What do you think the eGFR threshold is for continuing metformin in an established patient?" You have to search your memory. You have to produce a response. The effort — even if you get it wrong, especially if you get it wrong — strengthens the memory trace. Only after you have attempted does the tutor provide the correction, grounded in the specific NICE/CKS recommendation.

It stays anchored to the exam. The tutor is not drawing from general internet knowledge about diabetes management globally. It is grounded in the MRCGP AKT curriculum, the official question explanation, and NICE/CKS guidance for UK primary care. The answer it gives is the answer the AKT expects — not a generic international consensus summary that may differ from UK practice on specific thresholds, drug choices, or referral criteria.

Worked Example: One AKT Question, Two Responses

The question: A 68-year-old man with type 2 diabetes and CKD stage 3b (eGFR 38) is currently on metformin 1g BD. His recent HbA1c is 64 mmol/mol. He has no symptoms of hyperglycaemia. What is the most appropriate next step in management?

ChatGPT's response:

A 500-word explanation covering: the classification of CKD stage 3b, the NICE guideline on metformin use in renal impairment, the eGFR thresholds for dose reduction and discontinuation, the rationale for considering SGLT2 inhibitors in CKD (including renal and cardiovascular benefits), a discussion of the HbA1c target in an older patient with CKD, a comparison of alternative second-line agents (DPP-4 inhibitors, GLP-1 receptor agonists), and a summary of the key learning points.

Clear. Comprehensive. Clinically accurate. The trainee reads it, understands it, and moves on. The cognitive work was done by ChatGPT. The trainee absorbed it passively.

The iatroX Socratic Tutor exchange:

Tutor: You selected option C. Before I explain, can you tell me what you think the eGFR threshold is for continuing metformin in an established patient?

Trainee: I think it's 30?

Tutor: Close — eGFR 30 is actually the threshold below which metformin should be stopped entirely. Between eGFR 30 and 45, metformin can be continued but the dose should be reviewed and potentially reduced. This patient's eGFR is 38 — which zone does that put them in?

Trainee: The "continue but review dose" zone?

Tutor: Exactly. So metformin can continue, but the dose may need reducing. Now, the patient's HbA1c is 64 mmol/mol — above the typical individualised target. Given that, what does NICE recommend as the next step: adjust the metformin dose, or add a second agent?

Trainee: Add a second agent?

Tutor: Good. The HbA1c is above target, so treatment intensification is more appropriate than just dose adjustment. And for a patient with CKD stage 3b and type 2 diabetes, which class of second-line agent has both glycaemic and renal protective evidence?

Trainee: An SGLT2 inhibitor?

Tutor: Correct — that maps to option A, which is the answer. NICE CKS recommends considering an SGLT2 inhibitor as add-on therapy in type 2 diabetes with CKD, given the evidence for renal and cardiovascular benefit. The key learning points: this patient's eGFR is in the "continue but review" range for metformin (not the stop range), the HbA1c above target indicates intensification rather than dose reduction, and SGLT2 inhibitors have specific evidence supporting their use in CKD.

Four turns. Three retrieval events (eGFR threshold, dose zone identification, second-line agent selection). One corrected misconception (the trainee confused the "stop" threshold with the "review" threshold). One reinforced clinical principle (SGLT2 inhibitors in CKD). The trainee did the cognitive work. The tutor guided it.

Compare: the ChatGPT explanation covered the same content in more detail. But the trainee who read it produced nothing from memory. The trainee who worked through the Socratic exchange produced three answers, corrected one error, and built three retrieval-strengthened memory traces. Next week, one of them will remember the eGFR threshold. The evidence says it will be the second one.

The Legitimate Exception

There are moments when you want the answer fast. The night before the AKT. A concept you have revised repeatedly and need to confirm. A quick check during a clinical session. The "just explain it" override exists in the iatroX Socratic Tutor — switching from question-first to direct explanation for time-pressured moments. It is not the default, because the evidence says question-first produces durable learning. But it exists, because the briefs specify it and because real revision involves real time pressure.

Try It

The MRCGP AKT Q-bank is part of iatroX's free UK core exam offering. The Socratic Tutor is a Pro feature inside the Q-bank. Try the same question in both modes — and notice which one you actually remember next week.

This article is templatable. The same structure works for PLAB 1, MRCP Part 1, MRCPCH, MRCPsych, FRCA, GPhC CRA — swap the exam, swap the worked example, keep the evidence and the principle.

Start MRCGP AKT revision with the Socratic Tutor →