TranscriptAgent
Try it free
TRANSCRIPTAGENT.AI · transcript analysis

from "Test Prep" to Real Growth: A New Way to Look at Interim Assessments

Channel: Khan Academy Published: 2026-05-26 14:28
Khan Academy

This is a Khan Academy product/demo webinar about new interim assessments, focused on moving beyond multiple-choice toward more authentic, AI-assisted, and actionable assessment experiences. Lauren Deeders explains the product rationale and theory of action; Peter Jacobson demos the student experience, explain-your-thinking conversations, accessibility features, scoring architecture, and future personalized recommendations.

Watch on YouTube ›

Get the market thesis, key claims, assets, contradictions, and follow-up questions from any financial video — then unlock a version personalized to your portfolio, watchlist, and favorite speakers.

Detailed summary

This transcript is a product launch / walkthrough for Khan Academy’s new interim assessments, framed as part of a “re-imagined” Khan Academy. The core thesis is that traditional assessments are too multiple-choice heavy, too slow to produce useful action, too opaque to build trust, and too weak at capturing real student thinking. Lauren Deeders argues that Khan Academy is using AI plus psychometrics to build a more authentic, efficient, and open assessment system that can better measure what students know and can do. Lauren’s setup is that Khan Academy already has formative tools like unit tests, quizzes, and course challenges, but interim assessments fill a different need: a more comprehensive assessment and learning system. …

🔒 The full detailed summary continues — read all of it free with an account. Read the full summary →

Main takeaways

  1. Khan Academy is launching interim assessments as part of a broader “re-imagined” product strategy.
  2. The product is positioned as a response to four assessment pain points: overreliance on multiple choice, weak actionability, lack of trust/transparency, and poor time efficiency.
  3. AI is used selectively to enable conversational assessment, not to replace assessment rigor.
  4. The “explain your thinking” flow is the key differentiator: it probes reasoning, supports partial credit, and exposes student thinking.
  5. The system includes accessibility features and adaptive pathways, showing the product is meant to be usable at scale.
  6. Teacher and district reporting is a central selling point: individual and class-level summaries are meant to be actionable.
  7. Khan Academy plans personalized practice recommendations tied to assessment results, while keeping teachers in control.
  8. The launch is staged: math first, ELA in pilot, then more grades and subjects later.

Market read by horizon

Short term

Immediate setup is product validation: watch whether the explain-your-thinking flow feels natural, fast, and non-punitive in early district use. The main near-term risk is friction from mandatory responses and any sign the AI prompts students too much or too little.

  • The immediate watch item is rollout/early access for district partners next school year, which is framed as no-charge for current partners.
Show more
  • Near-term product risk is usability: students currently cannot skip items, so early friction or fatigue matters.
  • The explain-your-thinking prompt is the tactical centerpiece; its quality will determine whether the AI interaction feels helpful or intrusive.
Mid term

Over the next few months, the base case is a phased rollout in math first, with ELA still being tuned and district alignment doing most of the work. The key confirmation signal will be whether teacher-facing summaries translate into actionable instruction and repeat usage.

  • Over the next several weeks/months, the key question is whether conversational assessment consistently reveals more student understanding than one-shot short response.
Show more
  • If learner-insight summaries and rollups prove actionable for teachers, the product can shift from novelty to routine diagnostic use.
  • Adoption will likely depend on whether the assessment blueprints feel comparable enough across districts while still allowing local alignment.
Long term

The long-run thesis is a shift toward assessment as an always-on learning system rather than a closed testing event. If the model works, it weakens the old tradeoff between rigor, transparency, and actionability by using AI to surface reasoning at scale.

  • Structurally, Khan Academy is arguing for a shift from closed, test-security-driven assessment systems to larger, more open item banks with better transparency.
Show more
  • The lasting thesis is that assessments can become part of a learning loop rather than a separate measurement event.
  • If successful, this could support a broader regime where AI helps surface reasoning, not just answers, across subjects and grade bands.
Unlock the full horizon read See the full short-term, mid-term, and long-term implications with confirmation and invalidation signals. Unlock horizon read

Key claims (10)

NEUTRAL education technology product launch Khan Academy

Khan Academy is building brand new interim assessments as part of Re-Imagined Khan Academy.

The presenters repeatedly frame the session as a preview of new interim assessments in the reimagined product.

BEARISH assessment design interim assessments

The company believes multiple-choice-only assessment is insufficient because it does not capture real-world application or nuanced understanding.

Lauren says multiple choice is efficient but too binary and not representative of real use.

BULLISH AI in education AI assessment

Conversational assessment can mimic the kinds of teacher-student probing used to understand student thinking.

Lauren explicitly says AI can prompt and probe like a teacher asking follow-up questions.

Unlock 7 more claims See the full bullish, bearish, and counter-consensus argument map extracted from the transcript. Unlock all claims

Speakers

HOST Denise GUEST Lauren Deeders GUEST Peter Jacobson

Interview (14 Q&A)

interim assessment purpose

Why are you developing interim assessments, and why now?

Lauren explains that Khan Academy already has formative assessments like unit tests and quizzes, but they saw an opportunity with AI to create a comprehensive assessment and learning system for interim assessments. They identified pain points including: too much multiple-choice, lack of actionability, lack of trust (guarded item banks), and assessments that don't fit in a class period. They aim to address these with authentic assessments (interactive items, short response, conversational assessments), actionable score reports, open item banks, and efficiency.

explain your thinking

How does the explain your thinking feature work in Khan Academy's assessments?

Peter walks through a demo of the explain your thinking feature. It starts with a two-part question where the student first answers a math or ELA item using standard input widgets, then enters a conversation with an AI persona called Conductor. The AI probes the student's conceptual understanding (e.g., asking why the tangent of 30.5° is the same in all right triangles with that angle). If the student demonstrates sufficient understanding, Conductor ends the conversation. If not, it continues probing. The system uses three AI agents: one to score the response, one to generate the conversation, and a self-critique agent that checks the other AI's work to ensure it doesn't give away the answer.

item types

What kinds of item types are available in Khan Academy's new assessment experience?

Peter shows several item types including numerical input, geometry proofs using a drop-down format, expression widgets with a number pad for complex equations, multi-check or single-select multiple choice, the explain your thinking conversational AI item, short response items, and interactive graph items that are accessible via keypad for screen readers.

Unlock the full interview (11 more Q&A) Every question, answer summary, and YouTube timestamp. Unlock full Q&A

Where this transcript pushes against consensus

  • The claim that AI can reliably mimic teacher-student assessment conversations is plausible but not yet demonstrated at scale in this transcript.
  • The assertion that open item banks will solve trust/security issues is directional, but tradeoffs around item reuse and comparability remain unresolved.
  • They say conversational assessment improves learning outcomes, but the causal evidence is described as a theory of action rather than proven results.
  • The ELA use case is explicitly described as early and experimental, so any broad implication there is premature.
  • The promise of grade-level content for all students is compelling, but the operational details and efficacy are not yet shown.

Topics

interim assessmentsconversational assessmentAI in educationstudent reasoningpsychometricsteacher diagnosticsadaptive testingaccessibilitypersonalized recommendationsdistrict blueprints

Create your free research agent

Unlock the full claims, asset map, scores, related transcripts, follow-up questions, and AI chat — shaped around your portfolio, watchlist, favorite speakers, and risks.

  • Full claims and asset map
  • Personalized relevance to your watchlist
  • Follow-up questions you can track
  • Related transcripts from your workspace
  • AI chat about this video
Create your free research agent
TRANSCRIPTAGENT.AI