Emmanuel A. Olowe — University of Edinburgh
The Setting
"My oscilloscope shows a flat line." The hardware is real, the deadline is tonight, and no instructor is online.
The Dilemma
"Phase shift occurs in circuits. Check your connections."
"Given your measured phase of −45° at 10 kHz, τ = RC ≈ 15.9 µs..."
Key Insight:
Determine the right kind of help, then deliver it through the right tier and overlay.
Security & Control
"Give students OpenAI API keys? One leaked key = $10,000 bill"
Operational Layer
Educational Foundation
Sarah's struggling with phase shifts in her circuit.
| Student State | Model | Overlay |
|---|---|---|
| Outside ZPD (basic) | Local model | Direct answer |
| Within ZPD | Policy-selected | Hint + follow-up |
| ZPD ceiling | Premium model | Socratic questioning |
Decision happens in milliseconds — student never sees the machinery
Intent Matching
Pre-computed embeddings for 89 representative tasks
Routing Brain
Architecture
Decision Engine
All weights configurable in policy.json — hot-reloaded without restart
Cost 40% — primary goal is saving money
Latency 25% — students notice slowness
Health 20% — auto-deprioritize failing models
| Route | Model | Cost | Used For |
|---|---|---|---|
| Local | gpt-oss-20b | $0.00 | Simple Q&A |
| Premium | GPT-5 Mini | $0.25/$2 | Escalated reasoning |
"What is a resistor?"
→ local model wins (short, no images, simple)
"12,000-token circuit analysis + image + SCPI error"
→ premium model wins (large context, high reasoning)
Equity & Safety
| Guardrail | Purpose |
|---|---|
| Budget caps | Stop a small number of students from consuming premium capacity |
| Escalation rules | Promote hard troubleshooting and high-effort tasks at the right moment |
| Human override | Allow instructors to approve or block high-cost help explicitly |
privacy_mode = features_only keeps routing decisions on metadata when neededSarah's Request Journey
socratic_troubleshoot → triage checklist in 21.7 s
| Path | Plan latency | What the student notices |
|---|---|---|
| Cache hit | <10 ms P95 | Usually negligible |
| Cache miss | ~82 ms CPU | Usually hidden by provider latency |
Routed (EduRouter) — 21.7 s avg
GPT-5 Mini direct — 23.2 s avg
gpt-oss-20b direct — 26.1 s avg
EduRouter steers 75% to local tier — fastest end-to-end despite local model's higher raw latency.
SCPI escalation path → GPT-5 Mini + socratic_troubleshoot. Sarah receives a triage checklist, confirms probe settings, corrects her measurement, and submits on time.
Evaluation
| Prompt | Keyword route | Embedding route |
|---|---|---|
| "Help me work through this" | Too generic | Matches proof-style help |
| "I'm lost with phase measurement" | Weak escalation signal | Promotes troubleshooting path |
| "Quick check — is this normal?" | Over-escalates | Stays on cheaper tier |
Results
Cost Reduction
Local model — 75% · simple chat, basic Q&A
Premium model — 25% · escalated or higher-demand cases
The routed system does not flatten everything to the cheapest model.
Discussion
/route/feedback auto-adjusts weightsSummary
"My oscilloscope shows a flat line"
Infrastructure-level routing intelligence — grounded in scaffolding theory, embedding-based intent detection, and declarative governance — makes AI tutoring simultaneously cheaper, more pedagogically appropriate, and more equitable.
Emmanuel A. Olowe · University of Edinburgh
e.a.olowe@sms.ed.ac.uk
IEEE EDUCON 2026