- Published on
An 8B-parameter model matches GPT-4o across five knowledge-intensive benchmarks and beats it on two. It does this by replacing the entire retrieval orchestration layer—confidence classifiers, query routers, rerankers, fusion logic—with four special tokens. No external components. When it fails, you read a transcript. That's the whole debugging story.