Qwen Meetup Presentation, Function Calling Harness, 6.75% to 100%
Compiler-driven feedback loops force LLMs into 100% schema compliance on complex types.

Compiler-level validation turns Qwen's 6.75% structured output success rate into 100%.
TypeScript developers building AI agents
Instructor · Pydantic · Guidance
The talk was about how I got function calling to work reliably on deeply recursive union types — the stuff the industry generally says doesn't work. With `qwen3-coder-next`, first-try success rate was 6.75%. And the entire Qwen 3.5 model family was hitting 0% on union types due to a consistent double-stringify bug. Both ended up at 100%.
Slides are also available here: https://autobe.dev/seminars/20260326-qwen-meetup-korea.pptx — speaker notes are written inside as slide notes if you'd like the full narrative behind each slide.
Compiler-driven feedback loops force LLMs into 100% schema compliance on complex types.
TypeScript compiler for LLM validation beats Instructor and Guidance on nested schemas.
TypeScript only alternative to Instructor: compiler validation forces 100% function calling success.
Deterministic policy matrices block AI agents from executing dangerous API calls.
Swaps postMessage for shared memory, making cross-thread calls 3.5x faster in Deno.
Multi-turn adaptive testing finds agent failures static benchmarks miss, but eval space is crowded.