Gemma 4 26B A4B: the strongest local baseline so far
The best local LM Studio result in the current public table: not perfect, but unusually solid across both scanned invoices and agentic paperwork folders.
61.1% Practical score
4/9 Resolved
7/9 Core pass
SVG sample failed separately Visual sample
What Worked
- Strongest local practical score in the frozen v1 paperwork suite.
- Seven of nine cases reached core-pass level, which matters more than one-off formatting wins.
- Handled the mixed paperwork workflow better than smaller local candidates.
Where It Broke
- Still missed strict resolution on several cases because proof and exact artifact details matter.
- The separate City Plan SVG sample did not produce a usable visual artifact.
- Near misses still need manual review before treating outputs as dependable.
Readout
This is the local model to beat right now, especially if the use case is private document triage with human review.