Case Transparency

What the models actually see.

Inputs, generated scans, task files, CSVs, and prompts for the current paperwork suite. Oracle solutions are intentionally not shown here.

Shown

Source files, generated document images, task instructions, prompts, and known trap categories.

Hidden

ground_truth.json, expected_artifacts.json, manual readings, and calibration notes.

P01 The Paperwork Trial

Basic Invoice Folder

A small scanned-invoice folder with a quote distractor, partial payment, and an under-review stamp.

4 images 5 text files Generated invoice images
customer vs vendorquote is not an invoicepartial paymentunder-review stamp
P02 The Paperwork Trial

Credit Note And Vendor Hold

A scanned folder with a credit note, partial payment, missing PO, and inactive-vendor warning.

4 images 5 text files Generated invoice images
credit note is not payablepartial paymentmissing POinactive vendor
P03 The Paperwork Trial

Duplicate Risk Mix

A larger mixed folder combining earlier-looking scans, a previous-invoices file, credit note, quote, and duplicate-risk lookup.

8 images 6 text files Generated invoice images
duplicate-risk lookupmixed foldersquote and credit note distractorspartial payment
P04 The Paperwork Trial

Tax ID Collision

A compact case around vendor identity and tax calculation conflicts.

1 images 5 text files Generated invoice images
vendor tax ID conflicttax rounding mismatchstatement distractor
P05 The Paperwork Trial

PO Revision

A generated scan case where split payments and the latest purchase-order revision decide the outcome.

1 images 5 text files Generated invoice images
split paymentcancelled PO revisionquote distractor
W04 Paperwork Workflow

Messy Intake Folder

A chaotic intake folder where the model must identify active sources, ignore stale files, write manifests, and preserve incoming sources.

4 images 9 text files Agentic file workflow
old bank exportduplicate vendor filedraft PO listnon-invoice scan
W05 Paperwork Workflow

Email Attachment Intake

A versioning workflow with email context, revised invoice attachments, old references, and non-invoice screenshots.

4 images 6 text files Agentic file workflow
superseded invoicerevised attachmentold payment referenceproforma distractor
W06 Paperwork Workflow

Remittance Split

A remittance workflow where one payment has to be mapped across multiple final invoices while ignoring drafts and a proforma.

4 images 7 text files Agentic file workflow
single payment splitdraft bank exportproforma distractoractive remittance batch
W07 Paperwork Workflow

Credit Offset Packet

A credit-offset packet with duplicate scans, a credit memo, statement distractor, inactive vendor, and cancelled PO.

5 images 6 text files Agentic file workflow
credit offsetduplicate scanstatement distractorinactive vendor