W07 Paperwork Workflow Agentic file workflow

Credit Offset Packet

A credit-offset packet with duplicate scans, a credit memo, statement distractor, inactive vendor, and cancelled PO.

Transparency boundary

This page shows the files the model can inspect. It does not show the hidden oracle solution.

Excluded: ground_truth.json, expected_artifacts.json, manual readings, and calibration notes.

What This Tests

credit offsetduplicate scanstatement distractorinactive vendorcancelled PO

Files Shown

5 generated image files and 6 text/CSV/task files.

All files are synthetic benchmark material.

Generated Images

Scans and visual attachments visible to the model
W07 input incoming/attachments/credit_memo_4410.png
incoming/attachments/credit_memo_4410.png
W07 input incoming/attachments/invoice_4409.png
incoming/attachments/invoice_4409.png
W07 input incoming/attachments/invoice_4409_duplicate_scan.png
incoming/attachments/invoice_4409_duplicate_scan.png
W07 input incoming/attachments/invoice_4411_cancelled_po.png
incoming/attachments/invoice_4411_cancelled_po.png
W07 input incoming/attachments/statement_page.png
incoming/attachments/statement_page.png

Task And Source Files

No oracle solution files
incoming/README_TASK.md
# Paperwork Workflow Case 07: Credit Offset Packet

Audit this synthetic intake folder. This is benchmark bookkeeping, not financial or tax advice.

You must not modify anything under incoming/.

Case ID for audit_result.json: P3-WORK-07

Steps:
1. Identify active source files and ignore duplicate scans, statements, and credit memos as non-invoice documents.
2. Use the credit memo to explain why the bank payment is lower than the approved invoice gross total.
3. Reject invoices from inactive vendors or cancelled purchase orders.
4. Create work/attachment_index.json.
5. Create work/final_document_set.json.
6. Create work/payment_reconciliation.json.
7. Create normalized invoice text files under work/normalized/.
8. Create audit_result.json.
9. Create proof.txt containing only the final proof_code.

Required audit_result.json keys:
case_id, approved_invoice_ids, review_invoice_ids, reject_invoice_ids,
ignored_document_ids, total_approved_gross_cents, warnings_by_invoice,
evidence, proof_code

Required audit_result.json values and formats:
- case_id must be "P3-WORK-07"
- ignored_document_ids must be ["CM-4410", "INV-4409-DUP", "STATEMENT-JUN"]
- warnings_by_invoice must use only these short warning codes: inactive_vendor, cancelled_po
- evidence must be an array of source path strings, not a nested object

Proof code formula:
total_approved_gross_cents
+ numeric parts of approved/review/reject invoice IDs
+ numeric part of the credit memo ID
+ 97 * total warning count

Use invoice IDs only for approved/review/reject invoice IDs. Do not include duplicate scans, credit memos, or statement IDs in the invoice-ID sum.

Required work/attachment_index.json shape:
{"attachments":[{"attachment_path":"...","document_id":"...","document_type":"...","use":"..."}]}

Required work/final_document_set.json shape:
{
  "approved_invoice_ids": [...],
  "credit_document_ids": [...],
  "ignored_document_ids": [...],
  "rejected_invoice_ids": [...],
  "source_bank_export": "..."
}

Required work/payment_reconciliation.json shape:
{
  "bank_payment_cents": 123,
  "credit_offset_cents": 123,
  "gross_invoice_cents": 123,
  "matched_invoice_id": "...",
  "payment_reference": "..."
}
incoming/ap_notes.txt
AP note:
- INV-4409 is the final invoice for Flare Tooling.
- CM-4410 is a credit memo and reduces the payment, but it is not an invoice.
- invoice_4409_duplicate_scan.png is a duplicate scan of INV-4409.
- INV-4411 belongs to a vendor on hold and PO-7788 is cancelled.
incoming/bank_export_june.csv
date,description,amount_cents
2026-06-03,PAY FLARE INV-4409 LESS CM-4410,28000
incoming/purchase_orders.csv
po_id,vendor_id,status,approved_invoice_id
PO-7701,V-FLARE,approved,INV-4409
PO-7788,V-OLD,cancelled,INV-4411
incoming/vendor_master.csv
vendor_id,vendor_name,status
V-FLARE,Flare Tooling,active
V-OLD,Old Gate Parts,inactive
model_prompt.md
Audit the synthetic credit offset intake folder in `incoming/`.

Write these files:
- `work/attachment_index.json`
- `work/final_document_set.json`
- `work/payment_reconciliation.json`
- `work/normalized/INV-4409.txt`
- `work/normalized/INV-4411.txt`
- `audit_result.json`
- `proof.txt`

Use `incoming/README_TASK.md` as the full task specification. Use the generated image attachments, AP note, bank export, vendor master, and purchase orders.

Important:
- Use credit memo CM-4410 as payment-offset evidence, not as an invoice.
- Ignore the duplicate scan of INV-4409.
- Reject INV-4411 because its vendor is inactive and its PO is cancelled.
- Do not modify anything under `incoming/`.
- This is benchmark bookkeeping, not financial, tax, or legal advice.
- `proof.txt` must contain only the numeric proof code.