Government Document Intelligence
Automated extraction of 167 fields per document with >95% accuracy in under 60 seconds
167 fields per document
>95% accuracy

Context
A state government agency relied on manual data entry for scanned 4-page forms - 15-20 minutes per document with 2-5% error rates across 167 fields.
Constraint
Documents contained PII requiring strict protection. Scan quality varied (skewed, poor print, handwritten). Regulatory compliance was mandatory.
Intervention
Built a multi-engine OCR system in Rust with consensus merging, 65-zone coordinate-based PII redaction, and a production Axum API - with 663 tests.
Outcome
167 fields extracted at >95% accuracy in under 60 seconds (16x faster), full PII compliance, 663 automated tests - delivered in 15 weeks.
Architecture
From scanned form to structured, validated data
Document Preprocessing
Multi-Engine Extraction
PII Protection
Production API
Tech Stack
Core Language
Rust
Cloud
AWS (ECS Fargate, Textract, S3, Secrets Manager)
API Framework
Axum
Document Processing
TIFF/image processing, multi-engine OCR
Deployment
Docker, ECS Fargate
Testing
663 automated tests
Results
Planning a Similar Mandate?
A direct working session about the problem, the constraints, and the fastest credible path forward.
We respond within 4 hours during business hours
