This is the pipeline at its most demanding: insurance documents in, clean records in the agency's management system out. Archway Computers brought me in to automate the document-heavy back office of six independent US insurance agencies (Clifford & Bradford, Bailey Family, PolicyWatch, Sterling, Marathon, and Nau). Producers were stuck hand-reading quotes, endorsements, policies, and forms and re-keying them across a patchwork of AMS and CRM systems. I built the document-processing pipelines that read those documents, validate what comes out, and push structured data into the right place, so producers spend their time selling instead of typing. I can't share every detail under the engagement, but here is the shape of the work.
The problem
These agencies weren't short on software. Between them they ran AMS360, Applied Epic, EZLynx, and Dynamics CRM, and they were underusing systems they already paid for, because feeding those systems was manual. A producer read each incoming document by hand and typed the data into the AMS before a quote could move toward bind. The documents themselves fought back: scanned, photographed, hand-filled, and laid out differently by every carrier. At volume, that reading-and-re-keying was a constant drag on the lead-to-bind gap, and it was exactly the repetitive work that burns out good producers. Multiply it across six agencies on four different stacks and the manual approach simply didn't scale.
What I built
A set of document-processing pipelines, orchestrated in n8n, that own the messy middle between a raw document and a clean record in the system of record. The same backbone runs at every agency; only the endpoints change.
Ingestion and extraction
Each pipeline takes a document in whatever form it arrives (emailed PDF, scan, or phone photo) and extracts the fields that matter. I lean on strong prompting with capable models first, plus OCR and cleanup for the photographed and hand-filled documents, where layouts are inconsistent and handwriting is in the mix.
Validation and schema enforcement
Extraction is only useful if you can trust it. Every output is checked against a strict schema (Zod): if a record doesn't validate, it doesn't pass. Low-confidence or failed documents drop into a review queue for a human instead of silently writing bad data into a policy system, where a wrong number has real consequences.
Mapping and routing into the AMS
Validated data is mapped into the exact shape each agency's AMS or CRM expects, then written in. Where a system exposed a clean API, I used it; where it didn't, browser automation with Playwright drove the interface the way a person would, reliably and unattended.
Agents and the work downstream
With clean data in place, AI agents handle the steps that used to sit on a producer's desk: the follow-on actions, the routing, the busywork between a document landing and a quote moving forward.
Observability and reliability
Nothing is a black box. Every pipeline run is logged, so a failure is visible and traceable rather than a mystery. I built a gold-standard test set early and regression-tested every change against it, so improving one document type couldn't quietly break another. That discipline matters more in insurance than almost anywhere: the cost of a silent error is a mis-bound policy.
How I approached it
My default was the simplest implementation that works: strong prompting and a clean API call before anything custom. The hard part here was never a single model call; it was making the whole thing reliable across six agencies, four management systems, and documents that never look the same twice.
The results
Thousands of documents a month now flow through these pipelines instead of through a producer's hands. The manual reading-and-re-keying that bottlenecked every quote is automated, validated data lands in the systems the agencies already pay for, and they move from lead to bind without the back-office drag. Producers are freed for the work that actually needs a person. Specific figures stay with the client under the engagement.
Built with: n8n, Playwright, LLM-based extraction and OCR, Zod schema validation, AMS360, Applied Epic, EZLynx, and Dynamics CRM.
Have a document-heavy process slowing your team down? Tell me your stack and goals and I'll send back a build plan and timeline.