Classify and Split Documents Automatically
Turn mixed page streams into structured, auditable documents using deterministic rules and governed AI. Deploy on-prem, private cloud, or SaaS—without changing your downstream workflows.
Stop manual sorting and costly mis-splits
Manual intake breaks down when documents arrive as mixed PDFs, scanned batches, and email attachments. Teams lose time separating pages, guessing document types, and fixing boundary errors—creating downstream exceptions, compliance risk, and unpredictable SLAs.
IRIS Classification & Splitting applies a governed, hybrid approach: deterministic separation (barcodes, patch codes, value-change rules, fingerprints) where you need speed and certainty, and AI-driven classification where variability is high. Low-confidence cases are routed to verification so every decision can be corrected, tracked, and continuously improved.
Designed for hybrid workflows, APIs, and governance
IRIS implements Classification & Splitting as a horizontal capability across desktop capture, server platforms, cloud microservices, and SDKs—so you can standardize intake logic and reuse it across teams and channels. Deterministic methods (barcodes/patch codes/fingerprints/value-change expressions) deliver low-latency, high-confidence separation, while AI engines (e.g., NClassify/XClassify-based approaches) handle long-tail variability with confidence scoring and controlled thresholds.
For modern architectures, IRIS services integrate through REST APIs and SDK patterns, and in cloud deployments results are carried through queue-based pipelines with companion metadata—making every classification and split decision inspectable, debug-able, and audit-friendly. This enables “rules → AI → verify” orchestration without turning AI into a black box, uncontrolled.
Your Questions Answered
Is this ‘AI-only’, or can we keep deterministic control?
You can start fully deterministic (barcodes, patch codes, fingerprints) and layer AI only where variance demands it. Hybrid orchestration keeps outcomes explainable and operationally governed.
What happens when classification confidence is low?
Low-confidence documents can be routed to verification queues for human review. Corrections can feed continuous improvement, while preventing bad data from reaching ERP/ECM.
Can it handle email attachments and mixed PDFs, not just scans?
Yes. The capability is designed for multi-channel ingestion (scan, email, folders, cloud) and applies the same classification/splitting logic across sources.
Do we need to redesign our downstream workflow to use it?
No. Outputs are designed to integrate into existing export and validation flows. The goal is to improve intake structure while preserving downstream systems and controls.
Can we deploy on-prem for data residency requirements?
Yes. IRIS supports on-prem, private cloud, SaaS, and hybrid models—so you can align deployment to compliance, security, and operational constraints.
A practical path to governed automation
Start deterministic, add AI where it adds value, and keep humans for outliers—measurably.
Assess document families
Identify top inbound flows (mailroom, AP, HR/legal) and define target document classes and boundary rules.
Configure deterministic separation
Apply barcodes/patch codes, fingerprints, blank detection, and value-change rules for fast, reliable splitting.
Introduce AI classification
Use AI models for high-variance sets; tune thresholds and confidence routing to minimize exceptions.
Verify and govern
Route exceptions to review, capture audit trails, and feed corrections back into continuous improvement.
Measure and scale
Monitor KPIs (throughput, % auto-classified, exception rate) and scale services or stations as volume grows.
Make intake faster and correct
See how hybrid rules + AI improves throughput without losing control.