| Agent classifier | Classifies documents into predefined classes using LLMs. The platform handles the LLM call and grounds the result in the document content. This step is recommended for most classification needs. | Agent classifier, linked to a classification schema JSON file |
| Agent extract | Extracts structured data from documents using LLMs. The platform handles the LLM call and ensures extracted values are grounded in the source document. This step is recommended for most extraction needs. | Agent extract, linked to an extraction schema JSON file. |
| Apply checkpoint | Evaluates extracted and refined data against validation rules. Files failing validation are routed to human review. Multiple checkpoints can be used in a single flow for staged review. | Checkpoint, linked to a validations configuration. |
| Apply classifier | Runs rule-based split classification using a custom Python classifier. Use for code-driven, deterministic class assignments, rule-based routing, or cases where external context is needed before classification. | Classifier, linked to a classifier file. |
| Apply refiner | Post-processes extracted data using refiner logic. Used to clean, transform, standardize, and enrich output before it reaches validation or downstream systems. | refiner |
| Map records | Specify how multipage documents are parsed into separate records. | |
| Map UDF | Applies a UDF to each record individually. Used for per-document transformations, lookups, or custom processing logic. | UDF |
| Process case | Run a case program against a packet of input files for packet processing, also called case management, and populate cross-class fields using a refiner program. | refiner |
| Process files | Digitizes input documents using OCR. This is typically the first step in any flow, converting raw files (PDF, TIFF, images) into machine-readable text that downstream steps can process. | Reader |
| Unified extractor | Extract structured fields for a packet of records using the unified extraction pipeline and configured schema. | |