Flow steps

Map steps

Use map steps to define processing steps in your pipeline.

Step	Purpose	Linked module
Agent classifier	Classifies documents into predefined classes using LLMs. The platform handles the LLM call and grounds the result in the document content. This step is recommended for most classification needs.	Agent classifier, linked to a classification schema JSON file
Agent extract	Extracts structured data from documents using LLMs. The platform handles the LLM call and ensures extracted values are grounded in the source document. This step is recommended for most extraction needs.	Agent extract, linked to an extraction schema JSON file.
Apply checkpoint	Evaluates extracted and refined data against validation rules. Files failing validation are routed to human review. Multiple checkpoints can be used in a single flow for staged review.	Checkpoint, linked to a validations configuration.
Apply classifier	Runs rule-based split classification using a custom Python classifier. Use for code-driven, deterministic class assignments, rule-based routing, or cases where external context is needed before classification.	Classifier, linked to a classifier file.
Apply refiner	Post-processes extracted data using refiner logic. Used to clean, transform, standardize, and enrich output before it reaches validation or downstream systems.	refiner
Map records	Specify how multipage documents are parsed into separate records.
Map UDF	Applies a UDF to each record individually. Used for per-document transformations, lookups, or custom processing logic.	UDF
Process case	Run a case program against a packet of input files for packet processing, also called case management, and populate cross-class fields using a refiner program.	refiner
Process files	Digitizes input documents using OCR. This is typically the first step in any flow, converting raw files (PDF, TIFF, images) into machine-readable text that downstream steps can process.	Reader
Unified extractor	Extract structured fields for a packet of records using the unified extraction pipeline and configured schema.

Reduce steps

Use reduce steps to combine streams in your pipeline, reducing the output.

Step	Purpose	Linked module
Combine	Combine branches into a single flow output.
Reduce UDF	Applies a UDF across all records in a batch. Used for aggregations, cross-document comparisons, or batch-level output generation. See Custom functions in flow for details.	UDF

Map steps

Use map steps to define processing steps in your pipeline.

Step	Purpose	Linked module
Agent classifier	Classifies documents into predefined classes using LLMs. The platform handles the LLM call and grounds the result in the document content. This step is recommended for most classification needs.	Agent classifier, linked to a classification schema JSON file
Agent extract	Extracts structured data from documents using LLMs. The platform handles the LLM call and ensures extracted values are grounded in the source document. This step is recommended for most extraction needs.	Agent extract, linked to an extraction schema JSON file.
Apply checkpoint	Evaluates extracted and refined data against validation rules. Files failing validation are routed to human review. Multiple checkpoints can be used in a single flow for staged review.	Checkpoint, linked to a validations configuration.
Apply classifier	Runs rule-based split classification using a custom Python classifier. Use for code-driven, deterministic class assignments, rule-based routing, or cases where external context is needed before classification.	Classifier, linked to a classifier file.
Apply refiner	Post-processes extracted data using refiner logic. Used to clean, transform, standardize, and enrich output before it reaches validation or downstream systems.	refiner
Map records	Specify how multipage documents are parsed into separate records.
Map UDF	Applies a UDF to each record individually. Used for per-document transformations, lookups, or custom processing logic.	UDF
Process case	Run a case program against a packet of input files for packet processing, also called case management, and populate cross-class fields using a refiner program.	refiner
Process files	Digitizes input documents using OCR. This is typically the first step in any flow, converting raw files (PDF, TIFF, images) into machine-readable text that downstream steps can process.	Reader
Unified extractor	Extract structured fields for a packet of records using the unified extraction pipeline and configured schema.

Reduce steps

Use reduce steps to combine streams in your pipeline, reducing the output.

Step	Purpose	Linked module
Combine	Combine branches into a single flow output.
Reduce UDF	Applies a UDF across all records in a batch. Used for aggregations, cross-document comparisons, or batch-level output generation. See Custom functions in flow for details.	UDF

Flow steps

Flow steps

Map steps

Filter step

Reduce steps

Map steps

Filter step

Reduce steps