For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Logo
AI Hub
OverviewApp editorFlow editorAdminAPI & SDK
OverviewApp editorFlow editorAdminAPI & SDK
  • Flow editor
    • About flows
    • Creating flows
    • Flow step reference
      • Process files
      • Map records
      • Agent classifier
      • Apply classifier
      • Apply checkpoint
      • Creating validation checkpoints
      • Agent extract
      • Apply refiner
      • Creating refiner programs
      • Process case
    • Using custom functions
    • Flow guides
    • Reviewing flow results
AI Hub
On this page
  • Split PDF, TIF, and TIFF files
Flow editorFlow step reference

Apply classifier

Was this page helpful?
Built with
Enterprise Single-tenant

The Apply classifier step runs rule-based split classification using a custom Python classifier. Use it when you want code-driven, deterministic class assignments, for example rule-based routing, keyword or layout heuristics, or cases where external context is needed before classification. The step links to a classifier module (a classifier file) in your flow.

For classification that uses a large language model with a schema you define, use the Agent classifier step.

Split PDF, TIF, and TIFF files

The Split PDF and TIF source files option, if enabled, splits and groups pages in PDF, TIF, and TIFF files into separate records according to their class. Split documents are assigned file names that indicate the original document, the assigned class, and the range of pages included.

For example, if pages 1–5 of a 10-page PDF file named input.pdf are classified as class_name_1 and pages 6–10 are class_name_2, then:

  • labeled_outputs/class_name_1/input.class_name_1-1-5.pdf contains pages 1–5 of input.pdf

  • labeled_outputs/class_name_2/input.class_name_2-6-10.pdf contains pages 6–10 of input.pdf