Running and deploying apps | Instabase AI Hub Documentation

The Hub displays all apps available to you, including prebuilt apps, apps you created, apps shared within your organization, and advanced apps customized for your enterprise. Open any app to run it or see results from previous runs.

Running apps

You can run accessible apps from the Hub on demand.

From the Hub, select the app you want to run, then click Run app.

If the app includes sample files, you can optionally preview app functionality with a sample run by selecting Use sample files. Otherwise, select Upload files.
If you’re an organization member, verify the workspace you want to run the app in.

Run results are available only in the selected workspace, and are viewable by all members of that workspace.
If the app includes custom or secret keys, specify runtime values for the keys in the Advanced settings section.
Select files to process and click Run.

When the run completes, view results from the App runs tab.

Deleting app runs

You can delete app runs individually or in bulk to satisfy data retention or confidentiality requirements.

Deleting app runs permanently deletes associated results, log files, and database entries. By default, source files uploaded for the run are also deleted, but you can opt to keep them.

From the Hub, open the app you want to delete runs from.
Select the App runs tab.
Indicate which runs you want to delete:
- Hover over a single run, click the overflow icon , and select Delete run.
- Select the checkboxes for multiple runs and click Actions > Delete runs.
Select if you want to keep source files and click Delete.

You can share apps you create with other AI Hub users by managing version visibility for individual app versions.

App versions may be either private—visible only to you and organization admins—or shared. Sharing functionality differs based on your AI Hub subscription.

Community App sharing is enabled with the app URL. Any AI Hub user with the URL can view and use shared versions of your app. Community apps aren’t shared in the Hub.
Commercial & Enterprise App sharing is enabled through organization membership. Any member of your organization can view and use shared versions of your app and see run results in any workspace they have access to.

The account that initiates an app run is responsible for any consumption units used.

Version visibility is indicated on each app’s Versions tab. To change version visibility for a given app version, hover over the version and click the edit icon .

Creating deployments

Commercial & Enterprise

Deployments let you configure an app to run at scale with automation, integration, and human review.

In Workspaces, select the Deploy tab, then click Add deployment.
Specify options for your deployment, then click Save.
- Name — Specify a unique name to help users differentiate the deployment across all workspaces they have access to.
- Description — Specify an optional description for the deployment.
- App — Select the app and app version that you want to run at scale for this deployment. Available apps include all apps that are accessible to you, whether prebuilt, shared within your organization, or created by you.
- Workspace — Select the workspace where you want to run the deployment and store run results. If you enable reviews, only members of this workspace can review results.
- Runtime configurations — If the selected app includes custom or secret keys, specify runtime values for the keys. You can’t delete keys or modify key names or types specified in the app, but you can add new keys for use in deployment integrations.
- Integrations — Configure pre- and post-processing options, either pulling file or folders from upstream systems or sending results to downstream systems. For details, see Configuring integrations.
- Notifications — Configure email or webhook notifications when runs start, complete, fail, or when items are queued for review. For details, see Configuring notifications.
- Review — To manually verify results that fail validation, enable human review, then select a review strategy.
  - Review by file — Sends only files that fail validation for human review, and assigns reviews by file.
  - Review by run — Sends entire runs for human review if any file fails validation, and assigns reviews by run.
  Enterprise Enterprise organizations can configure additional review options:
  - Review queue — Assign a group within the deployment workspace to conduct initial reviews. You can select whether reviews are assigned manually or round robin, with reviews assigned to group members in turn. If you select round robin assignment, admins and managers are excluded from reviews by default, but you can optionally include them.
  - Escalation queue — Assign a group within the deployment workspace to review files flagged for further evaluation. Like review queues, you can select assignment method and optionally include admins and managers in reviews.
    
    Queue options aren’t available in personal workspaces.
  - Service-level agreement — Specify efficiency targets for human review in minutes, hours, or days. Timing begins when a deployment run begins, and the SLA is satisfied on a given file when it’s marked as reviewed. The Review tab indicates time remaining against the SLA to help reviewers prioritize.
- Data retention — To automatically delete future deployment runs on a specified cadence, enable automatic run cleanup. Specify how long to keep run data (default 90 days) and indicate whether to delete source files.

Configuring integrations

Use integrations to pull files or folder contents from upstream systems for processing or to send results to downstream systems.

Google Drive isn’t supported for upstream or downstream integrations.

Upstream integrations

Connected drive — Pull files or folders from a connected drive directory on a set schedule or any time new contents are detected. When you add a connected drive as a new integration, all existing contents are processed. If the integration runs when new files are detected, the drive is checked for new content every 3 minutes.

Each file or folder is processed as a single run.

Contents are copied from the selected directory to a parallel processing folder in your default drive that’s created to manage file handling. By default, the input folder is cleared when processing begins. For scheduled runs, you can opt to retain contents in the connected drive to address certain advanced use cases, but be aware that this can trigger duplicate runs.
Connected mailbox — Pull files from a connected mailbox on a set schedule. When you add a connected mailbox as a new integration, it processes existing emails received after a specified start date. If no date is specified, it processes emails from one schedule interval ago. For example, if the integration is scheduled to run daily, it begins processing emails from one day ago.

Use integration configuration options to limit input to specified comma-separated values. For example, depending on mailbox type, you can pull emails based on header details (from, to, subject) or key words (contains). When using more than one filter field, fields are combined with AND logic, while multiple values within a filter are combined with OR logic.

By default, email attachments are treated as separate files, but you can disable Process attachments to exclude them. Inline images are treated as part of the email body.

Downstream integrations

Results are sent only after required reviews are closed. During configuration, you can test the connection by sending results from a previous app run to your downstream integration.

Email — Send results to an email address in CSV, XLSX, or JSON format. In projects with classes, separate CSV files are generated for each class.
Connected drive — Send results in CSV, XLSX, or JSON format to a connected drive. In projects with classes, separate CSV files are generated for each class.
Custom function — Send results in JSON format using a custom Python function.

Integration function

For advanced integrations, you can write a custom integration function in Python.

For example, you might use an integration function to send results to a webhook:

1 import requests
2 
3 # Construct a list of records information
4 concise_records = []
5 for record in results['records']:
6     concise_record = {
7         "fields": record.get("results"),  # Note: This might be intended to be "fields" or a similar key
8         "classification_label": record.get("classification_label"),
9         "record_index": record.get("record_index")
10     }
11     concise_records.append(concise_record)
12 
13 # Post endpoint call template
14 url = "https://example.com/my_own_webhook"
15 response = requests.post(url, json=concise_records)
16 if response.status_code == 200:
17     print("POST request successful")
18 else:
19     print(f"POST request failed with status code {response.status_code}")
20     return None

Integration functions accept these parameters:

Parameter	Required?	Description
`results`	Required	Results of the app run in JSON format. Individual documents within the app run are exported as `records[0].results`.
`keys`	Optional	Access runtime configurations and secrets. Use `keys['custom']['<key-name>']` for custom values and `keys['secret']['<key-name>']` for org-defined secrets.

For additional guidance about custom functions, see Writing custom functions.

Configuring notifications

Notifications inform you when a deployment run starts, completes, fails, or when items are queued for review.

Runs are considered complete when they finish processing without requiring review, or when runs with required reviews are closed.

Supported notifications include:

Email — Send a rich text notification to specified email addresses when runs reach designated checkpoints. Messages include a link to access run results or reviews, as applicable. You can preview and test notification emails, but you can’t change the subject or content of messages.
Webhook — Send HTTP POST requests to a specified endpoint URL when runs reach designated checkpoints. Payloads contain event details like timestamp, run ID, status, and contextual information. You can add custom headers, preview the payload format, and send test notifications to validate the integration.

Running deployments

Commercial & Enterprise

While deployments are most beneficial when automated with upstream integrations, you can run them on demand if necessary.

In Workspaces, select the Deploy tab, then click the name of the deployment you want to run.
Click Run deployment.
If the deployment includes custom or secret keys, specify runtime values for the keys in the Advanced settings section as needed.

You can use the runtime values specified in the deployment as-is, or you can override the values by entering a different value.
Select files to process.
Click Run.

When the run completes, click the run ID to view results.

Monitoring deployments

Commercial & Enterprise

Deployment metrics help you monitor consumption, handling time, and automation rates, giving you insight into deployment efficiency.

In Workspaces, on the Deploy tab, you can enable Show automation metrics to display key metrics and trends over the past 7 days for each deployment.

Documents processed shows the total number of documents processed from submission to completion of any reviews.
Avg handling time shows the average time to process a document from submission to when the run is complete or, if human review is required, when the document is marked reviewed.
Avg automation rate shows the average percent of all fields extracted accurately as measured by unmodified human review results.

To see additional metrics with visualizations, click the name of a deployment, then select its Metrics tab.

The deployment metrics page reiterates the key metrics shown in Workspaces. To display an alert when these metrics deviate more than a specified amount, click Configure alert. Hover over any metric type and click the edit icon to add or change an alert.

The detailed report provides in-depth information about deployment metrics over the period you specify: last 6 hours, last 24 hours, last 7 days, or last 30 days. You can download the detailed report as a ZIP file containing CSV files for individual metrics.

Consumption metrics

Consumption indicates how many documents, pages, or runs were processed by a deployment. If the deployment classifies documents, you can filter by class to see consumption for specific document types.

Handling time metrics

Handling time measures the average time to process a document from submission to when the run is complete or, if human review is required, when the document is marked reviewed.

The main handling time chart displays average human review processing time versus average total processing time (including human review) for documents or runs. Data is plotted across the time range you specify, with yellow representing human review and blue showing the total. Spikes in the chart indicate longer processing times, which might represent anomalies or particularly complex cases. Use this chart to quickly gain insight into trends over time and to understand processing efficiency for automation and human review.

The Handling time distribution chart presents a histogram of processing times for documents or runs. Use the toggle to display total handling time or human review times only. The x-axis shows time intervals in minutes, while the y-axis displays the number of runs or documents. The chart includes key statistics such as mean handling time and a trimmed mean that excludes outliers above a specified percentile. A vertical red line represents the percentile cutoff. Use this chart to understand the distribution of handling times, identify common durations and outliers, and assess overall efficiency.

The Handling time by class chart lets you compare processing times across document types. Use the toggle to display total handling time or human review times only. Additionally, you can search by class name or sort the data by various criteria. A vertical dashed line indicates the overall average handling time across all classes. Use this chart to identify classes that require more processing time, which might suggest the need for app improvements or additional human review bandwidth.

Automation metrics

Automation measures how accurately fields are processed as measured by unmodified human review results.

The Automation accuracy by field chart shows the automation state of individual fields. You can search by field name or sort the data by various criteria. Use the toggle to show runtime accuracy, which is the percent of validated fields that were extracted correctly as measured by unmodified human review results. Use this chart to measure validation accuracy based on human review outcomes.

The Extraction automation rate | All fields chart shows the percent of all fields that were extracted accurately as measured by unmodified human review results. Unlike runtime accuracy, automation rate includes fields without validation rules, and fields that failed extraction. High automation rates indicate fields that are extracted accurately without needing human intervention. Low automation rates indicate fields that are extracted incorrectly or that require human correction. You can search by field name or sort the data by various criteria. Use this chart to compare automation success across fields and identify fields that require improvements. If automation rates differ from runtime accuracy, it indicates fields that have no validation rules or that failed extraction.

The Extraction automation rate chart shows the automation rate for a specific field over time. The x-axis shows the specified time range, while the y-axis displays the automation rate. The graph includes two lines: one representing the automation rate for the selected field and another showing the average automation rate across all fields. Use this chart to visualize performance over time, particularly for lower performing fields identified in the adjacent chart.

Automation states

Automation state evaluates the effectiveness of automation through validation rules and human review.

Automation state includes two key measures:

Validation outcome (valid or invalid) indicates whether a field passed validation rules. Fields are also considered valid if no validation rules apply.
Human review outcome (unmodified or modified) indicates whether a field was changed during human review.

Combined, these measures provide four automation states:

Valid and unmodified (dark green) — Result passed validation and wasn’t corrected in human review. This state indicates a high degree of extraction accuracy.
Invalid and unmodified (lighter green) — Result failed validation but wasn’t corrected because it was actually valid. This state indicates effective human review, but suggests a need to improve validation rules.
Invalid and modified (yellow) — Result failed validation and was corrected in human review. This state indicates both effective validation and effective human review, but suggests a need to improve extraction accuracy.
Valid and modified (red) — Result passed validation but was corrected in human review. This state indicates effective human review, but suggests a need to improve validation rules.

Viewing logs

Logs provide detailed insights into app and deployment runs, helping you troubleshoot issues, monitor performance, and understand how your documents are being processed.

You can access logs from the runs page of any app or deployment. To view or download a log, hover over the run you want to investigate, click the overflow icon , then select View logs or Download logs. You can download logs in CSV or JSON format.

Each log entry includes a timestamp, log level, and detailed message. Log levels indicate the severity and type of information.

INFO — General operational information, such as processing status and model calls.
WARNING — Potential issues that don’t stop execution but might require attention.
ERROR — Serious problems that might cause failures or unexpected behavior.

When troubleshooting, focus on error and warning messages first, as they often indicate the root cause of issues. Info messages provide context about standard operations and can help trace document flow through your application.

If an app uses custom or secret keys, logs include a Runtime configurations tab showing the key values used. For secret keys, only the key names are displayed, not the actual secret values.

Using advanced automation apps

Enterprise

Advanced automation apps are custom apps created by Instabase to address complex enterprise use cases.

Advanced automation apps are available from the Hub and tagged with Advanced. You can test, run, and deploy advanced automation apps just like any other app, but you can’t edit them or access an underlying automation project. Similarly, advanced automation apps are created with a specific AI runtime version, but you can’t update AI runtime for advanced automation apps like you can with standard apps.

If required for your use case, advanced automation apps might be designed with multiple review checkpoints. In this case, each review must be closed before the run can proceed or complete.

Running apps

Deleting app runs

Sharing apps

Creating deployments

Configuring integrations

Integration function

Configuring notifications

Running deployments

Monitoring deployments

Consumption metrics

Handling time metrics

Automation metrics

Automation states

Viewing logs

Using advanced automation apps