App runs endpoint

The App runs endpoint lets you create an app run for an AI Hub app and returns a run ID. A deployment-specific version lets you run deployments, also returning a run ID. You can then use the run ID to check the status of an app or deployment run and, when complete, get the results.

curl examples reference the API_ROOT, API_TOKEN, and IB_CONTEXT values as variables. If you don’t set up these variables, define them in the request.
For Python request examples, it’s assumed you installed the AI Hub SDK. Python examples include code initializing the API client, with values you must define.
All endpoints support the IB-Context header. Organization members must set the IB-Context header as their organization ID to complete the request with their organization account. While optional, it’s a best practice to include the IB-Context header in all requests.
All endpoints use the standard HTTP response status codes. For each endpoint, some common status codes are listed.

The AI Hub interface includes an API runner tool you can use to generate end-to-end code samples for running an app or deployment by API.

Use the API runner tool with an app or deployment...

Run app

Method	Syntax
POST	`API_ROOT/v2/apps/runs`

Description

Run an app by its name or app ID by sending a POST request to API_ROOT/v2/apps/runs. The input for the run is specified using a batch ID or an input file path. To run deployments by API, use the Run deployment endpoint.

Any specified input or output is validated against the context set by the IB-Context header. For example, if the context is set to your community account, but the batch ID used as input for the run is stored in your organization, the call fails.

You can find an app ID by opening the app in AI Hub and clicking App details in the left panel. The app ID is also listed in the app URL, such as https://aihub.instabase.com/hub/apps/**528c36e8-ac5b-490d-a41b-7eec9c404b87**.

Request body

Parameters are required unless marked as optional.

Parameter	Type	Description	Values
`app_id`	string	Required unless using `app_name`. The app ID of the AI Hub app through which you want to process the files.	A valid app ID of a prebuilt or custom AI Hub app.
`app_name`	string	Required unless using `app_id`. The name of the AI Hub app through which you want to process the files.	A valid name of a prebuilt or custom AI Hub app.
`owner`	string	Optional. The account that generated the app. If not specified, defaults to your AI Hub username.	For custom AI Hub apps belonging to you, don’t specify a value. For public AI Hub apps published by Instabase, specify `instabase`.
`batch_id`	string	Required unless using `input_dir`. The batch ID of a batch created with the Batches endpoint. All files uploaded to the batch are used as input for the run.	A valid integer batch ID.
`input_dir`	string	Required unless using `batch_id`. The path of the input folder in a connected drive or Instabase Drive.	A complete path to the input folder. See specifying file paths.
`version`	string	Optional. Which version of the app to use. If not specified, defaults to the latest production version.	A valid semantic version string that exists for the app.
`output_workspace`	string	Optional. The workspace in which to run the app. The output is saved to the default drive of the specified workspace. If not defined, the default is: - Community accounts: Runs in and saves to the personal workspace’s Instabase Drive (`<USER-ID>/my-repo/Instabase Drive`). - Organization accounts: Runs in and saves to the organization’s default drive (`<ORGANIZATION-ID>/<USER-ID>/<default-drive>`).	A valid name of a workspace in the specified community or organization context, such as `shared-workspace-accounting`.
`output_dir`	string	Optional. Defines a specific location for the output to be saved in a connected drive or Instabase drive. If defined, overrides the `output_workspace` value.	A complete path to the output folder. See specifying file paths.
`settings`	dict	Optional. Advanced settings for your run.
`settings/runtime_config`	dict	Optional. A dictionary containing the runtime configuration for the app run, for use in validation functions. See runtime config for details.
`settings/webhook_config`	dict	Optional. Configure the webhook URL that’s called on app run completion. See webhook parameters.
`settings/webhook_config/url`	string	Optional. The webhook URL to which a HTTP request is sent when the run is completed.
`settings/webhook_config/headers`	dict	Optional. Configure the headers that are sent alongside the HTTP request. The format is `{ "<HTTP-HEADER>": "<VALUE>"}`.

Response status

Status	Meaning
200 OK	Run started successfully.

Response schema

The response body is a JSON object known as a run object. The run object provides information about the run and its progress, but not the run results. You can use the run’s id to poll the Run status endpoint and the Run results endpoint.

Key	Type	Description
`id`	string	Run ID of the run.
`status`	string	Status of the run. Possible values are `COMPLETE`, `FAILED`, `CANCELLED`, `RUNNING`, or `STOPPED_AT_CHECKPOINT`.
`msg`	string	Optional. Message about the run.
`start_timestamp`	integer	When the run started, in Unix time.
`finish_timestamp`	integer	When the run finished, in Unix time. `null` if run is still in progress.

Examples

Request (curl)

$ curl "${API_ROOT}/v2/apps/runs" \
>   -H "Authorization: Bearer ${API_TOKEN}" \
>   -H "IB-Context: ${IB_CONTEXT}"\
>   -H "Content-Type: application/json" \
>   -d '{
>         "batch_id": "<BATCH-ID>",
>         "app_name": "<APP-NAME>"
>       }'

Request (Python SDK)

1 from aihub import AIHub
2 
3 client = AIHub(api_root="<API-ROOT>",
4                api_key="<API-TOKEN>",
5                ib_context="<IB-CONTEXT>")
6 result = client.apps.runs.create(app_name='<APP-NAME>',
7                                  batch_id='<BATCH-ID>')

Response

1 {
2   "id": "<RUN-ID>",
3   "status": "RUNNING",
4   "msg": "",
5   "start_timestamp": 1709592306000,
6   "finish_timestamp": null,
7 }

Run deployment

Method	Syntax
POST	`API_ROOT/v2/aihub/deployments/<DEPLOYMENT-ID>/run`

Description

Run a deployment by sending a POST request to API_ROOT/v2/aihub/deployments/<DEPLOYMENT-ID>/run, using the request URL to specify the ID of the deployment. The input for the run is specified using a batch ID or an input file path.

You can find a deployment’s ID by opening the deployment in AI Hub and looking at the site URL, such as https://aihub.instabase.com/deployments/**01902d6f-bb35-74cb-bd27-c09b38bbf20a**/runs.

Request body

Parameters are required unless marked as optional.

Parameter	Type	Description	Values
`batch_id`	string	Required unless using `input_dir`. The batch ID of a batch created with the Batches endpoint. All files uploaded to the batch are used as input for the run.	A valid integer batch ID.
`input_dir`	string	Required unless using `batch_id`. The path of the input folder in a connected drive or Instabase Drive.	A complete path to the input folder. See specifying file paths.
`output_workspace`	string	Optional. The workspace in which to run the app. The output is saved to the default drive of the specified workspace. If not defined, the default is: - Community accounts: Runs in and saves to the personal workspace’s Instabase Drive (`<USER-ID>/my-repo/Instabase Drive`). - Organization accounts: Runs in and saves to the organization’s default drive (`<ORGANIZATION-ID>/<USER-ID>/<default-drive>`).	A valid name of a workspace in the specified community or organization context, such as `shared-workspace-accounting`.
`output_dir`	string	Optional. Defines a specific location for the output to be saved in a connected drive or Instabase drive. If defined, overrides the `output_workspace` value.	A complete path to the output folder. See specifying file paths.
`settings`	dict	Optional. Advanced settings for your run.
`settings/runtime_config`	dict	Optional. A dictionary containing the runtime configuration for the run, for use in validation functions. See runtime config for details.
`settings/webhook_config`	dict	Optional. Configure the webhook URL that’s called on run completion. See webhook parameters.
`settings/webhook_config/url`	string	Optional. The webhook URL to which a HTTP request is sent when the run is completed.
`settings/webhook_config/headers`	dict	Optional. Configure the headers that are sent alongside the HTTP request. The format is `{ "<HTTP-HEADER>": "<VALUE>"}`.

Response status

Status	Meaning
202 ACCEPTED	Successfully initiated an asynchronous operation to run the deployment.

Response schema

The response body contains the run ID, that you can use to poll the Run status endpoint and the Run results endpoint.

Key	Type	Description
`job_id`	string	Run ID of the run.

Examples

Request (curl)

$ curl "${API_ROOT}/v2/aihub/deployments/<DEPLOYMENT-ID>/run" \
>   -H "Authorization: Bearer ${API_TOKEN}" \
>   -H "IB-Context: ${IB_CONTEXT}"\
>   -H "Content-Type: application/json" \
>   -d '{
>         "batch_id": "<BATCH-ID>",
>       }'

Response

1 {
2   "job_id": "<RUN-ID>",
3 }

Run status

Method	Syntax
GET	`API_ROOT/v2/apps/runs/<RUN-ID>`

Description

Get the status of a run by sending a GET request to API_ROOT/v2/apps/runs/<RUN-ID>. The run’s id value is returned in the response of the initial run.

Request body

There is no request body. Use the request URL to specify a run id.

Response status

Status	Meaning
200 OK	Success.

Response schema

Key	Type	Description
`id`	string	Run ID of the run.
`status`	string	Status of the run. Possible values are `COMPLETE`, `FAILED`, `CANCELLED`, `RUNNING`, or `STOPPED_AT_CHECKPOINT`.
`msg`	string	Optional. Message about the run.
`start_timestamp`	integer	When the run started, in Unix time.
`finish_timestamp`	integer	When the run finished, in Unix time. `null` if run is still in progress.

Examples

Request (curl)

$ curl "${API_ROOT}/v2/apps/runs/<RUN-ID>" \
>   -H "Authorization: Bearer ${API_TOKEN}"\
>   -H "IB-Context: ${IB_CONTEXT}"

Request (Python SDK)

1 from aihub import AIHub
2 
3 client = AIHub(api_root="<API-ROOT>",
4                api_key="<API-TOKEN>",
5                ib_context="<IB-CONTEXT>")
6 status = client.apps.runs.status('<RUN-ID>')

Response

1 {
2   "id": "<RUN-ID>",
3   "status": "COMPLETE",
4   "msg": "Completed",
5   "start_timestamp": 1709592306000,
6   "finish_timestamp": 1709592306500,
7 }

Run results

Method	Syntax
GET	`API_ROOT/v2/apps/runs/<RUN-ID>/results`

Description

Get the results of a completed run by sending a GET request to API_ROOT/v2/apps/runs/<RUN-ID>/results. The run id value is returned in the response of the initial run.

You can use this endpoint to get the results of a deployment run. Use the run ID returned by the deployment run.

Query parameters

Query parameter	Description
`include_review_results`	Optional. Whether to include human review details in the results. When set to `true`, details such as review status and edits at the run or document level and the extracted field level are included. See the `include_review_results` section of the response schema for details.
`include_confidence_scores`	Optional. Whether to include confidence scores in the results. When set to `true`, classification confidence scores at the run or document level and extraction confidence scores at the extracted field level are included. See the `include_confidence_scores` section of the response schema for details.
`include_validation_results`	Optional. Whether to include validation status in the results. When set to `true`, validation results at the run or document level and extracted field level are included. See the `include_validation_results` section of the response schema for details.
`include_source_info`	Optional. Whether to include source information in the results. When set to `true`, source details such as the image path of the generated image are included in the results. See the `include_source_info` section of the response schema for details.
`file_offset`	Optional, defaults to `0`. The initial file index to start returning results from.

Response status

Status	Meaning
200 OK	Results successfully retrieved.

Response schema

The response body is a JSON object containing the results of the run.

Key	Type	Description
`batch_id`	integer	Optional. The batch ID.
`files`	list	A list of files processed during the run.
`files/original_file_name`	string	The original name of the file processed.
`files/documents`	list	An array containing each document within the file.
`files/documents/fields`	list	A list containing the extracted fields from the document, each with its field name, extracted value, and type. See `<DOCUMENT-FIELD>` structure below.
`files/documents/class_name`	string	The classification label of the document. Can be `null` if classification is not applicable.
`files/documents/post_processed_paths`	list	An array of strings, each representing a path to post-processed documents.
`has_more`	boolean	Indicates whether additional results are available beyond those included in the current response.

For each extracted field, there is a <DOCUMENT-FIELD> object structure:

Key	Type	Description
`field_name`	any	The name of the field.
`value`	any	The extracted value of the field.
`type`	string	The type of the field.

Depending on the included query parameters, the following information can also be included in the results.

include_review_results

The following fields are included in the response when using the include_review_results query parameter.

Run or document level:

Key	Type	Description
`review_completed`	boolean	Indicates if the run or document has completed review.
`files/documents/review_completed`	boolean	Indicates if the document has been marked as reviewed.
`files/documents/class_edit_history`	list	An array containing the history of edits to the document’s class.
`files/documents/class_edit_history/timestamp`	string	Datetime string of the class edit history event.
`files/documents/class_edit_history/user_id`	string	User ID of the user who edited the class.
`files/documents/class_edit_history/modifications`	list	List of class modifications made in this single edit history event.
`files/documents/class_edit_history/modifications/message`	string	Message of class edit history modification.

The review process can include manually correcting values. The Run results endpoint doesn’t support returning the original and corrected values.

Extracted field level:

Key	Type	Description
`edit_history`	list	An array containing the history of edits to the document’s field.
`files/edit_history/timestamp`	string	A datetime string of the field edit history event.
`files/edit_history/user_id`	string	User ID of user who edited the field’s value.
`files/edit_history/modifications`	list	List of field modifications made in this single edit history event.
`files/edit_history/modifications/message`	string	Message of field edit history modification.

include_confidence_scores

The following fields are included in the response when using the include_confidence_scores query parameter.

Run or document level:

Key	Type	Description
`files/documents/classification_confidence/ocr`	float	The classification model’s confidence to classify the document. Takes values from [0,1].

Extracted field level:

Key	Type	Description
`confidence/model`	float	The model’s confidence in the extracted value. Takes values from `[0,1]`.

include_validation_results

The following fields are included in the response when using the include_validation_results query parameter.

Run or document level:

Key	Type	Description
`files/validations/final_result_pass`	boolean	Indicates if the document has passed all validation rules.

Extracted field level:

Key	Type	Description
`validations/valid`	boolean	Indicates if the document has passed validation rules pertaining to this field.
`validations/alerts`	list	Any alerts for field-level validation rules. Populated only if the `validations/valid` value is `false`.

include_source_info

The following fields are included in the response when using the include_source_info query parameter.

Run or document level:

Key	Type	Description
`files/documents/post_processed_paths`	list	An array of strings, each representing a path to post-processed documents.

Extracted field level:

Key	Type	Description
`source_coordinates/top_x`	float	Top-left x coordinate of the bounding box.
`source_coordinates/top_y`	float	Top-left y coordinate of the bounding box.
`source_coordinates/bottom_x`	float	Bottom-right x coordinate of the bounding box.
`source_coordinates/bottom_y`	float	Bottom-right y coordinate of the bounding box.
`source_coordinates/page_number`	integer	Zero-indexed page number of the bounding box.

Examples

Simple request (curl)

A simple request returns extracted field names and values:

$ curl "${API_ROOT}/v2/apps/runs/<RUN-ID>/results" \
> -H "Authorization: Bearer ${API_TOKEN}"`
> -H "IB-Context: ${IB_CONTEXT}"

Simple request (Python SDK)

1 from aihub import AIHub
2 
3 client = AIHub(api_root="<API-ROOT>",
4                api_key="<API-TOKEN>",
5                ib_context="<IB-CONTEXT>")
6 results = client.apps.runs.results('<RUN-ID>')

Response (simple request)

1 {
2   "has_more": false,
3   "batch_id": "9706",
4   "files": [
5     {
6       "original_file_name": "test.png",
7       "documents": [
8         {
9           "class_name": "Wage And Tax Statement",
10           "fields": [
11             {
12               "value": "123 STREET RD ANYWHERE, USA,12345",
13               "type": "TEXT",
14               "field_name": "employers_address_and_ZIP_code"
15             }
16           ],
17           "doc_id": null
18         }
19       ]
20     },
21     {
22       "original_file_name": "test2.png",
23       "documents": [
24         {
25           "class_name": "Driver License",
26           "fields": [
27             {
28               "value": "Male",
29               "type": "TEXT",
30               "field_name": "sex"
31             }
32           ],
33           "doc_id": null
34         }
35       ]
36     }
37   ]
38 }

Request with query parameters (curl)

This request includes the include_confidence_scores and include_source_info query parameters. Confidence scores and source information (coordinates) are returned:

$ curl "${API_ROOT}/v2/apps/runs/<RUN-ID>/results?include_confidence_scores=true&include_source_info=true" \
>   -H "Authorization: Bearer ${API_TOKEN}"`
>   -H "IB-Context: ${IB_CONTEXT}"\

Request with query parameters (Python SDK)

1 from aihub import AIHub
2 
3 client = AIHub(api_root="<API-ROOT>",
4                api_key="<API-TOKEN>",
5                ib_context="<IB-CONTEXT>")
6 results = client.apps.runs.results('<RUN-ID>',
7                                    include_confidence_score=True,
8                                    include_source_info=True)

Response (request with query parameters)

1 {
2   "has_more": false,
3   "batch_id": "9706",
4   "files": [
5     {
6       "original_file_name": "test.png",
7       "documents": [
8         {
9           "class_name": "Wage And Tax Statement",
10           "fields": [
11             {
12               "value": "123 STREET RD ANYWHERE, USA,12345",
13               "type": "TEXT",
14               "confidence": {
15                 "model": 0.59005505
16               },
17               "source_coordinates": [
18                 {
19                   "top_x": 36.0,
20                   "top_y": 181.0,
21                   "bottom_x": 84.0,
22                   "bottom_y": 212.0,
23                   "page_number": 0
24                 },
25                 {
26                   "top_x": 90.0,
27                   "top_y": 182.0,
28                   "bottom_x": 198.0,
29                   "bottom_y": 211.0,
30                   "page_number": 0
31                 }
32               ],
33               "field_name": "employers_address_and_ZIP_code"
34             }
35           ],
36           "post_processed_paths": [
37             "instabase-org/orguser/fs/S3 Drive/app-runs/dd97-42cb-8b8e-28a63066887e/4180-8508-b103afb67185/s1_process_files/images/test.png.PNG"
38           ],
39           "doc_id": null
40         }
41       ]
42     },
43     {
44       "original_file_name": "test2.png",
45       "documents": [
46         {
47           "class_name": "Driver License",
48           "fields": [
49             {
50               "value": "Male",
51               "type": "TEXT",
52               "confidence": {
53                 "model": 0.691
54               },
55               "source_coordinates": [],
56               "field_name": "sex"
57             }
58           ],
59           "post_processed_paths": [
60             "instabase-org/orguser/fs/S3 Drive/app-runs/dd97-42cb-8b8e-28a63066887e/4180-8508-b103afb67185/s1_process_files/images/test2.png.PNG"
61           ],
62           "doc_id": null
63         }
64       ]
65     }
66   ]
67 }

Runtime config

You can use the runtime_config setting to specify key-value pairs to pass into the run at runtime. These values are propagated to downstream processes and can be referenced in validation functions.

For example, if your validation function’s behavior varies based on time of year, you might use runtime_config to pass in the current date, for example:

1 {
2   "current_date": "06/01/2024",
3   ...
4 }

Webhook parameters

You can use the webhook_config setting to ensure your application is notified when a run completes. AI Hub POSTs JSON-encoded data of the format below to the webhook endpoint:

$ # body
> {
> "status": <string>,
> "msg": <string>,
> "job_id": <string>,
> "input_dir": <string>,
> "output": <string>
> }

The response body contains the following fields:

status: "OK" | "ERROR"
msg: (optional) Error message. Present only if status is ERROR.
job_id: A unique identifier for the run.
input_dir: Input directory.
output: The full path to the root output folder.

To acknowledge receipt of the event, your endpoint must return a 2xx HTTP status code. All response codes outside this range, including 3xx codes, indicate to AI Hub that you did not receive the event.

If AI Hub does not receive a 2xx HTTP status code, the notification attempt is repeated up to 7 times.

Specifying file paths

When running an app using the v2/apps/runs endpoint, there are two methods to specify the app’s input:

Batch: Create a batch using the Batches endpoint, upload files from your local filesystem, then use the batch_id parameter to specify the batch. All files in the batch are processed.
Connected drive: Use the input_dir parameter to specify a file path to an input folder in a connected drive. All files in the input folder are processed.

When specifying a file path in a connected drive, the format varies if you have a community or organization account.

Community account: /<USER-ID>/my-repo/fs/<DRIVE-NAME>/<FOLDER>/
Organization account: /<ORGANIZATION-ID>/<WORKSPACE>/fs/<DRIVE-NAME>/<FOLDER>/

Value	Description
`<USER-ID>`	Your user ID. You can find this on the Settings > APIs page, under User ID.
`<ORGANIZATION-ID>`	Your organization ID. You can find this on the Settings > APIs page, under Organization ID.
`<WORKSPACE>`	This value reflects the workspace name (shared workspaces) or your user ID (personal workspaces). You can find this value by selecting the workspace in Workspaces, then looking at the `workspace` query string in the URL. For example: - https://aihub.instabase.com/workspaces/create?workspace=New_Workspace - https://aihub.instabase.com/workspaces/create?workspace=john.doe_gmail.com
`<DRIVE-NAME>`	The drive’s display name. You can see a drive’s display name in by opening the Data Sources panel in a workspace where the drive is connected. For example, the Instabase Drive’s display name is `Instabase Drive`.
`<FOLDER>`	The exact name of the folder, as displayed in the filesystem.

$	curl "${API_ROOT}/v2/apps/runs" \
>	-H "Authorization: Bearer ${API_TOKEN}" \
>	-H "IB-Context: ${IB_CONTEXT}"\
>	-H "Content-Type: application/json" \
>	-d '{
>	"batch_id": "<BATCH-ID>",
>	"app_name": "<APP-NAME>"
>	}'

1	from aihub import AIHub
2
3	client = AIHub(api_root="<API-ROOT>",
4	api_key="<API-TOKEN>",
5	ib_context="<IB-CONTEXT>")
6	result = client.apps.runs.create(app_name='<APP-NAME>',
7	batch_id='<BATCH-ID>')

1	{
2	"id": "<RUN-ID>",
3	"status": "RUNNING",
4	"msg": "",
5	"start_timestamp": 1709592306000,
6	"finish_timestamp": null,
7	}

$	curl "${API_ROOT}/v2/aihub/deployments/<DEPLOYMENT-ID>/run" \
>	-H "Authorization: Bearer ${API_TOKEN}" \
>	-H "IB-Context: ${IB_CONTEXT}"\
>	-H "Content-Type: application/json" \
>	-d '{
>	"batch_id": "<BATCH-ID>",
>	}'

$	curl "${API_ROOT}/v2/apps/runs/<RUN-ID>" \
>	-H "Authorization: Bearer ${API_TOKEN}"\
>	-H "IB-Context: ${IB_CONTEXT}"

1	{
2	"id": "<RUN-ID>",
3	"status": "COMPLETE",
4	"msg": "Completed",
5	"start_timestamp": 1709592306000,
6	"finish_timestamp": 1709592306500,
7	}

$	curl "${API_ROOT}/v2/apps/runs/<RUN-ID>/results" \
>	-H "Authorization: Bearer ${API_TOKEN}"`
>	-H "IB-Context: ${IB_CONTEXT}"

1	{
2	"has_more": false,
3	"batch_id": "9706",
4	"files": [
5	{
6	"original_file_name": "test.png",
7	"documents": [
8	{
9	"class_name": "Wage And Tax Statement",
10	"fields": [
11	{
12	"value": "123 STREET RD ANYWHERE, USA,12345",
13	"type": "TEXT",
14	"field_name": "employers_address_and_ZIP_code"
15	}
16	],
17	"doc_id": null
18	}
19	]
20	},
21	{
22	"original_file_name": "test2.png",
23	"documents": [
24	{
25	"class_name": "Driver License",
26	"fields": [
27	{
28	"value": "Male",
29	"type": "TEXT",
30	"field_name": "sex"
31	}
32	],
33	"doc_id": null
34	}
35	]
36	}
37	]
38	}

$	# body
>	{
>	"status": <string>,
>	"msg": <string>,
>	"job_id": <string>,
>	"input_dir": <string>,
>	"output": <string>
>	}