Limitations and file processing
AI Hub usage is subject to these limits.
-
Instabase Drive storage includes up to one terabyte of storage for AI Hub projects and documents. Organizations can connect external storage if the Instabase Drive is insufficient.
-
Upload files of up to 50 MB or 800 pages each.
-
Upload up to 100 MB per upload batch.
-
Upload up to 100 files in a Converse conversation.
-
Upload up to 500 files in a Build project.
-
Process up to 1,000 files in an app run.
-
AI Hub is supported on the most recent two versions of Chrome, Edge, Firefox, Opera, and Safari.
Supported file types
These file types are supported for Converse and Build import.
.bat
, .bashc
, .c
, .cc
, .chtml
, .cmake
, .cmd
, .cpp
, .cs
, .css
, .csv
, .cxx
, .cy
, .dockerfile
, .doc
, .docx
, .eml
, .gdoc
, .go
, .gsheet
, .gslides
, .h++
, .hpp
, .html
, .java
, .jpeg
, .jpg
, .js
, .json
, .mht
, .mhtml
, .mkfile
, .msg
, .pdf
, .perl
, .php
, .plsql
, .png
, .pptx
, .py
, .pxi
, .pyx
, .r
, .rd
, .rs
, .rtf
, .ruby
, .tif
, .tiff
, .ts
, .txt
, .xls
, .xlsx
, .xml
, .yaml
, .yml
, .zsh
In commercial and enterprise Build projects with file splitting enabled, multipage files can include multiple documents. For best results in all other projects and conversations, use one file for each document.
Processed Build results can be exported in CSV or Excel format.
Digitization details
When you upload files to AI Hub, the default digitization process includes these steps.
-
Optical character recognition (OCR) is performed on both typed and handwritten text.
-
Page rotation, skew, and warp are corrected.
-
Signatures, checkboxes, and barcodes—both numeric and non-numeric formats—are detected, and appropriate markers are added to the text space.
-
PDF layers are flattened to include all text and image elements.
-
Email attachments are separated and treated as individual files. Inline images are treated as part of the email body.
-
Excel files are processed in their native format, unless you disable Process spreadsheets natively in digitization settings for your conversation or project. Native processing offers better results for wide tables, but doesn’t support embedded objects, such as charts, or source highlighting in results.
Excel limitations
Excel spreadsheets are subject to these limitations.
Upload limitations
-
Files must be less than 10 MB.
-
Files can contain multiple small- to medium-sized tables on one sheet (totaling 200 rows and 30 columns), or one large table up to 400 columns.
Extraction limitations
Total extracted results are limited to 80,000 cells, for example:
-
If extracting 400 columns, you can retrieve up to 200 rows (400 × 200 = 80,000).
-
If extracting 10 columns, you can retrieve up to 8,000 rows (10 × 8,000 = 80,000).
You can adjust the number of columns and rows as needed within the 80,000 cell limit.
Unsupported features
-
Advanced Excel features such as macros and data validation.
-
Triangular or nested tables.
-
Tables with multirow or frozen headers.
-
Tables with empty rows or columns.