Testing apps | Instabase AI Hub Documentation

Commercial & Enterprise

Accuracy testing ensures automation apps meet your accuracy requirements before deployment. Through systematic comparison of app results against verified ground truth values, you can measure performance, identify improvement areas, and confidently deploy apps that deliver reliable results.

How accuracy testing works

Accuracy testing compares app run results to verified values for a set of documents, showing you exactly how accurate your app is and where improvements are needed.

Ground truth datasets are collections of documents with verified, correct values that serve as the benchmark for measuring app performance. These datasets allow you to test consistently as you iterate on your app.

The testing workflow

Follow these high-level steps to implement accuracy testing for automation apps.

Create ground truth datasets or update existing datasets associated with your app.

Verify ground truth values for any new or updated datasets.

You can run accuracy tests against outdated datasets, but doing so typically lowers accuracy metrics, because results aren’t aligned with existing ground truth values.
Conduct accuracy testing on the new app version.

View or compare accuracy tests and examine error patterns to identify areas for improvement.
Review accuracy metrics and identify specific areas where your app needs improvement.
Make changes to your app based on test results, then test again to measure improvements.

Repeat this cycle until your app meets your accuracy thresholds, then you’re ready to deploy.

Best practices

Test early and often — Create ground truth datasets as soon as you have a working app, then test with each iteration to track improvements.

Use representative data — Ensure your ground truth datasets include documents similar to your production use case.

Separate development and validation data — Use project files for initial testing, then validate with completely new documents to assess real-world performance.