Supported LLM providers

AI Hub supports specific large language model providers and APIs for use with AI Hub deployments. This list is subject to change as new models become available or as requirements evolve.

Customers who elect to provision their own LLM endpoints must ensure compliance with these specifications to maintain compatibility and receive full support under applicable Service Level Agreements.

Customers are responsible for ensuring their LLM endpoints remain compatible with the latest specifications. Use of unsupported models or endpoints that don’t maintain complete API compatibility isn’t supported and is excluded from any Service Level Agreements or performance warranties.

For questions or clarifications regarding supported providers and models, contact the support team.

General requirements

All customer LLM endpoints must provide official models from supported providers. In addition:

  • Endpoints must maintain complete API compatibility, including all available parameters, options, model variants, and authentication methods.

  • Custom or alternative models not listed in this documentation aren’t supported.

  • Models and required capabilities might change over time.

Supported providers and APIs

Google Gemini

Available in agent mode with AI runtime 2.x.

AI runtime 2.x operates exclusively in agent mode and supports only Google Gemini models:

  • LLM Models:
    • gemini-2.5-flash
    • gemini-2.5-pro
  • Access Methods:
    • Google Cloud Vertex AI

Legacy Mode (AI Runtime 1.0.13)

AI runtime 1.0.13 is the final release in the 1.x series and will expire according to the standard timeline, unless otherwise requested. Contact Instabase Support to request timeline adjustments.

AI runtime 1.0.13 continues to support the following model providers in legacy mode:

OpenAI API
  • LLM Models:
    • gpt-3.5-turbo-16k (deprecated)
    • gpt-3.5-turbo-1106 (deprecated)
    • gpt-4o-2024-05-13
    • gpt-4o-2024-08-06
    • gpt-4o-2024-11-20
    • gpt-4o-mini-2024-07-18
  • Embedding Models:
    • text-embedding-ada-002
  • Moderation:
    • omni-moderation-latest
Azure OpenAI API
  • LLM Models:
    • See OpenAI LLM Models above
  • Embedding Models:
    • See OpenAI Embedding Models above
AWS Bedrock (Anthropic Claude)
  • LLM models Cross-region inference endpoints are supported
    • anthropic.claude-3-5-sonnet-20241022-v2:0
    • anthropic.claude-3-5-sonnet-20240620-v1:0
    • anthropic.claude-3-5-haiku-20241022-v1:0
    • anthropic.claude-3-7-sonnet-20250219-v1:0
  • Embedding models:
    • amazon.titan-embed-text-v2:0