Supported LLM providers

AI Hub supports specific large language model providers and APIs for use with AI Hub deployments. This list is subject to change as new models become available or as requirements evolve.

Customers who elect to provision their own LLM endpoints must ensure compliance with these specifications to maintain compatibility and receive full support under applicable Service Level Agreements.

Customers are responsible for ensuring their LLM endpoints remain compatible with the latest specifications. Use of unsupported models or endpoints that don’t maintain complete API compatibility isn’t supported and is excluded from any Service Level Agreements or performance warranties.

For questions or clarifications regarding supported providers and models, contact the support team.

General requirements

All customer LLM endpoints must provide official models from supported providers. In addition:

  • Endpoints must maintain complete API compatibility, including all available parameters, options, model variants, and authentication methods.

  • Custom or alternative models not listed in this documentation aren’t supported.

  • Models and required capabilities might change over time.

Supported providers and APIs

OpenAI API

Supported as of AI Runtime 1.0.8:

  • LLM Models:
    • gpt-3.5-turbo-16k
    • gpt-3.5-turbo-1106
    • gpt-4o-2024-05-13
    • gpt-4o-2024-08-06
    • gpt-4o-2024-11-20
    • gpt-4o-mini-2024-07-18
  • Embedding Models:
    • text-embedding-ada-002
  • Moderation:
    • text-moderation-stable

Azure OpenAI API

Supported as of AI Runtime 1.0.8:

  • LLM Models:
    • See OpenAI LLM Models above
  • Embedding Models:
    • See OpenAI Embedding Models above

AWS Bedrock (Anthropic Claude)

Supported as of AI Runtime 1.0.8:

  • LLM models Cross-region inference endpoints are supported
    • anthropic.claude-3-5-sonnet-20241022-v2:0
    • anthropic.claude-3-5-sonnet-20240620-v1:0
    • anthropic.claude-3-5-haiku-20241022-v1:0
    • anthropic.claude-3-7-sonnet-20250219-v1:0
  • Embedding models:
    • amazon.titan-embed-text-v2:0