AI Gateway

AI Gateway documentation

Spice provides a high-performance, OpenAI API-compatible AI Gateway optimized for managing and scaling large language models (LLMs). Additionally, Spice offers tools for Enterprise Retrieval-Augmented Generation (RAG), such as SQL query across federated datasets and an advanced search feature (see Search).

Spice supports full OpenTelemetry observability, enabling detailed tracking of data flows and requests for full transparency and easier debugging.

Supported Models

Spice supports a variety of LLMs, including OpenAI, Azure OpenAI, Anthropic, Groq, Hugging Face, and more (see Model Providers for all supported models).

Core Features

Custom Tools: Equip models with tools to interact with the Spice runtime.
System Prompts: Customize system prompts and override defaults for v1/chat/completion.

For detailed configuration and API usage, refer to the API Documentation.

Example: Configuring an OpenAI Compatible Model

To use a language model hosted on OpenAI (or compatible), specify the openai path and model ID in from.

Example spicepod.yml:

models:
  - from: openai:gpt-4o-mini
    name: openai
    params:
      openai_api_key: ${ secrets:SPICE_OPENAI_API_KEY }

  - from: openai:llama3-groq-70b-8192-tool-use-preview
    name: groq-llama
    params:
      endpoint: https://api.groq.com/openai/v1
      openai_api_key: ${ secrets:SPICE_GROQ_API_KEY }

For details, see OpenAI (or Compatible) Language Models.

Last updated 1 year ago

Was this helpful?

hashtagSupported Models​arrow-up-right

hashtagCore Features​arrow-up-right

hashtagExample: Configuring an OpenAI Compatible Model​arrow-up-right

Supported Models

Core Features

Example: Configuring an OpenAI Compatible Model