Guides

Managed Categories and Resources

Overview

Pay-i manages the Categories and Resources defined below. The costs, across all unit types, for these resources are automatically kept up to date. Further, Pay-i maintains the history of these prices over time so you can see how price changes affect your business.

Support for significantly more Resources and Providers are in active development. If there is a specific Resource or Provider that you'd like to see us support and maintain automatically please contact [email protected].

If you want to define your own, see Custom Categories and Resources.

When using the Ingest API with these resources, first use the Get all Resources for a Category API to see the complete list of resources and their available unit types.

Model Versions and Time-Based Resolution

In the tables below, you'll find both model aliases (like gpt-4o) and specific versioned models (like gpt-4o-2024-08-06). Model aliases are simplified, non-versioned names that point to specific versioned models based on the timestamp of your request.

How Time-Based Resolution Works

Pay-i uses a time-based partition system for model resolution:

  1. Each specific model version (like gpt-4o-2024-08-06) has an associated release date
  2. When you use a model name in your code, Pay-i automatically resolves it to the correct version based on the timestamp of the event in your request, not when you call the ingest API
  3. The combination of (model name, event timestamp) uniquely resolves to exactly one resource version
  4. Model versions are partitioned in time based on their release dates, with the cutoff being midnight UTC

This is especially important for historical data backfilling: when ingesting past events, model names map to the versions that were current at the time of the original requests, not to the current versions.

Model naming conventions vary across providers. Some providers (like OpenAI, Anthropic, and Google Vertex AI) offer model aliases, while others (like Azure OpenAI and AWS Bedrock) typically use specific model version identifiers.

Both approaches have their own benefits and tradeoffs:

  • Using model aliases:

    • No need to update code when newer versions are released
    • Simplified usage without remembering specific version dates
    • Important Note: Model aliases are controlled by the provider (e.g., OpenAI, Anthropic), not by Pay-i. When a provider changes what version an alias points to (e.g., OpenAI updating gpt-4o to point to a newer version), your code automatically uses that new version on your next API call. This happens without notification or your explicit approval.
  • Using specific model versions:

    • Explicit control over which model version you're using
    • Consistency in behavior across deployments
    • Version locking for compliance or quality control
    • You decide when to upgrade to newer versions, not the provider

You can use either approach in your code based on your specific requirements:

# Using a model name (automatically resolves based on timestamp)
client.ingest.units(
    category="system.openai",
    resource="gpt-4o",  # Will resolve to the appropriate version based on timestamp
    units={"text": {"input": 10, "output": 50}}
)

# Using a specific versioned model
client.ingest.units(
    category="system.openai",
    resource="gpt-4o-2024-08-06",  # Explicitly using this specific version
    units={"text": {"input": 10, "output": 50}}
)

Category: system.openai

Pay-i supports OpenAI models including GPT-4o, GPT-4.1, GPT-4, GPT-3.5 Turbo, and the O-series models, as well as text embeddings and DALL-E image generation. These models offer varying capabilities from basic text completion to multimodal interactions with images and video.

For the most current information on OpenAI models and their capabilities, see OpenAI's Models documentation.

Resource FamilyModel Versions
gpt-4ogpt-4o, gpt-4o-2024-08-06, gpt-4o-2024-05-13
gpt-4o-minigpt-4o-mini, gpt-4o-mini-2024-07-18
gpt-4o-realtimegpt-4o-realtime-preview, gpt-4o-realtime-preview-2024-12-17, gpt-4o-realtime-preview-2024-10-01 (Only trackable via manual Ingest API, not available through auto-instrumentation)
gpt-4o-mini-realtimegpt-4o-mini-realtime-preview, gpt-4o-mini-realtime-preview-2024-12-17
chatgpt-4ochatgpt-4o-latest (model used by the ChatGPT application)
gpt-4.1gpt-4.1, gpt-4.1-2025-04-14
gpt-4.1-minigpt-4.1-mini, gpt-4.1-mini-2025-04-14
gpt-4.1-nanogpt-4.1-nano, gpt-4.1-nano-2025-04-14
gpt-4gpt-4, gpt-4-0125-preview, gpt-4-1106-preview, gpt-4-0613, gpt-4-0314
gpt-4-visiongpt-4-vision-preview, gpt-4-1106-vision-preview (deprecated as of 2024-12-06)
gpt-4-turbogpt-4-turbo, gpt-4-turbo-2024-04-09
gpt-4-turbo-previewgpt-4-turbo-preview, gpt-4-0125-preview, gpt-4-1106-preview
gpt-3.5-turbogpt-3.5-turbo, gpt-3.5-turbo-0125, gpt-3.5-turbo-1106, gpt-3.5-turbo-0613, gpt-3.5-turbo-0301
o1o1, o1-2024-12-17
o1-previewo1-preview, o1-preview-2024-09-12
o1-minio1-mini, o1-mini-2024-09-12
o3o3, o3-2025-04-16
o3-minio3-mini, o3-mini-2025-01-31
o4-minio4-mini, o4-mini-2025-04-16
text-embeddingtext-embedding-3-small, text-embedding-3-large, text-embedding-ada-002
dall-e-2dall-e-2-1024x1024-standard, dall-e-2-512x512-standard, dall-e-2-256x256-standard
dall-e-3dall-e-3-1024x1024-hd, dall-e-3-1024x1792-hd, dall-e-3-1792x1024-hd, dall-e-3-1024x1024-standard, dall-e-3-1024x1792-standard, dall-e-3-1792x1024-standard

Category: system.anthropic

Pay-i integrates with Anthropic's Claude family of models, including the latest Claude 3.7 Sonnet, Claude 3.5 (Sonnet and Haiku), and the earlier Claude 3 (Opus, Sonnet, and Haiku) versions. Claude models are known for their high-quality reasoning, instruction following, and content generation capabilities.

For complete details on Anthropic's models and their specifications, see Anthropic's Claude model documentation.

Resource FamilyModel Versions
claude-3-7-sonnetclaude-3-7-sonnet-latest, claude-3-7-sonnet-20250219
claude-3-5-sonnetclaude-3-5-sonnet-latest, claude-3-5-sonnet-20240620, claude-3-5-sonnet-20241022
claude-3-5-haikuclaude-3-5-haiku-latest, claude-3-5-haiku-20241022
claude-3-opusclaude-3-opus-20240229
claude-3-sonnetclaude-3-sonnet-20240229
claude-3-haikuclaude-3-haiku-20240307

Category: system.azureopenai

Pay-i supports Azure-hosted versions of OpenAI models, including GPT-4o, GPT-4, GPT-3.5 Turbo, and text embeddings. Azure OpenAI resources offer deployment options with different pricing based on deployment configuration.

For the most up-to-date information on Azure OpenAI models and deployment options, see Azure OpenAI Service models documentation.

Resource FamilyModel Versions
gpt-4ogpt-4o-2024-05-13, gpt-4o-2024-08-06, gpt-4o-2024-11-20
gpt-4o-minigpt-4o-mini
gpt-4-turbogpt-4-turbo-2024-04-09, gpt-4-1106-preview, gpt-4-0125-preview
gpt-4gpt-4-0613, gpt-4-32k-0613
gpt-35-turbogpt-35-turbo-0125, gpt-35-turbo-1106, gpt-35-turbo-0613, gpt-35-turbo-0301, gpt-35-turbo-16k-0613
o1o1
o1-previewo1-preview
o1-minio1-mini
o3-minio3-mini
text-embeddingtext-embedding-3-large-1, text-embedding-3-small-1, text-embedding-ada-002-1, text-embedding-ada-002-2
dall-e-2dall-e-2-1024x1024-standard, dall-e-2-512x512-standard, dall-e-2-256x256-standard
dall-e-3dall-e-3-1024x1024-hd, dall-e-3-1024x1792-hd, dall-e-3-1792x1024-hd, dall-e-3-1024x1024-standard, dall-e-3-1024x1792-standard, dall-e-3-1792x1024-standard

Category: system.aws.bedrock

Pay-i integrates with AWS Bedrock's foundation model ecosystem, including Anthropic Claude models, Meta's Llama 3 (versions 1, 2, and 3) family, and Amazon's own Nova models.

For detailed information on AWS Bedrock models and capabilities, see AWS Bedrock model reference documentation.

Resource FamilyModel Versions
anthropic.claude-3-7anthropic.claude-3-7-sonnet-20250219-v1:0, us.anthropic.claude-3-7-sonnet-20250219-v1:0
anthropic.claude-3-5anthropic.claude-3-5-sonnet-20241022-v2:0, anthropic.claude-3-5-haiku-20241022-v1:0, anthropic.claude-3-5-sonnet-20240620-v1:0, us.anthropic.claude-3-5-sonnet-20241022-v2:0, us.anthropic.claude-3-5-haiku-20241022-v1:0, us.anthropic.claude-3-5-sonnet-20240620-v1:0, eu.anthropic.claude-3-5-sonnet-20240620-v1:0, apac.anthropic.claude-3-5-sonnet-20241022-v2:0, apac.anthropic.claude-3-5-sonnet-20240620-v1:0
anthropic.claude-3anthropic.claude-3-opus-20240229-v1:0, anthropic.claude-3-sonnet-20240229-v1:0, anthropic.claude-3-haiku-20240307-v1:0, us.anthropic.claude-3-opus-20240229-v1:0, us.anthropic.claude-3-sonnet-20240229-v1:0, us.anthropic.claude-3-haiku-20240307-v1:0, eu.anthropic.claude-3-sonnet-20240229-v1:0, eu.anthropic.claude-3-haiku-20240307-v1:0, apac.anthropic.claude-3-sonnet-20240229-v1:0, apac.anthropic.claude-3-haiku-20240307-v1:0
meta.llama3-3meta.llama3-3-70b-instruct-v1:0, us.meta.llama3-3-70b-instruct-v1:0
meta.llama3-2meta.llama3-2-90b-instruct-v1:0, meta.llama3-2-11b-instruct-v1:0, meta.llama3-2-3b-instruct-v1:0, meta.llama3-2-1b-instruct-v1:0, us.meta.llama3-2-90b-instruct-v1:0, us.meta.llama3-2-11b-instruct-v1:0, us.meta.llama3-2-3b-instruct-v1:0, us.meta.llama3-2-1b-instruct-v1:0
meta.llama3-1meta.llama3-1-405b-instruct-v1:0, meta.llama3-1-70b-instruct-v1:0, meta.llama3-1-8b-instruct-v1:0, us.meta.llama3-1-405b-instruct-v1:0, us.meta.llama3-1-70b-instruct-v1:0, us.meta.llama3-1-8b-instruct-v1:0
meta.llama3meta.llama3-70b-instruct-v1:0, meta.llama3-8b-instruct-v1:0
amazon.novaamazon.nova-pro-v1:0, amazon.nova-lite-v1:0, amazon.nova-micro-v1:0

Category: system.google.vertex

Pay-i supports Google Vertex AI's Gemini models, including Gemini 1.5 (Pro and Flash variants) and the newer Gemini 2.0 models. These models offer multimodal capabilities, handling text, image, audio, and video inputs with strong reasoning and instruction-following abilities.

For complete details on Google Vertex AI's Gemini models and their capabilities, see Google Cloud Gemini documentation.

Resource FamilyModel Versions
google.gemini-1.5-progoogle.gemini-1.5-pro, google.gemini-1.5-pro-002, google.gemini-1.5-pro-001
google.gemini-1.5-flashgoogle.gemini-1.5-flash, google.gemini-1.5-flash-002, google.gemini-1.5-flash-001
google.gemini-2.0-flash-litegoogle.gemini-2.0-flash-lite, google.gemini-2.0-flash-lite-001
google.gemini-2.0-flashgoogle.gemini-2.0-flash, google.gemini-2.0-flash-001