メインコンテンツへスキップ

Work with AI data

DoiT supports AI services provided by the following: AWS (Amazon Bedrock), Google Cloud (Vertex AI), Microsoft Azure (Azure Machine Learning), Databricks, Anthropic, and OpenAI.

After connecting your GenAI providers, you can start analyzing and monitoring your AI cost and usage.

AI data in Cloud Analytics

DoiT Cloud Analytics refreshes AI data on an hourly basis.

Limitations

  • Databricks: Currently, only the cost data is available. The token usage data will be supported soon.

  • OpenAI: OpenAI cost data reflects final charges while usage data is for monitoring activity. This means that the costs shown in reports may not always match your actual usage. If you need more details, check the OpenAI API Usage Dashboard.

    • OpenAI audio input tokens: OpenAI Costs API aggregates the costs for cached and non-cached audio input tokens together. Because of this, GenAI Intelligence does not separate OpenAI audio SKUs into cached and non-cached – the cost for audio input tokens is always combined.
  • Anthropic (Analytics API): Cost and usage data is available only for usage-based Enterprise plans. See Claude Enterprise Analytics API.

You can get AI data through dimensions and metrics. See below for the mapping between the DoiT and AI terminologies.

Basic metrics

DoiT termAI termAI definition
costcostThe total cost for a specific resource or usage.
usageusageThe usage for tokens. In addition, the usage metric includes consumption based on seconds, bytes, characters, and so on, depending on the specific service, model, or operation being tracked.

Standard dimensions

The table below lists the standard dimensions available for all AI models that are integrated with the DoiT platform.

DoiT termDescription
Billing AccountThe unique identifier for a specific organization in your AI account.
Project ID/Account IDThe unique identifier for your AI provider workspace.
Service/Service ID/SKU DescriptionThe high-level description of the type of AI product or capability used. For example, Completions API, Embeddings API, Claude haiku3.5 Usage - Input tokens, Web Search Usage.
SKU ID/SKU descriptionThe specific, granular and billable unit within the AI service that you are using. For example, chatgpt-4o-latest, input, text-embedding-3-large, claude-3-5-haiku-20241022, input_tokens, claude-sonnet-4-20250514, output_tokens.
UnitThe primary unit of usage. For most providers the primary unit is token, but they also use other units depending on the AI service you are consuming.
OperationA distinct, billable action performed by an AI service. For example, input, cached input, web_search, image.
ProviderThe name of the AI provider: Amazon Web Services, Anthropic, Anthropic (Analytics API), Azure, Databricks, Google Cloud, OpenAI.

GenAI labels

Below are the GenAI system labels that you can use in the DoiT platform. The GenAI system labels are grouped in the GenAI section.

LabelDescriptionProvider
API Key IDThe unique identifier of the AI key.Anthropic, OpenAI
API Key NameThe name of the AI API key.Anthropic, OpenAI
Base ModelThe identifier of an AI model offering. Example: gpt-4.1, claude-haiku-4.5.Anthropic, Anthropic (Analytics API), OpenAI
Billing CategoryHow the request was billed by Cursor. One of the following: Usage-based, Included in Business, Free, Aborted, Not Charged, or Errored, Not Charged.Cursor
CachedIndicates whether the operation used cached tokens for cost optimization: true, false.Anthropic, Anthropic (Analytics API), Azure, OpenAI
Coding agentIndicates whether the usage was generated through Claude Code, Anthropic's agentic coding tool: true, false.Anthropic (Analytics API), Cursor
Consumption ModelThe pricing model used for AI services: PAYG (Pay-As-You-Go) or Provisioned Throughput.Azure, GCP
Context WindowA context window restriction applied on the AI model. One of the following: 0-200k, 200k-1MAnthropic, Anthropic (Analytics API)
Cost typeThe billable cost component the spend belongs to: tokens (model token usage), web_search (server-side web search), or code_execution (server-side code execution).Anthropic (Analytics API)
FeatureThe type of AI capability or service feature being used. Vertex AI examples include Model Serving, Model Serving via Model Garden, Vertex Colab, Metadata storage. Microsoft Azure examples include Model Serving, Audio Generation, Embeddings.Azure, GCP
GenAI SpendThe costs of any generative AI workloads irrespective of AI provider.Anthropic, Anthropic (Analytics API), AWS, Azure, Cursor, Databricks, GCP, OpenAI
Inference GeoThe geographic region where inference was processed, used to track data residency controls. One of the following: global, us, or not_available (region unset).Anthropic (Analytics API)
Input TokensThe number of tokens sent to the AI model in the request (the prompt and context).Cursor
InvokationIndicates what initiated the request, for example human for activity initiated directly by a user.Anthropic (Analytics API), Cursor
Is ChargeableIndicates whether the request incurred a charge: true (billable) or false (free or included in quota).Cursor
Is HeadlessIndicates whether the request came from a background (headless) agent rather than an interactive editor session: true, false.Cursor
Is Model ServingIndicates whether the service is actively serving a model: true, false. Available for Vertex AI and Amazon Bedrock.Azure, GCP
Max ModeIndicates whether Cursor's Max mode was enabled for the request: true, false.Cursor
Media FormatFor models that support multiple media types, the media format distinguishes whether the service was processing audio, images, or text. For example, audio.Anthropic, AWS, OpenAI
ModelThe identifier of an AI model offering. Example: gpt-4o-audio-preview, claude-sonnet-4-6.Anthropic, Anthropic (Analytics API), AWS, Azure, Cursor, Databricks, GCP, OpenAI
Model FamilyA group of related AI models that share a common architecture and training methodology. For example, Claude, Gemini, GPT-5, Mistral.Anthropic, Anthropic (Analytics API), AWS, Azure, Databricks, GCP, OpenAI
Model VersionThe version of an AI model offering. For example, 2024-12-17.Anthropic, OpenAI
Output TokensThe number of tokens generated by the AI model in its response.Cursor
Organization NameThe unique identifier for the AI organization. For Anthropic (Analytics API), this is the Organization Label of the connection.Anthropic, Anthropic (Analytics API), AWS, Azure, OpenAI
ProductThe Claude product surface that produced the usage or cost: chat, claude_code, cowork, office_agent, claude_in_chrome, claude_design, or other (usage that cannot be attributed to a known surface).Anthropic (Analytics API)
ResolutionThe resolution of an image in a single usage. Used in OpenAI models that support image processing.OpenAI
Service TierService tiers are used in Anthropic API to prioritize API availability for specific workflows. Can be Priority, Standard, and Batch.Anthropic, Anthropic (Analytics API)
SpeedThe inference mode used to process the request: standard, fast.Anthropic (Analytics API)
Token typeThe category of token the usage applies to (when the cost type is tokens): uncached_input_tokens, output_tokens, cache_read_input_tokens, or cache-creation tokens (cache_creation.ephemeral_1h_input_tokens, cache_creation.ephemeral_5m_input_tokens).Anthropic (Analytics API)
Unit CategoryThe billing unit type for Azure AI services. Values include Commitment, Tokens, Batch, Time or Time Short, Period, and Request.Azure
Usage TypeThe direction of token flow in the AI model interaction: input (tokens sent to the model) or output (tokens generated by the model).Anthropic, Anthropic (Analytics API), AWS, Azure, Databricks, GCP, OpenAI
User EmailThe email address of the user who generated the usage. Null when unavailable or when the account has been deleted.Anthropic (Analytics API), Cursor
User IDThe unique identifier of a specific user in your AI organization.Anthropic, Anthropic (Analytics API), OpenAI
User NameThe name of a specific user in your AI organization.Anthropic, Anthropic (Analytics API), OpenAI
WorkspaceWorkspace name in Anthropic console.Anthropic
Workspace IDID of the workspace in Anthropic console.Anthropic

Example reports

The GenAI Intelligence dashboard contains several preset report widgets to help you jump start the GenAI spend and usage analysis. You can adjust the configurations of a preset report or create your own from scratch to dive deeper into GenAI data.

Cost by usage type

The example below groups costs by the normalized Usage Type dimension across AI providers.

GenAI usage cost per usage type report

Amazon Bedrock cost by user

Standard AWS billing aggregates Bedrock costs by model and region. For granular cost tracking and allocating, AWS recommends Application Inference Profiles (see AWS re:Post article: How to Track and Limit Amazon Bedrock Usage by User).

To track Bedrock usage by user in the DoiT platform:

  1. Create an AIP for each user via the Bedrock console or API.

  2. Apply custom cost allocation tags and use the AIP ARN in your inference API calls instead of the base model ARN.

  3. Activate tags in the AWS Billing and Cost Management console.

  4. Activate AWS cost allocation tags in the DoiT platform. Depending on your account type, you may need to reactivate your tags.

After a tag appears in AWS billing data, it can take 6–8 hours before the tag shows up in Cloud Analytics reports; for a new tag, it can take up to 24 hours. See Limitations for latency and other restrictions.

The example below shows top five users by Amazon Bedrock costs in the last three weeks, grouped by a custom cost allocation tag.

Amazon Bedrock top users by cost