Skip to main content

Billing data schema

This page provides the schema definition for the billing data available in DoiT Cloud Intelligence. The schema defines the structure of the billing data tables available for the CloudFlow SQL node.

You can query your data using the provider-specific table names.

Cloud providers​

You can query billing data from the following cloud providers, depending on the cloud environment you are using.

AWS​

The following table defines the data for the aws_dci table.

NameTypeDescriptionSample Values
service_descriptionSTRINGThe human-readable name of the AWS service being billed.Amazon Elastic Compute Cloud
service_idSTRINGThe technical identifier (code) for the AWS service.AmazonEC2
sku_descriptionSTRINGThe detailed, human-readable description of the priced SKU or usage type.EU (Ireland) On Demand Linux m5.large Instance Hour
sku_idSTRINGThe AWS SKU or rate code used to price this line item.ABCDE12345FGHI67890
operationSTRINGThe specific AWS API or usage operation that generated the cost.RunInstances:0002
descriptionSTRINGA short text description of this charge line, often detailing the usage context.Compute instance usage
usage_start_timeTIMESTAMPThe start time of the metered usage window for this row.2025-12-01 10:00:00.000000 UTC
usage_end_timeTIMESTAMPThe end time of the metered usage window for this row.2025-12-01 11:00:00.000000 UTC
project_idSTRINGThe AWS Account ID or unique identifier for the account.123456789012
project_nameSTRINGThe display name associated with the AWS account.Production-Web-Account
project_numberSTRINGThe numerical identifier for the AWS account (same as project_id).123456789012
project_labelsREPEATED RECORDUser-defined tags attached at the account or cost allocation level.See nested fields below.
project_labels.keySTRINGThe key of a custom label.Owner
project_labels.valueSTRINGThe value of a custom label.Finance-Team
labelsREPEATED RECORDUser-defined tags on the specific resource or usage line item.See nested fields below.
labels.keySTRINGThe key of a custom label.environment
labels.valueSTRINGThe value of a custom label.staging
system_labelsREPEATED RECORDAWS-generated metadata or tags (e.g., resource type, tenancy).See nested fields below.
system_labels.keySTRINGThe key of a system label.aws:createdBy
system_labels.valueSTRINGThe value of a system label.Terraform
regionSTRINGThe AWS region where the usage ran or is billed.eu-west-1
zoneSTRINGThe Availability Zone where the resource was located (when available).eu-west-1a
cost_typeSTRINGType of cost (e.g., Usage, Recurring Fee, Tax, Credit, Refund).Usage
resource_idSTRINGThe AWS specific ID of the underlying resource (e.g., EC2 Instance ARN).arn:aws:ec2:eu-west-1:123...
usage.amount_in_pricing_unitsFLOATQuantity used for pricing (e.g., hours, GB, requests) in provider units.1.0
usageFLOATThe usage quantity aggregated for reporting purposes.1.0
costFLOATThe net cost for this row in the billing currency after pricing and discounts.0.096
creditSTRINGThe credit amount aggregated for reporting purposes (often represented as a negative cost).-0.015

Azure​

The following table defines the data for the azure_dci table.

NameTypeDescriptionSample Values
service_descriptionSTRINGThe human-readable name of the Azure service being billed.Virtual Machines
service_idSTRINGThe technical identifier for the Azure service.microsoft.compute
sku_descriptionSTRINGThe detailed, human-readable description of the priced SKU or usage type.Standard, D2, v3, VM
sku_idSTRINGThe Azure SKU or rate code used to price this line item.DZN-0036
operationSTRINGThe specific API or usage operation that generated the cost.VirtualMachines/allocate
descriptionSTRINGA short text description of this usage or charge line.Windows, D2,v3, Instance, Hour
usage_start_timeTIMESTAMPStart time of the metered usage window for this row.2025-12-01 10:00:00.000000 UTC
usage_end_timeTIMESTAMPEnd time of the metered usage window for this row.2025-12-01 11:00:00.000000 UTC
project_idSTRINGThe Azure Subscription ID where the usage occurred.12345678-abcd-efgh-ijkl-123456789012
project_nameSTRINGThe display name associated with the Azure Subscription or Resource Group.Production-Web-Subscription
project_numberSTRINGThe numerical identifier for the subscription or project.987654321012
project_labelsREPEATED RECORDUser-defined tags applied at the subscription or resource group level.See nested fields below.
project_labels.keySTRINGThe key of a custom label.Owner
project_labels.valueSTRINGThe value of a custom label.DevOps
labelsREPEATED RECORDUser-defined tags on the specific resource or usage line item.See nested fields below.
labels.keySTRINGThe key of a custom label.CostCenter
labels.valueSTRINGThe value of a custom label.CC1001
system_labelsREPEATED RECORDAzure-generated metadata or tags (e.g., resource type).See nested fields below.
system_labels.keySTRINGThe key of a system label.resourceType
system_labels.valueSTRINGThe value of a system label.Microsoft.Compute/virtualMachines
regionSTRINGThe Azure region where the usage ran or is billed.eastus
zoneSTRINGThe Availability Zone where the resource was located (when available).1
cost_typeSTRINGType of cost (e.g., Usage, Fee, Tax, Discount).Usage
resource_idSTRINGThe Azure specific ID of the underlying resource (e.g., VM Resource ID)./subscriptions/123/resourceGroups/rg-prod/...
usage.amount_in_pricing_unitsFLOATQuantity used for pricing (e.g., hours, GB, requests) in provider units.1.0
report.usageFLOATThe usage quantity aggregated for reporting purposes.1.0
report.costFLOATThe net cost for this row in the billing currency after pricing and discounts.0.0450
report.creditSTRINGThe credit amount aggregated for reporting purposes (often represented as a negative cost).-0.0050

Google Cloud​

The following table defines the data for the gcp_dci table.

NameTypeDescriptionSample Values
billing_account_idSTRINGThe unique ID of the Google Cloud billing account responsible for the charges.01A2B3-4C5D6E-7F8G9H
service_descriptionSTRINGThe human-readable name of the cloud service consumed.Compute Engine
service_idSTRINGThe technical identifier for the Google Cloud service.6F81-5844-456A
sku_descriptionSTRINGThe detailed, human-readable description of the billable unit.N1 Predefined Instance Core running in Americas
sku_idSTRINGThe unique identifier for the specific billable product or usage unit.B53E-74B8-60C9
usage_start_timeTIMESTAMPThe start of the granular time interval for which the usage was recorded.2025-12-01 10:00:00.000000 UTC
usage_end_timeTIMESTAMPThe end of the granular time interval for which the usage was recorded.2025-12-01 11:00:00.000000 UTC
project_idSTRINGThe unique ID of the Google Cloud project where the usage occurred.my-production-project-12345
project_nameSTRINGThe user-defined name of the project.My Production Application
project_numberSTRINGThe project's unique numerical identifier.123456789012
project_labelsREPEATED RECORDCustom tags applied directly to the project.See nested fields below.
project_labels.keySTRINGThe key of a custom label.environment
project_labels.valueSTRINGThe value of a custom labelprod
labelsREPEATED RECORDCustom user-defined labels/tags applied to the resource for cost allocation.See nested fields below.
labels.keySTRINGThe key of a custom label.team
labels.valueSTRINGThe value of a custom label.frontend
system_labelsREPEATED RECORDLabels automatically applied by Google Cloud for resource categorization.See nested fields below.
system_labels.keySTRINGThe key of a system label.resource_type
system_labels.valueSTRINGThe value of a system label.instance
countrySTRINGThe country where the resource is hosted.US
regionSTRINGThe specific regional location.us-central1
zoneSTRINGThe specific zone within the region.us-central1-a
cost_typeSTRINGThe nature of the financial transaction.Usage
resource_idSTRINGThe specific identifier of the resource itself (e.g., a VM instance ID).0123456789012345678
resource_global_idSTRINGA unique, globally scoped identifier for the resource.gcp-us-central1-12345
usage.amount_in_pricing_unitsFLOATThe amount of usage normalized to the price unit.3600.0
kubernetes_cluster_nameSTRINGThe name of the GKE or Anthos cluster associated with the usage.production-cluster-a
kubernetes_namespaceSTRINGThe specific Kubernetes namespace where the usage occurred.default
seller_nameSTRINGIdentifies the seller, usually Google or a third-party marketplace vendor.Google
subscription.instance_idSTRINGThe ID of the specific subscription instance that covers this usage.sub-a1b2c3d4
transaction_typeSTRINGThe specific type of billing transaction.USAGE
usageFLOATThe usage quantity aggregated for reporting purposes.1.0
costFLOATThe net cost for this row in the billing currency after pricing and discounts.0.0245
creditsSTRINGThe credit amount aggregated for reporting purposes (often represented as a negative cost).-0.0055

Third-party platforms​

You can query billing data from the following third-party platforms, depending on the third-party platforms you have connected to DoiT Cloud Intelligence.

Databricks, Datadog, MongoDB, Snowflake​

The following table defines the data for the databricks_dci, datadog_dci, mongodb_dci and snowflake_dci tables.

Column NameColumn TypeDescriptionSample Values
etlREPEATED RECORDA nested field containing processing metadata from the ETL (Extract, Transform, Load) pipeline.See nested fields below.
etl.tsTIMESTAMPThe primary timestamp of when the record was processed by the ETL pipeline.2026-02-02T10:00:00Z
etl.session_idSTRINGThe unique string identifier linking this record to the ETL processing session.sess_001
etl.run_start_timeTIMESTAMPThe specific time the ETL job responsible for this record began execution.2026-02-02T09:30:00Z
billing_account_idSTRINGThe unique financial account ID (e.g., Databricks workspace billing, Snowflake account).DB-12345-ABCDE
service_descriptionSTRINGThe human-readable name of the specific service consumed.Databricks SQL Warehouse Compute
service_idSTRINGThe unique technical identifier for the platform service.databricks_sql_compute
sku_descriptionSTRINGThe detailed, human-readable description of the billable unit.Databricks DBU Standard (SQL)
sku_idSTRINGThe unique identifier for the specific billable product or usage unit.DBU_STANDARD_SQL
usage_date_timeDATETIMEThe primary date and time when the resource consumption took place.2025-12-01 10:00:00
usage_start_timeTIMESTAMPThe start of the granular time interval for which the usage was recorded.2025-12-01 10:00:00.000000 UTC
usage_end_timeTIMESTAMPThe end of the granular time interval for which the usage was recorded.2025-12-01 11:00:00.000000 UTC
projectRECORDA nested field containing metadata about the organizational container where the usage occurred.See nested fields below.
project.idSTRINGThe project's unique short identifier (e.g., Workspace ID).123456789012
project.nameSTRINGThe user-defined name of the project or account.Production-Web-Account
project.numberSTRINGThe numerical identifier of the project.123456789012
project.ancestry_numbersSTRINGThe numerical hierarchy path of the resource within the organization.123456789012:Data Analytics
project.project_labelsREPEATED RECORDCustom tags applied directly to the project or account.See nested fields below.
project.project_labels.keySTRINGThe key of a custom label.Data-Tier
project.project_labels.valueSTRINGThe value of a custom label.Gold
labelsREPEATED RECORDCustom user-defined tags/labels applied to the resource or usage event.See nested fields below.
labels.keySTRINGThe key of a custom label.team
labels.valueSTRINGThe value of a custom label.frontend
system_labelsREPEATED RECORDLabels automatically applied by the platform for categorization.See nested fields below.
system_labels.keySTRINGThe key of a system label.cloud.region
system_labels.valueSTRINGThe value of a system label.us-east-1
locationRECORDA nested field containing geographical details about where the service was executed or data stored.See nested fields below.
location.countrySTRINGThe country where the resource is hosted.US
location.regionSTRINGThe specific regional location.us-central1
location.zoneSTRINGThe specific zone within the region.us-central1-a
export_timeTIMESTAMPThe time this specific billing record was exported or finalized by the platform.2025-12-02 05:30:00.000000 UTC
costFLOATThe total cost of the usage line item before any applicable taxes or adjustments.1.45
currencySTRINGThe currency in which the cost is reported.USD
currency_conversion_rateFLOATThe rate used to convert the billing currency to the reporting currency.1.0
usageRECORDA nested field describing the quantity consumed and its units.See nested fields below.
usage.amountFLOATThe raw quantity of the service consumed in its base measurement.3600.0
usage.unitSTRINGThe base unit of measurement for the raw usage amount.seconds
usage. amount_in_pricing_unitsFLOATThe usage amount converted into the specific units used for billing.1.0
usage.pricing_unitSTRINGThe unit of measure used to calculate the final cost on the invoice.hour
invoice.monthSTRINGThe specific billing period to which the usage and costs are attributed. This is usually formatted as YYYYMM (Year and Month).202602
cost_typeSTRINGThe nature of the financial transaction.USAGE
reportRECORDA list of aggregated reporting metrics (cost, usage, savings) for this line item.See nested fields below.
report.costFLOATThe final calculated cost after all credits and discounts have been applied.12.45
report.usageFLOATThe total quantity consumed for this specific reporting entry.150.0
report.savingsFLOATThe total amount saved through discounts, such as Committed Use Discounts (CUDs) or Spot pricing.3.20
report.savings_descriptionSTRINGA descriptive label explaining the type of discount or saving applied to the cost.Committed Use Discount
report.creditSTRINGThe name or ID of any promotional credit or rebate applied to this record.Free Trial Credit
report.ext_metricRECORDA nested structure containing supplemental data points not directly related to monetary cost.See nested fields below.
report.ext_metric.keySTRINGThe name or category of the supplemental metric being tracked.carbon_emissions
report.ext_metric.valueFLOATThe numerical value associated with the supplemental metric.0.42
report.ext_metric.typeSTRINGThe unit or classification of the metric value (e.g., kg, percentage, count).kg_CO2e
resource_idSTRINGThe specific identifier of the resource generating the cost (e.g., Warehouse ID, MongoDB Cluster Name).wh-prod-analytics-01
operationSTRINGThe specific action or API call that generated the usage.COMPUTE_QUERY_EXECUTION
row_idSTRING`A unique identifier for this specific billing record.0a1b2c3d-4e5f-6789-0abc-def123456789
descriptionSTRINGA more detailed text description of the usage event.Snowflake Virtual Warehouse X-Small consumption
customer_typeSTRINGThe categorization of the customer.Enterprise
is_marketplaceBOOLEANA flag indicating whether the product/service was acquired via a marketplace.false
customer_idSTRINGThe unique identifier for the customer associated with this usage.CUST-98765

DataHub​

The following table defines the data for the datahub_dci table.

Column NameColumn TypeDescriptionSample Values
event_idSTRINGA unique identifier for an individual event created when an event is generated.550e8400-e29b-41d4-a716-446655440000
cloudSTRINGThe name of the cloud provider or platform where the service was consumed.AWS
billing_account_idSTRINGThe unique financial account ID.AB-12345-CDEFG
service_descriptionSTRINGThe human-readable name of the specific service consumed.Compute Engine
service_idSTRINGThe unique technical identifier for the platform service.6F81-5844-456A
sku_descriptionSTRINGThe detailed, human-readable description of the billable unit.N1 Predefined Instance Core running in Americas
sku_idSTRINGThe unique identifier for the specific billable product or usage unit.B53E-74B8-60C9
operationSTRINGThe specific action or API call that generated the usage.COMPUTE_QUERY_EXECUTION
usage_date_timeDATETIMEThe primary date and time when the resource consumption took place.2025-12-01 10:00:00
export_timeTIMESTAMPThe time this specific billing record was exported or finalized by the platform.2025-12-02 05:30:00.000000 UTC
project_idSTRINGThe project's unique short identifier (e.g., Workspace ID).123456789012
project_nameSTRINGThe user-defined name of the project or account.Data Analytics
project_numberSTRINGThe user-defined label for a project.project-1234
project_labelsREPEATED RECORDCustom tags applied directly to the project or account.See nested values below.
project_labels.keySTRINGThe key of a custom label.Data-Tier
project_labels.valueSTRINGThe value of a custom label.Gold
labelsREPEATED RECORDCustom user-defined tags/labels applied to the resource or usage event.See nested fields below.
labels.keySTRINGThe key of a custom label.team
labels.valueSTRINGThe value of a custom label.frontend
system_labelsREPEATED RECORDLabels automatically applied by the platform for categorization.See nested fields below.
system_labels.keySTRINGThe key of a system label.cloud.region
system_labels.valueSTRINGThe value of a system label.us-east-1
countrySTRINGThe country where the resource is hosted.US
regionSTRINGThe specific regional location.us-central1
zoneSTRINGThe specific zone within the region.us-central1-a
resource_idSTRINGThe specific identifier of the resource itself (e.g., a VM instance ID).0123456789012345678
resource_global_idSTRINGA unique, globally scoped identifier for the resource.gcp-us-central1-12345
pricing_unitSTRINGThe physical unit of measurement for the consumed amount.token
cost_typeSTRINGThe nature of the financial transaction.Usage
is_marketplaceBOOLEANA flag indicating whether the Gen AI service was acquired via the marketplace.false
metricsRECORDThe numerical measurements of performance over a specific period.See nested fields below.
metrics.keySTRINGThe name of the metric.cpu_utilization
metrics.valueFLOATThe numeric measurement of the metric.45.2
sourceSTRINGThe origin of the data.aws.ec2.us-east-1a
batchSTRINGA collection of data points or events grouped together to be processed in a single transaction.batch_size: 500

Gen AI​

The following table defines the data for the genai_dci table.

Column NameColumn TypeDescriptionSample Values
providerSTRINGThe name of the cloud provider or platform where the Gen AI service was consumed.GCP
billing_account_idSTRINGThe unique ID of the financial account responsible for the Gen AI charges.A1B2C3-4D5E6F-7G8H9I
project_idSTRINGThe unique identifier for the cloud project or resource group where the usage occurred.gen-ai-experiments-001
project_internal_idSTRINGAn internal, unique identifier for the project, separate from the user-facing project_id.123456789012
project_numberSTRINGThe numerical identifier for the project or subscription.987654321012
project_nameSTRINGThe human-readable name of the project.AI Development Sandbox
project_ancestry_numbersSTRINGThe numerical path of the project within the organization's hierarchy (e.g., Organization/Folder/Project).12345/67890/98765
service_descriptionSTRINGThe human-readable name of the Gen AI service or API consumed.Generative Language API
service_idSTRINGThe technical identifier for the Gen AI service.aiplatform.googleapis.com
sku_idSTRINGThe unique identifier for the specific billable model or usage unit.sku_inference_model_x
sku_descriptionSTRINGThe detailed, human-readable description of the billable unit (e.g., tokens, model version).Code Generation - 1M tokens
usage_date_timeDATETIMEThe primary date and time when the resource consumption took place.2025-12-01 10:30:00
usage_start_timeTIMESTAMPThe start of the time interval for which the usage was recorded.2025-12-01 10:00:00.000000 UTC
usage_end_timeTIMESTAMPThe end of the time interval for which the usage was recorded.2025-12-01 11:00:00.000000 UTC
export_timeTIMESTAMPThe time this specific usage record was exported by the cloud provider.2025-12-02 05:00:00.000000 UTC
locationSTRINGThe high-level geographical location where the resource was hosted.us-central
countrySTRINGThe country where the resource is hosted.US
regionSTRINGThe specific regional location (e.g., us-central1).us-central1
zoneSTRINGThe specific availability zone within the region (if applicable).us-central1-a
costFLOATThe total cost of the usage line item before any applicable taxes or adjustments.0.0025
currencySTRINGThe currency in which the cost is reported.USD
currency_conversion_rateFLOATThe rate used to convert the billing currency to the reporting currency.1.0
cost_typeSTRINGThe nature of the financial transaction.Usage
usage_amountFLOATThe total numeric amount of usage consumed (e.g., the number of tokens).125000.0
usage_unitSTRINGThe physical unit of measurement for the consumed amount.token
usage_amount_in_pricing_unitsFLOATThe usage amount normalized to the pricing unit (e.g., dividing tokens by 1,000,000).0.125
usage_pricing_unitSTRINGThe unit used by the billing rate.1000000 token
invoice_monthSTRINGThe billing month the charge is associated with (e.g., YYYYMM).202512
resource_idSTRINGThe specific identifier of the resource associated with the usage (e.g., a specific endpoint ID).model-endpoint-abc
customer_typeSTRINGThe categorization of the customer.Direct
is_marketplaceBOOLEANA flag indicating whether the Gen AI service was acquired via the marketplace.false
labelsREPEATED RECORDCustom user-defined tags applied to the usage for cost allocation.See nested fields below.
labels.keySTRINGThe key of a custom label.environment
labels.valueSTRINGThe value of a custom label.staging
system_labelsREPEATED RECORDLabels automatically applied by the cloud provider for resource categorization.See nested fields below.
system_labels.keySTRINGThe key of a system label.ai.google.com/model_version
system_labels.valueSTRINGThe value of a system label.gemini-pro-v2