Compute Engine

GCP Compute Engine — commitment brief

Data sourced: March 2026. Verify current figures at the Google Cloud Pricing Calculatorarrow-up-right.

Coverage summary

Compute Engine supports two commitment instruments: resource-based CUDs (commit to a specific quantity of vCPUs, memory, GPUs, or Local SSD in a region) and compute flexible CUDs (spend-based commitments that apply across multiple VM families and regions). Resource-based CUDs deliver the highest per-resource discount and are best for stable, predictable workloads. Compute flexible CUDs offer a simpler, flat-rate discount that works even when the VM mix or region shifts. Both instruments are applied at the billing-account level; resource CUDs apply first, then flexible CUDs cover remaining eligible spend.

What is covered: vCPU and memory costs for eligible machine families, committed GPU quantities (A100, L4, and others on eligible families), and Local SSD capacity. OS license commitments are also available separately.

What is not covered: Shared-core machine types (f1-micro, g1-small, and similar), preemptible/Spot VMs, premium OS licenses (charged separately), networking, disk storage beyond Local SSD commitments, and machine families explicitly excluded from CUDs (see table below).

CUD types

Compute Engine supports both resource-based CUDs and compute flexible CUDs. Resource-based CUDs commit to a fixed quantity of vCPUs, memory, or GPUs in a specific region; 1-yr and 3-yr terms are available. Compute flexible CUDs commit to a minimum hourly spend across eligible VM families and regions within a billing account; discounts vary by family: 28% (1-yr) / 46% (3-yr) for most general-purpose and compute-optimized families; 17% (1-yr) / 38% (3-yr) for H3/H4D; no 1-yr discount / 63% (3-yr) for memory-optimized M-series. Note on Sustained Use Discounts (SUDs): SUDs do not apply to resources already covered by a CUD. On uncommitted resources, SUDs continue to accumulate automatically throughout the month.

Machine type / SKU coverage

Machine Type / SKU
Resource CUD
Flex (Spend) CUD
1-yr discount
3-yr discount
Notes

N1 (General Purpose)

✅ Yes

✅ Yes

~37% (resource) / ~28% (flex)

~55% (resource) / ~46% (flex)

Eligible for SUDs on uncommitted usage; shared-core N1 excluded from CUDs

N2 / N2D (General Purpose)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

SUD max 20% on uncommitted usage

N4 / N4D / N4A (General Purpose)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

E2 (General Purpose)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

No SUD; cost-optimized family

T2D / T2A (General Purpose)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

Arm-based T2A; no SUD

C2 / C2D (Compute Optimized)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

SUD max 20% on uncommitted C2 usage

C3 / C3D / C4 / C4A / C4D (Compute Optimized)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

H3 / H4D (HPC Optimized)

✅ Yes

✅ Yes

~37% (resource) / ~17% (flex)

~55% (resource) / ~38% (flex)

H3/H4D flex CUD 1-yr 17%, 3-yr 38%; resource CUD up to ~55%

M1 / M2 (Memory Optimized)

✅ Yes

✅ Yes

~37% (resource) / no flex discount

~70% (resource) / ~63% (flex)

Flex CUD: no 1-yr discount; 3-yr only. SUD max 30% on uncommitted usage

M3 / M4 (Memory Optimized)

✅ Yes

✅ Yes

~37% (resource) / no flex discount

~70% (resource) / ~63% (flex)

Flex CUD: no 1-yr discount; 3-yr only

Z3 (Storage Optimized)

✅ Yes

✅ Yes

~37% / ~28%

~55% / ~46%

A2 (Accelerator Optimized — A100)

✅ Yes

❌ No

~37%

~55%

Resource CUD only; not covered by flex CUD

A3 (Accelerator Optimized — H100)

✅ Yes

❌ No

~37%

~55%

Resource CUD only; not covered by flex CUD

G2 (Accelerator Optimized — L4)

✅ Yes

❌ No

~37%

~55%

Resource CUD only; not covered by flex CUD

Shared-core (f1-micro, g1-small, e2-micro, etc.)

❌ No

❌ No

Excluded from all CUDs

Preemptible / Spot VMs

❌ No

❌ No

Spot pricing model; no commitment instruments

Resource CUD vs. Compute Flexible CUD

Resource-based CUD: Commits to a specific quantity of vCPUs, memory, GPUs, or Local SSD in a single region. Provides the highest discount ceiling — up to ~55% for most families and ~70% for memory-optimized (M1/M2/M3/M4) on 3-yr terms. Best when your resource footprint (family, size, region) is stable and predictable. Commitment is locked to a region; unused commitment is still billed.

Compute Flexible CUD: Commits to a minimum hourly spend (in dollars) across eligible VM families and regions within a billing account — similar in concept to AWS Compute Savings Plans. Flat-rate discounts for general-purpose and compute-optimized families: 28% on 1-yr terms, 46% on 3-yr terms. H3/H4D families: 17% (1-yr), 38% (3-yr). Memory-optimized M-series: no discount on 1-yr terms; 63% on 3-yr terms only. Covers Compute Engine, GKE, and Cloud Run within the same billing account. Best when your VM family mix or region footprint shifts, or when you need a simpler purchasing experience.

Sustained Use Discounts

Sustained Use Discounts (SUDs) are automatic discounts applied to N1, N2, N2D, C2, M1, and M2 instances that run for a significant portion of the billing month. SUDs accumulate incrementally — N1/M1/M2 machines earn up to 30% maximum SUD; N2/N2D/C2 earn up to 20%. Resources covered by a CUD do not also earn SUDs. The optimal strategy is to use CUDs for your committed baseline (earning the higher CUD discount), and let uncommitted burst usage earn SUDs automatically. Machine families without SUD eligibility (E2, T2D, T2A, C3, N4, Z3, A2, A3, G2) should be covered by CUDs or flex CUDs for maximum savings.

Regional availability

Resource-based CUDs and compute flexible CUDs are available in all commercial GCP regions for broadly-deployed machine families (N1, N2, N2D, E2, T2D, C2, C2D, C3). However, several machine series have restricted regional availability that directly limits where CUDs for those machines can be purchased.

Notable restrictions: M3 and M4 memory-optimized machines are available only in select regions and zones — verify availability in your target region before planning M3/M4 commitments. A3 (H100 GPU) machines are restricted to a very small number of zones globally (approximately 5 zones) and may require contacting your account team for quota. A2 (A100) machines are available in roughly 15 zones. G2 (L4 GPU) machines are the most broadly available GPU option, deployed in 40+ zones. H3 and H4D HPC-optimized machines are also limited to select zones.

⚠️ CUD availability varies by machine type and region. Always verify at GCP regions and zonesarrow-up-right before purchasing.

Archera

Google Cloud Compute Engine is a core service within Archera's commitment management scope. Archera can automate resource-based CUD and compute flexible CUD purchase, monitor utilization by region and machine family, right-size commitments as your fleet evolves, and wrap commitments in a GRI/GSP — eliminating downside risk on over-commitment while preserving the full CUD discount.


Sources

⚠️ Discount percentages are approximate and region/machine-type-dependent. Always verify with the Google Cloud Pricing Calculatorarrow-up-right.

Last updated

Was this helpful?