# Compute Engine

## GCP Compute Engine — commitment brief

> **Data sourced**: March 2026. Verify current figures at the [Google Cloud Pricing Calculator](https://cloud.google.com/products/calculator).

### Coverage summary

Compute Engine supports two commitment instruments: resource-based CUDs (commit to a specific quantity of vCPUs, memory, GPUs, or Local SSD in a region) and compute flexible CUDs (spend-based commitments that apply across multiple VM families and regions). Resource-based CUDs deliver the highest per-resource discount and are best for stable, predictable workloads. Compute flexible CUDs offer a simpler, flat-rate discount that works even when the VM mix or region shifts. Both instruments are applied at the billing-account level; resource CUDs apply first, then flexible CUDs cover remaining eligible spend.

**What is covered**: vCPU and memory costs for eligible machine families, committed GPU quantities (A100, L4, and others on eligible families), and Local SSD capacity. OS license commitments are also available separately.

**What is not covered**: Shared-core machine types (f1-micro, g1-small, and similar), preemptible/Spot VMs, premium OS licenses (charged separately), networking, disk storage beyond Local SSD commitments, and machine families explicitly excluded from CUDs (see table below).

### CUD types

> Compute Engine supports both resource-based CUDs and compute flexible CUDs. Resource-based CUDs commit to a fixed quantity of vCPUs, memory, or GPUs in a specific region; 1-yr and 3-yr terms are available. Compute flexible CUDs commit to a minimum hourly spend across eligible VM families and regions within a billing account; discounts vary by family: 28% (1-yr) / 46% (3-yr) for most general-purpose and compute-optimized families; 17% (1-yr) / 38% (3-yr) for H3/H4D; no 1-yr discount / 63% (3-yr) for memory-optimized M-series. Note on Sustained Use Discounts (SUDs): SUDs do not apply to resources already covered by a CUD. On uncommitted resources, SUDs continue to accumulate automatically throughout the month.

### Machine type / SKU coverage

| Machine Type / SKU                               | Resource CUD | Flex (Spend) CUD | 1-yr discount                       | 3-yr discount                   | Notes                                                                     |
| ------------------------------------------------ | ------------ | ---------------- | ----------------------------------- | ------------------------------- | ------------------------------------------------------------------------- |
| N1 (General Purpose)                             | ✅ Yes        | ✅ Yes            | \~37% (resource) / \~28% (flex)     | \~55% (resource) / \~46% (flex) | Eligible for SUDs on uncommitted usage; shared-core N1 excluded from CUDs |
| N2 / N2D (General Purpose)                       | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   | SUD max 20% on uncommitted usage                                          |
| N4 / N4D / N4A (General Purpose)                 | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   |                                                                           |
| E2 (General Purpose)                             | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   | No SUD; cost-optimized family                                             |
| T2D / T2A (General Purpose)                      | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   | Arm-based T2A; no SUD                                                     |
| C2 / C2D (Compute Optimized)                     | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   | SUD max 20% on uncommitted C2 usage                                       |
| C3 / C3D / C4 / C4A / C4D (Compute Optimized)    | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   |                                                                           |
| H3 / H4D (HPC Optimized)                         | ✅ Yes        | ✅ Yes            | \~37% (resource) / \~17% (flex)     | \~55% (resource) / \~38% (flex) | H3/H4D flex CUD 1-yr 17%, 3-yr 38%; resource CUD up to \~55%              |
| M1 / M2 (Memory Optimized)                       | ✅ Yes        | ✅ Yes            | \~37% (resource) / no flex discount | \~70% (resource) / \~63% (flex) | Flex CUD: no 1-yr discount; 3-yr only. SUD max 30% on uncommitted usage   |
| M3 / M4 (Memory Optimized)                       | ✅ Yes        | ✅ Yes            | \~37% (resource) / no flex discount | \~70% (resource) / \~63% (flex) | Flex CUD: no 1-yr discount; 3-yr only                                     |
| Z3 (Storage Optimized)                           | ✅ Yes        | ✅ Yes            | \~37% / \~28%                       | \~55% / \~46%                   |                                                                           |
| A2 (Accelerator Optimized — A100)                | ✅ Yes        | ❌ No             | \~37%                               | \~55%                           | Resource CUD only; not covered by flex CUD                                |
| A3 (Accelerator Optimized — H100)                | ✅ Yes        | ❌ No             | \~37%                               | \~55%                           | Resource CUD only; not covered by flex CUD                                |
| G2 (Accelerator Optimized — L4)                  | ✅ Yes        | ❌ No             | \~37%                               | \~55%                           | Resource CUD only; not covered by flex CUD                                |
| Shared-core (f1-micro, g1-small, e2-micro, etc.) | ❌ No         | ❌ No             | —                                   | —                               | Excluded from all CUDs                                                    |
| Preemptible / Spot VMs                           | ❌ No         | ❌ No             | —                                   | —                               | Spot pricing model; no commitment instruments                             |

### Resource CUD vs. Compute Flexible CUD

**Resource-based CUD**: Commits to a specific quantity of vCPUs, memory, GPUs, or Local SSD in a single region. Provides the highest discount ceiling — up to \~55% for most families and \~70% for memory-optimized (M1/M2/M3/M4) on 3-yr terms. Best when your resource footprint (family, size, region) is stable and predictable. Commitment is locked to a region; unused commitment is still billed.

**Compute Flexible CUD**: Commits to a minimum hourly spend (in dollars) across eligible VM families and regions within a billing account — similar in concept to AWS Compute Savings Plans. Flat-rate discounts for general-purpose and compute-optimized families: 28% on 1-yr terms, 46% on 3-yr terms. H3/H4D families: 17% (1-yr), 38% (3-yr). Memory-optimized M-series: no discount on 1-yr terms; 63% on 3-yr terms only. Covers Compute Engine, GKE, and Cloud Run within the same billing account. Best when your VM family mix or region footprint shifts, or when you need a simpler purchasing experience.

### Sustained Use Discounts

> Sustained Use Discounts (SUDs) are automatic discounts applied to N1, N2, N2D, C2, M1, and M2 instances that run for a significant portion of the billing month. SUDs accumulate incrementally — N1/M1/M2 machines earn up to 30% maximum SUD; N2/N2D/C2 earn up to 20%. Resources covered by a CUD do not also earn SUDs. The optimal strategy is to use CUDs for your committed baseline (earning the higher CUD discount), and let uncommitted burst usage earn SUDs automatically. Machine families without SUD eligibility (E2, T2D, T2A, C3, N4, Z3, A2, A3, G2) should be covered by CUDs or flex CUDs for maximum savings.

## Regional availability

Resource-based CUDs and compute flexible CUDs are available in all commercial GCP regions for broadly-deployed machine families (N1, N2, N2D, E2, T2D, C2, C2D, C3). However, several machine series have restricted regional availability that directly limits where CUDs for those machines can be purchased.

**Notable restrictions**: M3 and M4 memory-optimized machines are available only in select regions and zones — verify availability in your target region before planning M3/M4 commitments. A3 (H100 GPU) machines are restricted to a very small number of zones globally (approximately 5 zones) and may require contacting your account team for quota. A2 (A100) machines are available in roughly 15 zones. G2 (L4 GPU) machines are the most broadly available GPU option, deployed in 40+ zones. H3 and H4D HPC-optimized machines are also limited to select zones.

⚠️ CUD availability varies by machine type and region. Always verify at [GCP regions and zones](https://cloud.google.com/compute/docs/regions-zones) before purchasing.

### Archera

Google Cloud Compute Engine is a core service within Archera's commitment management scope. Archera can automate resource-based CUD and compute flexible CUD purchase, monitor utilization by region and machine family, right-size commitments as your fleet evolves, and wrap commitments in a GRI/GSP — eliminating downside risk on over-commitment while preserving the full CUD discount.

***

**Sources**

* [Committed use discounts (CUDs) for Compute Engine](https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview)
* [Resource-based committed use discounts](https://cloud.google.com/compute/docs/instances/signing-up-committed-use-discounts)
* [Compute flexible committed use discounts (blog)](https://cloud.google.com/blog/products/compute/save-money-with-the-new-compute-engine-flexible-cuds)
* [Expanded coverage for Compute Flex CUDs](https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/)
* [Sustained use discounts](https://cloud.google.com/compute/docs/sustained-use-discounts)
* [VM instance pricing](https://cloud.google.com/compute/vm-instance-pricing)

⚠️ Discount percentages are approximate and region/machine-type-dependent. Always verify with the [Google Cloud Pricing Calculator](https://cloud.google.com/products/calculator).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.archera.ai/help-center/cloud-service-intelligence/cloud-service-intelligence/gcp/compute/compute-engine.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
