All case studies

// FinOps / Cost Optimization

Cost optimization: from Kubernetes to Cloud Run

Stateless services ran on an always-on GKE cluster sized for peak — idle most of the day, billing all of it. Migrating the right workloads to Cloud Run with scale-to-zero cut GCP compute spend without hurting reliability.

Cost optimization: from Kubernetes to Cloud Run
SYS_BLOG_PREVIEW_SECURE
K8S-TO-CLOUD-RUN-COST

Impact Metrics

55%

GCP compute cost cut

→ 0

Idle cost, scale-to-zero

0

Downtime migrating

Stack Applied

GCPCloud RunGKECloudflareTerraformGitHub Actions

// Context & Background

A web3 product ran everything on a permanently-provisioned GKE cluster. Node pools were sized for peak traffic that arrived a few hours a day, so the cluster sat largely idle the rest of the time — and billed for every idle core.

// The Challenge

Cut the GCP bill meaningfully without re-architecting the whole application or adding cold-start latency to the endpoints that couldn't tolerate it.

// The Roadmap & Approach

01

Utilization audit of the GKE estate

Profiled every workload's real CPU/memory usage and request patterns to separate the genuinely always-on services from the bursty, stateless ones.

02

Migrate stateless services to Cloud Run

Moved stateless, request-driven services to Cloud Run with scale-to-zero — so they cost nothing when idle and scale out automatically under load.

03

Right-size what stays on Kubernetes

Kept stateful and long-running workloads on a much smaller, right-sized GKE node pool — paying only for what genuinely needs to stay warm.

04

Tune cold starts and keep CI/CD intact

Set minimum-instance counts on latency-sensitive services, fronted everything with Cloudflare, and ported deployments cleanly into the existing CI/CD.

// Validated Results

  • ~55% reduction in GCP compute spend
  • Non-critical services now scale to zero — idle cost effectively eliminated
  • Zero downtime during the migration
  • Far less Kubernetes upkeep for the services that moved off it

Facing similar architecture challenges?

Let's talk through your system architecture — a 30-minute review session usually surfaces the quickest reliability and FinOps improvements.