We publish the invoices, the trace logs, and the negotiation notes behind real cloud bills. Our team has paid down seven-figure AWS, GCP, and Azure spends across SaaS, fintech, and media workloads, and we write up what actually moved the needle, not what the marketing decks promise. If you are tired of FinOps content that stops at "turn on Compute Optimizer," this is the corner of the internet that keeps going.
AWS Data Transfer Math That Finance Will Believe
Egress is still the most misunderstood line item on a cloud bill. We break down inter-AZ, inter-region, NAT Gateway processing fees, and the often-ignored PrivateLink charges with worked examples and the exact pricing pages from the AWS On-Demand pricing and GCP network pricing references. A single misplaced NAT Gateway in front of an S3-heavy workload can quietly add five figures a month, and most cost dashboards bury it under "EC2-Other."
Our walkthroughs show how to use VPC Flow Logs and the AWS Cloud Financial Management blog methodology to attribute bytes back to services, then how to brief a CFO on the trade-off between gateway endpoints, Transit Gateway hub-and-spoke, and good old-fashioned regional consolidation.
FinOps for AI Workloads in 2026
GPU spend has rewritten the FinOps playbook. We track GB200 and H200 capacity pricing across the three hyperscalers, Lambda Labs, and CoreWeave, and we compare them against managed inference endpoints like Bedrock, Vertex AI, and Azure AI Foundry. The hidden cost is rarely the GPU hour itself; it is idle capacity reservations, tokenizer inefficiency, and shadow embeddings re-computed every deploy.
We follow the FinOps Foundation framework and its 2026 AI Working Group guidance, but translate it into engineering practice: per-request cost telemetry with OpenTelemetry, prompt-cache hit ratios, and chargeback models that survive a quarterly review. Expect concrete numbers on KV-cache reuse, speculative decoding ROI, and when a fine-tune actually pays back versus a longer system prompt.
Right-Sizing, Spot, and Savings Plans Without the Regret
Commitment-based discounts are powerful and easy to get wrong. We document a rolling 30/60/90-day right-sizing cadence that combines Kubernetes autoscaling primitives, Karpenter consolidation, and the newer Azure Cost Management anomaly detection. The goal is a coverage ratio you can defend, not a Savings Plan dashboard that looks green while utilization quietly collapses.
For Spot and Azure Spot VMs, we share interruption-rate data by instance family and region, the diversification patterns that survive a real capacity event, and the workloads where Spot is still a trap (long-running stateful jobs, anything with strict SLOs, and most CI runners once you price in retry storms). Where Reserved Instances still beat Savings Plans, we say so and show the math.
New posts go up several times a week and the latest are listed below. Browse the grid for deep dives, vendor comparisons, and the occasional unflattering benchmark, and follow along as we keep cutting bills in public.