banner
cloudnativeboy.bsky.social
@cloudnativeboy.bsky.social
Host of youtube.com/@cloudnativefm podcast, CNCF Ambassador
1/ 🧵 Big news for k8s networking: the Ingress-NGINX controller is being retired, maintainers will move it to end-of-life mode (maintenance only until March 2026). If you run production clusters on it, this matters.

Backstory: 👇
November 25, 2025 at 1:05 PM
Does Google hold the record of the largest known K8s cluster with 130,000 nodes?

Here's the story of driving demand for these kinds of mega-clusters

I would like to host a podcast, engineers managing/operating more than 50K nodes? Say 👋 if you're.
cloud.google.com/blog/product...
How we built a 130,000-node GKE cluster | Google Cloud Blog
Learn about the architectural innovations we used to build a 130,000-node Kubernetes cluster, and the trends driving demand for these environments.
cloud.google.com
November 24, 2025 at 5:42 PM
Either we make the move to the Gateway API, which gives us a standard, extensible way to handle app-layer routing and separation of concerns, or we’ll keep reworking a deprecated Ingress stack. I’ll pick the Gateway route.
faun.pub/migrating-fr...
Migrating from NGINX Ingress Controller to Kubernetes Gateway API Using ingress2gateway
A Practical Guide for Teams Preparing for the NGINX Ingress Retirement
faun.pub
November 24, 2025 at 10:02 AM
This guide provides specific prompt engineering techniques for Claude 4.x models, with specific guidance for Sonnet 4.5 and Haiku 4.5.

What do you want to add to these best practices list:
platform.claude.com/docs/en/buil...
Prompting best practices
Claude API Documentation
platform.claude.com
November 23, 2025 at 4:24 PM
Just published a short forensic episode with Richard Simon (CloudTherapist) on the AWS us-east-1 outage (Oct 20, 2025). Root cause: a DNS automation race condition affecting DynamoDB → cascading failures.

What to classify as critical, and how to think about multi-region HA!!
youtu.be/MCwt95TluAU
#cloudnativewisdom014: AWS US-EAST-1 Outage Explained
YouTube video by Cloud Native Podcast
youtu.be
November 20, 2025 at 6:52 PM
Which one challenging you most?
1. Ops overhead catches teams off guard
2. Security issues put clusters at risk
3. Talent acquisition: High talent costs & skill gaps in K8s expertise
4. Technical debt piling up faster than teams can manage
www.cncf.io/blog/2025/11...
Top 5 hard-earned lessons from the experts on managing Kubernetes
Kubernetes has transformed how modern organizations deploy and operate scalable infrastructure, and the hype around automated cloud native orchestration has made its adoption nearly ubiquitous over…
www.cncf.io
November 19, 2025 at 5:27 PM
Kgateway is an open source implementation of the Kubernetes Gateway API that unifies ingress, API gateway, service mesh, and AI gateway capabilities in a singular modular control plane.

This stuff is progressive very fast and most of it in a right direction:
www.cncf.io/blog/2025/11...
Kgateway v2.1 is released!
Kgateway is an open source implementation of the Kubernetes Gateway API that unifies ingress, API gateway, service mesh, and AI gateway capabilities in a singular modular control plane.
www.cncf.io
November 19, 2025 at 4:50 PM
AWS outage - 20 Oct 2025

Azure outage - 29 Oct 2025

Cloudflare outage - 18 Nov 2025

who’s next and when?
November 19, 2025 at 4:39 PM
OpenAI has a big, bigger, and now the biggest competition ahead, according to a news outlet. Enterprises are already pursuing Anthropic with Claude, which is now available on all three major cloud services. Revenue cycle can begin sooner 📈.
www.anthropic.com/news/microso...
Microsoft, NVIDIA and Anthropic announced new strategic partnerships.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
www.anthropic.com
November 18, 2025 at 4:43 PM
Today, I just learned Cloudflare is down with Spotify, ChatGPT, X, Zoom, Microsoft Teams.

Cloudflare said it had to temporarily disable some services for UK users in its attempts to fix the issue today. Gone are the days of DevOps with Canary Deployment
www.theguardian.com/technology/l...
Cloudflare says ‘incident now resolved’ after outage causes error messages across the internet – live
Company says it believes the outage has now been resolved and a fix has been implemented
www.theguardian.com
November 18, 2025 at 3:24 PM
This will hurt my search engine query for Prometheus as an open-source observability tool. There might be a key differentiator with & without "Jeff Prometheus," AI startup has already raised $6.2 billion in funding, let that sink in.
techcrunch.com/2025/11/17/j...
Jeff Bezos reportedly returns to the trenches as co-CEO of new AI startup, Project Prometheus | TechCrunch
Jeff Bezos is partly backing a new AI startup called Project Prometheus that has raised $6.2 billion in funding, and will take on duties as co-chief executive.
techcrunch.com
November 17, 2025 at 6:47 PM
We built the cloud to be simpler, more flexible, Yet somewhere along the way, we lost enterprise-grade inventory, a system of record that tells you what you own? where it runs? who owns it? and what policies apply?

InfraGraph promises to bring it back, A deep dive: medium.com/@saimsafder1...
InfraGraph and the Return of Infrastructure Inventory — Why Cloud Needs a Single Source of Truth
How a relationship-first knowledge graph could restore the system-of-record we lost moving from data centers to multi/hybrid cloud
medium.com
November 17, 2025 at 2:31 PM
Nvidia Grove, an open source K8s API designed for running AI inference workloads.

API includes autoscaling components designed to aid scaling resources as a single resource on K8s, applied to individual components or all the way up to entire service replicas.
www.sdxcentral.com/news/nvidia-...
Nvidia unveils Grove: An open source API to help orchestrate AI inference
Offering turns complex orchestration needs into simple Kubernetes pods
www.sdxcentral.com
November 15, 2025 at 7:21 PM
New on Claude API: Structured outputs

Define your schema once. Get perfectly formatted responses every time. No more complex parsing logic or wasted tokens on retries and the best part: no impact on model performance

Available - Sonnet 4.5 and Opus 4.1 in public beta
November 15, 2025 at 8:09 AM
Helm has officially released v4.0.0.

One of the biggest changes is the removal of Helm 2 migration code and old compatibility layers.

Helm v4 also improves security/maintainability by dropping unsupported APIs & removing older commands that were rarely used.
helm.sh/docs/overview/
Helm 4 Overview | Helm
Helm v4 represents a significant evolution from v3, introducing breaking changes, new architectural patterns, and enhanced functionality while maintaining backwards compatibility for charts.
helm.sh
November 15, 2025 at 6:54 AM
Anthropic confirmed the first documented case of a large-scale, AI-orchestrated cyber-espionage campaign, one where AI agents, not humans, executed 80–90% of the attack.
www.anthropic.com/news/disrupt...
Disrupting the first reported AI-orchestrated cyber espionage campaign
A report describing an a highly sophisticated AI-led cyberattack
www.anthropic.com
November 14, 2025 at 6:50 PM
Happy Friday 👋

He who knows not and knows not he knows not is a fool. "Shun him."

He who knows and knows not he knows is asleep, "Awakens him."

He who knows not and knows he not knows is hungry. "Feed him."

He who knows and knows he knows not is wise. "Follow him."
November 14, 2025 at 5:13 PM
If you’re still sending raw JSON into your LLMs, you’re burning tokens, latency, and budget!

Try TOON (Token-Oriented Object Notation).

Clear like YAML, compact like CSV:

• 30–60% fewer tokens
• Up to 50% lower costs
• Shines for tabular data.

Read more:
medium.com/medialesson/...
November 14, 2025 at 4:10 PM
What we captured: practical playbooks for reducing cloud costs, guardrails for secure supply chains, patterns for platform engineering, and repeatable automation standards that bridge R&D and enterprise ops.

Thanky'all for sharing hard-won lessons and shipping-ready guidance.
November 13, 2025 at 7:36 PM
Highlights I loved:
• @portainerio: : smart container management that simplifies ops, ship faster with less cognitive load.
@chainguard.dev: solid guardrails & supply-chain security for OSS + enterprise.
• InfrOS: forward-looking approaches to continuous governance, cost optimization...
Big thanks to Guy Brodetzki (InfrOS), Adrian Mouat (Chainguard), Neil Cresswell (@portainer.io), and Sarah Polan for joining our KubeCon Atlanta livestream. 🙏

Watch the highlights: www.youtube.com/watch?v=Mbpe...
November 13, 2025 at 7:16 PM
Big thanks to Guy Brodetzki (InfrOS), Adrian Mouat (Chainguard), Neil Cresswell (@portainer.io), and Sarah Polan for joining our KubeCon Atlanta livestream. 🙏

Watch the highlights: www.youtube.com/watch?v=Mbpe...
November 13, 2025 at 6:51 PM
One day in the afterNOON, you realize LLM prompts can be heavy on cost and tokens.

Meet TOON, a new file format sitting b/w JSON & CSV:

💸 Token-efficient: reduce token counts by 30–60%
🤿 LLM-friendly: built-in guardrails for safer outputs
🍱 Minimal syntax & more
medium.com/medialesson/...
JSON vs TOON — A new era of structured input?
Why structure matters more than ever
medium.com
November 13, 2025 at 12:07 PM
Watch all the highlights on this link 👇

#KubeCon Atlanta Day 02 Live Coverage: youtube.com/watch?v=Db9x...

Day 03 Live Coverage starts here 11 am EST | 4 pm GMT: youtube.com/watch?v=Mbpe...
November 12, 2025 at 6:35 PM
Thanks ALL, Rose leads efforts at "Code First Girls" to help underrepresented talent thrive in dev & engineering roles

Jon Brookes is creating Minimal Viable K8S, which'll be featured in future episodes #cloudnativefm

Gurdip Kalley Zivan from datafy.io to helps teams cut cloud block-storage costs
November 12, 2025 at 6:34 PM