CloudThrill
banner
cloudthrill.bsky.social
CloudThrill
@cloudthrill.bsky.social
Welcome to CloudThrill, a dynamic consulting firm specializing in cloud automation and DevOps services. Founded and led by our passionate
@clouddude.bsky.social , follow us in linkedin linkedin.com/company/cloudthrill
Nice example of a production #vLLM setup on 𝗡𝗲𝗯𝗶𝘂𝘀 with terraform, managed K8s, inference, and observability all in one place.

This can be a ref stack builders can use without reinventing the basics 💡.
👨🏻‍💻 full code on our repo.
github.com/CloudThrill/vllm-production-stack-terraform
📢 𝗡𝗲𝘄 𝘁𝗲𝗿𝗿𝗮𝗳𝗼𝗿𝗺 #vLLM 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗦𝘁𝗮𝗰𝗸 𝗔𝗰𝗿𝗼𝘀𝘀 𝗖𝗹𝗼𝘂𝗱𝘀 🧑🏼‍🚀 | 𝗣𝗮𝗿𝘁 𝟰: 𝗡𝗲𝗯𝗶𝘂𝘀 𝗖𝗹𝗼𝘂𝗱 💚

🔎 𝗪𝗵𝗮𝘁 𝘆𝗼𝘂'𝗹𝗹 𝗱𝗲𝗽𝗹𝗼𝘆:
✅ Enterprise-grade GPU inference
✅ Secure vllm endpoints (LetsEncrypt)
✅ Full observability: Grafana + vLLM dashboards
✅ Lightning-fast deployment

👉 read the guide: tinyurl.com/Nebiusvllm
vLLM Production Stack on Nebius K8s with Terraform🧑🏼‍🚀 - Cloudthrill
This terraform stack delivers a production-ready vLLM serving environment On Nebius Cloud managed Kubernetes supporting Highly optimized GPU inference with operational best practices.
tinyurl.com
January 20, 2026 at 5:03 PM
🎄This week, we’re counted down CloudThrill’s 𝗧𝗼𝗽 𝟯 𝗺𝗼𝘀𝘁-𝗿𝗲𝗮𝗱 blog posts of 𝟮𝟬𝟮𝟱🏆.
Three posts. Three lessons. First reveal drops this Monday 👀

#CloudThrill #LLM #AIInfrastructure #LLM #OpenSourceAI #AIEngineering
January 1, 2026 at 11:08 PM
CloudThrill is a proud sponsor of Tech Beats Unplugged podcast🎙️. 🔥New episode out with Michael (WebScale) Webster- breaking down the VMware–Broadcom chaos, Nutanix , and real exit strategies. Listen now🎧👇🏼.
🎙𝗧𝗲𝗰𝗵𝗯𝗲𝗮𝘁𝘀 𝘂𝗻𝗽𝗹𝘂𝗴𝗴𝗲𝗱 is BACK #Episode 08 !!!👊🏻
"𝗡𝘂𝘁𝗮𝗻𝗶𝘅 𝗶𝗻 𝘁𝗵𝗲 𝗔𝗴𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗩𝗶𝗿𝘁𝘂𝗮𝗹𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗦𝗵𝗮𝗸𝗲-𝗨𝗽 " 🎧🔥
🔥New episode out with Michael Webster from Nutanix & I- breaking down the #VMware–Broadcom chaos, #Nutanix, and real exit strategies.
👉🏻 Listen here: sptfy.com/TBeats08 👈🏻
December 23, 2025 at 8:46 PM
This terraform stack delivers a production-ready vLLM serving environment On @awscloud.bsky.social #EKS, supporting both CPU/GPU inference with operational best practices embedded in AWS Integration and Automation (𝗮𝘄𝘀-𝗶𝗮). A One Click Deploy🔥Check the repo and Blog below 👇🏻
📢 𝗡𝗲𝘄 𝘁𝗲𝗿𝗿𝗮𝗳𝗼𝗿𝗺 vLLM 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗦𝘁𝗮𝗰𝗸 𝗔𝗰𝗿𝗼𝘀𝘀 𝗖𝗹𝗼𝘂𝗱𝘀
𝗣𝗮𝗿𝘁 𝟭: 𝗔𝗺𝗮𝘇𝗼𝗻 𝗘𝗞𝗦
🔎 𝗪𝗵𝗮𝘁 𝘆𝗼𝘂'𝗹𝗹 𝗱𝗲𝗽𝗹𝗼𝘆:
✅ Enterprise-grade infra
✅ Switch between CPU/ GPU inference with a single flag
✅ Full observability: Grafana + vLLM dashboards
✅ OpenAI-compatible API

👉 Check it out: cloudthrill.ca/vllm-product...
vLLM Production Stack on Amazon EKS with Terraform🧑🏼‍🚀 - Cloudthrill
This terraform stack delivers a production-ready vLLM serving environment On Amazon EKS supporting both CPU/GPU inference with operational best practices embedded in AWS Integration and Automation (aws-ia).
cloudthrill.ca
December 18, 2025 at 3:40 PM
This terraform stack delivers a production-ready vLLM serving environment On 𝗚𝗼𝗼𝗴𝗹𝗲 𝗖𝗹𝗼𝘂𝗱 𝗚𝗞𝗘, supporting both 𝗖𝗣𝗨/𝗚𝗣𝗨 inference with operational best practices embedded in #Terraform #GKE Module. A One Click Deploy🔥Check the repo and blog below 👇🏻
📢 𝗡𝗲𝘄 𝘁𝗲𝗿𝗿𝗮𝗳𝗼𝗿𝗺 #vLLM 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗦𝘁𝗮𝗰𝗸 𝗔𝗰𝗿𝗼𝘀𝘀 𝗖𝗹𝗼𝘂𝗱𝘀
𝗣𝗮𝗿𝘁 𝟭: GCP 𝗚𝗞𝗘 🔵🔴🟢
🔎 𝗪𝗵𝗮𝘁 𝘆𝗼𝘂'𝗹𝗹 𝗱𝗲𝗽𝗹𝗼𝘆:
✅ Enterprise-grade infra
✅ Switch between CPU/ GPU inference with a single flag
✅ Full observability: Grafana + vLLM dashboards
✅ OpenAI-compatible API

👉 read the guide: cloudthrill.ca/vllm-product...
vLLM Production Stack on GCP GKE with Terraform🧑🏼‍🚀 - Cloudthrill
This terraform stack delivers a production-ready vLLM serving environment On Google Cloud GKE supporting both CPU/GPU inference with operational best practices embedded in Terraform Kubernetes Engine Modules.
cloudthrill.ca
December 18, 2025 at 3:36 PM
🔥Check out what’s cooking in #vLLM for 𝟮𝟬𝟮𝟲 and beyond. From the project leader himself 𝗦𝗶𝗺𝗼𝗻 𝗠𝗼! #𝗢𝗽𝗲𝗻𝗦𝗼𝘂𝗿𝗰𝗲𝗔𝗜 #LeadingTheway 💪 #RaySummit2025 #Anyscale
🩵 Throwback Thursday from #𝗥𝗮𝘆𝗦𝘂𝗺𝗺𝗶𝘁 𝗦𝗙!
I Caught up with 𝗦𝗶𝗺𝗼𝗻 𝗠𝗼, lead on #vllm project, to chat about the 𝘁𝗼𝗽 𝟱 𝗻𝗲𝘄 𝘃𝗟𝗟𝗠 𝗳𝗲𝗮𝘁𝘂𝗿𝗲𝘀 and what’s cooking in their 𝟮𝟬𝟮𝟲 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 straight from the source🔥and boy did he deliver!! #vLLM #RaySummit2025 #anyscale @cloudthrill.bsky.social #OpenSourceAI
🎙️Top 5 new VLLM features 2026! with Simon Mo @ 𝗥𝗮𝘆 𝗦𝘂𝗺𝗺𝗶𝘁
YouTube video by CloudDude
www.youtube.com
December 11, 2025 at 5:25 PM
Reposted by CloudThrill
Still thinking of hosting your on AI Backend?
our FREE vLLM POC is still live - but not forever.
📢𝗔𝗽𝗽𝗹𝘆 𝗻𝗼𝘄 → cloudthrill.ca/ai-poc

Run AI assistants, RAG, or open models privately in the cloud:
✅ No external APIs
✅ No vendor lock-in
✅ Total data control

Your Infra. Your Models. Your rules.🏆🏁
October 23, 2025 at 2:18 PM
💡 In this 5-min read you'll learn:
✅ How embeddings work – in the simplest way possible
🔁 Chunk sizes, overlaps, and text splitters
📦 Vector DBs, popular embedding models used today

💡Oh,& don’t forget, our Private 𝗔𝗜 𝗜𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 campaign is still running, with a 𝐋𝐈𝐌𝐈𝐓𝐄𝐃 FREE 𝐏𝐎𝐂 cloudthrill.ca/ai-poc
October 21, 2025 at 6:55 PM
🍁We’re excited to share that CloudThrill has been awarded a 𝐏𝐫𝐨𝐒𝐞𝐫𝐯𝐢𝐜𝐞𝐬 𝐩𝐫𝐞𝐪𝐮𝐚𝐥𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧👏🏼 with 𝐏𝐮𝐛𝐥𝐢𝐜 𝐒𝐞𝐫𝐯𝐢𝐜𝐞𝐬 𝐚𝐧𝐝 𝐏𝐫𝐨𝐜𝐮𝐫𝐞𝐦𝐞𝐧𝐭 𝐂𝐚𝐧𝐚𝐝𝐚 !
👋🏻 Work with a 𝐩𝐮𝐛𝐥𝐢𝐜 𝐚𝐠𝐞𝐧𝐜𝐲 ? Let’s talk about your challenges - we’d love to hear from you! cloudthrill.ca/contact-us
#CloudThrill #ProServices #GovernmentOfCanada
October 1, 2025 at 4:20 PM
Check out our new 𝐯𝐋𝐋𝐌 𝐏𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐒𝐭𝐚𝐜𝐤 blog👇🏼
💎We cover:
✅ What is 𝐯𝐋𝐋𝐌 production stack ?
✅ Request Flow & Architecture breakdown
✅ Serving Engine, Request Router & KV-Cache Netwrk
✅ Autoscaling & built-in fault-tolerance
✅ One-click Helm install

#LLMs #Kubernetes #Cloudthrill #vLLM
September 24, 2025 at 6:20 PM
Here’s a full recap from #CloudThrill team of the vLLM beginners series, broken in 3 parts 💫 share and enjoy!
📦#vLLM for 𝐁𝐞𝐠𝐢𝐧𝐧𝐞𝐫𝐬 𝐛𝐮𝐧𝐝𝐥𝐞: from basics to deployment! 👇Missed our vLLM series this summer? Here’s a full recap
Part1️⃣: 𝐅undamentals cloudthrill.ca/what-is-vllm
Part2️⃣: 𝐊ey 𝐅eatures cloudthrill.ca/what-is-vllm...
part3️⃣: 𝐃eployment 𝐎ptions cloudthrill.ca/vllm-deloyment
#vllm_project #lmcache #LLMs
September 2, 2025 at 7:21 PM
🔐 Learn the key to easy and production-grade secret management on K8s 👇🏼
𝐋𝐢𝐤𝐞 𝐭𝐡𝐢𝐬 𝐤𝐢𝐧𝐝 𝐨𝐟 𝐬𝐭𝐮𝐟𝐟? Subscribe here 👉 tinyurl.com/CloudThrillBlogs
August 19, 2025 at 5:26 PM
Get your teams to level up their CI/CD skills with this GithubActions cert guide 👇🏻
🚀 #NewBlog How to 𝐏𝐚𝐬𝐬 𝐭𝐡𝐞 𝐆𝐢𝐭𝐇𝐮𝐛 𝐀𝐜𝐭𝐢𝐨𝐧𝐬 𝐂𝐞𝐫𝐭𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧!
👉Complete guide here cloudthrill.ca/how-i-passed...
🎯 I included all you need to ace it:
✅Free exam practice
✅ Most exhaustive Cheat Sheet (all topics covered)
✅CliffsNotes, & exam expectations
#Cloudthrill #Certification #GitHubActions
How to pass the GitHub Actions certification (cheat-sheet) - Cloudthrill
🎯Everything you need to pass the GitHub Actions certification in one place. ✔️ Clear examples, ✔️Cheat Sheet ✔️CLI syntax, and what to expect in the test.
cloudthrill.ca
August 13, 2025 at 4:06 PM
#NewBlog: final part of our #VLLM blog series🔥
💎This, we shift from theory to practice, covering #vLLM installs across platforms? check our new blog, where we break it down in 5 sections😎#TherYouGo
🚀#BlogSeries #vllm🔥
𝐯𝐋𝐋𝐌 𝐟𝐨𝐫 𝐁𝐞𝐠𝐢𝐧𝐧𝐞𝐫𝐬 𝐏𝐚𝐫𝐭 𝟑:📖 𝐃𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭 𝐎𝐩𝐭𝐢𝐨𝐧𝐬
Learn to deploy #vLLM everywhere! Even on CPU🤫

✅Platform & model Support Matrix
✅Install on GPU & CPU
✅Build Wheel from scratch | Python vLLM package
✅Docker/Kubernetes Deployment
✅Running vLLM server (Offline + Online inference)
vLLM for beginners: Deployment Options (PartIII) - Cloudthrill
In this final part, we’ll shift from theory to practice, covering how to deploy vLLM across different environments, from source builds to docker containers. In this series, we aim to provide a solid foundation of vLLM core concepts to help you understand how it works and why it’s emerging as a de facto choice for LLM deployment.
cloudthrill.ca
August 5, 2025 at 1:52 PM
New week, new blog! 👇🏼
July 29, 2025 at 2:21 PM
👋🏼See you next Thursday! #Livestream #LLMs #Quantization
🎙️ Join our 🔴𝐓𝐞𝐜𝐡 𝐁𝐞𝐚𝐭𝐬 𝐋𝐢𝐯𝐞 Show!
🗓️ Thursday 17th 11:30 AM EDT
🎯 A chill livestream unpacking LLM #Quantization: #vllm vs #ollama. Learn about the What & How.

🔥Dope guest stars:
#bartowski from arcee.ai & Eldar Kurtic from #RedHat

🔗Stream on YouTube & Linkedin:
www.youtube.com/watch?v=XTE0...
🔴TechBeats live : LLM Quantization "vLLM vs. Ollama"
YouTube video by CloudDude
www.youtube.com
July 10, 2025 at 2:34 PM
#NewBlog part 2 of our #VLLM blog series 🔥
💎What makes #VLLM the Rolls Royce of inference? 👇🏻check our new blog, where we break it down in 5 performance-packed layers😎#TherYouGo
July 2, 2025 at 3:29 PM
Reposted by CloudThrill
🚀#NewBlog 𝐆𝐢𝐭𝐇𝐮𝐛 actions Azure deploy with 𝐎𝐈𝐃𝐂!
💡Over 𝟐𝟑 𝐦𝐢𝐥𝐥𝐢𝐨𝐧𝐬 secrets were exposed in #GitHub last year💀 & 𝟓𝟎K+ #Huggingface tokens leaks every month!
🛡️Switch to 𝐬𝐞𝐜𝐫𝐞𝐭𝐥𝐞𝐬𝐬 with Pipeline identity now!
👉We show you how: cloudthrill.ca/github-actio...
#Azure #NHI #CICD #Terraform #ManagedIdentity
Terraform Pipelines for Dummies Part3: GitHub Actions Azure Deploy with OIDC - Cloudthrill
Struggling with Azure credential management in your CI/CD pipelines? Both Azure and GitHub Actions now support OpenID Connect (OIDC) for secure deployments by simplifying the process, and aligning with modern security practices. With GitHub’s OIDC provider, you can authenticate directly from your workflows using managed identities without the need for static access keys.
cloudthrill.ca
June 24, 2025 at 5:31 PM
Want to learn about @VLLm ? start here 👇🏻
🚀#NewBlogAlert "What is #vLLM ?"
We’re kicking off our 𝐯𝐋𝐋𝐌 𝐟𝐨𝐫 𝐁𝐞𝐠𝐢𝐧𝐧𝐞𝐫𝐬 𝐬𝐞𝐫𝐢𝐞𝐬 with
𝐏𝐚𝐫𝐭 𝟏:📖 𝐓𝐡𝐞 𝐅𝐮𝐧𝐝𝐚𝐦𝐞𝐧𝐭𝐚𝐥𝐬💫
New to vLLM ? This one's for you👇🏻: cloudthrill.ca/what-is-vllm

✅ What is vLLM ( vLLM vs Ollama)
✅ Core Architecture (Engine, Sched, Execution, Memory)
✅ Offline and Online inference
vLLM for beginners: The Fundamentals - Cloudthrill
In this series, we aim to provide a solid foundation of vLLM core concepts to help you understand how it works and why it’s emerging as a defacto choice for LLM deployment.
cloudthrill.ca
June 17, 2025 at 5:28 PM
🚨 As proud sponsors, we're excited to share the latest episode of #TechBeatsUnplugged! 🎧Tune in as Steve Giguere digs through every attack vector☢️on your GitHub workflows and how to protect you🛡️from them.
🎙𝐓𝐞𝐜𝐡 𝐁𝐞𝐚𝐭𝐬 𝐔𝐧𝐩𝐥𝐮𝐠𝐠𝐞𝐝 is BACK #Episode 06 !!!👊🏻
🎧🔥"𝐆𝐢𝐭𝐇𝐮𝐛 𝐒𝐞𝐜𝐮𝐫𝐢𝐭𝐲 𝐡𝐨𝐫𝐫𝐨𝐫 𝐬𝐭𝐨𝐫𝐢𝐞𝐬 with
#SteveGiguere "☢️...tons of 𝐚𝐭𝐭𝐚𝐜𝐤 𝐯𝐞𝐜𝐭𝐨𝐫𝐬, best practices, and a lot of laughs😅. You don't wanna miss this !
Thank you Steve🙏🏻
👉🏻 spoti.fi/4dYicES 👈🏻
Ep06: "GitHub Security horror stories " (with Steve Giguere)
Tech Beats Unplugged · Episode
spoti.fi
June 10, 2025 at 2:20 PM
New Blog drop!! 👋🏻
🧠Your AI workloads are nothing without securing credentials.
🚀 #NewBlogAlert 🛡️ #HashiCorpVault
I'm kicking off a 𝐕𝐚𝐮𝐥𝐭 𝐟𝐨𝐫 𝐃𝐮𝐦𝐦𝐢𝐞𝐬 𝐬𝐞𝐫𝐢𝐞𝐬 with
𝐏𝐚𝐫𝐭 𝟏:🔐 𝐇𝐨𝐰 𝐭𝐨 𝐒𝐞𝐭 𝐔𝐩 𝐇𝐚𝐬𝐡𝐢𝐂𝐨𝐫𝐩 𝐕𝐚𝐮𝐥𝐭 with 𝐑𝐚𝐟𝐭 & 𝐓𝐋𝐒
👉check it out: tinyurl.com/HashiVault-f...
@cloudthrill.bsky.social
HashiCorp Vault for Dummies: Setup your 1st Vault with TLS (WSL) - Cloudthrill
n this guide, you'll learn how to set up a local Vault server using Raft storage and TLS in a WSL (Windows Subsystem for Linux) environment. Whether you're just starting with secrets management, prepp...
tinyurl.com
May 20, 2025 at 5:03 PM
🚨@CloudThrill is excited to announce its membership in the NVIDIA Inception Program! 👏🏻👏🏻👏🏻
Read full statement: cloudthrill.ca/cloudthrill-...
#NVIDIAInception Program for Startups!
May 12, 2025 at 9:30 PM
🚨#AI & #CyberSec heads in #Toronto!
Join us on Wednesday, 𝐌𝐚𝐲 𝟕𝐭𝐡 from 5:30pm-8pm EST for another exciting #TAICO Meetup (Toronto AI and Cybersecurity Organization).
#Cloudthrill #ProudSponsor🔥
www.meetup.com/taico-toront...
TAICO May 2025 meetup!, Wed, May 7, 2025, 5:30 PM | Meetup
The TAICO team is proud to announce our next meetup at the Adaptavist office in Toronto. Much thanks to [Adaptavist](https://www.adaptavist.com/ "https://www.adaptavist.com
www.meetup.com
May 3, 2025 at 1:16 AM
Check out our team's new article how to Ace your #CNCF Certified Kubernetes Administrator exam🔥 #CKA
#NewBlog 𝐇𝐨𝐰 𝐭𝐨 𝐏𝐚𝐬𝐬 𝐭𝐡𝐞 𝐂𝐊𝐀 𝐂𝐞𝐫𝐭𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧 – 𝐍𝐨 𝐑𝐞𝐭𝐚𝐤𝐞𝐬!🚀
Time to refocus on your goals—like finally crushing that elusive 𝐂𝐊𝐀 𝐞𝐱𝐚𝐦 with my curated guide on:
✅ Best resources, hands-on labs, time investment tips
✅ D-day strategies, CLI tricks that save you time
🔥Just Do it💪
👉🏻 buff.ly/S60cXwN #CNCF
How to pass the CKA certification - Cloudthrill
CKA is 100% hands-on—no multiple-choice, just real-world challenges. In this post, I’ll break down:✅ Exam structure & key domains✅ How I prepared (resources, labs, and time investment)✅ Tips to ace it...
cloudthrill.ca
April 28, 2025 at 8:58 PM
🧠 𝐆𝐏𝐓 𝐌𝐨𝐝𝐞𝐥𝐬 𝐟𝐨𝐫 𝐝𝐮𝐦𝐦𝐢𝐞𝐬 #cheatsheet
🤔 If you’ve opened #ChatGPT lately and thought:
“𝐖𝐚𝐢𝐭… 𝐰𝐡𝐚𝐭’𝐬 𝐨𝟑? 𝐀𝐧𝐝 𝐰𝐡𝐲 𝐚𝐫𝐞 𝐭𝐡𝐞𝐫𝐞 𝐬𝐨 𝐦𝐚𝐧𝐲 𝐦𝐨𝐝𝐞𝐥𝐬 𝐧𝐨𝐰?” You’re not alone. Today #openAI finally answered🙋🏻‍♀️
👉🏻https://platform.openai.com/docs/models/compare
April 16, 2025 at 6:23 PM