Karl Weinmeister
@kweinmeister.bsky.social
500 followers 1.9K following 110 posts
Cloud Developer Advocacy @ Google. AI/ML/Data, Blue Devil & Longhorn, wanna-be at home improvement. Opinions are my own.
Posts Media Videos Starter Packs
kweinmeister.bsky.social
What API call are you making 10x a day? 🤔 Turn it into a simple /command.

With Gemini CLI extensions, you can build your own shortcuts to speed up your work:
/fetch_jira ABC-123
/deploy_staging

Learn how easy it is to get started:
geminicli.com/docs/extensi...
kweinmeister.bsky.social
Has Gemini ever felt like it's losing focus as your conversation goes on? Naturally, more context means more topics to cover.

Use the /compress command to keep the Gemini CLI on track. It prunes the history without a full reboot.

Get started today with the Gemini CLI: npx @google/gemini-cli
Reposted by Karl Weinmeister
rsamborski.bsky.social
Dublin 🇮🇪 we're coming!
Learn to build AI agents and deploy your MCP servers to production scale.

Register: goo.gle/accelerate-ai-dublin
Seats are limited!

Shir Meir Lador @kweinmeister.bsky.social @caseywest.bsky.social
#AI #AIagents #MCPServers #CloudRun #Workshop @GoogleCloudTech #DublinEvents
kweinmeister.bsky.social
Learn about Sparse Attention in DeepSeek-V3.2-Exp:
* O(L²) → O(L·k) with similar performance to V3.1 Terminus
* Lightning indexer scores previous tokens
* Top-k selector picks top 2k sparse tokens from 128k window

📄 Paper: github.com/deepseek-ai/...

🎬Video:
youtube.com/shorts/CLsju...
DeepSeek Sparse Attention Explained
YouTube video by Cloud with Karl
youtube.com
kweinmeister.bsky.social
All those pesky brackets making tokenization messy 😂
kweinmeister.bsky.social
When I have a choice, I’ve been picking it over JSON. The readability and comments are nice!
kweinmeister.bsky.social
Got it! Will pass on the feedback.
kweinmeister.bsky.social
What really makes agentic software unique from traditional software? And is it all-or-nothing?

The essential concept is already there in the name: agency. You can control the degree of agency with tool design and human-in-the-loop patterns.

youtube.com/shorts/JIHfn...
Agentic AI: what makes it unique?
YouTube video by Cloud with Karl
youtube.com
kweinmeister.bsky.social
The Accelerate AI with Cloud Run tour is headed to Europe!
👉 Register at goo.gle/accelerate-ai

Join me and the Google team in a hands-on workshop near you:
📍 Dublin: Oct 29 goo.gle/accelerate-ai-dublin
📍 Munich: Oct 31 goo.gle/accelerate-ai-munich
📍 Paris: Nov 4 goo.gle/accelerate-ai-paris
kweinmeister.bsky.social
Yes, you still need code quality checks in the era of AI. Different prompts, context, and models lead to different outcomes.

Ruff unifies a litany of Python tools to do the job. Pro tip: send any issues it can't fix automatically to the Gemini CLI.

Read more: medium.com/google-cloud...
How to Write Better Python with Ruff on Google Cloud
Ruff can unify your code quality toolchain, accelerate CI/CD, and integrate with AI tooling for seamless Python development.
medium.com
kweinmeister.bsky.social
Want to use models from Google's Vertex AI Model Garden with an OpenAI-compatible API?

My new video shows how to set up LiteLLM as a local proxy to do just that. Simplify your workflow and call models like Qwen, DeepSeek, and more through a unified interface.

www.youtube.com/shorts/ntI5A...
How to Connect to Vertex AI Models with a LiteLLM Proxy
Ready to explore the vast ecosystem of AI models in Google's Vertex AI Model Garden? This video shows you how to use LiteLLM as a unified bridge to access and experiment with a diverse range of…
www.youtube.com
Reposted by Karl Weinmeister
kweinmeister.bsky.social
The all-new Gemini 2.5 Flash Image model is built different. I had a blast mixing together multiple images and text!

👉 Try it out in AI studio: ai.dev
📖 Read all about it: developers.googleblog.com/en/introduci...
kweinmeister.bsky.social
"Relying on a single model forces an unnecessary compromise. Use a top-tier model for everything, and you pay a premium for power you don’t always need. Use a smaller model for everything, and you sacrifice quality on complex queries... Model routing is designed to solve this optimization problem."
A Developer’s Guide to Model Routing
Not long ago, building with LLMs meant picking one general-purpose model and sticking with it. Today, the landscape is flooded with…
medium.com
kweinmeister.bsky.social
With this getting started content, you'll learn how to:

✅ Scaffold a new Hono project using the official CLI.
✅ Build a complete CRUD API
✅ Connect to a serverless Firestore database
✅ Add production-grade middleware
✅ Implement type-safe input validation
✅ Deploy your application to Cloud Run