In love with minds, souls, Rock 🤟🏻 and Photography 📷
My religion is LARMATS.
Many asked, "How do I verify Gemini SFT works?" and "How do I prepare DPO data?"
With Vertex AI ENG team we published tutorials on preparing data and using custom metrics for evaluating tuning with Gemini 2.5 models.
Code in 🧵
Many asked, "How do I verify Gemini SFT works?" and "How do I prepare DPO data?"
With Vertex AI ENG team we published tutorials on preparing data and using custom metrics for evaluating tuning with Gemini 2.5 models.
Code in 🧵
We've launched an engineering blog to document our Vertex AI research. Our first post, in collaboration withSGLang, shares insights on implementing EAGLE-3 at scale.
Blog and code in the 🧵
We've launched an engineering blog to document our Vertex AI research. Our first post, in collaboration withSGLang, shares insights on implementing EAGLE-3 at scale.
Blog and code in the 🧵
Docs in the 🧵
Docs in the 🧵
Deploy source code directly via API for:
> Git-native CI/CD
> No GCS buckets
> Auditable builds
Code and blog in the 🧵
Deploy source code directly via API for:
> Git-native CI/CD
> No GCS buckets
> Auditable builds
Code and blog in the 🧵
We often stick to familiar tools for comfort, rather than exploring alternatives. The JAX AI stack and its components look exciting to me.
There's a lot to learn here: docs.jaxstack.ai/en/latest/
We often stick to familiar tools for comfort, rather than exploring alternatives. The JAX AI stack and its components look exciting to me.
There's a lot to learn here: docs.jaxstack.ai/en/latest/
Vertex AI now supports preference tuning (DPO) for Gemini 2.5 Flash and Flash-Lite, allowing you to use response pairs to adjust user preferences.
Code and docs in the 🧵
Vertex AI now supports preference tuning (DPO) for Gemini 2.5 Flash and Flash-Lite, allowing you to use response pairs to adjust user preferences.
Code and docs in the 🧵
We just released a whitepaper, "Context Engineering: Sessions and Memory," detailing how memory evolves from raw conversations to curated agent knowledge.
You can find the paper here: lnkd.in/euud4BUB
We just released a whitepaper, "Context Engineering: Sessions and Memory," detailing how memory evolves from raw conversations to curated agent knowledge.
You can find the paper here: lnkd.in/euud4BUB
Last week, in the latest ADK release (v1.18.0), the team introduced a low-code Visual Agent Builder, along with new observability and testing features.
Release notes and a blog from Thomas Chong about the visual builder in the 🧵
Last week, in the latest ADK release (v1.18.0), the team introduced a low-code Visual Agent Builder, along with new observability and testing features.
Release notes and a blog from Thomas Chong about the visual builder in the 🧵
Trusting an agent's memory is tricky. Is chat info verified? Without history, it's guess. Memory Revisions (preview) helps with version control through snapshots for each change.
Code & doc in 🧵
Trusting an agent's memory is tricky. Is chat info verified? Without history, it's guess. Memory Revisions (preview) helps with version control through snapshots for each change.
Code & doc in 🧵
Vertex AI launched major updates to Vertex AI Agent Builder for easier deployment and scaling agents in production.
I'm working with the Agent Engine team to release content on our new features, starting today with Memory Bank. Stay tuned!
Vertex AI launched major updates to Vertex AI Agent Builder for easier deployment and scaling agents in production.
I'm working with the Agent Engine team to release content on our new features, starting today with Memory Bank. Stay tuned!
It's a walkthrough that compares Llama 4 (baseline vs. EAGLE) on 8x H100s and includes the code patch needed to make vLLM work.
Hope it helps! 👇
It's a walkthrough that compares Llama 4 (baseline vs. EAGLE) on 8x H100s and includes the code patch needed to make vLLM work.
Hope it helps! 👇
Can't wait to share!
#VertexAI #LLMs #Benchmarking #Optimization #ModelGarden #LLMServing
Can't wait to share!
#VertexAI #LLMs #Benchmarking #Optimization #ModelGarden #LLMServing
This time, Amit and I welcomed a special guest: Ravin Kumar from Google DeepMind. He shared insights on building open models with agentic capabilities.
Stay tuned! The episode will soon be on the Google Cloud Tech YouTube channel👇
This time, Amit and I welcomed a special guest: Ravin Kumar from Google DeepMind. He shared insights on building open models with agentic capabilities.
Stay tuned! The episode will soon be on the Google Cloud Tech YouTube channel👇
Check out the course and blog in the 🧵 And stay tuned...more is coming!
Check out the course and blog in the 🧵 And stay tuned...more is coming!
The team rolled out features focused on secure code execution, stateful debugging, and better DevX.
Check out the full release notes in 🧵. And keep an eye on the repo for the next ADK community call!
The team rolled out features focused on secure code execution, stateful debugging, and better DevX.
Check out the full release notes in 🧵. And keep an eye on the repo for the next ADK community call!
RVSP and tutorial in the 🧵
RVSP and tutorial in the 🧵
We launched a Terraform module for agent deployment on Vertex AI, initially needing a local Python script and GCS uploads.
The new blueprint automates agent serialization and packaging during the Terraform apply cycle.
Link in the 🧵
We launched a Terraform module for agent deployment on Vertex AI, initially needing a local Python script and GCS uploads.
The new blueprint automates agent serialization and packaging during the Terraform apply cycle.
Link in the 🧵
Building multi-agent systems using various models and frameworks can be challenging. So today I spent some time on what you can build and deploy with ADK, MCP, A2A, Agent Engine, and Vertex AI.
Full tutorial dropping soon!
Building multi-agent systems using various models and frameworks can be challenging. So today I spent some time on what you can build and deploy with ADK, MCP, A2A, Agent Engine, and Vertex AI.
Full tutorial dropping soon!
Google and vLLM announced a new backend uses tpu-inference for efficient PyTorch and JAX models on TPUs.
Check out the full blog and try it on Vertex AI with the new vLLM TPU container!
Google and vLLM announced a new backend uses tpu-inference for efficient PyTorch and JAX models on TPUs.
Check out the full blog and try it on Vertex AI with the new vLLM TPU container!
Model as a Service (MaaS) gives access to large open models via a managed, serverless API, removing the need for your infrastructure.
Check the new documentation in 🧵 to learn more
Model as a Service (MaaS) gives access to large open models via a managed, serverless API, removing the need for your infrastructure.
Check the new documentation in 🧵 to learn more
Check out the two new tutorials in 🧵 on building AI agents using the Agent Development Kit (ADK) + Vertex AI Agent Engine for Sessions & Memory.
Check out the two new tutorials in 🧵 on building AI agents using the Agent Development Kit (ADK) + Vertex AI Agent Engine for Sessions & Memory.
Vertex AI launched a Terraform resource to deploy agents using custom classes or agentic frameworks like ADK.
Check out the notebook and blog post for the full code and a step-by-step guide in the 🧵
Vertex AI launched a Terraform resource to deploy agents using custom classes or agentic frameworks like ADK.
Check out the notebook and blog post for the full code and a step-by-step guide in the 🧵
Join us next week for the 1st ADK community call! In this 1-hour session, we'll share the technical roadmap, address technical questions and discuss contributions.
🗓️ Date: October 15, 2025
⏰ Time: 9:30 AM - 10:30 AM PST
🔗 Virtual (links in 🧵)
Join us next week for the 1st ADK community call! In this 1-hour session, we'll share the technical roadmap, address technical questions and discuss contributions.
🗓️ Date: October 15, 2025
⏰ Time: 9:30 AM - 10:30 AM PST
🔗 Virtual (links in 🧵)
Vertex AI Model Garden just launched the google_vertex_ai_endpoint_with_model_garden_deployment Terraform resource to manage your open model deployment (Hugging Face or Model Garden) with one unique main.tf.
Docs and code in the 🧵
Vertex AI Model Garden just launched the google_vertex_ai_endpoint_with_model_garden_deployment Terraform resource to manage your open model deployment (Hugging Face or Model Garden) with one unique main.tf.
Docs and code in the 🧵