leonie
banner
iamleonie.bsky.social
leonie
@iamleonie.bsky.social
I do Machine Learning at Weaviate and write about it on the internet.
Make RAG results more trustworthy with citations.

In his latest recipe, @danman966.bsky.social shows you how you can build a RAG pipeline with citations, using:
- a @weaviate.bsky.social vector database and
- @anthropic.com's Claude 3.5 Sonnet

📌 Code: github.com/weaviate/rec...
February 11, 2025 at 3:48 PM
I’m trying to wrap my head around multi-agent system architectures.

Here are some patterns I’m seeing so far:

1. Type of collaboration:
Network vs. hierarchical

2. Type of information flow:
Sequential vs. parallel vs. loop

3. Type of functionality:
Routing vs. aggregating

What else?
January 28, 2025 at 5:00 PM
#1 Rule of RAG Club: Look at your data.

With the new explorer tool, looking at your data got a lot easier in Weaviate Cloud.

The explorer tool provides a graphical interface to easily:
• Browse collections
• Inspect objects, metadata, and vectors

Check it out now: https://buff.ly/3KWivSF
January 22, 2025 at 4:01 PM
You can be GPU poor like me and still fine-tune an LLM.

Here’s how you can fine-tune Gemma 2 in a Kaggle notebook on a single T4 GPU:
@kaggle.com offers 30 hours/week of GPUs for free
@unsloth.bsky.social uses 60% less memory to fit it on a T4 GPU

🔗Code: https://buff.ly/4apUUG2
January 21, 2025 at 4:00 PM
I talk about RAG so much, I could fill a book.

So, we did - and you can download it for free.

Together with my colleagues Mary & Prajjwal, we curated an e-book of the most effective advanced RAG techniques.

Which ones did we miss?

Get it now: weaviate.io/ebooks/advan...
January 16, 2025 at 12:46 PM
Over the holidays, I learned how to fine-tune an LLM.

Here’s my entry for the latest @kaggle.com comp.

This tutorial shows you:
• Fine-tune Gemma 2
• LoRA fine-tuning with @unsloth.bsky.social on T4 GPU
• Experiment tracking with @weightsbiases.bsky.social

🔗Code: www.kaggle.com/code/iamleon...
January 15, 2025 at 12:50 PM
Got myself a little early Christmas present.

Although this book is from 2017, I heard so many good things about it this year.

Can't wait to dig into this over the holidays.

And with that being said, I hope you have some nice and relaxing holidays yourself!

See you in the new year!
December 19, 2024 at 3:55 PM
It’s time to review the AI space in 2024!

Here’s what I got right (and what I missed) in my 2024 predictions:

✅ Evaluation
❌ Multimodal foundation models
❌ Fine-tuning open-weight models and quantization
❌ AI agents
✅ RAG lives on
❌ Knowledge graphs

medium.com/towards-data...
December 17, 2024 at 6:31 PM
日本語テキスト向けのハイブリッド検索には日本語テキス用のトークナイザーが必要です。

@weaviate.bsky.socialでは3つのトークナイザーを使用することができます。

一つずつのメリットとデメリットはこちら
weaviate.io/blog/hybrid-...
December 17, 2024 at 2:41 PM
ハイブリッド検索とは何?

ハイブリッド検索は、デンスベクトルとスパースベクトルを統合して、それぞれの検索手法の利点を活かします。

この記事では、Weaviateの日本語テキスト向けのハイブリッド検索の説明をします。

- 日本語テキス用のトークナイザーを使用するキーワード検索
- ベクトル検索
- 融合アルゴリズム

詳しくはこちら
https://buff.ly/49yMR9K
December 10, 2024 at 11:02 PM
Look what came in the mail today!

This is already the 2nd edition of “Developing apps with GPT-4” by Olivier and Marie-Alice I had the pleasure to review.

This edition covers the latest advancements in GPT-4, especially regarding its visual capabilities to build multimodal applications.
December 4, 2024 at 5:03 PM
Struggling with RAG over PDF files?

You might want to give Docling a try.

𝗪𝗵𝗮𝘁'𝘀 𝗗𝗼𝗰𝗹𝗶𝗻𝗴?
• Python package by IBM
• OS (MIT license)
• PDF, DOCX, PPTX → Markdown, JSON

𝗪𝗵𝘆 𝘂𝘀𝗲 𝗗𝗼𝗰𝗹𝗶𝗻𝗴?
• Doesn’t require fancy gear, lots of memory, or cloud services
• Works on regular computers or Google Colab Pro
November 28, 2024 at 1:34 PM
Yes, you don’t need a vector database to do vector search.

@victorialslocum.bsky.social shows you how - using just numpy.

This article covers:
• How does vector search work?
• How to do vector search from scratch in Python
• and more

Learn more: weaviate.io/blog/vector-...
November 26, 2024 at 4:13 PM
Struggling with slow filtered vector search?

Here's how Weaviate's researchers 10x'ed query speeds with ACORN:
November 20, 2024 at 10:00 AM
Hi, I am Leonie!
(Yes, that's the handle)

I do machine learning at Weaviate and write about it on the Internet.
medium.com/@iamleonie

You might know me for my monochrome technical visuals.
November 20, 2024 at 9:10 AM