Ian Johnson 🔬🤖
banner
enjalot.bsky.social
Ian Johnson 🔬🤖
@enjalot.bsky.social
Data Visualization and Machine Learning
Building Latent Scope to visualize unstructured data through the lens of ML
github.com/enjalot/latent-scope
Pinned
allow me to reintroduce myself!
I'm a prototyper and Data Alchemist interested in using machine learning for data visualization.

I'm building github.com/enjalot/late... using the lessons learned from co-authoring these 4 distill.pub papers
mean pooling
March 18, 2025 at 1:59 PM
implemented a new rendering component for latent scope's scatter plot. had to replace regl-scatterplot with d3-zoom + regl shaders so we could support mobile
January 23, 2025 at 12:37 AM
Reposted by Ian Johnson 🔬🤖
I'll be at @unireps.bsky.social this Saturday presenting a new experimental pipeline to visually explore structured neural network representations. The core idea is to take thousands of prompts that activate a concept, and then cluster and draw them using MultiDiffusion. 🧵👇
December 11, 2024 at 11:18 PM
am i missing something for handling image data in parquet files?

I can load a dataset from HF like:
dataset = load_dataset("Marqo/marqo-ge-sample", split='google_shopping')
df = pd.DataFrame(dataset)
but i need to convert the images to bytes if I want to do:
df.to_parquet("sample.parquet")
December 10, 2024 at 7:51 PM
Reposted by Ian Johnson 🔬🤖
“They said it could not be done”. We’re releasing Pleias 1.0, the first suite of models trained on open data (either permissibly licensed or uncopyrighted): Pleias-3b, Pleias-1b and Pleias-350m, all based on the two trillion tokens set from Common Corpus.
December 5, 2024 at 4:39 PM
the algorithm is not some deity but a landscape, the feed is an uber ride across the manifold, only the windows are blacked out. what if you had a map of the algorithm? what if the UX of the feed let you look out of the window?

musing with @infowetrust.com
image from distill.pub/2017/aia/
December 5, 2024 at 1:33 AM
Reposted by Ian Johnson 🔬🤖
Spent the day playing with this. I'm absolutely blown away @enjalot.bsky.social!

- Chose any embedding from HF
- Project with UMAP, cluster with HDBSCAN
- Use Ollama to label the clusters (Works incredibly well!)
Latent Scope
enjalot.github.io
December 3, 2024 at 4:03 PM
😙👌📊📈📚
Visionary Press holiday deals are live. 🎁 Biggest discounts of the year + free 🇺🇸 shipping code: holidayshipping

The next week can make or break a small business like visionarypress.com — if you love our work I appreciate your sharing.
Visionary Press
Celebrate information graphics!
visionarypress.com
December 3, 2024 at 2:22 AM
what's crazy to me is that so many of these can be run very efficiently on an M1 MacBook pro, and just fine on a VM with only CPU.
crazy how much value you can pull out of text without billions of parameters
If you're interested in embedding models for retrieval (search), clustering, classification, paraphrase mining, etc., then there's now 10,000 fully free and open source options on @hf.co via Sentence Transformers.

Check out the most popular ones here: huggingface.co/models?libra...
November 29, 2024 at 5:08 PM
Reposted by Ian Johnson 🔬🤖
If you're interested in embedding models for retrieval (search), clustering, classification, paraphrase mining, etc., then there's now 10,000 fully free and open source options on @hf.co via Sentence Transformers.

Check out the most popular ones here: huggingface.co/models?libra...
November 29, 2024 at 4:40 PM
Hidden States is happening next week in SF!

It's a one-day unconference gathering researchers, designers, prototypers and engineers interested in pushing the boundaries of AI interfaces, going below the API and working with the hidden states.

hiddenstates.org
November 26, 2024 at 6:23 PM
One way I've been thinking about ML models for some time is as a lens.

The weights are crystalized patterns whose structure emerges from the crushing pressures of backpropagation.

By shining a piece of data through this lens you see the patterns diffracted in the hidden states.
November 21, 2024 at 7:23 PM
allow me to reintroduce myself!
I'm a prototyper and Data Alchemist interested in using machine learning for data visualization.

I'm building github.com/enjalot/late... using the lessons learned from co-authoring these 4 distill.pub papers
November 15, 2024 at 4:56 PM
I'm here now! excited to meet up with folks in SF on Dec 3rd
Posting for Ian Johnson from other place - unconf Dec 3 in San Fran on embeddings/SAE etc

hiddenstates.org
hiddenstates.org
November 13, 2024 at 1:19 PM
Reposted by Ian Johnson 🔬🤖
Posting for Ian Johnson from other place - unconf Dec 3 in San Fran on embeddings/SAE etc

hiddenstates.org
hiddenstates.org
November 7, 2024 at 7:13 AM
👋
November 13, 2024 at 11:55 AM