Gemini: "Here is F.L.O.O.R. (First-person Lino Observation & Ornamental Review)."
Pretty good!
Gemini: "Here is F.L.O.O.R. (First-person Lino Observation & Ornamental Review)."
Pretty good!
- SAM 3 enables detecting, segmenting and tracking of objects across images and videos, now with short text phrases and exemplar prompts.
ai.meta.com/blog/segment...
- SAM 3 enables detecting, segmenting and tracking of objects across images and videos, now with short text phrases and exemplar prompts.
ai.meta.com/blog/segment...
JiTs are simple large-patch Transformers that operate on raw pixels, no tokenizer, pre-training, or extra losses needed. JiT excels in high-dimensional spaces where traditional noise-predicting models can fail.
JiTs are simple large-patch Transformers that operate on raw pixels, no tokenizer, pre-training, or extra losses needed. JiT excels in high-dimensional spaces where traditional noise-predicting models can fail.
AgentEvolver integrates three synergistic mechanisms: Self-Questioning, Self-Navigating, and Self-Attributing; to systematically address critical bottlenecks in Agent RL training, including task scarcity,
AgentEvolver integrates three synergistic mechanisms: Self-Questioning, Self-Navigating, and Self-Attributing; to systematically address critical bottlenecks in Agent RL training, including task scarcity,
Karpathy
An agentic ML Engineer that trains state-of-the-art ML models using Claude Code SDK and Google ADK. This is a very simple implemenation demoing the power of Claude skills for ML.
Karpathy
An agentic ML Engineer that trains state-of-the-art ML models using Claude Code SDK and Google ADK. This is a very simple implemenation demoing the power of Claude skills for ML.
For more than one reason...
For more than one reason...
Video models, world models... They all aim to give genAI better understanding of our world, where text is still limited.
ifm.mbzuai.ac.ae/pan/
Video models, world models... They all aim to give genAI better understanding of our world, where text is still limited.
ifm.mbzuai.ac.ae/pan/
“Big tech has made their choice”
www.404media.co/google-has-c...
“Big tech has made their choice”
www.404media.co/google-has-c...
And rather than driving the rich away, IPS researchers found that the number of millionaires has *increased.*
Tax the rich. Greg Ryan in @bloomberg.com:
ByteDance unveils China’s most affordable AI coding agent at just US$1.30 a month
www.scmp.com/tech/big-tec...
ByteDance unveils China’s most affordable AI coding agent at just US$1.30 a month
www.scmp.com/tech/big-tec...
Multi-Vector Retrieval via Fixed Dimensional Encodings is an interesting approach by Google Research. It transforms multi-vector representations into single fixed-size vectors (fixed dimensional encodings).
Multi-Vector Retrieval via Fixed Dimensional Encodings is an interesting approach by Google Research. It transforms multi-vector representations into single fixed-size vectors (fixed dimensional encodings).
Godot games really improved in quality and variety !
www.youtube.com/watch?v=7ZwE...
Godot games really improved in quality and variety !
www.youtube.com/watch?v=7ZwE...
I don't know..... this doesn’t exactly sound like a good deal for Perplexity.
I don't know..... this doesn’t exactly sound like a good deal for Perplexity.
A gentle and comprehensive introduction to the DeltaNet
Part 1: sustcsonglin.github.io/blog/2024/de...
Part 2: sustcsonglin.github.io/blog/2024/de...
Part 3: sustcsonglin.github.io/blog/2024/de...
A gentle and comprehensive introduction to the DeltaNet
Part 1: sustcsonglin.github.io/blog/2024/de...
Part 2: sustcsonglin.github.io/blog/2024/de...
Part 3: sustcsonglin.github.io/blog/2024/de...