The sort of small wins that accumulate into a real career in AI.
When I started grad school AI prof's didn't have space for me in their group and when I ended I had no papers at NeurIPS/ICLR/ICML, yet the process can still work.
www.interconnects.ai/p/my-path-in...
The sort of small wins that accumulate into a real career in AI.
When I started grad school AI prof's didn't have space for me in their group and when I ended I had no papers at NeurIPS/ICLR/ICML, yet the process can still work.
www.interconnects.ai/p/my-path-in...
An in-depth analysis of the DeepSeek-V3/R1 model architecture and its AI infrastructure, highlighting key innovations such as Multi-head Latent Attention (MLA) for enhanced memory efficiency
An in-depth analysis of the DeepSeek-V3/R1 model architecture and its AI infrastructure, highlighting key innovations such as Multi-head Latent Attention (MLA) for enhanced memory efficiency
Have not tried it, but per them - For building complex full-stack projects - Code, prompt, branch, pull request
Have not tried it, but per them - For building complex full-stack projects - Code, prompt, branch, pull request
Nvidia releases CUTLASS 4.0, which ships with CuTe DSL, a programming language that is fully consistent with CuTe C++ in its programming model, APIs, abstraction level, and performance.
Nvidia releases CUTLASS 4.0, which ships with CuTe DSL, a programming language that is fully consistent with CuTe C++ in its programming model, APIs, abstraction level, and performance.
v6e-1 (Trillium) TPUs!! 2x the high bandwidth memory as v5e-1 (32GB) and a whopping peak rating of 918 BF16 TFLOPS (nearly 3x A100)!
on Google Colab.
v6e-1 (Trillium) TPUs!! 2x the high bandwidth memory as v5e-1 (32GB) and a whopping peak rating of 918 BF16 TFLOPS (nearly 3x A100)!
on Google Colab.
Cognition Labs trained a model using RL for writing CUDA kernels. It outperforms top reasoning models (o3 & o4-mini) in writing CUDA code.
Blogpost: cognition.ai/blog/kevin-32b
HuggingFace: huggingface.co/cognition-ai...
Cognition Labs trained a model using RL for writing CUDA kernels. It outperforms top reasoning models (o3 & o4-mini) in writing CUDA code.
Blogpost: cognition.ai/blog/kevin-32b
HuggingFace: huggingface.co/cognition-ai...
The NIH BioArt Source is an awesome library of *free* professionally drawn illustrations for scientific presentations or figures. Downloadable in HD. Thank you NIH for this invaluable tool 🙏!
Check it out 👇
bioart.niaid.nih.gov
The NIH BioArt Source is an awesome library of *free* professionally drawn illustrations for scientific presentations or figures. Downloadable in HD. Thank you NIH for this invaluable tool 🙏!
Check it out 👇
bioart.niaid.nih.gov
Tech Report: pub.sakana.ai/ai-scientist...
GitHub: github.com/SakanaAI/AI-...
This work is a proud collaboration between Sakana AI, UBC, and Oxford University.
Tech Report: pub.sakana.ai/ai-scientist...
GitHub: github.com/SakanaAI/AI-...
This work is a proud collaboration between Sakana AI, UBC, and Oxford University.
books.apple.com/jp/book/%E7%9...
books.apple.com/jp/book/%E7%9...
To reflect on my recent progress in #AI #upskilling, I built a #DataScience notebook on #Kaggle to analyze learning achievements across Microsoft Learn and Google Cloud Skills Boost using #Python and #MachineLearning.
See below
👇
To reflect on my recent progress in #AI #upskilling, I built a #DataScience notebook on #Kaggle to analyze learning achievements across Microsoft Learn and Google Cloud Skills Boost using #Python and #MachineLearning.
See below
👇
Data: www.kaggle.com/datasets/ash...
Tools: R(tidytext, ggplot) and Affinity Designer.
Data: www.kaggle.com/datasets/ash...
Tools: R(tidytext, ggplot) and Affinity Designer.
Code: github.com/araffin/ardu...
Blog post:
medium.com/@araffin/sim...
Code: github.com/araffin/ardu...
Blog post:
medium.com/@araffin/sim...
www.manning.com/books/build-...
www.manning.com/books/build-...
Find it on Leanpub!
Find it on Leanpub!