Samrat Man Singh
@samrat.me
Attending this later today. Anyone else I know going?
August 19, 2025 at 12:54 PM
Attending this later today. Anyone else I know going?
Great presentations and conversation at the Elixir Berlin Meetup today
1/ Lukas talked about his big bag of Postgres tricks and techniques
#elixirlang
1/ Lukas talked about his big bag of Postgres tricks and techniques
#elixirlang
June 12, 2025 at 8:39 PM
Great presentations and conversation at the Elixir Berlin Meetup today
1/ Lukas talked about his big bag of Postgres tricks and techniques
#elixirlang
1/ Lukas talked about his big bag of Postgres tricks and techniques
#elixirlang
Reposted by Samrat Man Singh
I've just released Handoff, a new Elixir library for distributed function graph execution.
hexdocs.pm/handoff
Among other things, the library enables distributed Nx computations (anyone said distributed inference?)
A huge part of this was sponsored by tvlabs.ai!
hexdocs.pm/handoff
Among other things, the library enables distributed Nx computations (anyone said distributed inference?)
A huge part of this was sponsored by tvlabs.ai!
Handoff v0.1.0 — Documentation
hexdocs.pm
May 11, 2025 at 5:47 AM
I've just released Handoff, a new Elixir library for distributed function graph execution.
hexdocs.pm/handoff
Among other things, the library enables distributed Nx computations (anyone said distributed inference?)
A huge part of this was sponsored by tvlabs.ai!
hexdocs.pm/handoff
Among other things, the library enables distributed Nx computations (anyone said distributed inference?)
A huge part of this was sponsored by tvlabs.ai!
Reposted by Samrat Man Singh
Introducing Tidewave: tidewave.ai
While working on our web apps, we run code, query the database, read logs, search docs… but our AI tools are limited to reading code.
Tidewave closes this gap by introducing Runtime Intelligence, for Phoenix and Rails. More coming soon, check out the website!
While working on our web apps, we run code, query the database, read logs, search docs… but our AI tools are limited to reading code.
Tidewave closes this gap by introducing Runtime Intelligence, for Phoenix and Rails. More coming soon, check out the website!
Tidewave
tidewave.ai
April 29, 2025 at 8:20 PM
Introducing Tidewave: tidewave.ai
While working on our web apps, we run code, query the database, read logs, search docs… but our AI tools are limited to reading code.
Tidewave closes this gap by introducing Runtime Intelligence, for Phoenix and Rails. More coming soon, check out the website!
While working on our web apps, we run code, query the database, read logs, search docs… but our AI tools are limited to reading code.
Tidewave closes this gap by introducing Runtime Intelligence, for Phoenix and Rails. More coming soon, check out the website!
Demo: Making phone calls from Elixir
I'm working on a Twilio plugin for the Membrane multimedia framework. The demo shows an outbound call, but receiving calls works too.
#ElixirLang
www.youtube.com/watch?v=4dhl...
I'm working on a Twilio plugin for the Membrane multimedia framework. The demo shows an outbound call, but receiving calls works too.
#ElixirLang
www.youtube.com/watch?v=4dhl...
Making phone calls from Elixir using Twilio
YouTube video by Samrat Man Singh
www.youtube.com
March 27, 2025 at 5:13 PM
Demo: Making phone calls from Elixir
I'm working on a Twilio plugin for the Membrane multimedia framework. The demo shows an outbound call, but receiving calls works too.
#ElixirLang
www.youtube.com/watch?v=4dhl...
I'm working on a Twilio plugin for the Membrane multimedia framework. The demo shows an outbound call, but receiving calls works too.
#ElixirLang
www.youtube.com/watch?v=4dhl...
Testing Voice AI flows can become tedious fast
But... you can just simulate conversations using another voice agent. Then, evaluate how the conversation went.
But... you can just simulate conversations using another voice agent. Then, evaluate how the conversation went.
March 26, 2025 at 11:17 AM
Testing Voice AI flows can become tedious fast
But... you can just simulate conversations using another voice agent. Then, evaluate how the conversation went.
But... you can just simulate conversations using another voice agent. Then, evaluate how the conversation went.
Got a small, standalone Python script working to use the new OpenAI transcription models(`gpt-4o-transcribe` and `gpt-4o-mini-transcribe`) using the Realtime WebSocket connection:
samrat.me/standalone-o...
samrat.me/standalone-o...
Standalone OpenAI transcription Python script
OpenAI announced some new audio models yesterday, including new transcription models(gpt-4o-transcribe and gpt-4o-mini-transcribe). This model performs better than Whisper, though because they are bui...
samrat.me
March 21, 2025 at 1:46 PM
Got a small, standalone Python script working to use the new OpenAI transcription models(`gpt-4o-transcribe` and `gpt-4o-mini-transcribe`) using the Realtime WebSocket connection:
samrat.me/standalone-o...
samrat.me/standalone-o...
OpenAI's new TTS looks(and sounds) pretty great for the price.
Also, I hope this pushes other providers to just price API usage by minute. Every other TTS provider(ElevenLabs, Cartesia, etc) currently have monthly credits pricing.
Also, I hope this pushes other providers to just price API usage by minute. Every other TTS provider(ElevenLabs, Cartesia, etc) currently have monthly credits pricing.
March 20, 2025 at 5:37 PM
OpenAI's new TTS looks(and sounds) pretty great for the price.
Also, I hope this pushes other providers to just price API usage by minute. Every other TTS provider(ElevenLabs, Cartesia, etc) currently have monthly credits pricing.
Also, I hope this pushes other providers to just price API usage by minute. Every other TTS provider(ElevenLabs, Cartesia, etc) currently have monthly credits pricing.
New #ElixirLang tool
LivebookTools lets you work on Livebook in your own editor and opens MCP access to cell outputs(so Cursor, etc) can automatically iterate on the code if the cell has errors
github.com/thmsmlr/live...
LivebookTools lets you work on Livebook in your own editor and opens MCP access to cell outputs(so Cursor, etc) can automatically iterate on the code if the cell has errors
github.com/thmsmlr/live...
GitHub - thmsmlr/livebook_tools: Powertools for livebook.dev — AI Code Editing, MCP Servers, and Running Livebooks from the CLI
Powertools for livebook.dev — AI Code Editing, MCP Servers, and Running Livebooks from the CLI - thmsmlr/livebook_tools
github.com
March 9, 2025 at 6:37 AM
New #ElixirLang tool
LivebookTools lets you work on Livebook in your own editor and opens MCP access to cell outputs(so Cursor, etc) can automatically iterate on the code if the cell has errors
github.com/thmsmlr/live...
LivebookTools lets you work on Livebook in your own editor and opens MCP access to cell outputs(so Cursor, etc) can automatically iterate on the code if the cell has errors
github.com/thmsmlr/live...
Reposted by Samrat Man Singh
What does talk-prep look like? Bug-fixin' mostly
@samrat.me darling, you have a PR :D
github.com/samrat/kokor...
@samrat.me darling, you have a PR :D
github.com/samrat/kokor...
Fix concatenation issue for tensors by lawik · Pull Request #5 · samrat/kokoro
Nx.concatenate requires the same shape and very frequently a random chunk of text would not be the right shape. So it would blow up.
The concatenate was avoidable so instead of a complex padding sc...
github.com
March 4, 2025 at 5:51 PM
What does talk-prep look like? Bug-fixin' mostly
@samrat.me darling, you have a PR :D
github.com/samrat/kokor...
@samrat.me darling, you have a PR :D
github.com/samrat/kokor...
Pushed my first @replicate.com model, SmolVLM2 by @hf.co (@pcuenq.hf.co et al.)
Will push the 500M and 2.2B models soon as well.
Will push the 500M and 2.2B models soon as well.
March 4, 2025 at 5:13 AM
Pushed my first @replicate.com model, SmolVLM2 by @hf.co (@pcuenq.hf.co et al.)
Will push the 500M and 2.2B models soon as well.
Will push the 500M and 2.2B models soon as well.
I wrote up a post on how you can run AI models on Elixir that aren't yet supported by Bumblebee
samrat.me/running-ml-m...
#MyElixirStatus #ElixirLang
samrat.me/running-ml-m...
#MyElixirStatus #ElixirLang
Running ML models in Elixir using Pythonx
I just discovered Pythonx, which runs a Python interpreter in the same OS process as Elixir.
Looks like it also wraps uv so it sets up a virtual environment with all the Python dependencies your code...
samrat.me
February 20, 2025 at 11:24 PM
I wrote up a post on how you can run AI models on Elixir that aren't yet supported by Bumblebee
samrat.me/running-ml-m...
#MyElixirStatus #ElixirLang
samrat.me/running-ml-m...
#MyElixirStatus #ElixirLang
Wow, this looks super useful for running models that aren't yet supported in Bumblebee(among other things)
#ElixirLang
github.com/livebook-dev...
#ElixirLang
github.com/livebook-dev...
GitHub - livebook-dev/pythonx: Python interpreter embedded in Elixir
Python interpreter embedded in Elixir. Contribute to livebook-dev/pythonx development by creating an account on GitHub.
github.com
February 20, 2025 at 8:58 PM
Wow, this looks super useful for running models that aren't yet supported in Bumblebee(among other things)
#ElixirLang
github.com/livebook-dev...
#ElixirLang
github.com/livebook-dev...
Reposted by Samrat Man Singh
Holy smokes this is awesome, a new debugger for Phoenix LiveView. Think React devtools, but not a chrome extension.
github.com/software-man...
#ElixirLang
github.com/software-man...
#ElixirLang
February 7, 2025 at 2:37 PM
Holy smokes this is awesome, a new debugger for Phoenix LiveView. Think React devtools, but not a chrome extension.
github.com/software-man...
#ElixirLang
github.com/software-man...
#ElixirLang
Reposted by Samrat Man Singh
We did it! Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️
Generate 10 seconds of speech in ~1 second for $0.
What will you build? 🔥
Generate 10 seconds of speech in ~1 second for $0.
What will you build? 🔥
February 7, 2025 at 5:03 PM
We did it! Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️
Generate 10 seconds of speech in ~1 second for $0.
What will you build? 🔥
Generate 10 seconds of speech in ~1 second for $0.
What will you build? 🔥
Reposted by Samrat Man Singh
Interested in how to build voice AI apps in Elixir?
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Building a Membrane Pipeline to Talk to Google Gemini
Elixir, together with the Membrane framework, is great for building pipelines that enable seamless communication between users and LLMs.
blog.swmansion.com
February 5, 2025 at 5:08 AM
Interested in how to build voice AI apps in Elixir?
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Interested in how to build voice AI apps in Elixir?
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Building a Membrane Pipeline to Talk to Google Gemini
Elixir, together with the Membrane framework, is great for building pipelines that enable seamless communication between users and LLMs.
blog.swmansion.com
February 5, 2025 at 5:08 AM
Interested in how to build voice AI apps in Elixir?
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini
#ElixirLang
blog.swmansion.com/building-a-m...
Reposted by Samrat Man Singh
Had some fun generating speech using the Elixir Kokoro implementation by @samrat.me.
Made a script for typing stuff and getting the talky-talk.
github.com/samrat/kokor...
So a bit of ONNX, some Erlang Ports, Membrane :D
#elixirlang
Made a script for typing stuff and getting the talky-talk.
github.com/samrat/kokor...
So a bit of ONNX, some Erlang Ports, Membrane :D
#elixirlang
Mix.install-based example beyond raw audio · Issue #4 · samrat/kokoro
This will start prompting your terminal for input and speak it to your default audio output. I have this other example unrelated to TTS that includes working with files and MP3s. #!/usr/bin/env eli...
github.com
February 4, 2025 at 7:42 AM
Had some fun generating speech using the Elixir Kokoro implementation by @samrat.me.
Made a script for typing stuff and getting the talky-talk.
github.com/samrat/kokor...
So a bit of ONNX, some Erlang Ports, Membrane :D
#elixirlang
Made a script for typing stuff and getting the talky-talk.
github.com/samrat/kokor...
So a bit of ONNX, some Erlang Ports, Membrane :D
#elixirlang
Reworking the Livebook for Kokoro(🔊 text-to-speech model).
Now, you get a text input to input text you want read out loud(or whispered) to you.
#MyElixirStatus #ElixirLang
Now, you get a text input to input text you want read out loud(or whispered) to you.
#MyElixirStatus #ElixirLang
January 31, 2025 at 4:15 PM
Reworking the Livebook for Kokoro(🔊 text-to-speech model).
Now, you get a text input to input text you want read out loud(or whispered) to you.
#MyElixirStatus #ElixirLang
Now, you get a text input to input text you want read out loud(or whispered) to you.
#MyElixirStatus #ElixirLang
Writing up a blog post about this. Not sure how much of it should be an introduction to Membrane.
#ElixirLang
#ElixirLang
Got Gemini Multimodal Live API working with Membrane WebRTC.
Here's me having a short chat with it
#ElixirLang
youtu.be/JxWE3pxyGqA
Here's me having a short chat with it
#ElixirLang
youtu.be/JxWE3pxyGqA
Talking to Gemini Multimodal Live with Membrane WebRTC
YouTube video by Samrat Man Singh
youtu.be
January 26, 2025 at 5:02 PM
Writing up a blog post about this. Not sure how much of it should be an introduction to Membrane.
#ElixirLang
#ElixirLang
January 25, 2025 at 11:35 AM
Oops, forgot to tag #ElixirLang
More voice AI exploration:
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4
Elixir text-to-speech with Kokoro
YouTube video by Samrat Man Singh
youtu.be
January 7, 2025 at 10:22 PM
Oops, forgot to tag #ElixirLang
More voice AI exploration:
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4
Elixir text-to-speech with Kokoro
YouTube video by Samrat Man Singh
youtu.be
January 7, 2025 at 10:19 PM
More voice AI exploration:
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4
Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex
Demo 👇
youtu.be/VFKX6Af9gs4