Samrat Man Singh
samrat.me
Samrat Man Singh
@samrat.me
Software engineer; climber; #ElixirLang

Website: https://samrat.me

Building https://octocut.com
Attending this later today. Anyone else I know going?
August 19, 2025 at 12:54 PM
Great presentations and conversation at the Elixir Berlin Meetup today

1/ Lukas talked about his big bag of Postgres tricks and techniques

#elixirlang
June 12, 2025 at 8:39 PM
Reposted by Samrat Man Singh
I've just released Handoff, a new Elixir library for distributed function graph execution.

hexdocs.pm/handoff

Among other things, the library enables distributed Nx computations (anyone said distributed inference?)

A huge part of this was sponsored by tvlabs.ai!
Handoff v0.1.0 — Documentation
hexdocs.pm
May 11, 2025 at 5:47 AM
Reposted by Samrat Man Singh
Introducing Tidewave: tidewave.ai

While working on our web apps, we run code, query the database, read logs, search docs… but our AI tools are limited to reading code.

Tidewave closes this gap by introducing Runtime Intelligence, for Phoenix and Rails. More coming soon, check out the website!
Tidewave
tidewave.ai
April 29, 2025 at 8:20 PM
Demo: Making phone calls from Elixir

I'm working on a Twilio plugin for the Membrane multimedia framework. The demo shows an outbound call, but receiving calls works too.

#ElixirLang

www.youtube.com/watch?v=4dhl...
Making phone calls from Elixir using Twilio
YouTube video by Samrat Man Singh
www.youtube.com
March 27, 2025 at 5:13 PM
Testing Voice AI flows can become tedious fast

But... you can just simulate conversations using another voice agent. Then, evaluate how the conversation went.
March 26, 2025 at 11:17 AM
Got a small, standalone Python script working to use the new OpenAI transcription models(`gpt-4o-transcribe` and `gpt-4o-mini-transcribe`) using the Realtime WebSocket connection:

samrat.me/standalone-o...
Standalone OpenAI transcription Python script
OpenAI announced some new audio models yesterday, including new transcription models(gpt-4o-transcribe and gpt-4o-mini-transcribe). This model performs better than Whisper, though because they are bui...
samrat.me
March 21, 2025 at 1:46 PM
OpenAI's new TTS looks(and sounds) pretty great for the price.

Also, I hope this pushes other providers to just price API usage by minute. Every other TTS provider(ElevenLabs, Cartesia, etc) currently have monthly credits pricing.
March 20, 2025 at 5:37 PM
Phoenix now comes with DaisyUI. Looks like a great addition for non-designers(like me)

#ElixirLang
March 20, 2025 at 3:07 PM
New #ElixirLang tool

LivebookTools lets you work on Livebook in your own editor and opens MCP access to cell outputs(so Cursor, etc) can automatically iterate on the code if the cell has errors

github.com/thmsmlr/live...
GitHub - thmsmlr/livebook_tools: Powertools for livebook.dev — AI Code Editing, MCP Servers, and Running Livebooks from the CLI
Powertools for livebook.dev — AI Code Editing, MCP Servers, and Running Livebooks from the CLI - thmsmlr/livebook_tools
github.com
March 9, 2025 at 6:37 AM
Reposted by Samrat Man Singh
Pushed my first @replicate.com model, SmolVLM2 by @hf.co (@pcuenq.hf.co et al.)

Will push the 500M and 2.2B models soon as well.
March 4, 2025 at 5:13 AM
Voice models are getting scary good. This one feels very human:

www.sesame.com/research/cro...
Crossing the uncanny valley of conversational voice
At Sesame, our goal is to achieve “voice presence”—the magical quality that makes spoken interactions feel real, understood, and valued.
www.sesame.com
March 1, 2025 at 4:26 PM
I wrote up a post on how you can run AI models on Elixir that aren't yet supported by Bumblebee

samrat.me/running-ml-m...

#MyElixirStatus #ElixirLang
Running ML models in Elixir using Pythonx
I just discovered Pythonx, which runs a Python interpreter in the same OS process as Elixir. Looks like it also wraps uv so it sets up a virtual environment with all the Python dependencies your code...
samrat.me
February 20, 2025 at 11:24 PM
Wow, this looks super useful for running models that aren't yet supported in Bumblebee(among other things)

#ElixirLang

github.com/livebook-dev...
GitHub - livebook-dev/pythonx: Python interpreter embedded in Elixir
Python interpreter embedded in Elixir. Contribute to livebook-dev/pythonx development by creating an account on GitHub.
github.com
February 20, 2025 at 8:58 PM
Reposted by Samrat Man Singh
Holy smokes this is awesome, a new debugger for Phoenix LiveView. Think React devtools, but not a chrome extension.

github.com/software-man...

#ElixirLang
February 7, 2025 at 2:37 PM
Reposted by Samrat Man Singh
We did it! Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️

Generate 10 seconds of speech in ~1 second for $0.

What will you build? 🔥
February 7, 2025 at 5:03 PM
Reposted by Samrat Man Singh
Interested in how to build voice AI apps in Elixir?

Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini

#ElixirLang

blog.swmansion.com/building-a-m...
Building a Membrane Pipeline to Talk to Google Gemini
Elixir, together with the Membrane framework, is great for building pipelines that enable seamless communication between users and LLMs.
blog.swmansion.com
February 5, 2025 at 5:08 AM
Interested in how to build voice AI apps in Elixir?

Here's a blog post I wrote about building a Membrane pipeline that orchestrates audio/video communication between a browser and Google Gemini

#ElixirLang

blog.swmansion.com/building-a-m...
Building a Membrane Pipeline to Talk to Google Gemini
Elixir, together with the Membrane framework, is great for building pipelines that enable seamless communication between users and LLMs.
blog.swmansion.com
February 5, 2025 at 5:08 AM
Reposted by Samrat Man Singh
Had some fun generating speech using the Elixir Kokoro implementation by @samrat.me.

Made a script for typing stuff and getting the talky-talk.

github.com/samrat/kokor...

So a bit of ONNX, some Erlang Ports, Membrane :D

#elixirlang
Mix.install-based example beyond raw audio · Issue #4 · samrat/kokoro
This will start prompting your terminal for input and speak it to your default audio output. I have this other example unrelated to TTS that includes working with files and MP3s. #!/usr/bin/env eli...
github.com
February 4, 2025 at 7:42 AM
Reworking the Livebook for Kokoro(🔊 text-to-speech model).

Now, you get a text input to input text you want read out loud(or whispered) to you.

#MyElixirStatus #ElixirLang
January 31, 2025 at 4:15 PM
Writing up a blog post about this. Not sure how much of it should be an introduction to Membrane.

#ElixirLang
Got Gemini Multimodal Live API working with Membrane WebRTC.

Here's me having a short chat with it

#ElixirLang

youtu.be/JxWE3pxyGqA
Talking to Gemini Multimodal Live with Membrane WebRTC
YouTube video by Samrat Man Singh
youtu.be
January 26, 2025 at 5:02 PM
Elixir bindings for Kokoro TTS github.com/samrat/kokoro

#ElixirLang
January 25, 2025 at 11:35 AM
Oops, forgot to tag #ElixirLang
More voice AI exploration:

Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex

Demo 👇

youtu.be/VFKX6Af9gs4
Elixir text-to-speech with Kokoro
YouTube video by Samrat Man Singh
youtu.be
January 7, 2025 at 10:22 PM
More voice AI exploration:

Bumblebee doesn't currently support text-to-speech models, but managed to get Kokoro TTS working with Ortex

Demo 👇

youtu.be/VFKX6Af9gs4
Elixir text-to-speech with Kokoro
YouTube video by Samrat Man Singh
youtu.be
January 7, 2025 at 10:19 PM