Ganesh
ganesh3.bsky.social
Ganesh
@ganesh3.bsky.social
Machine Learning with Patents granted & filed | Current focus on LLM/GenAI | Graph Analytics
#Memento - new framework lets #LLM #agents learn from experience, no fine-tuning required.
The system has 3 main components: a #planner, a #tool-enabled executor that work in an alternating loop to complete tasks, & a growing "case bank" that stores #past #experiences
venturebeat.com/ai/this-new-...
venturebeat.com
September 15, 2025 at 11:04 PM
Making #Kernel Development more accessible with #KernelLLM
We introduce KernelLLM, a large language model #LLM based on #Llama 3.1 Instruct, which has been trained specifically for the task of authoring #GPU kernels using #Triton
huggingface.co/facebook/Ker...
facebook/KernelLLM · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
May 20, 2025 at 10:50 PM
#miniCOIL: on the Road to Usable #Sparse #Neural Retrieval. Use a vector that is comprised of contextual meaning of the words from a dense model and if word is not seen during trying then fall back on #BM25 which leverages the TF-IDF based approach.
qdrant.tech/articles/min...
miniCOIL: on the Road to Usable Sparse Neural Retrieval - Qdrant
Introducing miniCOIL, a lightweight sparse neural retriever capable of generalization.
qdrant.tech
May 19, 2025 at 4:20 AM
#AlphaEvolve: A #Gemini-powered #coding agent for designing #advanced #algorithms. New #AI agent evolves algorithms for #math and practical applications in computing by combining the creativity of large language models #LLMs with automated #evaluators
deepmind.google/discover/blo...
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms
New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators
deepmind.google
May 18, 2025 at 8:15 AM
#Orbital converts trained #scikit-#learn #pipelines into pure #SQL, enabling #machinelearning model execution directly within #databases—no #Python runtime needed.
posit-dev.github.io/orbital/
Orbital
posit-dev.github.io
May 13, 2025 at 10:31 PM
Flexible, lightweight #opensource #framework for #orchestrating #multiple #AI #agents to handle #complex conversations. Features include 🧠 Intelligent #intent #classification, 🔤 Dual #language support, 🌊 Flexible agent responses, 📚 #Context management
github.com/awslabs/mult...
GitHub - awslabs/multi-agent-orchestrator: Flexible and powerful framework for managing multiple AI agents and handling complex conversations
Flexible and powerful framework for managing multiple AI agents and handling complex conversations - awslabs/multi-agent-orchestrator
github.com
May 4, 2025 at 10:33 PM
Introducing #HallOumi:
A State-of-the-Art Claim-Verification Model - a family of #opensource claim verification (hallucination detection) models, outperforming #DeepSeek R1, #OpenAI o1, #Google Gemini 1.5 Pro, #Llama 3.1 405B, and #Claude Sonnet 3.5 at only 8B parameters!
oumi.ai/blog/posts/i...
Oumi - Introducing HallOumi, A State-of-the-Art Claim-Verification Model
We're excited to introduce HallOumi-8B and HallOumi-8B-Classifier, a family of open-source claim verification (hallucination detection) models, outperforming DeepSeek R1, OpenAI o1, Google Gemini 1.5 ...
oumi.ai
April 7, 2025 at 8:03 AM
A #Visual Guide to Mixture of Experts (#MoE). Demystifying the role of MoE in Large Language Models #LLM
newsletter.maartengrootendorst.com/p/a-visual-g...
A Visual Guide to Mixture of Experts (MoE)
Demystifying the role of MoE in Large Language Models
newsletter.maartengrootendorst.com
April 7, 2025 at 6:54 AM
Do we really have to employ #complex autonomous #software #agents? #Agentless -- an agentless approach employs a simplistic three-phase process of #localization, #repair, and #patch validation, without letting the #LLM decide future actions #opensource #software #agents
arxiv.org/abs/2407.01489
Agentless: Demystifying LLM-based Software Engineering Agents
Recent advancements in large language models (LLMs) have significantly advanced the automation of software development tasks, including code synthesis, program repair, and test generation. More recent...
arxiv.org
January 14, 2025 at 10:06 PM