Nate Sesti
@natesesti.bsky.social
14 followers 1 following 10 posts
Coding @continue.dev, Publicly Thinking @ http://natesesti.com, (no longer) Studying Physics @ MIT ('23) 📌 Continue is hiring! https://continue.dev/about-us
Posts Media Videos Starter Packs
Reposted by Nate Sesti
alanmnichol.bsky.social
So 🔥
I love when startups do research and just ship in product. Skip the benchmarking theatre.
natesesti.bsky.social
Introducing Instinct 🐺, a new state-of-the-art open Next Edit model trained by @continue.dev

Runnable with @ollamabot.bsky.social and available today in VS Code
natesesti.bsky.social
This work was possible thanks to a generous compute grant from the Nvidia DGX Cloud Innovation Lab, allowing us access to a dedicated 8xH100 GPU instance for distributed training and inference
natesesti.bsky.social
Instinct was the product of an incredible summer internship project by Adarsh, with guidance from the rest of the Continue team

We're just getting started with this first release and are particularly excited to push forward Next Edit by:
- post-training with KTO
- scaling the dataset by 10x
natesesti.bsky.social
Instinct's weights, dataset, and training code are all available for use under the Apache-2.0 license

If you are interested in furthering the state of the art, either as part of the community or the Continue team, please reach out!

huggingface.co/continuedev/...
continuedev/instinct · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
natesesti.bsky.social
To evaluate Instinct, we adopted an LLM-as-judge strategy, similar to Zeta from Zed, asking Claude to rank edits on a scale from 1-5 based on functional equivalence to the ground truth edit

Instinct’s average score of 3.877 outperforms Zeta’s score of 3.735
natesesti.bsky.social
Our dataset of 4,000+ high-quality real-world edits was built by collecting keystroke-level data from the Continue team and chunking it into coherent edits

For multilingual support we used Qwen3-Coder-30B to "translate" these into synthetic data for Java, C++, Python, and Rust
natesesti.bsky.social
While traditional autocomplete inserts text by using a fill-in-the-middle prompt format (FIM), Next Edit accepts a prefix, suffix, and "range to replace", allowing it to make complex changes all at once by rewriting the range

To users, this appears as an inline diff
natesesti.bsky.social
Introducing Instinct 🐺, a new state-of-the-art open Next Edit model trained by @continue.dev

Runnable with @ollamabot.bsky.social and available today in VS Code
Reposted by Nate Sesti
continue.dev
Continue 1.0 is here! Combining our open-source IDE extensions with hub.continue.dev makes it frictionless to use custom AI code assistants. Discover the models, rules, prompts, docs, and other building blocks you need to become an amplified developer ✨
Continue Hub + VS Code