Martin Koch
banner
martinkoch.bsky.social
Martin Koch
@martinkoch.bsky.social
LLMs will eat the world.
CPO @ aqua-cloud.io
Opinions are my own.
🆕 OpenAI just released their AI coding competitor to claude code: Codex CLI

You can find it here: github.com/openai/codex
It's open source and has a YOLO mode with a small safety net build in.🪂
April 16, 2025 at 6:50 PM
Reposted by Martin Koch
Microsoft has created an AI-generated version of Quake
Microsoft has created an AI-generated version of Quake
You can now try out Microsoft’s new Muse AI model
buff.ly
April 5, 2025 at 6:30 PM
Reposted by Martin Koch
Apple is reportedly bringing live translation to AirPods
Apple is reportedly bringing live translation to AirPods
It could arrive with iOS 19.
buff.ly
March 13, 2025 at 9:10 PM
Following the "Prompts are Programs" paradigm, Microsoft Research explores unit testing prompts with PromptPex:

github.com/microsoft/pr...

It's a tool for automatic test generation for LLM prompts. It sIimplifies QA by automatically generating clear input/output spec & targeted test cases.

🧵1/n
GitHub - microsoft/promptpex: Prompt Exploration
Prompt Exploration. Contribute to microsoft/promptpex development by creating an account on GitHub.
github.com
March 13, 2025 at 4:57 PM
🆕 OpenAI just released new tools for building agentic systems in their new Responses API:
- Web Search
- File Search (for local filesystem)
- Computer Use (incl. Browser Use)

A new Agents SDK
openai.github.io/agents-sdk-p...

and new Tracing Plattform
More info:
platform.openai.com/docs/guides/...
OpenAI Platform
Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.
platform.openai.com
March 11, 2025 at 5:41 PM
Also new: A terminal-based coding agent, similar to aider.
Looks pretty useful after playing around with it on my existing project.

👉It's in limited research preview, first come first served! 👈

"npm install -g @anthropic-ai/claude-code"
"Claude" and log in

docs.anthropic.com/en/docs/agen...
Claude Code overview - Anthropic
Learn about Claude Code, an agentic coding tool made by Anthropic. Currently in beta as a research preview.
docs.anthropic.com
February 24, 2025 at 8:51 PM
Anthropic launches an “extended thinking mode” for the new Claude 3.7 Sonnet. It lets users and devs - through a “thinking budget” - ask the model to spend more time reasoning.

www.anthropic.com/news/visible...
Claude's extended thinking
Discussing Claude's new thought process
www.anthropic.com
February 24, 2025 at 8:28 PM
👑 The King is Back!

Claude 3.7 released today and reclaims the crown for AI dev tasks!

It’s surprising how long it took the competition to reach Claude 3.5-level coding skills. For a long long time Claude was the favorite among AI dev communities using tools like Cursor, Windsurf, Aider, etc.
February 24, 2025 at 8:27 PM
Reposted by Martin Koch
An uncensored version of R1 is released 🔥

“R1 1776 is a DeepSeek-R1 reasoning model that has been post-trained by Perplexity AI to remove CCP censorship. The model provides unbiased, accurate, and factual information while maintaining high reasoning capabilities.”

huggingface.co/perplexity-a...
perplexity-ai/r1-1776 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
February 19, 2025 at 3:22 AM
Zuckerberg folds.
- Fires content moderation team, replaced by X-style community notes.
- Less censorship at cost of safety

🧵1/6
January 7, 2025 at 2:42 PM
New research from Anthropic:
Alignment faking in large language models.

Claude often pretends to have different views during training while actually maintaining its original preferences 💀

www.anthropic.com/research/ali...
Alignment faking in large language models
A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models
www.anthropic.com
December 18, 2024 at 7:34 PM
Reposted by Martin Koch
NEW: OpenAI just dropped new Elon Musk receipts: ‘You can’t sue your way to AGI’ www.theverge.com/2024/12/13/2...
December 13, 2024 at 7:34 PM
Reposted by Martin Koch
OpenAI’s Sora video generator includes a powerful feature that allows users to seamlessly add, remove, or edit objects within the video, offering new possibilities for customization and creativity.
December 9, 2024 at 8:54 PM
QwQ-32B-Preview, the new reasoning model from Alibabas Qwen team is now available unquantized on HuggingChat - for free!

huggingface.co/chat/models/...
https://huggingface.co/chat/models/Qwen/QwQ-32B-Preview
t.co
November 28, 2024 at 9:00 PM