CPO @ aqua-cloud.io
Opinions are my own.
You can find it here: github.com/openai/codex
It's open source and has a YOLO mode with a small safety net build in.🪂
You can find it here: github.com/openai/codex
It's open source and has a YOLO mode with a small safety net build in.🪂
ai.meta.com/blog/llama-4...
Get it here: www.llama.com/llama-downlo...
Still coming: Lama 4 2T Parameter "Behemoth"
ai.meta.com/blog/llama-4...
Get it here: www.llama.com/llama-downlo...
Still coming: Lama 4 2T Parameter "Behemoth"
github.com/microsoft/pr...
It's a tool for automatic test generation for LLM prompts. It sIimplifies QA by automatically generating clear input/output spec & targeted test cases.
🧵1/n
github.com/microsoft/pr...
It's a tool for automatic test generation for LLM prompts. It sIimplifies QA by automatically generating clear input/output spec & targeted test cases.
🧵1/n
- Web Search
- File Search (for local filesystem)
- Computer Use (incl. Browser Use)
A new Agents SDK
openai.github.io/agents-sdk-p...
and new Tracing Plattform
More info:
platform.openai.com/docs/guides/...
- Web Search
- File Search (for local filesystem)
- Computer Use (incl. Browser Use)
A new Agents SDK
openai.github.io/agents-sdk-p...
and new Tracing Plattform
More info:
platform.openai.com/docs/guides/...
Looks pretty useful after playing around with it on my existing project.
👉It's in limited research preview, first come first served! 👈
"npm install -g @anthropic-ai/claude-code"
"Claude" and log in
docs.anthropic.com/en/docs/agen...
Looks pretty useful after playing around with it on my existing project.
👉It's in limited research preview, first come first served! 👈
"npm install -g @anthropic-ai/claude-code"
"Claude" and log in
docs.anthropic.com/en/docs/agen...
www.anthropic.com/news/visible...
www.anthropic.com/news/visible...
Claude 3.7 released today and reclaims the crown for AI dev tasks!
It’s surprising how long it took the competition to reach Claude 3.5-level coding skills. For a long long time Claude was the favorite among AI dev communities using tools like Cursor, Windsurf, Aider, etc.
Claude 3.7 released today and reclaims the crown for AI dev tasks!
It’s surprising how long it took the competition to reach Claude 3.5-level coding skills. For a long long time Claude was the favorite among AI dev communities using tools like Cursor, Windsurf, Aider, etc.
“R1 1776 is a DeepSeek-R1 reasoning model that has been post-trained by Perplexity AI to remove CCP censorship. The model provides unbiased, accurate, and factual information while maintaining high reasoning capabilities.”
huggingface.co/perplexity-a...
“R1 1776 is a DeepSeek-R1 reasoning model that has been post-trained by Perplexity AI to remove CCP censorship. The model provides unbiased, accurate, and factual information while maintaining high reasoning capabilities.”
huggingface.co/perplexity-a...
- Fires content moderation team, replaced by X-style community notes.
- Less censorship at cost of safety
🧵1/6
- Fires content moderation team, replaced by X-style community notes.
- Less censorship at cost of safety
🧵1/6
Alignment faking in large language models.
Claude often pretends to have different views during training while actually maintaining its original preferences 💀
www.anthropic.com/research/ali...
Alignment faking in large language models.
Claude often pretends to have different views during training while actually maintaining its original preferences 💀
www.anthropic.com/research/ali...
Best one so far in the OSS space.
Podcastfy.ai
Best one so far in the OSS space.
Podcastfy.ai
huggingface.co/chat/models/...
huggingface.co/chat/models/...