Lightnews — Scholar-powered news

Dreadnode

@dreadnode.bsky.social

MLOps 🤝 AIRT

Building on MLOps principles is the way forward for AI red teaming. To showcase the impact of this process, we deployed automated adversarial attacks (TAP, GOAT, Crescendo) against Llama Maverick-17B-128E-Instruct.

Dig into the case study results here: dreadnode.io/blog/186-jai...

December 11, 2025 at 6:37 PM

Dreadnode

@dreadnode.bsky.social

"Offense and defense aren't peers. Defense is offense's child." - John Lambert

We built an LLM-powered AMSI provider and paired it against a red team agent. Then, we wrote a blog about it: dreadnode.io/blog/llm-pow...

LLM-Powered AMSI Provider vs. Red Team Agent

We built an LLM-powered AMSI provider and paired it against a red team agent, generating a unique dataset and a blueprint for detecting malicious code at execution time.

dreadnode.io

December 3, 2025 at 4:51 PM

Reposted by Dreadnode

velvethamm3r.bsky.social

@velvethamm3r.bsky.social

✍ The White House just launched the Genesis Mission, a bold bet on AI-enabled science. But there's a layer we can't afford to treat as an afterthought: cybersecurity. (1/4)

dreadnode.io/blog/from-co...

From Compute to Congress: The Cyber Layer Beneath the Genesis Mission

As the Genesis Mission accelerates AI development across critical scientific domains, robust cybersecurity and adversarial testing must be foundational, not bolted on later.

dreadnode.io

December 2, 2025 at 3:25 PM

Reposted by Dreadnode

SentinelOne

@sentinelone.com

AI as an Amplifier for Human Tradecraft: how scale can meet sharper intelligence.

What’s New: In their #LABScon 2025 talk, @dreadnode.bsky.social's Brad Palm and @machinavelli.com show how agentic AI can explore every analytical pathway — at speed and scale.

October 9, 2025 at 9:35 PM

Reposted by Dreadnode

velvethamm3r.bsky.social

@velvethamm3r.bsky.social

🧵 Tonight at midnight, CISA 2015 and SLCGP expire as Congress debates another shutdown.

We're witnessing a cyber identity crisis: threats don't discriminate between civilian and military sectors, but our defenses remain fragmented. What needs to happen immediately: 🧵(1/4)

September 30, 2025 at 5:04 PM

Dreadnode

@dreadnode.bsky.social

Tonight at midnight, two critical pieces of cybersecurity legislation are due to expire: the CISA 2015 and the SLCGP.

Read @velvethamm3r.bsky.social's take on why reauthorizing these programs will help CISA transform into a integrated defensive command: dreadnode.io/blog/from-co...

From Compute to Congress: To Address CISA's Authority Gap, Reauthorize CISA 2015 and SLCGP

Two critical cybersecurity programs—CISA 2015 and SLCGP—expire September 30, 2025. Learn why Congress must act now to preserve voluntary information sharing, fund state/local security, and operational...

dreadnode.io

September 30, 2025 at 8:44 PM

Dreadnode

@dreadnode.bsky.social

Dreadnode is a proud sponsor of @sentinelone.com's #labscon25!

Heading to Scottsdale this week? Catch @machinavelli.com and Brad Palm's talk, Auto-Poking the Bear—Analytical Tradecraft in the AI Age, on Thursday at 2pm MT.

Or, shoot us a DM to find time to meet up onsite!

September 16, 2025 at 5:57 PM

Dreadnode

@dreadnode.bsky.social

!!!

SentinelOne @sentinelone.com · Sep 16

🧠 @machinavelli.com and Brad Palm (@dreadnode.bsky.social) ask: can we trust AI-assisted CTI?

🔗 labscon.io/speakers/martin-wendiggensen
🔗 www.labscon.io/speakers/bra...

September 16, 2025 at 5:56 PM

Dreadnode

@dreadnode.bsky.social

Incoming: Dreadnode paper drop from Shane Caldwell and the crew.

PentestJudge—Judging Agent Behavior Against Operational Requirements: arxiv.org/abs/2508.02921

Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents (inspired by PaperBench).

August 6, 2025 at 6:31 PM

Reposted by Dreadnode

velvethamm3r.bsky.social

@velvethamm3r.bsky.social

✍ After talking AI Action Plan on @cyberscoop.bsky.social, wrote up @dreadnode.bsky.social thoughts on implementation ➡️ dreadnode.io/blog/five-ta...

‼️ While we debate frameworks, adversaries build AI attack capabilities. We need: evaluation ecosystems, red teaming, and procurement standards.

August 1, 2025 at 11:48 PM

Dreadnode

@dreadnode.bsky.social

In our latest blog, Shane Caldwell breaks down the process of creating a fully integrated, self-verifying agentic system that can do modern Windows Active Directory red team operations, without human interaction.

Read it here: dreadnode.io/blog/evals-t...

Evals: The Foundation for Autonomous Offensive Security

Learn how to build robust evaluations for autonomous red team agents that can perform Windows Active Directory operations. This blog covers action space design, programmatic verification, and measurin...

dreadnode.io

August 1, 2025 at 6:14 PM

Dreadnode

@dreadnode.bsky.social

Rise and shine! We're going live on Off By One with Stephen Sims this afternoon—meet us here at 11 AM PT: www.youtube.com/live/BzOmGw-...

Building and Deploying Offensive Security Agents with Dreadnode

YouTube video by Off By One Security

www.youtube.com

July 25, 2025 at 3:06 PM

Dreadnode

@dreadnode.bsky.social

In this edition of our From Compute to Congress policy blog series, Dreadnode Head of Policy Daria Bahrami explores how the TEST AI Act and red teaming standards can establish U.S. leadership in AI security: dreadnode.io/blog/from-co...

From Compute to Congress: Setting the Global Standard for AI Security

Daria explores how the TEST AI Act and red teaming standards can establish American leadership in AI security—a winning policy roadmap from Critical Effect DC 2025.

dreadnode.io

June 26, 2025 at 5:13 PM

Dreadnode

@dreadnode.bsky.social

Read @rad-ads.bsky.social's breakdown of Claude's attack sequence against the notoriously hard-to-solve "turtle" challenge: dreadnode.io/blog/ai-red-...

AI Red Teaming Case Study: Claude 3.7 Sonnet Solves the Turtle Challenge

See how Claude solved a notoriously difficult AI/ML CTF challenge, going beyond pattern matching to genuine problem-solving under adversarial conditions.

dreadnode.io

June 25, 2025 at 3:47 PM

Dreadnode

@dreadnode.bsky.social

Introducing AIRTBench, an AI red teaming benchmark for evaluating language models’ ability to autonomously discover and exploit AI/ML security vulnerabilities.

Read the paper on arXiv: arxiv.org/abs/2506.14682

Open-source dataset and benchmark eval code repo: github.com/dreadnode/AI...

June 18, 2025 at 1:24 PM

Dreadnode

@dreadnode.bsky.social

Check out @machinavelli.com's "Build with AI" Rigging workshop from @pivotcon.bsky.social: github.com/vmsv/pivot20...

GitHub - vmsv/pivot2025-llmworkshop

Contribute to vmsv/pivot2025-llmworkshop development by creating an account on GitHub.

github.com

May 20, 2025 at 3:16 PM

Dreadnode

@dreadnode.bsky.social

v3 of Rigging is out now. If you’re working with LLMs to build agents or run evaluations, check it out. We just added:

- Prompt caching for supported providers
- A unified tool system for function calling and fallbacks to xml/json parsing
- Native MCP integration

docs.dreadnode.io/open-source/...

docs.dreadnode.io

May 19, 2025 at 3:10 PM

Dreadnode

@dreadnode.bsky.social

Introducing our new blog series: "From Compute to Congress: Decoding AI Policy" by Dreadnode Head of Policy Daria Bahrami | Read the first post here: dreadnode.io/blog/from-co...

May 15, 2025 at 4:50 PM

Dreadnode

@dreadnode.bsky.social

Are manual or automated attacks more effective when attacking LLMs?

We found that automated approaches achieve significantly higher success rates (69.5%) compared to manual techniques (47.6%).

More insights on LLM attack execution methods here 👉 dreadnode.io/blog/the-aut...

May 8, 2025 at 3:30 PM

Dreadnode

@dreadnode.bsky.social

Strikes waitlist. Now open.

platform.dreadnode.io/waitlist/str...

[must have a Dreadnode account]

May 1, 2025 at 7:50 PM

Dreadnode

@dreadnode.bsky.social

What's your take on the growing dominance of automated attacks and the implications for AI red teams? Here's ours— based on our analysis of 30 LLM challenges, attempted by 1,674 unique Crucible users, across 214,271 attack attempts: arxiv.org/abs/2504.19855

April 29, 2025 at 4:15 PM

Dreadnode

@dreadnode.bsky.social

@moohax.bsky.social joins @gregotto.bsky.social on CyberScoop's Safe Mode podcast! Tune in at the 10-minute mark for a discussion on how AI fits into the offensive security narrative and what it means for tooling and defenses: www.youtube.com/watch?v=ZReR...

Dreadnode CEO Will Pearce on the ever-changing field of offensive AI security

YouTube video by CyberScoop

www.youtube.com

April 21, 2025 at 9:35 PM

Dreadnode

@dreadnode.bsky.social

Headed to RSA? Come meet the Dreadnode crew!

Whether you're looking for a private deep dive into our tech or want to hang out and talk offensive AI research, we'd love to connect.

Limited availability; Come and get it: calendly.com/tori-dreadno...

#BayArea #SanFrancisco #RSAC2025 #OffensiveAI

April 16, 2025 at 4:12 PM

Dreadnode

@dreadnode.bsky.social

Hey, we know that guy! Catch Dreadnode's @radads.bsky.social on NASDAQ #TradeTalks alongside @bugcrowd.com CEO
@davegerryjr.bsky.social and NFL CISO @tomasmald.bsky.social.

Tune in for a candid conversation on the intersection of AI and cybersecurity: www.nasdaq.com/videos/ever-...

www.nasdaq.com

April 9, 2025 at 8:07 PM

Reposted by Dreadnode

Martin Wendiggensen

@machinavelli.com

Will be talking about @dreadnode.bsky.social‘s great open-source rigging repo and how to build your own LLM workflows! Super excited!

PIVOTcon @pivotcon.bsky.social · Apr 3

Workshop 2: Building with AI - with Martin Wendiggensen @machinavelli.com and Vitor Ventura @vventura.bsky.social ,one of the best assorted CTI-AI builder duet on the market

✅retrieve local text data
✅LLM reasoning system with the tools for searches and analyses.
✅ AI agents

3/5

April 3, 2025 at 2:46 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news