^^ includes @skgabrie.bsky.social who is just starting up her lab at UCLA!
^^ includes @skgabrie.bsky.social who is just starting up her lab at UCLA!
1. Tools Fail: Detecting Silent Errors in Faulty Tools
Are you using tools with your LLMs? Are you assuming your tools are perfect? Assuming the LLM can just handle any errors for you? 😬
Danger… 🚨 Models trust tools over their own “knowledge” even for simple and well trained cases.
1. Tools Fail: Detecting Silent Errors in Faulty Tools
Are you using tools with your LLMs? Are you assuming your tools are perfect? Assuming the LLM can just handle any errors for you? 😬
Danger… 🚨 Models trust tools over their own “knowledge” even for simple and well trained cases.