💡 With smarter training, we maintain SSMs’ efficiencies while dramatically enhancing their capabilities.
💡 With smarter training, we maintain SSMs’ efficiencies while dramatically enhancing their capabilities.
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.
Our EMNLP 2024 paper boosts SSMs like Mamba and Hawk on long-range, context-heavy tasks, closing the gap with Transformers.
Proud to work with @jimmysmith1919.bsky.social, @antonisa.bsky.social, & Amarda Shehu.
📄 Paper: arxiv.org/abs/2411.01030
💻 Code: github.com/samblouir/bi...
Our EMNLP 2024 paper boosts SSMs like Mamba and Hawk on long-range, context-heavy tasks, closing the gap with Transformers.
Proud to work with @jimmysmith1919.bsky.social, @antonisa.bsky.social, & Amarda Shehu.
📄 Paper: arxiv.org/abs/2411.01030
💻 Code: github.com/samblouir/bi...
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.
• Multi-Phone Number Retrieval: Birdie SSMs achieve 100% accuracy on single lookups; outperform standard SSMs even more as tasks become more complex.
• SQuAD V2: We match a Transformer's performance curve across sequence lengths, while standard SSMs fall behind.