Gasser Elbanna
banner
gelbanna.bsky.social
Gasser Elbanna
@gelbanna.bsky.social
PhD student in Speech and Hearing at Harvard/MIT. Building ANNs to study how humans perceive/produce speech and voice.

Working with @joshhmcdermott.bsky.social

https://gasserelbanna.github.io/

MSc. at EPFL
BSc. at Cairo University
ex Logitech and IDIAP
We evaluated three classes of speech models against a set of neural signatures. If you’re curious how they performed, stop by our poster to learn more!
December 6, 2025 at 7:42 PM
Prior work has shown that speech models can predict brain responses to natural speech. However, it remains unclear whether these models also reproduce well-documented signatures of the auditory cortex.
December 6, 2025 at 7:42 PM
3. We manipulated the model’s access to past and future speech cues, revealing the importance of the acoustic context and its directionality in human speech recognition.

Come to our poster to learn more!

🧵4/4
June 5, 2025 at 12:02 AM
2. These tasks allowed us to compute the first full phoneme confusion matrix in humans at scale. This enabled the first systematic comparison of human–model phoneme confusions, revealing that humans and models share not only similar response patterns but also similar patterns of confusions.

🧵3/4
June 5, 2025 at 12:02 AM
This work has 3 main contributions:

1. We developed new models of continuous speech recognition alongside novel behavioral tasks to compare both models and humans on speech perception without conflating speech and language.

🧵2/4
June 5, 2025 at 12:02 AM