Multimodality and multilinguality
prev. predoc Google Deepmind
📂 Codebase (part of ESPnet): github.com/espnet/espnet
📖 README & User Guide: github.com/espnet/espne...
🎥 Demo Video: www.youtube.com/watch?v=kI_D...
📂 Codebase (part of ESPnet): github.com/espnet/espnet
📖 README & User Guide: github.com/espnet/espne...
🎥 Demo Video: www.youtube.com/watch?v=kI_D...
Language as the pivoting modality instead of images. Different training dataset.
Language as the pivoting modality instead of images. Different training dataset.