🌍 Mathematical Institute, Oxford
📈 Researching Neural Differential Equations & Rough Path Theory
📧 Email: [email protected]
🌐 GitHub: Benjamin-Walker
Was too proud of this one so had to post it somewhere!
Was too proud of this one so had to post it somewhere!
#NeurIPS2024 #MachineLearning #DeepLearning #StateSpaceModels
🧵6/6
#NeurIPS2024 #MachineLearning #DeepLearning #StateSpaceModels
🧵6/6
In contrast, using a dense state-transition matrix (IDS4/Linear CDE) or a non-linear state-transition (RNN) allows for state-tracking with only 1 layer.
🧵5/6
In contrast, using a dense state-transition matrix (IDS4/Linear CDE) or a non-linear state-transition (RNN) allows for state-tracking with only 1 layer.
🧵5/6
The benchmark tests state-tracking, a crucial ability for tasks involving permutation composition like chess.
The results? 👇
🧵4/6
The benchmark tests state-tracking, a crucial ability for tasks involving permutation composition like chess.
The results? 👇
🧵4/6
However, we also show that using a diagonal state-transition matrix—while drastically reducing computational costs—also significantly limits the model's capacity.
🧵3/6
However, we also show that using a diagonal state-transition matrix—while drastically reducing computational costs—also significantly limits the model's capacity.
🧵3/6
🧵2/6
🧵2/6