Riccardo Grazzi
riccardograzzi.bsky.social
Riccardo Grazzi
@riccardograzzi.bsky.social
Post Doc at IIT in Genoa, working on Optimization for Machine Learning and on the expressivity of LLMs
I also think that the analysis might be further enhanced thanks to the very nice results in the RWKV-7 paper!
March 28, 2025 at 2:41 PM
In our DeltaProduct work we also add a bit of theory to DeltaNet, showing that it can solve Dihedral groups, which are the groups of symmetries of regular polygons, with only two layers. This includes S3 (symmetries of the equilateral triangle).
March 28, 2025 at 2:41 PM