Posts
Media
Videos
Starter Packs
Pinned
Baran Hashemi
@rythian47.bsky.social
· Sep 8
Baran Hashemi
@rythian47.bsky.social
· Aug 1
Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms
Dynamic programming (DP) algorithms for combinatorial optimization problems work with taking maximization, minimization, and classical addition in their recursion algorithms. The associated value functions correspond to convex polyhedra in the max plus semiring. Existing Neural Algorithmic Reasoning models, however, rely on softmax-normalized dot-product attention where the smooth exponential weighting blurs these sharp polyhedral structures and collapses when evaluated on out-of-distribution (OOD) settings. We introduce Tropical attention, a novel attention function that operates natively in the max-plus semiring of tropical geometry. We prove that Tropical attention can approximate tropical circuits of DP-type combinatorial algorithms. We then propose that using Tropical transformers enhances empirical OOD performance in both length generalization and value generalization, on algorithmic reasoning tasks, surpassing softmax baselines while remaining stable under adversarial attacks. We also present adversarial-attack generalization as a third axis for Neural Algorithmic Reasoning benchmarking. Our results demonstrate that Tropical attention restores the sharp, scale-invariant reasoning absent from softmax.
arxiv.org
Baran Hashemi
@rythian47.bsky.social
· May 27
Baran Hashemi
@rythian47.bsky.social
· May 27
Baran Hashemi
@rythian47.bsky.social
· May 27
Baran Hashemi
@rythian47.bsky.social
· May 27
Baran Hashemi
@rythian47.bsky.social
· May 26
Baran Hashemi
@rythian47.bsky.social
· May 26
Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms
Dynamic programming (DP) algorithms for combinatorial optimization problems work with taking maximization, minimization, and classical addition in their recursion algorithms. The associated value func...
arxiv.org
Baran Hashemi
@rythian47.bsky.social
· Apr 3
Baran Hashemi
@rythian47.bsky.social
· Mar 25
Reposted by Baran Hashemi
Sean Carroll
@seanmcarroll.bsky.social
· Mar 19
Dark Energy Survey: implications for cosmological expansion models from the final DES Baryon Acoustic Oscillation and Supernova data
The Dark Energy Survey (DES) recently released the final results of its two principal probes of the expansion history: Type Ia Supernovae (SNe) and Baryonic Acoustic Oscillations (BAO). In this paper,...
arxiv.org
Baran Hashemi
@rythian47.bsky.social
· Mar 13
Baran Hashemi
@rythian47.bsky.social
· Feb 8
Baran Hashemi
@rythian47.bsky.social
· Jan 23
Can Transformers Do Enumerative Geometry?
How can Transformers model and learn enumerative geometry? What is a robust procedure for using Transformers in abductive knowledge discovery within a mathematician-machine collaboration? In this work...
arxiv.org