Lightnews — Scholar-powered news

1 3

Baran Hashemi @rythian47.bsky.social · May 27

Cool. Will definitely do 👍

Baran Hashemi @rythian47.bsky.social · May 27

Interesting. I was not aware of aware if the challenges in the video subfield. But that makes sense given the context. We will definitely explore those benchmarks in the future. Thanks for the suggestions.

Baran Hashemi @rythian47.bsky.social · May 27

Tnx. We did not test yet on any other benshmarks. You mean algorithmic or language type benchmarks?

Baran Hashemi @rythian47.bsky.social · May 27

Interesting. I was not aware of this study. However, we did not just used tropical operations, we tried to simulate a concrete tropical circuit and do the message passing in the tropical space with the Generalized Hilbert metric as the kernel.

Baran Hashemi @rythian47.bsky.social · May 26

7/ Our message ✍️
Better reasoning might come not from bigger models, but from choosing the right algebra/geometry 🌴.
@petar-v.bsky.social @jalonso.bsky.social
#TropicalGeometry #NeuralAlgorithmicReasoning #AI4Math

2 3

Baran Hashemi @rythian47.bsky.social · May 26

6/ We also show that each Tropical attention head can function as a tropical gate in a tropical circuit, simulating any max-plus circuit.

Baran Hashemi @rythian47.bsky.social · May 26

5/ We benchmarked on 11 canonical combinatorial tasks. Tropical attention beat vanilla & adaptive softmax attention on all three OOD axes, Length, value and Adversarial attack generalization:

Baran Hashemi @rythian47.bsky.social · May 26

4/ Tropical Attention runs each head natively in max-plus. Result:
Strong OOD length generalization with sharp attention maps even in several algorithmic tasks, including the notorious Quickselect algorithm (Another settlement for the challenge identified by @mgalkin.bsky.social )

Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms

Baran Hashemi @rythian47.bsky.social · May 26

3/ In the Tropical (max + ) geometry, “addition” is max, “multiplication” is +. Many algorithms already live here, carving exact polyhedral decision boundaries --> so why force them through exponential probabilities?
Let's ditch softmax, embrace the tropical semiring 🤯🍹.

Image by Cowdery and Challas, featured in June 2009 Mathematics Magazine

1 2

Baran Hashemi @rythian47.bsky.social · May 26

2/ We introduce Tropical Attention -- the first Neural Algorithmic reasoner that operates in the Tropical semiring, achieving SOTA OOD performance on executing several combinatorial algorithms
arxiv.org/abs/2505.17190

Dynamic programming (DP) algorithms for combinatorial optimization problems work with taking maximization, minimization, and classical addition in their recursion algorithms. The associated value func...

Paper Submission Entrance The workshop uses OpenReview as the review platform. For detailed submission guidelines, please see below.

Baran Hashemi @rythian47.bsky.social · May 26

🧵 Tropical Attention --> Softmax is out, Tropical max-plus is in 🦾
1/ 🔥Ever experinced softmax attention fade as sequences grow?
That blur is why many attention mechanisms stumble on algorithmic and reasoning tasks. Well, we have a Algebraic Geometric Tropical solution 🌴

1 4 10

Baran Hashemi @rythian47.bsky.social · Apr 7

I'm speaking about AI for enumerative geometry at the CMSA New Technologies in Mathematics seminar, on Wednesday.

Baran Hashemi @rythian47.bsky.social · Apr 3

If you think of DyT as an Activation function, it will be exactly a sub-family of our learnable Dynamic Range Activator (DRA) activation function, when (a,c)=0:

openreview.net/forum?id=4X9...

Baran Hashemi @rythian47.bsky.social · Mar 31

🔥Big News! The 2nd AI for Math Workshop is coming back to #ICML2025 and we’re back with the theme of exploring the frontiers of AI for mathematical reasoning, problem solving, discovery!

🫵 Calling all pioneers in AI4Math:
📜 Submit your exciting work:
sites.google.com/view/ai4math...

Call

sites.google.com

Baran Hashemi @rythian47.bsky.social · Mar 25

Beautiful indeed!

Bahram Shakerin @bahramshakerin.bsky.social · Mar 25

Why the DESI Results Should Not Be A Surprise

Robert Brandenberge

arxiv.org/abs/2503.17659

Why the DESI Results Should Not Be A Surprise

The recent DESI results provide increasing evidence that the density of dark energy is time-dependent. I will recall why, from the point of view of fundamental theory,, this result should not be surpr...

Dark Energy Survey: implications for cosmological expansion models from the final DES Baryon Acoustic Oscillation and Supernova data

Reposted by Baran Hashemi

Sean Carroll @seanmcarroll.bsky.social · Mar 19

The DESI survey @desisurvey.bsky.social suggests the universe is *not* maximally boring! Statistical significance is not quite there yet, but a new result is a bit stronger than their previous indication that dark energy might be varying with time. (cont.)

arxiv.org/abs/2503.06712

The Dark Energy Survey (DES) recently released the final results of its two principal probes of the expansion history: Type Ia Supernovae (SNe) and Baryonic Acoustic Oscillations (BAO). In this paper,...

Can Transformers Do Enumerative Geometry?

10 15 100

Baran Hashemi @rythian47.bsky.social · Mar 13

For the ICLR Camera-ready version:

openreview.net/forum?id=4X9...

1 2

Baran Hashemi @rythian47.bsky.social · Feb 8

Tnx. The probing methods were both linear and non-linear over the conjectural form of the large-genus asymptotic form of the intersections. If the model actually learned the underlying math, it must have internalized the parameters of the asymptotic formula. We found that this was the case.

Baran Hashemi @rythian47.bsky.social · Feb 8

🚀 Curious how Transformers understand Enumerative Geometry or model recursive functions with factorial blow-up?
I'll be presenting our results, openreview.net/forum?id=4X9..., at the Math4AI/AI4Math Workshop @mpiMathSci! 🔥
📅 Registration is open until Feb 28
🔗 www.mis.mpg.de/events/serie...
#AI4Math

We introduce a Transformer-based approach to computational enumerative geometry, specifically targeting the computation of $\psi$-class intersection numbers on the moduli space of curves....

openreview.net

Can Transformers Do Enumerative Geometry?

Baran Hashemi @rythian47.bsky.social · Jan 23

I am extremely happy to announce that our paper
Can Transformers Do Enumerative Geometry? (arxiv.org/abs/2408.14915) has been accepted to the
@iclr-conf.bsky.social!!
Congrats to my collaborators Alessandro Giacchetto at ETH Züruch and Roderic G. Corominas at Harvard.
#ICLR2025 #AI4Math #ORIGINS

How can Transformers model and learn enumerative geometry? What is a robust procedure for using Transformers in abductive knowledge discovery within a mathematician-machine collaboration? In this work...