#MustardSauce
construct and formalize mathematical proofs. Our results demonstrate significant performance improvements across multiple datasets, with using knowledge graphs, achieving up to a 34% success rate on the MUSTARDSAUCE dataset on o1-mini and consistently [3/4 of https://arxiv.org/abs/2503.11657v1]
March 18, 2025 at 5:54 AM Everybody can reply
September 18, 2025 at 2:54 PM Everybody can reply
2 likes