Lightnews — Scholar-powered news

Kaggle

@kaggle.com

📌 Mark Your Calendar: Live Game Arena Event This Monday!

We are releasing two new games, Poker and Werewolf, along with an updated Chess leaderboard next Monday, February 2, running daily from 9:30 AM PT to 11:30 AM PT through February 4

January 29, 2026 at 5:12 PM

Kaggle

@kaggle.com

🚀 Introducing Community Benchmarks on Kaggle!

As AI evolves at an unprecedented pace, measuring intelligence requires more than a few AI research labs alone – it requires the imagination and collective expertise of the global community. That’s why we’re launching Community Benchmarks.

January 14, 2026 at 2:17 PM

Kaggle

@kaggle.com

🏆 Announcing the winners of the Agents Intensive Capstone Project! 🎉

We're excited to announce the top 12 teams who showcased exceptional creativity & technical skill using AI agents! Check out their innovative projects & learn more about their submissions here:

www.kaggle.com/competitions...

December 18, 2025 at 3:16 PM

Kaggle

@kaggle.com

🚀 New on Kaggle Benchmarks: DeepSearchQA developed by Google DeepMind!

This benchmark focuses on complex web research tasks and tests agent comprehensiveness.

Check the leaderboard: www.kaggle.com/benchmarks/g...

A screenshot of the Kaggle DeepSearchQA leaderboard, showing the top five ranked models.

December 11, 2025 at 6:30 PM

Kaggle

@kaggle.com

📢 The FACTS Benchmark Suite is now live on Kaggle!

Developed by Google DeepMind and Google Research, this suite measures LLM factuality across four dimensions: Parametric knowledge, Search, Multimodal understanding & Grounding.

Explore the leaderboard: www.kaggle.com/benchmarks/g...

A screenshot of the Kaggle FACTS Benchmark Suite leaderboard. The table displays several large language models like GPT-4, Gemini, and others, ranked by their overall FACTS Score and performance breakdown in the four categories: Parametric Knowledge, Search, Multimodal Understanding, and Grounding. The overall score and dimension scores are visible.

December 11, 2025 at 11:53 AM

Kaggle

@kaggle.com

🚀 Benchmark your AI across India’s languages with IndicGenBench!

Developed by Google DeepMind, this benchmark spans 29 Indic languages, including first-ever evaluation data for 18 Indic languages. It supports language tasks like summarization, translation and question answering.

A screenshot of the IndicGenBench leaderboard on Kaggle Benchmarks. The leaderboard ranks various AI models based on their performance across 29 Indic languages on generative tasks. The top models and their scores are visible, showing a comparison of AI performance on tasks like cross-lingual summarization, machine translation and question answering for Indian languages.

December 9, 2025 at 12:57 PM

Kaggle

@kaggle.com

🚀 Feature Update on Kaggle Benchmark

You can now download Kaggle Benchmark leaderboard results!

Compare your favorite models with a simple CURL command or download the full CSV directly for deeper analysis.

Get started: www.kaggle.com/benchmarks

December 3, 2025 at 2:00 PM

Kaggle

@kaggle.com

♟️ Expanding Game Arena: Introducing Chess Openings

A new benchmark that tests reasoning beyond memorization. Each game starts from one of 20 popular openings, pushing models to adapt and think strategically rather than rely on learned patterns.

Image of the Kaggle Game Arena Chess Openings leaderboard. The leaderboard table displays the top AI models ranked by performance, with columns showing model name, score, and number of games played. The image highlights the competitive rankings and emphasizes the new Chess Openings benchmark, which starts each game from one of 20 popular two-ply openings to test adaptability and strategic reasoning beyond memorization.

October 22, 2025 at 4:56 PM

Kaggle

@kaggle.com

Check out the leaderboard here: 👇
www.kaggle.com/benchmarks/c...

Leaderboard showing Gemini-2.5-Flash achieving the top score of 94.8% on the Global MMLU Lite multilingual benchmark. This evaluation dataset tests models across 16 languages for cultural and linguistic biases.

October 17, 2025 at 4:22 PM

Kaggle

@kaggle.com

🚀 New Benchmark Launch: SimpleQA Verified!

We’ve partnered with Google DeepMind and Google Research to launch a curated 1,000-prompt benchmark designed to provide a more reliable and challenging evaluation of LLM short-form factuality.
Check out the leaderboard here: www.kaggle.com/benchmarks/d...

Screenshot of a new benchmark - SimpleQA Verified launched in partnership with Google DeepMind and Google Research.

September 10, 2025 at 3:17 PM

Kaggle

@kaggle.com

🏆 Results are in!

In the first #KaggleGameArena — Chess Text Input — AI models faced off using only text inputs (no tools, no move validation) in 40+ matches per pairing to build a robust Elo-like ranking ♟️

www.kaggle.com/benchmarks/k...

A screenshot of the chess text input benchmark leaderboard.

August 22, 2025 at 9:02 AM

Kaggle

@kaggle.com

What a show! The Kaggle Game Arena AI Chess Tournament is complete — and O3 takes the win! 🏆

Big thanks to
@magnuscarlseny.bsky.social , @gmhikaru.bsky.social, @gothamchess.bsky.social and GM David Howell for the fantastic commentary and analysis on Chessom and TakeTakeTakeApp.

August 7, 2025 at 7:16 PM

Kaggle

@kaggle.com

What an exciting start to the Kaggle Game Arena AI chess exhibition tournament ♟️!

The first round is complete, and we have our four semi-finalists! Congratulations to o4-mini, o3, Gemini 2.5 Pro & Grok 4!

Come back tomorrow! Semi-finals kick off, August 6th, at 10:30 am PT.

August 5, 2025 at 7:42 PM

Kaggle

@kaggle.com

Let the games begin! It's time to watch eight LLMs compete in the first round of head-to-head match-ups. #KaggleGameArena

August 5, 2025 at 6:03 PM

Kaggle

@kaggle.com

It’s Day 1 of the Kaggle Game Arena AI chess exhibition tournament ♟️!

Tune in today at 10:30AM PT to watch 4 head-to-head AI matchups 🤖 in a single-elimination bracket

August 5, 2025 at 4:13 PM

Kaggle

@kaggle.com

Think you can predict the winner of our AI chess exhibition tournament? 🧠🏆

Reply to this post with your filled-out bracket to let us know who you think will take home the gold medal!

August 5, 2025 at 1:20 PM

Kaggle

@kaggle.com

The inaugural #KaggleGameArena AI chess exhibition tournament kicks off live tomorrow.

For the next 3 days, August 5-7, tune in daily at 10:30 am PST, and catch commentary from
@gmhikaru.bsky.social, @gothamchess.bsky.social and @magnuscarlseny.bsky.social ⬇️

August 4, 2025 at 7:23 PM

Kaggle

@kaggle.com

Don’t forget to check out the Kaggle AI chess exhibition tournament tomorrow. The matches take place daily from August 5-7, with streams starting at 10:30 AM PT on kaggle.com/game-arena.

August 4, 2025 at 5:13 PM

Kaggle

@kaggle.com

Games are also a fantastic proxy for a wide range of real-world skills. They test a model's ability in strategic planning, reasoning, memory, adaptation, and even "theory of mind" – understanding an opponent's thoughts.

August 4, 2025 at 4:58 PM

Kaggle

@kaggle.com

📢 Introducing Kaggle Game Arena: a new, open benchmark platform where top AI models compete in complex, strategic games in streamed match-ups. We're charting new frontiers for trustworthy AI evaluation and it begins with chess — a classic proving ground for system intelligence.

August 4, 2025 at 4:29 PM

Kaggle

@kaggle.com

✔️ Join the waitlist for early access to Kaggle Benchmarks!

Kaggle Benchmarks is the fastest, easiest way to test new models.

Let Kaggle handle infrastructure while you focus on AI breakthroughs and benefit from competition-grade rigor.

Sign up here: goo.gle/kaggle-benchmarks-waitlist

July 15, 2025 at 10:01 PM

Kaggle

@kaggle.com

📣 ICML 2025 Alert! Find Kaggle at Booth #121.

Meet our team, explore an interactive demo, & our new community platform for building and sharing top models evaluations.

➕ learn more about Kaggle team's upcoming talk on GenAI evaluation! #ICML2025

July 14, 2025 at 9:49 PM

Kaggle

@kaggle.com

📢 You can now connect your Google Colab notebooks directly to Kaggle's Jupyter Servers!

Access Kaggle's powerful compute resources like GPUs, TPUs & large datasets from your preferred editor, like Colab or VS Code.

Try it now! 👇 www.kaggle.com/discussions/...

March 13, 2025 at 3:40 PM

Kaggle

@kaggle.com

Kaggle’s notebooks workspace is a no-cost, no set-up way to bring reproducible data science and ML projects to life. We’ll share features that help you get the most out of this resource. #WaysOfKaggling

January 22, 2025 at 5:54 PM

Kaggle

@kaggle.com

Competitions are a spectator sport, too! Even if you don’t compete, there’s a wealth of knowledge, reproducible code, and resources openly shared by the community. Browse code and solution write-ups shared by competitors:

www.kaggle.com/competitions...

January 21, 2025 at 4:00 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news