Lightnews — Scholar-powered news

@luyenchou.bsky.social

34 followers 57 following 27 posts

Edugeek. Techno-realist. Former recovering entreprenerur.

Posts Replies Media Videos

luyenchou.bsky.social

@luyenchou.bsky.social

Something Altman and OpenAI seem to recognize

February 22, 2025 at 3:34 PM

luyenchou.bsky.social

@luyenchou.bsky.social

Whassup @knickfilmskool.bsky.social fam?? Let’s go Knicks!!!!

January 2, 2025 at 12:51 AM

luyenchou.bsky.social

@luyenchou.bsky.social

Interesting context from @fchollet.bsky.social the creator of ARC-AGI regarding the “saturation” of the current ARC benchmark. It will be very revealing to see how o3 performs on the new ARC2 benchmark in 2025

December 22, 2024 at 3:16 PM

luyenchou.bsky.social

@luyenchou.bsky.social

TRS-80 Model I. With the max 16K RAM option 😂. It was a year before I upgraded to 48K and from cassette tape to a floppy drive. It was the bomb!

December 22, 2024 at 2:20 PM

luyenchou.bsky.social

@luyenchou.bsky.social

March 2023 comparison of GPT-4 and GPT-3.5 performance on academic exams:

openai.com/index/gpt-4-...

December 21, 2024 at 12:42 PM

luyenchou.bsky.social

@luyenchou.bsky.social

While we’ll have to wait to test real-world performance, to put in perspective o3 performance announced by OpenAI today, GPT-4 (March 2023) got ~40% of AP calc right (GPT-3.5 = 0%). Now o3 performs better than humans on PhD-level science. Just incredible progress.

openai.com/12-days/

December 21, 2024 at 2:47 AM

luyenchou.bsky.social

@luyenchou.bsky.social

So, I was expecting great things from 2.0 Flash, the newest consumer model. Here’s what I got from the same prompt. This highlights the challenge foundation model providers have balancing usefulness and “risk” - especially as the public becomes more familiar with LLM’s true capabilities

December 12, 2024 at 4:01 PM

luyenchou.bsky.social

@luyenchou.bsky.social

Here’s the prompt I gave Gemini Advanced 1.5 and its response. You can see it breaks down the complex task into meaningful and manageable chunks. And, amazingly, goes off and happily does the work, mining the web and social media sites, and synthesizing its findings. 2/3

December 12, 2024 at 4:01 PM

luyenchou.bsky.social

@luyenchou.bsky.social

So many choices, but I’d have to with…

December 1, 2024 at 11:21 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news