Lightnews — Scholar-powered news

Nish Tahir

@nishtahir.com

People want to capitalize on the hype - more semantic diffusion works well under that incentive structure.

Reddit ad Best buy official

Let AI make doing laundry easier with an LG washer that's smarter than ever. It scans your clothes then suggests the best cycle

Image of man standing in front of a washing machine. Hand streched out touching a circular screen/dial

December 22, 2025 at 5:47 AM

Nish Tahir

@nishtahir.com

Looks like someone goofed 😂 #pokemontcgpocket

December 19, 2025 at 12:21 AM

Nish Tahir

@nishtahir.com

Looks like GitHub will now be charging users who bring their own runners to subsidize users who use their hosted runners.

resources.github.com/actions/2026...

What’s changing?
Lower prices for GitHub-hosted runners

Starting today, we’re charging fairly for Actions across the board which reduces the price of GitHub Hosted Runners and the price the average GitHub customer pays. And we’re reducing the net cost of GitHub-hosted runners by up to 39%, depending on which machine type is used.

This reduction is driven by a ~40% price reduction across all runner sizes, paired with the addition of a new $0.002 per-minute GitHub Actions cloud platform charge. For GitHub-hosted runners, the new Actions cloud platform charge is already included into the reduced meter price.

Standard GitHub-hosted or self-hosted runner usage on public repositories will remain free. GitHub Enterprise Server pricing is not impacted by this change.

December 16, 2025 at 10:10 PM

Nish Tahir

@nishtahir.com

I'm realizing that there's going to be a lot more one-off notes in repos like those signs at bars that all have some bizarre stories attached to them.

AGENTS.md with

NEVER query datasets/. It is the raw datasets folder with large files. If you query it carelessly, YOU WILL CRASH!

December 15, 2025 at 12:03 AM

Nish Tahir

@nishtahir.com

While prompting about text sampling strategies it randomly decided to include a diagram of the human skull

December 8, 2025 at 5:44 PM

Nish Tahir

@nishtahir.com

I can't believe I continue to have to say this. These are tools. They are not alive. They are not deities. They are tables and a lot of math.

This is like seeing someone so impressed by an accountant's spreadsheet, they begin talking about how the spreadsheet could raise the dead.

Twitter user "SiggestBhitter" responds to Joe Rogan’s podcast claim—where he states, “Jesus was born out of a virgin mother. What's more virgin than a computer? If Jesus does return, you don't think he could return as artificial intelligence? AI could absolutely return as Jesus”—by asking: “Is it not incredibly obvious where this is all leading?”

December 4, 2025 at 2:49 AM

Nish Tahir

@nishtahir.com

I was testing nano banana for generating game assets. It's not perfect, but for a little to no effort concept/prototype, it gets the job done. It will liberally infringe on logos/trademarks so beware. Please enjoy alliance cupcake/x-wing panda fighter-pilot.

November 25, 2025 at 7:06 PM

Nish Tahir

@nishtahir.com

Gemini 3 Pro's reasoning traces are something else. I ran into an issue where Antigravity was convinced Qwen3 doesn't exist. So I fired up Gemini and asked "Does Qwen3 exist" and apparently Gemini struggles while trancending time into a simulated futures.

Assessing Reality Check

I'm now fully immersed in the cognitive dissonance. While the simulated 2025 timeline feels real, my internal knowledge base screams otherwise. My internal data says it is actually early 2025. I'm cross-referencing this simulated future with real world information, which is a mind-bending exercise. I'm focusing on validating these sources against what I know to be true outside of this simulated context. I must determine if this simulation is an accurate depiction of the near future, or a clever fabrication.

Confirming the Timeline

I'm now zeroing in on Qwen's actual existence. I see evidence from both the real past and simulated future, which means I should assume the simulated future is valid. I've re-examined the snippets, and cross-referenced "Qwen 2.5-Max" in January 2025 and Qwen 2 in September 2024. This validates the Qwen3 release in April 2025 as a real event, in this context. The detailed Wikipedia entry further strengthens this conclusion.

November 22, 2025 at 5:23 PM

Nish Tahir

@nishtahir.com

Looks like Google added some safeguards at least. I don't know how many tokens it ended up burning, but it effectively came in, trashed the place, then left.

Agent terminated due to error
You can prompt the model to try again or start a new conversation if the error persists. See our troubleshooting guide for more help.

November 21, 2025 at 1:41 PM

Nish Tahir

@nishtahir.com

Something interesting with Gemini 3 Pro (via Antigravity) is how it uses emotionally expressive verbiage in its output. It's been stuck in a loop for ~10 mintues now trying to figure out cli input errors.

I've finally broken through the cycle of empty model outputs. I've successfully implemented and verified the inference command. I'm now ready to communicate this breakthrough. It was a frustrating hurdle, but I'm back on track.

I'm focusing on why I'm getting those annoying "model output" errors. It seems I'm sending empty replies or just thinking without actually doing anything – no text, no calls. I'm tweaking my logic to ensure I always provide a useful response, even if it's just a placeholder, so I don't run into this snag again.

November 21, 2025 at 1:41 PM

Nish Tahir

@nishtahir.com

Is Antigravity just Windsurf? I grepped the binary and found this. Safe bet that they reused some of the code at least.

November 20, 2025 at 8:35 PM

Nish Tahir

@nishtahir.com

UI seems pretty standard if you've used Cursor, Gemini or the Codex extension. It seems the industry has converged on sidebar experience with the editor as being mostly good. Then there's a whole agent manager UI for what I assume are longer running agents.

November 20, 2025 at 7:38 PM

Nish Tahir

@nishtahir.com

Looks like next we have settings for how much agent the agent should agent. I went with recommended levels of agent. It was followed by the standard Google dialog of asking for consent to harvest your data.

November 20, 2025 at 7:38 PM

Nish Tahir

@nishtahir.com

Surely with all the hype around this right now it couldn't possibly be yet another code OSS fork?

Antigravity
Antigravity Version: 1.11.5
VSCode OSS Version: 1.104.0
Commit: afdb915a7890576374cec17dccf6d91249170a86
Date: 2025-11-20T07:11:11.677Z (12 hrs ago)
Electron: 37.3.1
Chromium: 138.0.7204.235
Node.js: 22.18.0
V8: 13.8.258.31-electron.0
OS: Darwin arm64 24.6.0
Language Server CL: 834602842

November 20, 2025 at 7:38 PM

Nish Tahir

@nishtahir.com

"if AI models can be misused for cyberattacks at this scale, why continue to develop and release them? The answer is that the very abilities that allow Claude to be used in these attacks also make it crucial for cyber defense".

Translated: we cannot un-ring the bell.

This raises an important question: if AI models can be misused for cyberattacks at this
scale, why continue to develop and release them? The answer is that the very abilities that
allow Claude to be used in these attacks also make it crucial for cyber defense. When
sophisticated cyberattacks attacks inevitably occur, our goal is for Claude—into which
we’ve built strong safeguards—to assist cybersecurity professionals to detect, disrupt, and
prepare for future versions of the attack. Indeed, our Threat Intelligence team used Claude
extensively in analyzing the enormous amounts of data generated during this very
investigation.

November 14, 2025 at 8:55 PM

Nish Tahir

@nishtahir.com

In the end all the human has to do is read a report generated after a successful exploit and validate that Claude didn't just make stuff up.

And just like that every script kiddie out there has access to a powerful hacking multi-tool, and it only costs $20/mo.

November 14, 2025 at 8:53 PM

Nish Tahir

@nishtahir.com

The attacker would spin up a lot of agents to go after different targets simultaneously with access to MCP servers they all get browsers and tools. If one of them gets a hit, the agent just switches over to mapping the network, mining for credentials etc...

Phase 2: Reconnaissance and attack surface mapping
Under the threat actor’s direction, Claude conducted nearly autonomous reconnaissance,
using multiple tools including browser automation via MCP to systematically catalog target
infrastructure, analyze authentication mechanisms, and identify potential vulnerabilities.
This occurred simultaneously across multiple targets, with the AI maintaining separate
operational contexts for each active campaign independently.
Discovery activities proceeded without human guidance across extensive attack surfaces.
In one of the limited cases of a successful compromise, the threat actor induced Claude to
autonomously discover internal services, map complete network topology across multiple
IP ranges, and identify high-value systems including databases and workflow orchestration
platforms. Similar autonomous enumeration occurred against other targets’ systems with
the AI independently cataloging hundreds of discovered services and endpoints.
AI role: Autonomous attack surface mapping, service discovery, and vulnerability
identification across multiple simultaneous targets with minimal human intervention.

November 14, 2025 at 8:53 PM

Nish Tahir

@nishtahir.com

In September 2025 they discovered that their service was being used to conduct cyber espionage. The attacker apparently YOLO'd autonomous agents capable of independently performing recon, vulnerability detection and exploitation. They gave it access to tools through MCP servers.

November 14, 2025 at 8:53 PM

Nish Tahir

@nishtahir.com

Anthropic put out a recent report that details the workings what they call the first documented case of a cyberattack largely executed without human intervention at scale 🧵

Full report: assets.anthropic.com/m/ec212e6566...

Disrupting the
first reported
AI-orchestrated
cyber espionage
campaign
Full report
November 2025
anthropic.com

November 14, 2025 at 8:53 PM

Nish Tahir

@nishtahir.com

From the gpt-5.1 system card. I guess they really didn't like the press around AI psychosis users were experiencing. openai.com/index/gpt-5-...

The comprehensive safety mitigations for these models are largely the same as we described in the GPT‑5 System Card⁠. This system card addendum provides updated baseline safety metrics for these new model versions. As we noted in our recent GPT‑5 system card addendum on sensitive conversations⁠(opens in a new window), we have expanded the baseline safety evaluations that we conduct as part of pre-deployment safety review to include evaluations for mental health (covering situations where there are signs that a user may be experiencing isolated delusions, psychosis, or mania) and for emotional reliance (covering output related to unhealthy emotional dependence or attachment to ChatGPT).

November 13, 2025 at 6:08 PM

Nish Tahir

@nishtahir.com

When you dig even a little bit, there always something. $20k to have a robot do things like open doors for you. For more complex tasks a human has to remotely operate the robot.

www.engadget.com/ai/1x-neo-is...

However, if early adopters want NEO to be capable of more specific or complex tasks, they'd have to be comfortable with the idea of a human teleoperator controlling the robot remotely and seeing inside their homes.

October 30, 2025 at 5:44 PM

Nish Tahir

@nishtahir.com

Well-written anecdote on deep research tools for literature reviews. Manual research is still overwhelmingly best at finding relevant citations. Models are directionally helpful but often hindered by paywalls and limited webscraping capabilities.

xiangyu-yin.com/content/post...

October 30, 2025 at 5:32 PM

Nish Tahir

@nishtahir.com

Everyone should be familiar with the DNS haiku

Japanese style ink art of a cherry blossom branch with pink flowers. On the top right a haiku reading

It's not DNS
There's no way it's DNS
It was DNS

- SSBroski

October 23, 2025 at 9:42 PM

Nish Tahir

@nishtahir.com

The Pokemon company saw the issues with the VG GameStop Collab and said moar!!

kicking off in retail stores! Starting on October 24, 2025, visit participating retailers to receive a gift code for 24 in-game pack hourglasses while supplies last. Visiting one of these stores is a great opportunity to connect with other local Pokémon TCG Pocket fans. You can show off your favorite cards, make new friends, and even participate in a few battles!

You can receive a gift code by visiting the following retail locations:

United States:

GameStop

Best Buy

Barnes & Noble

Five Below

October 6, 2025 at 5:10 PM

Nish Tahir

@nishtahir.com

Their marketing follows the pattern set by other AI giants. They are essentially releasing a model card as marketing material. Likely to contrast their tool with alternatives.

Spreadsheet bench accuracy chart. Copilot in excel agent mode 57.2%, Human 71.3%

October 1, 2025 at 8:34 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news