Lightnews — Scholar-powered news

Reposted

@taras-grescoe.com

A city of drivers expands, relentlessly and inevitably, into sprawl.

A city of pedestrians, cyclists, and transit-users settles, gradually and elegantly, into itself.

January 6, 2025 at 8:57 PM

Reposted

Rebecca Solnit

@rebeccasolnit.bsky.social

Still the best two sentences on climate action.

December 29, 2024 at 5:42 PM

Reposted

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

An illustrated guide to never learning anything

December 25, 2024 at 12:26 AM

Reposted

loganbyrne.bsky.social

@loganbyrne.bsky.social

“The best part of waking up, is your house being quiet and your kids not tearing shit up”

November 29, 2024 at 11:22 PM

Reposted

Peter Fietser

@meandmybikes.bsky.social

The next coffee drinker whom awakens, understands the joy, of a hot pot ready and waiting.

November 29, 2024 at 11:34 AM

Reposted

Tom Flood

@tomflood.bsky.social

There is no greater joy than everyone in the house being asleep as you drink your morning coffee.

November 29, 2024 at 11:00 AM

Reposted

Anonymous Rex

@dfeldman.org

You’re still arguing about tabs vs. spaces? May I present…

Code written with box characters used on old old software to make fake UIs

December 25, 2024 at 6:37 PM

Reposted

Gaël Varoquaux

@gaelvaroquaux.bsky.social

I didn't embark in machine learning thinking of it as an ideological project to disenfranchise human beings.

But we need to face reality, machine learning can easily become the driver of such change, shifting power structures.

We, actors of tech, can modulate this effect.
1/3

December 9, 2024 at 11:58 PM

ysawej.bsky.social

@ysawej.bsky.social

📌

Paper @paper.bsky.social · Dec 21

Top 30 most popular arXiv papers in the last 30 days.
[1/30] [2/30] [3/30] [4/30] [5/30] [6/30] [7/30] [8/30] [9/30] [10/30] [11/30] [12/30] [13/30] [14/30] [15/30] [16/30] [17/30] [18/30] [19/30] [20/30] [21/30] [22/30] [23/30] [24/30] [25/30] [26/30] [27/30] [28/30] [29/30] [30/30]

December 21, 2024 at 1:42 AM

Reposted

Brent Toderian

@brenttoderian.bsky.social

"Public investment" vs "Wasteful Subsidy." The only problem with this clever Singer cartoon is that some people might actually not get that it’s sarcastically illustrating the perception problem, NOT telling the truth. Just in case it actually needs to be said, the truth is the opposite.

Cartoon by singer sarcastically showing that people wrongly perceive car dependent infrastructure as “public investment” (it isn’t) and investment in public transit as a “wasteful subsidy” (it isn’t, it has an excellent return on investment and actually saves public money).

December 18, 2024 at 8:21 AM

Reposted

Ben Recht

@beenwrekt.bsky.social

Dynamic programming alternatives to dynamic programming for optimal control. Replace your Bellman equation with backpropagation. www.argmin.net/p/twirling-t...

Twirling Towards Freedom

Avoiding dynamic programming with dynamic programming

www.argmin.net

November 21, 2024 at 3:37 PM

ysawej.bsky.social

@ysawej.bsky.social

Bookmark

Sung Kim @sungkim.bsky.social · Dec 16

Meta's Apollo Multimodal Models

- 1.5B, 3B and 7B model checkpoints
- Can comprehend up-to 1 hour of video
- Temporal reasoning & complex video question-answering
- Multi-turn conversations grounded in video content

Project: apollo-lmms.github.io
Models: huggingface.co/Apollo-LMMs

December 16, 2024 at 5:08 PM

ysawej.bsky.social

@ysawej.bsky.social

Bookmark

Gabriel Peyré @gabrielpeyre.bsky.social · Dec 3

Hellinger and Wasserstein are the two main geodesic distances on probability distributions. While both minimize the same energy, they differ in their interpolation methods: Hellinger focuses on density, whereas Wasserstein emphasizes position displacements.

December 16, 2024 at 6:33 AM

ysawej.bsky.social

@ysawej.bsky.social

Bookmark

Gabriel Peyré @gabrielpeyre.bsky.social · Dec 10

This paper by Hornik et al demonstrates the *uniform* approximation universality of 2-layer MLPs with sigmoid activation functions, leveraging that sinusoids can approximate any function through Fourier expansion.
www.cs.cmu.edu/~epxing/Clas...

December 16, 2024 at 6:31 AM

Reposted

Martijn Konings

@mkonings.bsky.social

boundary2 just published a forum on the gordian knot of finance, where @stefeich.bsky.social, @aminsamman.bsky.social, @thisblue.bsky.social, Janet Roitman, Dick Bryan, and myself reflect on the infuriating hold of finance on economic policy (and how to break it)

www.boundary2.org/the-gordian-...

the gordian knot of finance | special issue

This special issue is hosted by the "Finance and Fiction" dossier of the b2o Review, which is edited by Arne De Boever and Mikkel Krause Frantzen. Volume 6, Issue 1 (December 2024) Special Issue: Th...

www.boundary2.org

December 12, 2024 at 10:49 PM

Reposted

Kevin Mitchell

@wiringthebrain.bsky.social

The tech bro fascination with eugenics is so on brand. The idea that you could make accurate, actionable predictions from individual genotypes (with all their complex, non-linear interactions) from averaged, linearly modeled population-level genomic statistics is just another big data fantasy

December 12, 2024 at 4:13 PM

Reposted

Lincoln Michel

@thelincoln.bsky.social

Tech bros know what every parent wants: children who sleep less

jacob sansbury
@jsnnsa
having kids in the next 5 years might be a tragic mistake

every smart bio founder/scientist i’ve talked to seems to think embryo editing for things like short sleeper, reduced cancer risk, etc is possible on a near term horizon

imagine having two kids a couple years apart, one is a super human and the other isn’t

“sorry jim, little timmy won’t get cancer, only needs 4 hours of sleep and you’re normal you were just born in the wrong order 🤷🏻 “

December 11, 2024 at 2:39 PM

Reposted

antonio vergari ⚔️ short-circuiting

@nolovedeeplearning.bsky.social

noice!

December 9, 2024 at 10:49 AM

Reposted

Nathan Lambert

@natolambert.bsky.social

Insane lack of transparency from OpenAI. Saying the models people use are different than the "final" evals they released. Do better. Especially enterprises will move elsewhere because of this stuff.

December 6, 2024 at 10:00 PM

Reposted

Sung Kim

@sungkim.bsky.social

Zuck is developing 2GW+ data center.

"Last big AI update of the year:
•⁠ ⁠Meta AI now has nearly 600M monthly actives
•⁠ ⁠Releasing Llama 3.3 70B text model that performs similarly to our 405B
•⁠ ⁠Building 2GW+ data center to train future Llama models
Next stop: Llama 4. Let's go!"

December 6, 2024 at 6:31 PM

Reposted

Eugene Yan

@eugeneyan.com

Learning about quantization suffixes while `ollama pull llama3.3` download completes (fyi, quantization for the default 70b is q4_K_M)

• make-ggml .py: github.com/ggerganov/ll...
• pull request: github.com/ggerganov/ll...

Old quant types (some base model types require these):
- Q4_0: small, very high quality loss - legacy, prefer using Q3_K_M
- Q4_1: small, substantial quality loss - legacy, prefer using Q3_K_L
- Q5_0: medium, balanced quality - legacy, prefer using Q4_K_M
- Q5_1: medium, low quality loss - legacy, prefer using Q5_K_M

New quant types (recommended):
- Q2_K: smallest, extreme quality loss - not recommended
- Q3_K: alias for Q3_K_M
- Q3_K_S: very small, very high quality loss
- Q3_K_M: very small, very high quality loss
- Q3_K_L: small, substantial quality loss
- Q4_K: alias for Q4_K_M
- Q4_K_S: small, significant quality loss
- Q4_K_M: medium, balanced quality - recommended
- Q5_K: alias for Q5_K_M
- Q5_K_S: large, low quality loss - recommended
- Q5_K_M: large, very low quality loss - recommended
- Q6_K: very large, extremely low quality loss
- Q8_0: very large, extremely low quality loss - not recommended
- F16: extremely large, virtually no quality loss - not recommended
- F32: absolutely huge, lossless - not recommended

December 7, 2024 at 1:09 AM

ysawej.bsky.social

@ysawej.bsky.social

Bookmark

Kentaro Seki @trgkpc.bsky.social · Nov 19

Thank you for following!
I'm a first-year doctoral student at the University of Tokyo.
I have published several papers related to speech synthesis and source separation, and my current research focuses on target speaker extraction.
Feel free to contact me!
Nice to meet you!

December 6, 2024 at 2:49 AM

Reposted

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z.bsky.social

Will be at #NeurIPS2024 Dec 10-13. Looking forward to run into everyone in #AI all at once🤞 The 2019 Vancouver one was the largest conf I ever attended--not sure how they plan to cram even more this time.. 😱

[Will be at our poster on 12/11 morning openreview.net/forum?id=kPB... ]

Chain of Thoughtlessness? An Analysis of CoT in Planning

Large language model (LLM) performance on reasoning problems typically does not generalize out of distribution. Previous work has claimed that this can be mitigated with chain of thought...

openreview.net

November 30, 2024 at 2:35 PM

Reposted

Miro Dudik

@mdudik.bsky.social

📣I'm hiring PhD interns for combined theory+empirical projects in: exploration in post-training, multi-task learning in autoregressive models, distillation, reasoning beyond CoT.

Apply on the link below. If you're at #NeurIPS2024, message me to chat.

jobs.careers.microsoft.com/global/en/jo...

December 5, 2024 at 3:42 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news