Lightnews — Scholar-powered news

scott b. weingart

@scottbot.bsky.social

Wait wait wait msnbc/msnow hasn't been associated with microsoft since 2005???

November 25, 2025 at 11:30 PM

Reposted by scott b. weingart

Sarah Bull

@sarahebull.bsky.social

It was a remarkable feeling, working on a recent project, to have access to software that transcribed the marginalia as well as the printed text.

scott b. weingart @scottbot.bsky.social · 14h

Nearly-perfect printed and handwritten text recognition is the most consequential technical contribution to the study of human culture of the last fifteen years, and it's not even close.

It fundamentally changes our (both lay and expert) relationship with the written past.

Dan Cohen @dancohen.org · 15h

New issue of my newsletter: "The Writing Is on the Wall for Handwriting Recognition" — One of the hardest problems in digital humanities has finally been solved, and it's a good use of AI newsletter.dancohen.org/archive/the-...

November 25, 2025 at 7:19 PM

Reposted by scott b. weingart

Oana Sorescu-Iudean

@oana-sorsk-id.bsky.social

Now this is an overview worth reading #skystorians. I’ve been running table models with pre-trained German language models (with pretty high CER) and it still took my total data entry time down at least 60-70%.

scott b. weingart @scottbot.bsky.social · 14h

Nearly-perfect printed and handwritten text recognition is the most consequential technical contribution to the study of human culture of the last fifteen years, and it's not even close.

It fundamentally changes our (both lay and expert) relationship with the written past.

Dan Cohen @dancohen.org · 15h

New issue of my newsletter: "The Writing Is on the Wall for Handwriting Recognition" — One of the hardest problems in digital humanities has finally been solved, and it's a good use of AI newsletter.dancohen.org/archive/the-...

November 25, 2025 at 7:06 PM

scott b. weingart

@scottbot.bsky.social

Nearly-perfect printed and handwritten text recognition is the most consequential technical contribution to the study of human culture of the last fifteen years, and it's not even close.

It fundamentally changes our (both lay and expert) relationship with the written past.

Dan Cohen @dancohen.org · 15h

New issue of my newsletter: "The Writing Is on the Wall for Handwriting Recognition" — One of the hardest problems in digital humanities has finally been solved, and it's a good use of AI newsletter.dancohen.org/archive/the-...

The Writing Is on the Wall for Handwriting Recognition

One of the hardest problems in digital humanities has finally been solved

newsletter.dancohen.org

November 25, 2025 at 6:14 PM

Reposted by scott b. weingart

Ted Underwood

@tedunderwood.com

MajinBook is a badly-needed catalog for shadow libraries. It provides metadata (e.g., date of first publication, popularity on Goodreads) for over half a million English-language books. arxiv.org/abs/2511.11412 +

MajinBook: An open catalogue of digital world literature with likes

This data paper introduces MajinBook, an open catalogue designed to facilitate the use of shadow libraries--such as Library Genesis and Z-Library--for computational social science and cultural analyti...

arxiv.org

November 21, 2025 at 2:24 PM

Reposted by scott b. weingart

suse anderson

@shineslike.bsky.social

“Blogs are one of the great literary inventions of our time. Coming somewhere between an essay and a diary entry, they are a form of personal journalism that is intimate and immediate... but they also have… a rough-edged informality that breaks down barriers. They are engaging.”

Luke McKernan @lukemckernan.bsky.social · 13d

A post on keeping old blog posts alive, following the news that the British Library is having to come up with a solution for all of its past blogs, following the collapse of the Typepad platform lukemckernan.com/2025/11/11/k...

Keeping posts going

Pity the poor British Library, an institution which simultaneously does such great things, yet at the same time seems to have utterly lost its way. The cyberattack of 2023 has created damage not on…

lukemckernan.com

November 12, 2025 at 11:22 AM

Reposted by scott b. weingart

Katie McDonough

@kmcdono.bsky.social

We are so proud of this work. Not only is it the first effort to publish & analyze **open-access data** derived from the entire text contents of digitized @britishlibrary.bsky.social newspapers, it presents a metadata-driven approach to understanding bias in big historical data. #dh #skystorians

Daniel Wilson @danielwilson.bsky.social · 14d

!Stop Press! Article on bias in digitised newspaper collections: ’Whose News’, in the new journal of @comphumresearch.bsky.social by Kaspar Beelen, @jonhistorian61.bsky.social, @kmcdono.bsky.social and me. See blog for summary & 🧵 1/7

Article doi.org/10.1017/chr....

Blog is.gd/2IFc30

#dh #c19 🗃️

Whose news? Critical methods for assessing bias in large historical datasets | Computational Humanities Research | Cambridge Core

Whose news? Critical methods for assessing bias in large historical datasets - Volume 1

doi.org

November 11, 2025 at 5:07 PM

Reposted by scott b. weingart

Daniel Wilson

@danielwilson.bsky.social

!Stop Press! Article on bias in digitised newspaper collections: ’Whose News’, in the new journal of @comphumresearch.bsky.social by Kaspar Beelen, @jonhistorian61.bsky.social, @kmcdono.bsky.social and me. See blog for summary & 🧵 1/7

Article doi.org/10.1017/chr....

Blog is.gd/2IFc30

#dh #c19 🗃️

Whose news? Critical methods for assessing bias in large historical datasets | Computational Humanities Research | Cambridge Core

Whose news? Critical methods for assessing bias in large historical datasets - Volume 1

doi.org

November 11, 2025 at 4:05 PM

Reposted by scott b. weingart

Mikko Tolonen

@tolonen.bsky.social

Great news! This is out: Opening the black box of EEBO academic.oup.com/dsh/advance-...

Opening the black box of EEBO

Abstract. Digital archives that cover extended historical periods can create a misleading impression of comprehensiveness while in truth providing access t

academic.oup.com

November 9, 2025 at 10:30 AM

Reposted by scott b. weingart

Shannon Mattern

@shannonmattern.bsky.social

“And then you have librarians who are experiencing a real existential crisis because they are getting asked by their jobs to promote [AI] tools that produce more misinformation. It's the most, like, emperor-has-no-clothes-type situation that I have ever witnessed.” - Alison Macrina

AI Is Supercharging the War on Libraries, Education, and Human Knowledge

"Fascism and AI, whether or not they have the same goals, they sure are working to accelerate one another."

www.404media.co

November 7, 2025 at 7:15 AM

Reposted by scott b. weingart

Melissa Terras

@melissaterras.bsky.social

How do we navigate gaps and challenges in quantifying and understanding innovation and R&D in the creative sector? Excellent roundup from @suzannerblack.bsky.social on how difficult it is to count and map innovation, and "return on funder investment", in the creative industries.

Dr Suzanne Black @suzannerblack.bsky.social · 18d

I have a new essay out on the CRAIC site as part of a larger piece of work around data-led approaches in the creative industries. I hope you enjoy it! craic.lboro.ac.uk/essays/captu... @designinf.bsky.social @melissaterras.bsky.social @lborouniversity.bsky.social

Capturing the value of creative R&D: data-collection for the Creative Industries Clusters Programme

Dr Suzanne R Black, CRAIC, Loughborough University London, and CoSTAR Foresight Lab Introduction This essay addresses gaps and challenges in quantifying and understanding innovation through ...

craic.lboro.ac.uk

November 7, 2025 at 12:03 PM

Reposted by scott b. weingart

Roger Freedman

@rogerfreedman.bsky.social

The first (1955) Danish edition of Ray Bradbury’s FAHRENHEIT 451. Later editions did not convert the title, so this is the only SI-compatible edition! 🎢

November 1, 2025 at 7:55 PM

Reposted by scott b. weingart

Gretchen McCulloch

@gretchenmcc.bsky.social

Haven't seen any linguistics research on character limits, but perhaps someone who follows me might know of some!

Lingüista Aburrido @ernestowg.bsky.social · 24d

Hey, @gretchenmcc.bsky.social, sorry to at you like this out of the blue, but couldn't think of anyone better to ask.

Are you aware of any research looking at how character limits influence word choice on social media?

November 1, 2025 at 11:10 PM

Reposted by scott b. weingart

Anna Alexandrova

@annaalexandrova.bsky.social

A permanent post in my department. Closing date Dec 14th 2025, interviews in March. Please spread #histsci

Assistant Professor in History of Knowledge Pre-1400

Applications are invited for the position of Assistant Professor in History of Knowledge Pre-1400, in the Department of History and Philosophy of Science at the University of Cambridge. Please note

www.cam.ac.uk

October 30, 2025 at 10:07 AM

Reposted by scott b. weingart

Melanie Walsh

@mellymeldubs.bsky.social

As DH grows, it’s increasingly important to publish conference papers, but there hasn’t been a clear venue for that.

So I’m thrilled to share this new home for DH proceedings, which will include CHR papers & more.

Thanks to @taylor-arnold.bsky.social for leading this effort!

bit.ly/ach-anthology

Screenshot that reads:

Introducing the Anthology for Computers and the Humanities

Taylor Arnold, Maria Antoniak, Miguel Escobar Varela, Marie Puren, Mila Oiva , Amanda Regan, Lauren Tilton, and Melanie Walsh

1 Data Science and Statistics, University of Richmond, U.S.A.
2 Computer Science, University of Colorado Boulder, U.S.A.
3 Faculty of Arts and Social Sciences, National University of Singapore
4 Laboratoire de Recherche de l'EPITA, Paris, France
5 History and Archaeology, University of Turku, Finland
6 History and Geography, Clemson University, U.S.A.
7 Rhetoric and Communication Studies, University of Richmond, U.S.A.
8 Information School, University of Washington, U.S.A.

Permanent Link: https://doi.org/10.63744/HHsQG7hNWyxG

Published: 25 September 2025

October 29, 2025 at 3:39 PM

Reposted by scott b. weingart

Grumpy Philosopher

@stevecooke.org

Every time discover a new piece on the dangers of LLMs, particularly for research and teaching, I add it to a Zotero library. I figure that I might as well share it, so here's my library of Cautionary AI Tales: www.zotero.org/groups/62758...

October 28, 2025 at 10:11 AM

Reposted by scott b. weingart

Karen (B)

@bookishkb.bsky.social

Online! Free!

Eliot Benbow @ebenbow.bsky.social · Oct 25

@londonmedievalsoc.bsky.social London Medieval Society will be hosting its first colloquium of the academic year on the 22nd November 2025 on the subject of Women and Knowledge in the Middle Ages. 😊

You can find the Zoom link here: us02web.zoom.us/j/81823681150

This poster provides the details for the London Medieval Society Colloquium on Women & Knowledge.

Saturday 22nd November 10:00-15:00 (UK time) via Zoom

In the centre of the poster is an image from a Medieval manuscript of a group of standing and seated women reading a number of books

Image credit: British Library, Harley 4431, f. 107

October 27, 2025 at 1:38 AM

Reposted by scott b. weingart

mia ridge

@miaout.bsky.social

'Cost-Effective Machine Learning for Automatically Processing Bibliographic Metadata' is a very readable account of using DistilBERT for specific DH tasks www.euppublishing.com/doi/full/10.... #AI4LAM

www.euppublishing.com

October 25, 2025 at 12:17 PM

Reposted by scott b. weingart

scott b. weingart

@scottbot.bsky.social

(I've occasionally heard the argument that even rejections add "value" to the general "market." For example, research applications help scholars gather thoughts, and can be reused for articles. Which is true, but may not outweigh the sunk costs & resource rebalancing from poor to rich institutions.)

October 24, 2025 at 7:00 PM

Reposted by scott b. weingart

scott b. weingart

@scottbot.bsky.social

Weirdly, this means in some cases that adding a (competitive, onerous, and popular) grant program where one didn't exist can actually cost a community more than if it didn't exist at all. So, that's cursed knowledge for you!

October 24, 2025 at 4:31 PM

Reposted by scott b. weingart

scott b. weingart

@scottbot.bsky.social

PSA: Grant programs with sufficiently low acceptance rates and payouts can cost more for applicants than they distribute to awardees.

Say it costs $5k in salaried hours to complete a proposal. 1k people apply. $1m is distributed overall. For every $1 spent, 20¢ of funding goes out.

This happens.

October 24, 2025 at 4:25 PM

Reposted by scott b. weingart

Yvonne Seale

@yvonneseale.bsky.social

A neat tool I just came across: Viabundus, a digital road map of northern Europe 1350-1650, that lets you calculate contemporary travel routes/times. In 1500, going Amiens → Köln by horse took almost 7 days and 13 toll payments.

#medievalsky

www.landesgeschichte.uni-goettingen.de/handelsstras...

October 24, 2025 at 10:58 PM

scott b. weingart

@scottbot.bsky.social

PSA: Grant programs with sufficiently low acceptance rates and payouts can cost more for applicants than they distribute to awardees.

Say it costs $5k in salaried hours to complete a proposal. 1k people apply. $1m is distributed overall. For every $1 spent, 20¢ of funding goes out.

This happens.

October 24, 2025 at 4:25 PM

Reposted by scott b. weingart

Carl T. Bergstrom

@carlbergstrom.com

1. We ( @jbakcoleman.bsky.social, @cailinmeister.bsky.social, @jevinwest.bsky.social, and I) have a new preprint up on the arXiv.

There we explore how social media companies and other online information technology firms are able to manipulate scientific research about the effects of their products.

Three schematic diagrams. The first illustrates selective publishing of internal resection, the second selective causal focus, and the third selective access and funding for researchers.

October 24, 2025 at 12:47 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news