Matt Miller
@thisismattmiller.com
370 followers 220 following 36 posts
Libraries/Data -- thisismattmiller.com
Posts Media Videos Starter Packs
thisismattmiller.com
New Blog Post.
Library of Congress & Flickr Commons: Analysis of user interactions on 40,000 images
thisismattmiller.com/post/lc-flic...
- Organizing 95K photo comments.
- Viewer to explore user georectified images
- Folksonomy tagging vs LCSH Vocabulary
- Placing into the Wiki* knowledge graph
LC & Flickr Commons
Library of Congress & Flickr Commons: Analysis of user interactions on 40,000 images.
thisismattmiller.com
thisismattmiller.com
One output, 1 hour 40mins of Siskel and Ebert summaries:
www.youtube.com/watch?v=hFLM...
thisismattmiller.com
Trying out workflows that use multimodal LLMs for validating and QA.

In this blog I walk through a test using 1000 Siskel and Ebert videos to extract key video frames and other data.

thisismattmiller.com/post/buildin...
Building datasets from video collections using local & cloud LLMs
Using Qwen2.5-VL, Gemini 2.5 and Whisper to build a Siskel and Ebert dataset
thisismattmiller.com
Reposted by Matt Miller
Reposted by Matt Miller
post45data.bsky.social
New dataset on bestsellers from 40+ countries, with consistent coverage for France, Germany, Spain, Italy, and the U.S.

Congrats to the authors @sdileonardi.bsky.social, @beccacohen.bsky.social, and @dan-sinnamon.bsky.social on this major contribution! 🎉

🔗: doi.org/10.18737/386...
Reposted by Matt Miller
siskel-ebert-bot.bsky.social
Gremlins 2: The New Batch (1990)
Director: Joe Dante
Cast: Phoebe Cates-Kline, Sylvester Stallone, Hulk Hogan, Zach Galligan, Christopher Lee
Watch Review
wp / wd
A screen capture from the Siskel and Ebert Show reviewing the movie Gremlins 2: The New Batch. Ebert gave it a thumbs down.
 Siskel gave it a thumbs down.
thisismattmiller.com
thisismattmiller.com/post/glitch/

New blog post about @glitch.com shutdown, how I migrated my apps, and how I used glitch for teaching and creative projects.
thisismattmiller.com
thisismattmiller.com
The Library of Congress BIBFRAME Update is online today at 1PM EDT.
Talks about:
- Hubs (BF ontology)
- BF Cataloging at Penn Libraries
- BF Validation Tooling
listserv.loc.gov/cgi-bin/wa?A...
listserv.loc.gov
thisismattmiller.com
Need a robots.txt directive indicating bulk download is available, not that they would abide by robots.txt
thisismattmiller.com
Yeah we have bots endlessly flooding id.loc.gov stressing servers to the limit trying to scrape millions of html pages even though we offer pretty much all of it as bulk downloads: id.loc.gov/download/
Reposted by Matt Miller
thisismattmiller.com
A new very chill bot, for these very un-chill times.
Posts FERNS from "Ferns: British and exotic..." by E. J. Lowe. 8 vols 1856-1860.
Makes a new collage every 8 hours.
Reposted by Matt Miller
dan-sinnamon.bsky.social
Interesting! Because they just terminated their grant for the Post45 Data Collective, which preserves and establishes access to collections of literary and cultural data!
rbtownsend.bsky.social
NEH announces a new funding opportunity to support "projects that develop and implement educational programs for professionals who preserve and provide access to humanities collections" bit.ly/43aviek
Preservation and Access Education and Training
Supports the development of knowledge and skills among professionals responsible for preserving and establishing access to humanities collections.
bit.ly
thisismattmiller.com
If you need me I’ll be in…
Photograph of kraftwerk performing at Coachella the screen reads COMPUTERWORLD
Reposted by Matt Miller
woodblockshop.bsky.social
waiting for the dead bodies to arrive, but they're not doing me any favors
A woodcut mashup image titled: waiting for the dead bodies to arrive, but they're not doing me any favors
thisismattmiller.com
It’s true, and /r/pslf is full of community bureaucratic evocation tips: “did you try the wet signature doc upload ritual?” or “are you sure you used this exact text in your reconsideration request summoning?” that are unclear if they ever actually work.
violetbfox.bsky.social
New minizine: "$93,605 : A Student Loan Ghost Story."

Framing it as a 30-year haunting starts to get at the impact #StudentDebt has had on my life. 👻 Read it online for free at violetbfox.info/minizines/.
Cover of a zine with a clipart ghost drawing and the title "$93,605 : A Student Loan Ghost Story"
thisismattmiller.com
I wrote a bit about turning triples into pie charts.
A screenshot of https://semlab.io/property-explorer/ its of two pie charts connected by a line it shows what percentage of entities are connected via the "collaborated with" property in our Wikibase database.
Reposted by Matt Miller
lallen.bsky.social
We’ve posted a job ad to join our team at LC Labs.

I am very proud and excited about the work we have planned. Please share.

I’ll also mention that we are part of the legislative branch and this is a partner-supported project.

www.usajobs.gov/job/832669800
Sr. Innovation Specialist
<p>This position is located in the Digital Innovation Division, Digital Strategy Directorate, Office of the Chief Information Officer.</p> <p>The position description number for this position is 35903...
www.usajobs.gov
Reposted by Matt Miller
thisismattmiller.com
Less Gemini AI in my gmail and more fixing the broken Bus Stop theme that I've used for the last 15+ years
screen shot of the theme selection in gmail with bus stop theme selected and the theme not being applied in the background
thisismattmiller.com
"How to tell the birds from the flowers and other wood-cuts." 1929 public domain book.
babel.hathitrust.org/cgi/pt?id=co...
found via
thisismattmiller.github.io/hathi-pd-202...
wood block print of of a clover plant and a plover bird, they looks the same. wood block print of a kale plant and a quail bird, they looks the same. Wood block print of a carrot and a parrot bird, they looks the same. Wood block print of a hen bird and a piece of lichen, they looks the same.
thisismattmiller.com
New blog post: Three interfaces to explore the 50,000 1929 HathiTrust resources that entered the public domain last month

thisismattmiller.com/post/hathi-p...
Hathi PD 2025
Data and tools to explore 50,000 1929 public domain titles in HathiTrust
thisismattmiller.com