Mat Kelly
@machawk1.bsky.social
140 followers 110 following 100 posts
Assistant Professor at Drexel CCI WS-DL & ODU CS alum Creator of #webarchiving software: WARCreate WAIL Mink ipwb
Posts Media Videos Starter Packs
Pinned
machawk1.bsky.social
Another Web Archiving & Digital Libraries (WADL) workshop is in the books as we completed the 2025 iteration at ACM Hypertext 2025! #ACMHT2025

Did you miss it? See the papers and soon slides and recorded video:
🔗 wadlworkshop.github.io

Photo credit: Tarannum Zaki
Reposted by Mat Kelly
cdrom.ca
Folks in digital preservation looking for a chill place to hang out off here - I run a digipres mastodon instance, digipres.club. I try to keep it running well and low stress. All are welcome.
digipres.club
Hometown is adapted from Mastodon, a decentralized social network with no ads, no corporate surveillance, and ethical design.
digipres.club
Reposted by Mat Kelly
sharonmcmeekin.bsky.social
Highly recommend joining digipres.club! Misty and the moderators that help her do a great job of running a chill and interesting place free from all the annoying ads and algorithms of the commercial social media options.
cdrom.ca
Folks in digital preservation looking for a chill place to hang out off here - I run a digipres mastodon instance, digipres.club. I try to keep it running well and low stress. All are welcome.
digipres.club
Hometown is adapted from Mastodon, a decentralized social network with no ads, no corporate surveillance, and ethical design.
digipres.club
machawk1.bsky.social
That's a wrap on ACM Hypertext 2025! #ACMHT2025 #HT2025 #HT25

Next year's Hypertext 2026 will be in London at School of Advanced Study at University of London and the British Library. #HT2026

September 8-11, 2026

The theme will be Hypertext as Method.
machawk1.bsky.social
At #ACMHT2025 day 4, Yavuz Selim Kartal (from @gesis.org) discusses supplementing social media posts with AI-generated summaries, which are perceived as most understandable form of enrichment as compared to metadata enrichment or pull quotes. @acmht.bsky.social #HT2025

📄 dl.acm.org/doi/10.1145/...
machawk1.bsky.social
Almost all participants in their interviews mentioned fairness in musicRecSys came down to paying the artists. Secondarily was alg. bias &popularity bias as interview insights.

Not everyone agreed that imbalance in the sys meant it was unfair. Many also wanted transparency & control.

#ACMHT2025
machawk1.bsky.social
At Day 4 of #ACMHT2025, Karlijn Dinnissen describes the role of fairness & diversity in user choices and perception of music playlists.

* What do end users think makes a fair MusicRec sys?
* How does that impact their choices?
* How should they be informed?

#HT2025

📄 dl.acm.org/doi/10.1145/...
machawk1.bsky.social
Ágnes Horvát highlights in her @acmht.bsky.social keynote that the frequency of certain words in PubMed abstracts has drastically increased due to authors massaging their texts with LLMs. #HT2025 #ACMHT2025
machawk1.bsky.social
At the #HT2025 Day 3 keynote, Ágnes Horvát asks, "How is science effectively promoted in online social spaces? Does self-publicity online lead to a tangible increase in citations?"

When we account for venue, popularity of authors, etc., there still seems to be a net positive.
machawk1.bsky.social
In his #nht25 prez @ #HT25, David Millard (@hoosfoos.bsky.social) ids the parallels between hypertext & those that will come as a result of gen AI. Who controls meaning? What are the boundaries of text? We typically think of AI as an assistant, the human being's still the wordsmith.
machawk1.bsky.social
@eastgate.bsky.social describes resurrecting Information Cities at #NHT25 as a narrative, a journey toward understanding, a central theme in open world games like Elder Scrolls or Cyberpunk 2077 #ACMHT2025 #HT2025 @acmht.bsky.social
machawk1.bsky.social
@drchargood.bsky.social kicks off Hypertext '25 (@acmht.bsky.social) Day 3 w/ the Narrative & Hypertext Workshop #NHT25, a venue for both those new to HT and established projects. Scheduled are 3 paper presentations, a panel, and a debate. #ACMHT2025

nht.ecs.soton.ac.uk/2025/program...
nht.ecs.soton.ac.uk
Reposted by Mat Kelly
webrecorder.net
Our browser extension, ArchiveWeb.page has recently surpassed 20,000 users on the Chrome Web store:

chromewebstore.google.com/detail/webre...

That's 20k+ users creating their own high-fidelity, interactive web archives, right in their own browsers!

If you use ArchiveWeb.page, leave us a review!
ArchiveWeb.page
ArchiveWeb.page
machawk1.bsky.social
They utilized the timestamp of a tweet to estimate the time that a tweet has been archived. They then established left & right time boundaries to limit search space to make resolving captured tweets from the web archive more efficient.
@tarannum44.bsky.social #ACMHT2025

📄 doi.org/10.1145/3720...
machawk1.bsky.social
1. Reconstruct a tweet URL based on the embedded parameters in a tweet 2. Search @archive.org. Web archives are most useful when the Tweet (or account) is no longer on the live web. - @tarannum44.bsky.social #ACMHT2025
machawk1.bsky.social
Another reason fake tweets are shared is for satire. 100k likes and no comments is suspicious. How can you tell it's real? Web archives serve as a good src of evidence that it's real. Their goal is to build an automated method for evidence that a tweet is authentic @tarannum44.bsky.social #ACMHT2025
machawk1.bsky.social
@tarannum44.bsky.social from @webscidl.bsky.social is up next at #ACMHT2025 describing web archives usage for verifying attribution of Twitter screenshots with a compelling use case of one's love for pineapple pizza. 🍍🍕 @phonedudemln.bsky.social
machawk1.bsky.social
Dominik Soós from @webscidl.bsky.social asks "Can LLMs Beat Humans on Discerning Human-written and LLM-generated Science News?" Their contribs inc. benchmarks & findings, new dataset w/ 362 article triplets, & a novel method to improve perf on open-weight LLMs #ACMHT2025

dl.acm.org/doi/full/10....
machawk1.bsky.social
In his #ACMHT2025 talk, Mariusz Pisarski asks the questions about Google's AI integration into their search UI stating that research shows that users largely don't click on results at all, but instead rely on the provided summaries.

Paper 🔗 dl.acm.org/doi/full/10....
machawk1.bsky.social
Hashing the first few letters and having a linear search is a perfectly reasonable data structure and works well for a collection of thousands, probably better than a full sort.

- @eastgate.bsky.social @ @human-ht.bsky.social #ACMHT2025
machawk1.bsky.social
@eastgate.bsky.social @ @human-ht.bsky.social:The problem w/ dealing w/ lots of stuff: there's always more to read & more questions to pursue. Even in antiquity, we have people that used thousands of srcs. Ancient collections assume readers want multiple tellings & read against them. #ACMHT2025
machawk1.bsky.social
At @human-ht.bsky.social at #ACMHT2025: @eastgate.bsky.social describes historical compression: We want to compress messages, because every bit is expensive. When your bits are inscribed in stone, they are really expensive.

One tablet had very bad handwriting then tooth marks on the back side.
machawk1.bsky.social
Words per pope: Andrezej Adamski discusses the inter-pope sentiment of popes in time at the HUMAN workshop (@human-ht.bsky.social ) at #ACMHT2025. Takeaway: phrases track the media environment of each era.
machawk1.bsky.social
How would hypertext help for census date resolution? Cathy Marshall discusses in her @human-ht.bsky.social keynote at #ACMHT2025, "It's important to be able to back out records when Ruth isn't Paul's bio mother & that all her records should be deprecated. The cost for link-making is high and noisy."
machawk1.bsky.social
At the HUMAN workshop ( @human-ht.bsky.social ) at Day 2 of ACM Hypertext, Cathy Marshall gives the keynote with a story about John Tukey looking data at large while she was close reading the data (i.e., observing microwaved Ivory soap) #ACMHT2025
machawk1.bsky.social
Enjoyed Alexander Petros proposing to bake in some features of HTMX into the HTML spec, namely to extend it to support more hypermedia controls. #ACMHT2025

Check out, "The Missing Mechanic: Behavioral Affordances as the Limiting Factor in Generalizing HTML Controls":

doi.org/10.1145/3720...