Greg Leppert
@leppert.me
1.1K followers 140 following 31 posts
Working on AI and access to knowledge at Harvard. Executive Director of the Institutional Data Initiative; Chief Technologist of the Berkman Klein Center.
Posts Media Videos Starter Packs
leppert.me
Data is everything.
leppert.me
This starts in an hour.
leppert.me
This Monday, @institutionaldatainitiative.org will host Petr Knoth to share his experience leading CORE ("The world’s largest collection of open access research papers") as the rise of AI brings new meaning, and challenges, to stewarding knowledge repositories. Join us virtually via the link below.
leppert.me
This Monday, @institutionaldatainitiative.org will host Petr Knoth to share his experience leading CORE ("The world’s largest collection of open access research papers") as the rise of AI brings new meaning, and challenges, to stewarding knowledge repositories. Join us virtually via the link below.
leppert.me
Tomorrow, it's our pleasure to host @ayahbdeir.bsky.social to talk about the power of data in building an AI ecosystem that's open, transparent, and fair. 11am ET on June 17th. Register at the link below to attend virtually.
leppert.me
The @institutionaldatainitiative.org is proud to support The New Commons challenge. $100k grants along with mentorship. Let's get impactful data into the AI ecosystem.
thegovlab.org
(1/4) CALL FOR APPLICATIONS FOR DATA COMMONS FOR AI

🏆Today, The Open Data Policy Lab (a collaboration btwn The GovLab & @microsoft.com launched The New Commons Challenge—an innovation challenge to foster the creation of data commons that can support generative AI developed in the public interest.
Reposted by Greg Leppert
free.law
To start the weekend, we've got a brand new experience for case law on CourtListener. It has better typography, more features and metadata, five million scanned decisions from @harvardlil.bsky.social, and a lot more. Read all about it and let us know what you think: free.law/2025/03/21/c...
A Faster, Smarter, Unified Case Law Experience
A redesigned case law modernizes the reading experience with enhanced layout and typography, more advanced features, better speed, and more.
free.law
leppert.me
The @institutionaldatainitiative.org at Harvard works with knowledge institutions to increase the availability, diversity, and responsible use of training data for AI. Reach out and join us.
leppert.me
Our goal is to develop methods and tools that can support expert staff at libraries everywhere, increasing the breadth of materials that can be digitized and the speed at which they’re made accessible to the public. Learn more at BPL: www.bpl.org/news/boston-...
Boston Public Library Expands Access to Collections Through AI-Enhanced Digitization
BOSTON, MA – March 12, 2025 - The Boston Public Library (BPL) is launching a large-scale digitization project to unlock hundreds of thousands…
www.bpl.org
leppert.me
Together, we’ll research opportunities to generate machine-readable representations of items, add searchable metadata, and begin the structuring of entire collections—all at the moment each item leaves the imaging station.
leppert.me
IDI and BPL are working to change this by collaborating at the outset of a large digitization project, exploring how AI might complement human expertise and strengthen the process in its earliest stages.
leppert.me
BPL is embarking on a new initiative to digitize hundreds of thousands of historic items. Conventional approaches to this scale lead to an impossible choice: sacrifice depth for breadth or drastically limit what gets digitized. AI tools can help, but they’re relegated to the end of the process.
leppert.me
With our digitization at Harvard Law School Library, we'll work to increase access to unique collections, such as the Supreme Court Records and Briefs that are critical to understanding decision-making at the highest U.S. court yet remain largely inaccessible.
leppert.me
If you're part of a library, university, or other knowledge institution and interested in working with a team of data scientists to refine and publish your data, we'd love to chat. And if you're a data scientist or community builder interested in working with institutions, we're hiring.
leppert.me
IDI is building a collection of large, impactful, and widely available datasets to increase AI’s accessibility and diversity while reaffirming institutions as stewards of knowledge.
leppert.me
The key is involving the institutions themselves in the conversation. Their missions are as much a reflection of the cultures they help to preserve as the data itself, and integrating them is critical as we look for new models to foster and interact with knowledge.