Lightnews — Scholar-powered news

David Steinberg @david4096.bsky.social · 2d

Catch you at the next one I hope ;)

David Steinberg @david4096.bsky.social · 3d

Hello from @ga4gh.org Plenary in Uppsala!

1 1 7

Reposted by David Steinberg

Global Alliance for Genomics and Health @ga4gh.org · Jul 22

Announcing GIF Project: Cloud-based BRCA Exchange variant analysis environment using GA4GH standards in Camber. The project aims to adapt and extend community-driven standards to support interoperable workflows, variant annotation, and metadata description. Learn more: www.ga4gh.org/what-we-do/g...

Cloud-Based BRCA Exchange Variant Analysis Environment Using GA4GH Standards in Camber

By integrating BRCA Exchange variant data with GA4GH standards, this GA4GH Implementation Forum (GIF) project creates open, platform-agnostic workflows and tools that can be used by anyone for scalabl...

www.ga4gh.org

2 2

David Steinberg @david4096.bsky.social · Apr 4

Nice to meet you too!

1

David Steinberg @david4096.bsky.social · Apr 3

Collected together @ga4gh.org Bluesky accounts here, lmk if you want to be added! go.bsky.app/8BDDMqM

1 2

David Steinberg @david4096.bsky.social · Apr 1

Calling all @ga4gh.org Connect 2025 attendees online and in-person, let's connect here on bluesky! #ga4ghconnect2025 #ga4gh #bioinformatics #genomics

1 1 6

David Steinberg @david4096.bsky.social · Mar 26

Grace Hopper could really get people laughing about information sciences and the struggles of working under strict hierarchies www.youtube.com/watch?v=si9i...

Capt. Grace Hopper on Future Possibilities: Data, Hardware, Software, and People (Part One, 1982)

YouTube video by National Security Agency

www.youtube.com

2

David Steinberg @david4096.bsky.social · Mar 12

If you haven't caught up with the amazing new demos from @dynamicland.org now is your chance www.youtube.com/watch?v=Osn3...

Summary of "Improvising to cellular playgrounds in Realtalk", Aug 2023

YouTube video by Dynamicland

www.youtube.com

3

David Steinberg @david4096.bsky.social · Mar 11

At the @mlcommons.org Croissant community meeting with, you guessed it

1

David Steinberg @david4096.bsky.social · Feb 26

The photo we saw reminded me immediately of some of the goals of @dynamicland.org as seen here dynamicland.org/2023/Improvi...

Improvising cellular playgrounds in Realtalk

dynamicland.org

David Steinberg @david4096.bsky.social · Feb 26

Another important direction is making immersive visual experiences that make data models accessible in a visual and humane way. I hope to experience this in person at a museum github.com/dbcls/dive

GitHub - dbcls/dive: Data Integration Visual Exploration (DIVE)

Data Integration Visual Exploration (DIVE). Contribute to dbcls/dive development by creating an account on GitHub.

github.com

1

David Steinberg @david4096.bsky.social · Feb 26

Toshiyaki Katayama, original author of the wildly popular KEGG database rounding out the keynotes @swat4hcls.bsky.social by showing us the past, present, and future of linked data in the life sciences — lots of excitement for the possibilities of #graphgenome!!

1 2 2

David Steinberg @david4096.bsky.social · Feb 25

Nice to see this one making the rounds @dockstore.org @ucscgenomics.bsky.social

BF Francis Ouellette @bffo.bsky.social · Feb 25

From NPG's Scientific Data | Applying the FAIR Principles to computational workflows |
#FAIRPrinciples #Workflows #FAIR #FAIRData #FairSoftware 🧬 🖥️ 🧪🔓
⬇️
www.nature.com/articles/s41...

Applying the FAIR Principles to computational workflows - Scientific Data

Recent trends within computational and data sciences show an increasing recognition and adoption of computational workflows as tools for productivity and reproducibility that also democratize access t...

www.nature.com

1 2

David Steinberg @david4096.bsky.social · Feb 25

Starter pack for #swat4hcls2025 conference go.bsky.app/PiZd2qR 🗣️ @swat4hcls.bsky.social

David Steinberg @david4096.bsky.social · Feb 25

Slide from a course at @maastrichtu.bsky.social that’s up on GitHub github.com/MaastrichtU-...

GitHub - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs: Resources for the KG course at IDS, Maastricht University

Resources for the KG course at IDS, Maastricht University - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs

github.com

David Steinberg @david4096.bsky.social · Feb 25

Embedding knowledge graphs in order to compare ontologies using learned features from Shervin Mehryar’s keynote

1

David Steinberg @david4096.bsky.social · Feb 25

From Prof Anna Fensel’s keynote a roundup of some of the connections between AI and semantic

1

David Steinberg @david4096.bsky.social · Feb 25

One of the common themes of the conversations at #swat4hcls so far is that knowledge graphs are proving to be critical for reliability and interpretability of AI and LLMs in specific

1 1

David Steinberg @david4096.bsky.social · Feb 22

Excited to attend #SWAT4HCLS in Barcelona next week, representing @cambercloud.bsky.social ! 🎉

At the hackathon, we’ll explore #CroissantML for seamless dataset & model access via @hf.co and @kaggle.com 🤓

3

David Steinberg @david4096.bsky.social · Feb 17

Check out our first preprint from #biohacakathon Fukushima 2024 and expect more on this work 🤓 files.osf.io/v1/resources...

files.osf.io

David Steinberg @david4096.bsky.social · Feb 17

We found some low hanging fruit for improvement and tested out bringing a bio dataset into Croissant. We think that continually increasing the use of ontologies and controlled vocabularies will be crucial for data harmonization and the new era of multimodal models!

1 1 1

David Steinberg @david4096.bsky.social · Feb 17

We made a simple tool for converting CroissantML to #RDF so it could be analyzed using #SPARQL and looked for differences between its usage between Kaggle and Hugging Face github.com/david4096/cr...

GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources

Tools for working with RDF from Croissant JSON-LD resources - GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources

github.com

1 1

David Steinberg @david4096.bsky.social · Feb 17

It works by providing a controlled vocabulary for high level dataset metadata as well as specific metadata for columnar data, which might seem like a small thing but is a huge step forward for bringing tools to data

1

David Steinberg @david4096.bsky.social · Feb 17

@hf.co , @kaggle.com , OpenML, DataVerse and others are all implementing some or part of the CroissantML spec that interoperates with tooling like Tensorflow so you can load datasets directly into your AI training code

1 1

David Steinberg @david4096.bsky.social · Feb 17

Biology datasets tend to be messy, require domain knowledge to parse, and not immediately usable for training AI models. That’s part of why I became interested in @mlcommons.org CroissantML as a way to bring ML tools to biology data — we’re presenting a poster on this effort at #swat4hcls next week!

1 2