Mingxun Wang
@mingxunwang.bsky.social
830 followers 87 following 30 posts
Assistant Professor @ UCR Computational Mass Spectrometry, Bioinformatics. #massspec #molecularnetworking #GNPS #MassQL https://www.cs.ucr.edu/~mingxunw/
Posts Media Videos Starter Packs
Pinned
mingxunwang.bsky.social
I am thrilled to share after years of work/procrastination that the MassQL manuscript is finally published in @natmethods.nature.com - "A universal language for finding mass spectrometry data patterns". This was an team effort from all co-authors that helped shape MassQL and how it could be used.
mingxunwang.bsky.social
To help make more mass spec data accessible - we've just rolled out a change to enable universal spectrum identifier resolution and plotting directly from mzML files in Zenodo. We're growing support from more sources in GNPS2 for public data reanalysis!

metabolomics-usi.gnps2.org/dashinterfac...
Reposted by Mingxun Wang
yelabiead.bsky.social
Interested in a co-authorship?
We’re building a tool for repository-scale untargeted #metabolomics and #exposomics of #environmental data. To make it the best it can be, we’re looking for people willing to share high-resolution LC-MS/MS (DDA) data from #water, #soil, #sediment, and related samples.
mingxunwang.bsky.social
GNPS2 and associated services will be down for power maintenance tonight and into tomorrow.
Reposted by Mingxun Wang
yelabiead.bsky.social
We just crossed the 800,000 files mark in Pan-ReDU. That's 800,000 public #metabolomics raw data files with harmonized metadata that can be re-analyzed to learn about new molecules and bio-distributions. 🎉 redu.gnps2.org
mingxunwang.bsky.social
still in development
mingxunwang.bsky.social
Yes, don't use that for the moment in classical networking
mingxunwang.bsky.social
Yes, you can do that, thats the default intensity threshold. That is relative to the base peak in the MS2.
mingxunwang.bsky.social
So now the 85 peak will need to be within 10% of the intensity of the 393 peak. You can put any expression on the 85 peak to modulate up or down for what you want
mingxunwang.bsky.social
QUERY scaninfo(MS2DATA) WHERE MS2PREC=393.2283:TOLERANCEMZ=0.1: INTENSITYMATCH=Y:INTENSITYMATCHREFERENCE AND MS2PROD=85.029:TOLERANCEMZ=0.1: INTENSITYMATCH=Y:INTENSITYPERCENT=10
mingxunwang.bsky.social
Hi @galanojeanmarie.bsky.social Yes, you're just missing one thing with the variable Y to determine the peak intensity of the second one.
mingxunwang.bsky.social
LOL - fun problem to have. I think this might be possible - I think the main graphml, we'll just need to get the actual task and display title.
mingxunwang.bsky.social
Thanks for the feedback - let me see if we can integrate. We've already added direct links for modifinder - so we can easily push it out to the resolver as well with the mirror plots.
Reposted by Mingxun Wang
natmethods.nature.com
The Mass Spectrometry Query Language (MassQL) is an open-source language for instrument-independent searching across mass spectrometry data for complex patterns of interest via concise and expressive queries without the need for programming skills.

www.nature.com/articles/s41...
mingxunwang.bsky.social
I am thrilled to share after years of work/procrastination that the MassQL manuscript is finally published in @natmethods.nature.com - "A universal language for finding mass spectrometry data patterns". This was an team effort from all co-authors that helped shape MassQL and how it could be used.
Reposted by Mingxun Wang
gnps2.bsky.social
We are back online!
gnps2.bsky.social
GNPS2 is planning on being down for server maintenance tomorrow at 12PM PST. We expect 5 hours of downtime to move servers, bring online new storage, and increase networking performance.
mingxunwang.bsky.social
Big thanks to Xianghu Wang for all the work as lead author and all coauthors who made this possible and the funding from @corteva.bsky.social
mingxunwang.bsky.social
While most of the clustering innovation in mass spectrometry has focused largely in proteomics - we hypothesize due to the ability to assess performance - I hope that tools like MS-RT can accelerate the computational innovation in metabolomics.
mingxunwang.bsky.social
After validation, we used MS-RT to evaluate the performance of several commonly used MS/MS clustering tools used in proteomics, specifically MS-Cluster and Falcon. We found that Falcon made generally favorable tradeoffs between purity can clustering completeness (how much was actually clustered).
mingxunwang.bsky.social
We validate this MS-RT approach by using proteomics MS/MS dataset and comparing the purity estimates from MS-RT with estimates using state-of-the-art proteomics database search approaches. We found that while not exactly identical the relative order across clustering tools is maintained.
mingxunwang.bsky.social
To address this, we introduce MS-RT which uses the retention time dimension within individual LC/MS/MS dataset to estimate the clustering purity (how often different molecules make it into the same MS/MS cluster).