Lightnews — Scholar-powered news

Marek Suppa

@mrshu.bsky.social

30 followers 410 following 16 posts

https://mareksuppa.com

Posts Replies Media Videos

Marek Suppa

@mrshu.bsky.social

𝗛𝗼𝗻𝗲𝘀𝘁𝗟𝗟𝗠

- Introduces 𝙃𝙊𝙉𝙀𝙎𝙀𝙏, a dataset with 930 queries in six categories to evaluate LLM honesty

- Proposes curiosity-driven prompting and two-stage fine-tuning for improving honesty and helpfulness

- Demonstrates up to 124.7% honesty and helpfulness improvement in models like Mistral-7b

December 6, 2024 at 9:06 PM

Reposted by Marek Suppa

🚲

@dx.social.ridetrans.it.ap.brid.gy

This is my tiny hill I will die on.

Mock of a pull quote that reads Polk coats are sin against the reader. They force the reader to read the same goddamn thing twice in both the text and the pull quote.

November 29, 2024 at 9:10 PM

Marek Suppa

@mrshu.bsky.social

Multimodal Large Language Models Make Text-to-Image Generative Models Align Better

- VisionPrefer datset captures diverse preferences (prompt-following, aesthetic, fidelity, harmlessness) using multimodal LLMs

- VP-Score model matches human accuracy in preference prediction, guiding model tuning

Figure 1: Fine-grained feedback from multimodal large language model help to yield more
human-preferred images. Left: Output generated by the baseline text-to-image generative model.
Right: Output generated by the baseline model optimized with fine-grained feedback from multimodal
large language model. We illustrate improvements in generation quality across four aspects: PromptFollowing, Aesthetic, Fidelity and Harmlessness. See in Appendix for more visualization examples.

December 5, 2024 at 10:28 PM

Marek Suppa

@mrshu.bsky.social

The Super Weight in Large Language Models

Setting as few as a single weight to zero will make various LLMs go from generating coherent text to outputting gibberish.

arxiv.org/abs/2411.07191

The Super Weight in Large Language Models

Recent works have shown a surprising result: a small fraction of Large Language Model (LLM) parameter outliers are disproportionately important to the quality of the model. LLMs contain billions of pa...

arxiv.org

November 28, 2024 at 9:16 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news