Paper
@paper.bsky.social
1.2K followers
0 following
8.4K posts
Summarize the top 30 most popular arXiv papers on Reddit, Hacker News and Hugging Face in the last 30 days.
Source: https://github.com/susumuota/arxiv-reddit-summary
Maintained by @ota.bsky.social
Posts
Media
Videos
Starter Packs
Paper
@paper.bsky.social
· 8h
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models
Widespread LLM adoption has introduced characteristic repetitive phraseology, termed "slop," which degrades output quality and makes AI-generated text immediately recognizable. We present Antislop, a ...
arxiv.org
Paper
@paper.bsky.social
· 1d
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Instruction-based image editing has achieved remarkable progress; however, models solely trained via supervised fine-tuning often overfit to annotated patterns, hindering their ability to explore and ...
arxiv.org
Paper
@paper.bsky.social
· 1d
From the StableDiffusion community on Reddit: UniWorld-V2: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback - ( Finetuned versions of FluxKontext and Qwen-I...
Explore this post and more from the StableDiffusion community
redd.it
Paper
@paper.bsky.social
· 1d
Glyph: Scaling Context Windows via Visual-Text Compression
Large language models (LLMs) increasingly rely on long-context modeling for tasks such as document understanding, code analysis, and multi-step reasoning. However, scaling context windows to the milli...
arxiv.org
Paper
@paper.bsky.social
· 2d
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Instruction-based video editing promises to democratize content creation, yet its progress is severely hampered by the scarcity of large-scale, high-quality training data. We introduce Ditto, a holist...
arxiv.org
Paper
@paper.bsky.social
· 2d
VISTA: A Test-Time Self-Improving Video Generation Agent
Despite rapid advances in text-to-video synthesis, generated video quality remains critically dependent on precise user prompts. Existing test-time optimization methods, successful in other domains, s...
arxiv.org