Lightnews — Scholar-powered news

Cem Koç

@cemkoch.bsky.social

23 followers 34 following 9 posts

Coffee Lover • Husky Dad • ML Researcher @  • Berkeley Grad

Posts Replies Media Videos

Cem Koç

@cemkoch.bsky.social

Today we have released the code and a demo iOS application for FastVLM - our extremely efficient and fast vision language model which runs on your device using MLX! You can check out the code and the app here: github.com/apple/ml-fas...

May 7, 2025 at 10:20 PM

Reposted by Cem Koç

Simons Institute for the Theory of Computing

@simonsinstitute.bsky.social

Join us! Registration is required.

simons.berkeley.edu/events/move-...

March 19, 2025 at 3:42 AM

Reposted by Cem Koç

Josh Susskind

@kindsuss.bsky.social

If you're looking for research scientist roles in Europe, check out Marco's post! The Paris team is fantastic, and does diverse idea-driven and impactful research. In addition, MLR is highly collaborative across timezones, so you'd have a chance to work with many others too.

Marco Cuturi @marcocuturi.bsky.social · Dec 18

The Apple Machine Learning Research (MLR) team in Paris has openings for both FTE roles and a short-term post-doc position to contribute to our team's research agenda. Researchers at Apple's MLR (led by Samy Bengio) target impactful publications in top-tier ML venues and OSS.

December 18, 2024 at 5:14 PM

Cem Koç

@cemkoch.bsky.social

Excited about vision-language models? 🚀 Check out our latest work on FastVLM, a new family of efficient vision-language models that balances the tradeoff between high-resolution image understanding and latency without compromising accuracy!

arxiv.org/abs/2412.13303

December 19, 2024 at 6:18 PM

Reposted by Cem Koç

Jiatao Gu

@jgu32.bsky.social

🤔Image-to-3D, monocular depth estimation, camera pose estimation, …, can we achieve all of this with just ONE model easily?

🚀Our answer is Yes -- Excited to introduce our latest work: World-consistent Video Diffusion (WVD) with Explicit 3D Modeling!

arxiv.org/abs/2412.01821

December 4, 2024 at 1:41 PM

Reposted by Cem Koç

Alaa El-Nouby

@alaaelnouby.bsky.social

𝗗𝗼𝗲𝘀 𝗮𝘂𝘁𝗼𝗿𝗲𝗴𝗿𝗲𝘀𝘀𝗶𝘃𝗲 𝗽𝗿𝗲-𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 𝘄𝗼𝗿𝗸 𝗳𝗼𝗿 𝘃𝗶𝘀𝗶𝗼𝗻? 🤔
Delighted to share AIMv2, a family of strong, scalable, and open vision encoders that excel at multimodal understanding, recognition, and grounding 🧵

paper: arxiv.org/abs/2411.14402
code: github.com/apple/ml-aim
HF: huggingface.co/collections/...

November 22, 2024 at 8:32 AM

Reposted by Cem Koç

Lucie Charlotte Magister

@charlottemagister.bsky.social

Looking for an alternative to RAG for personalization?

With PLUM, a pipeline for teaching LLMs to remember prior user conversations, we aim to enable your future personalization research! Joint work with @maartjeterhoeve.bsky.social, Katherine Metcalf and Yizhe Zhang from my internship at Apple.

🧵

November 21, 2024 at 6:03 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news