Caroline Lemieux
@cestlemieux.bsky.social
370 followers 99 following 9 posts
now: Assistant Professing™ in Software Practices Lab at UBC. was: postdoc MSR NYC, phd UC Berkeley. also at https://mastodon.acm.org/@cestlemieux. she/her.
Posts Media Videos Starter Packs
cestlemieux.bsky.social
The UBC Software Practices Lab is heading to #icfpsplash25! 4 ICFP/OOPSLA talks, 1 SPLASH-E, 5 talks at associated workshops... check it out: www.cs.ubc.ca/news/2025/10...
UBC Computer Science makes waves at programming language conference ICFP/SPLASH
www.cs.ubc.ca
cestlemieux.bsky.social
CodaMOSA was built on Pynguin version 0.18 .... since then, Pynguin has been very much improved. Pynguin version 0.41.0 includes the LLMMOSA Algorithm, which integrates core parts of CodaMOSA into modern Pynguin! Many kudos to Pynguin contributors for this. github.com/se2p/pynguin...
Release Pynguin 0.41.0 · se2p/pynguin
Fix subject_properties aren't registered when running Pynguin on an imported module Update documentation (Codestyle, Code Overview) Add LLM-Agent guidelines Add PynguinML mode: Parsing and test gen...
github.com
cestlemieux.bsky.social
A common follow-up, whenever we bring up parametric generators is "but can't those byte-level mutations result in a totally different generated input?"

....they do, but our experiments found those more destructive mutations generally lead to higher coverage. Our students fully explored this in:
rohan.padhye.org
Just Accepted to ACM TOSEM!

The "Havoc Paradox" is about the relationship between byte-level fuzzer mutations and their effect on the inputs produced by generators for structured strings (e.g. XML/SQL). Can disruptive mutations be controlled? Should they be? Find out.

📄 dl.acm.org/doi/pdf/10.1...
cestlemieux.bsky.social
after years of fuzzing libxml2 I am happy to announce I have now actually used xmllint
cestlemieux.bsky.social
I was today days old when I learned that IEEETran format has special figure captions overwritten by \usepackage{subcaption}; instead one should use \usepackage[caption=false]{subfig}. tex.stackexchange.com/questions/30...
cestlemieux.bsky.social
Back from #ICSE25! I'm looking forward to reading "No Harness, No Problem: Oracle-guided Harnessing for Auto-generating C API Fuzzing Harnesses", by @gabriel-sherman.bsky.social and @snagycs.bsky.social.(users.cs.utah.edu/~snagy/paper...). Nice progress in fuzz driver generation!
cestlemieux.bsky.social
Finally added a little logo for QuAC (our attribute-based python type inference tool, paper: doi.org/10.1145/3689... + repo: github.com/jifengwu2k/q...) to my website :).
under the header "projects", a logo consisting of a black shadow of a duck on the left hand side of the words ": QuAC". Where the duck and the text ": QuAC" intersect, the shadow is whited-out.
cestlemieux.bsky.social
Was already on my reading list, gives me more incentive to read it :P
Reposted by Caroline Lemieux
clegoues.bsky.social
🧠 Older models memorize more: Models like CodeGen, and CodeLlama show significantly higher leakage on Defects4J than newer models (e.g., Llama 3.1). They often reproduce patches verbatim, to the point that it’s weird (including comments!!) 🔥 3/