jrenau.bsky.social
@jrenau.bsky.social
Just created a new video using Avatars to demo Gemini and HAgent to extend a RISC-V Dino core with the B extension.

youtu.be/DMT0Xz_-U5g
RISC-V demo for Gemini with HAgent MCP
YouTube video by Jose Renau
youtu.be
September 30, 2025 at 12:18 AM
🎉 Proud to be one of 70 Amazon Research Award recipients this year! Great news for my students' funding too.

Thanks Amazon for supporting academic research!

www.amazon.science/research-awa...
70 Amazon Research Award recipients announced
Awardees, who represent 44 universities in 10 countries, have access to Amazon public datasets, along with AWS AI/ML services and tools.
www.amazon.science
June 3, 2025 at 10:05 PM
arxiv 📄 μRL: Discovering Transient Execution Vulnerabilities Using Reinforcement Learning

http://arxiv.org/abs/2502.14307v1


We propose using reinforcement learning to address the challenges of discovering microarchitectural vulnerabilities, such as Spectre and Meltdown, which exploit subtle int...
March 21, 2025 at 10:20 PM
arxiv 📄 VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework

http://arxiv.org/abs/2501.13411v1

Penetration testing is a vital practice for identifying and mitigating vulnerabilities in cybersecurity systems, but its manual execution is labor-intensive and time-consu...
January 27, 2025 at 6:32 PM
arxiv 📄 MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems

http://arxiv.org/abs/2501.14451v1

Autonomous Driving Systems (ADSs) are safety-critical, as real-world safety violations can result in significant losses. Rigorous ...
January 27, 2025 at 6:32 PM
arxiv 📄 Evaluating Agent-based Program Repair at Google

http://arxiv.org/abs/2501.07531v1

Agent-based program repair offers to automatically resolve complex bugs end-to-end by combining the planning, tool use, and code generation abilities of modern LLMs. Recent work has explored the use of age...
January 15, 2025 at 12:28 AM
arxiv 📄 Enabling New HDLs with Agents

http://arxiv.org/abs/2501.00642v1

Large Language Models (LLMs) based agents are transforming the programming language landscape by facilitating learning for beginners, enabling code generation, and optimizing documentation workflows. Hardware Description La...
January 7, 2025 at 6:11 PM
The new O3 ARC results are a new "oh shit moment" like the first time that I tried BERT or GPT-3.
www.youtube.com/watch?v=duQu...
going for a walk to think....
OpenAI's O3 and O3-Mini in 12 Minutes
YouTube video by Developers Digest
www.youtube.com
December 20, 2024 at 8:35 PM
arxiv 📄 Design choices made by LLM-based test generators prevent them from finding bugs

http://arxiv.org/abs/2412.14137v1

There is an increasing amount of research and commercial tools for automated test case generation using Large Language Models (LLMs). This paper critically examines whether ...
December 19, 2024 at 10:17 PM
arxiv 📄 GHIssuemarket: A Sandbox Environment for SWE-Agents Economic Experimentation

http://arxiv.org/abs/2412.11722v2

Software engineering agents (swe-agents), as key innovations in intelligent software engineering, are poised in the industry's end-of-programming debate to transcend from assis...
December 18, 2024 at 4:32 AM
arxiv 📄 Generating Move Smart Contracts based on Concepts

http://arxiv.org/abs/2412.12513v1

The growing adoption of formal verification for smart contracts has spurred the development of new verifiable languages like Move. However, the limited availability of training data for these languages h...
December 18, 2024 at 4:32 AM
arxiv 📄 PromptV: Leveraging LLM-powered Multi-Agent Prompting for High-quality Verilog Generation

http://arxiv.org/abs/2412.11014v1

Recent advances in agentic LLMs have demonstrated remarkable automated Verilog code generation capabilities. However, existing approaches either demand substanti...
December 17, 2024 at 10:16 PM
Reposted
Applications for our GenAI faculty position in the CSE department at UCSC close on Friday. Come and join our amazing team in Silicon Valley
> recruit.ucsc.edu/JPF01825

Ensure your applications are submitted by Friday as they will be reviewed over the Holiday Break!
Computer Science & Engineering: Assistant or Associate Professor, Generative Artificial Intelligence (initial review Dec. 20, 2024)
University of California, Santa Cruz is hiring. Apply now!
recruit.ucsc.edu
December 16, 2024 at 4:01 PM
arxiv 📄 DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production

http://arxiv.org/abs/2412.08069v1

Large Language Models (LLMs) have become increasingly integral to enhancing developer productivity, particularly in code generation, comprehension, and repair tasks. How...
December 16, 2024 at 8:23 PM
arxiv 📄 Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead

http://arxiv.org/abs/2412.08581v1

Exploratory testing (ET) harnesses tester's knowledge, creativity, and experience to create varying tests that uncover unexpected bugs from t...
December 16, 2024 at 8:23 PM
arxiv 📄 You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects

http://arxiv.org/abs/2412.10133v1

The ability to execute the test suite of a project is essential in many scenarios, e.g., to assess code quality and code coverage, to validate code changes made by developers o...
December 16, 2024 at 8:23 PM
arxiv 📄 MAGE: A Multi-Agent Engine for Automated RTL Code Generation

http://arxiv.org/abs/2412.07822v1

The automatic generation of RTL code (e.g., Verilog) through natural language instructions has emerged as a promising direction with the advancement of large language models (LLMs). However, p...
December 16, 2024 at 8:23 PM
arxiv 📄 AiEDA: Agentic AI Design Framework for Digital ASIC System Design

http://arxiv.org/abs/2412.09745v1

The paper addresses advancements in Generative Artificial Intelligence (GenAI) and digital chip design, highlighting the integration of Large Language Models (LLMs) in automating hardware...
December 16, 2024 at 8:23 PM
arxiv 📄 ExploraCoder: Advancing code generation for multiple unseen APIs via
planning and chained exploration


http://arxiv.org/abs/2412.05366v1

Through training on publicly available source code libraries, large language
models (LLMs) can invoke multiple encapsulated APIs to solve complex
pro...
December 11, 2024 at 12:31 AM
arxiv 📄 The BrowserGym Ecosystem for Web Agent Research


http://arxiv.org/abs/2412.05467v1

The BrowserGym ecosystem addresses the growing need for efficient evaluation
and benchmarking of web agents, particularly those leveraging automation and
Large Language Models (LLMs) for web interaction ta...
December 11, 2024 at 12:31 AM
arxiv 📄 GEE-OPs: An Operator Knowledge Base for Geospatial Code Generation on
the Google Earth Engine Platform Powered by Large Language Models


http://arxiv.org/abs/2412.05587v1

As the scale and complexity of spatiotemporal data continue to grow rapidly,
the use of geospatial modeling on the ...
December 11, 2024 at 12:31 AM
arxiv 📄 Applications and Implications of Large Language Models in Qualitative
Analysis: A New Frontier for Empirical Software Engineering


http://arxiv.org/abs/2412.06564v1

The use of large language models (LLMs) for qualitative analysis is gaining
attention in various fields, including softwa...
December 11, 2024 at 12:31 AM
arxiv 📄 Examining the Use and Impact of an AI Code Assistant on Developer
Productivity and Experience in the Enterprise


http://arxiv.org/abs/2412.06603v1

AI assistants are being created to help software engineers conduct a variety
of coding-related tasks, such as writing, documenting, and tes...
December 11, 2024 at 12:31 AM
arxiv 📄 Why Do Developers Engage with ChatGPT in Issue-Tracker? Investigating
Usage and Reliance on ChatGPT-Generated Code


http://arxiv.org/abs/2412.06757v1

Large language models (LLMs) like ChatGPT have shown the potential to assist
developers with coding and debugging tasks. However, their ...
December 11, 2024 at 12:31 AM