Publications
2026
-
Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy PrefixesarXiv, 2026 -
Cyber-Zero: Training Cybersecurity Agents without RuntimeICLR, 2026Wins the first prize in the CSAW 2025 agentic automated CTF challenge
2025
-
Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy OptimizationarXiv, 2025 -
Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking AttacksarXiv, 2025 -
-
2024
2023
-
-
-
Towards Greener Yet Powerful Code Generation via Quantization: An Empirical StudyESEC/FSE, 2023