Publications

2025

  1. ding2025gtpo.png
    Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
    Yifeng Ding, Hung Le, Songyang Han, Kangrui Ruan, Zhenghui Jin, Varun Kumar, Zijian Wang, and Anoop Deoras
    arXiv, 2025
  2. zhuo2025bigcodearena.png
    BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
    The BigCodeArena Team
    arXiv, 2025
  3. saha2025breaking.png
    Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks
    arXiv, 2025
  4. zhuo2025training.png
    Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
    arXiv, 2025
  5. zhuo2025cyberzero.png
    Cyber-Zero: Training Cybersecurity Agents without Runtime
    arXiv, 2025
    Wins the first prize in the CSAW 2025 agentic automated CTF challenge
  6. zhuo2024bcb.png
    BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
    The BigCodeBench Team
    ICLR (Oral), 2025
  7. ding2024horizon.png
    Planning-Aware Code Infilling via Horizon-Length Prediction
    EMNLP, 2025
  8. kuhar2024lib.png
    LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation
    NAACL (Oral), 2025

2024

  1. liu2024learning.png
    Learning Code Preference via Synthetic Evolution
    arXiv, 2024
  2. ding2024fewer.png
    Fewer Truncations Improve Language Modeling
    ICML, 2024
    Adopted by leading models like DeepSeek-v3 and GLM-4.5, reported in 机器之心
  3. zhang2024codefort.png
    CodeFort: Robust Training for Code Generation Models
    Findings of EMNLP, 2024
  4. athiwaratkun2024token.png
    Token Alignment via Character Matching for Subword Completion
    Findings of ACL, 2024
  5. lozhkov2024starcoder2.png
    StarCoder 2 and The Stack v2: The Next Generation
    The StarCoder Team
    arXiv, 2024
  6. ding2022cocomic.png
    CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
    LREC-COLING, 2024

2023

  1. ding2023cross.png
    CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion
    NeurIPS Datasets and Benchmarks, 2023
    Adopted by DeepSeek-Coder, Qwen2.5-Coder, StarCoder, and Augment Code
  2. ben2022mbxp.png
    Multi-lingual Evaluation of Code Generation Models
    ICLR, 2023
  3. wang2022recode.png
    ReCode: Robustness Evaluation of Code Generation Models
    ACL, 2023
  4. jain2022contragen.png
    ContraCLM: Effective Contrastive Learning For Causal Language Model
    ACL, 2023
  5. ding2023static.png
    A Static Evaluation of Code Completion by Large Language Models
    ACL Industry Track, 2023
  6. wei2023towards.png
    Towards Greener Yet Powerful Code Generation via Quantization: An Empirical Study
    ESEC/FSE, 2023

2022

  1. li2022debiasing.png
    Debiasing Neural Retrieval via In-batch Balancing Regularization
    GeBNLP Workshop, 2022
  2. bombari2022towards.png
    Towards Differential Relational Privacy and its use in Question Answering
    Simone Bombari, Alessandro Achille, Zijian Wang, Yu-Xiang Wang, Yusheng Xie, Kunwar Yashraj Singh, Srikar Appalaraju, Vijay Mahadevan, and Stefano Soatto
    arXiv, 2022
  3. li2022dqbart.png
    DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization
    ACL, 2022

2021

  1. dhole2021nl.png
    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
    The NL-Augmenter Team
    arXiv, 2021

2020

  1. kreiss2020modeling.png
    Modeling Subjective Assessments of Guilt in Newspaper Crime Narratives
    Elisa Kreiss*, Zijian Wang*, and Christopher Potts
    CoNLL, 2020

2019

  1. wang2019talkdown.png
    TalkDown: A Corpus for Condescension Detection in Context
    Zijian Wang and Christopher Potts
    EMNLP-IJCNLP, 2019
  2. qi2019answering.png
    Answering Complex Open-Domain Questions Through Iterative Query Generation
    Peng Qi, Xianwen Lin*, Leo Mehr*, Zijian Wang*, and Christopher D. Manning
    EMNLP-IJCNLP, 2019
  3. wang2019demographic.png
    Demographic Inference and Representative Population Estimates from Social Media Data
    Zijian Wang, Scott A. Hale, David Adelani, Przemyslaw A. Grabowicz, Timo Hartmann, Fabian Flöck, and David Jurgens
    WWW, 2019
    Best Poster Award

2018

  1. wang2018its.png
    It’s going to be okay: Measuring Access to Support in Online Communities
    Zijian Wang and David Jurgens
    EMNLP, 2018

2017

  1. choi2017social.png
    Social work in the classroom? A tool to evaluate topical relevance in student writing
    Heeryung Choi, Zijian Wang, Christopher Brooks, Kevyn Collins-Thompson, Beth Glover Reed, and Dale Fitch
    EDM, 2017