# denotes correponding author
* denotes work from interns under my supervision
arXiv 2025
SWE-bench Goes Live!
Linghao Zhang, Shilin He#, Chaoyun Zhang, Yu Kang, Bowen Li, Chengxing Xie, Junhao Wang, Maoquan Wang, Yufan Huang, Shengyu Fu, et al.
ACL 2025
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale
Linghao Zhang, Junhao Wang, Shilin He#, Chaoyun Zhang, Yu Kang, Bowen Li, Jiaheng Wen, Chengxing Xie, Maoquan Wang, Yufan Huang, et al.
arXiv 2025
Ufo2: The desktop agentos
Chaoyun Zhang, He Huang, Chiming Ni, Jian Mu, Si Qin, Shilin He, Lu Wang, Fangkai Yang, Pu Zhao, Chao Du, et al.
ICLR 2025
OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
Junjielong Xu, Qinan Zhang, Zhiqing Zhong, Shilin He#, Chaoyun Zhang, Qingwei Lin, Dan Pei, Pinjia He, Dongmei Zhang, Qi Zhang
NAACL 2025
UFO: A UI-focused Agent for Windows OS Interaction
Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, Qingwei Lin, Saravan Rajmohan, et al.
ICSE 2025
An Empirical Study on Package-Level Deprecation in Python Ecosystem
Zhiqing Zhong, Shilin He#, Haoxuan Wang, Boxi Yu, Haowen Yang, Pinjia He
ICDE 2025
AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models
Chaoyun Zhang, Zicheng Ma, Yuhao Wu, Shilin He*, Si Qin, Minghua Ma, Xiaoting Qin, Yu Kang, Yuyi Liang, Xiaoyu Gou, Yajie Xue, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang
arXiv 2024
Large Action Models: From Inception to Implementation
Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, Ray Huang, Si Qin, et al.
TMLR 2024
Large language model-brained gui agents: A survey
Chaoyun Zhang, Shilin He#, Jiaxu Qian, Bowen Li, Liqun Li, Si Qin, Yu Kang, Minghua Ma, Guyue Liu, Qingwei Lin, et al.
ICSE 2024
UniLog: Automatic Logging via LLM and In-Context Learning
Junjielong Xu, Ziang Cui, Yuan Zhao, Xu Zhang, Shilin He*, Pinjia He, Liqun Li, Yu Kang, Qingwei Lin, Yingnong Dang, et al.
ICSE 2024
Xpert: Empowering Incident Management with Query Recommendations via Large Language Models
Yuxuan Jiang, Chaoyun Zhang, Shilin He*, Zhihao Yang, Minghua Ma, Si Qin, Yu Kang, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, et al.
Technical Report 2023
Taskweaver: A Code-First Agent Framework
Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, et al.
ISSTA 2023
ROME: Testing Image Captioning Systems via Recursive Object Melting
Boxi Yu, Zhiqing Zhong, Jiaqi Li, Yixing Yang, Shilin He, Pinjia He
ICSE 2023
Incident-aware Duplicate Ticket Aggregation for Cloud Systems
Jinyang Liu, Shilin He*, Zhuangbin Chen, Liqun Li, Yu Kang, Xu Zhang, Pinjia He, Hongyu Zhang, Qingwei Lin, Zhangwei Xu, et al.
ICSE 2023
Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction
Zhenhao Li, Chuan Luo, Tse-Hsun Peter Chen, Weiyi Shang, Shilin He, Qingwei Lin, Dongmei Zhang
ICSE 2023
Conan: Diagnosing batch failures for cloud systems
Liqun Li, Xu Zhang, Shilin He, Yu Kang, Hongyu Zhang, Minghua Ma, Yingnong Dang, Zhangwei Xu, Saravan Rajmohan, Qingwei Lin, et al.
ESEC/FSE 2023
STEAM: Observability-Preserving Trace Sampling
Shilin He, Botao Feng, Liqun Li, Xu Zhang, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang
ICLR 2023
Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection
Xu Zhang, Yuan Zhao, Ziang Cui, Liqun Li, Shilin He, Qingwei Lin, Yingnong Dang, Saravan Rajmohan, Dongmei Zhang
ESEC/FSE 2022
SPINE: A Scalable Log Parser with Feedback Guidance
Xuheng Wang, Xu Zhang, Liqun Li, Shilin He, Hongyu Zhang, Yudong Liu, Lingling Zheng, Yu Kang, Qingwei Lin, Yingnong Dang, et al.
ESEC/FSE 2022
An Empirical Study of Log Analysis at Microsoft
Shilin He, Xu Zhang, Pinjia He, Yong Xu, Liqun Li, Yu Kang, Minghua Ma, Yining Wei, Yingnong Dang, Saravanakumar Rajmohan, et al.
TASLP 2022
Exploiting Inactive Examples for Natural Language Generation with Data Rejuvenation
Wenxiang Jiao, Xing Wang, Shilin He, Zhaopeng Tu, Irwin King, Michael R Lyu
WWW 2022
UniParser: A Unified Log Parser for Heterogeneous Log Data
Yudong Liu, Xu Zhang, Shilin He, Hongyu Zhang, Liqun Li, Yu Kang, Yong Xu, Minghua Ma, Qingwei Lin, Yingnong Dang, et al.
ACM SIGOPS 2022
An intelligent framework for timely, accurate, and comprehensive cloud incident detection
Yichen Li, Xu Zhang, Shilin He*, Zhuangbin Chen, Yu Kang, Jinyang Liu, Liqun Li, Yingnong Dang, Feng Gao, Zhangwei Xu, et al.
ESEC/FSE 2021
Onion: identifying incident-indicating logs for cloud systems
Xu Zhang, Yong Xu, Si Qin, Shilin He, Bo Qiao, Ze Li, Hongyu Zhang, Xukun Li, Yingnong Dang, Qingwei Lin, et al.
ACM Computing Surveys 2021
A Survey on Automated Log Analysis for Reliability Engineering
Shilin He, Pinjia He, Zhuangbin Chen, Tianyi Yang, Yuxin Su, Michael R Lyu
USENIX ATC 2021
Fighting the Fog of War: Automated Incident Detection for Cloud Systems
Liqun Li, Xu Zhang, Xin Zhao, Pu Zhao, Bo Qiao, Shilin He, Pochian Lee, Jeffrey Sun, Feng Gao, Li Yang, et al.
NAACL 2021
Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation
Yongchang Hao, Shilin He, Wenxiang Jiao, Zhaopeng Tu, Michael Lyu, Xing Wang
EMNLP 2020
Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation
Wenxiang Jiao, Xing Wang, Shilin He, Irwin King, Michael R Lyu, Zhaopeng Tu
ASE 2019
Logzip: Extracting Hidden Structures via Iterative Clustering for Log Compression
Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, Michael R Lyu
ICSE 2019
Tools and benchmarks for automated log parsing
Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, Michael R Lyu
EMNLP 2019
Towards Understanding Neural Machine Translation with Word Importance
Shilin He, Zhaopeng Tu, Xing Wang, Longyue Wang, Michael R Lyu, Shuming Shi
ESEC/FSE 2018
Identifying impactful service system problems via log analysis
Shilin He, Qingwei Lin, Jian-Guang Lou, Hongyu Zhang, Michael R Lyu, Dongmei Zhang
IEEE TDSC 2017
Towards automated log parsing for large-scale log data analysis
Pinjia He, Jieming Zhu, Shilin He, Jian Li, Michael R Lyu
ISSRE 2016
Experience report: system log analysis for anomaly detection
Shilin He, Jieming Zhu, Pinjia He, Michael R Lyu