Senior Researcher, Microsoft Research
Contact Info:
Email: shilin.he@microsoft.com
Github
Google Scholar
Introduction:
I am now a Senior researcher in the DKI group at Microsoft. My recent focuses are LLM and its applications, including Agents, RAG, etc. Before that, I worked on AIOps and machine translation.
I joined Microsoft Research Asia in 2020. Prior to that, I received my Ph.D. degree from the Chinese University of Hong Kong (CUHK) in 2020, under the supervision of Prof. Michael R. Lyu.
I am the maintainer of TaskWeaver, a code-first agent framework designed for reliable planning and code execution. I am also the maintainer of LogPAI project, one of the most comprehensive open-source projects in the AIOps area.
Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection
[PDF] [BibTex]
ICLR 2023
CONAN: Diagnosing Batch Failures for Cloud Systems
[PDF] [BibTex]
ICSE-SEIP 2023
Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction
[PDF] [BibTex]
ICSE 2023
Incident-aware Duplicate Ticket Aggregation for Cloud Systems
[PDF] [BibTex]
ICSE 2023
An Empirical Study of Log Analysis at Microsoft
[PDF] [BibTex]
ESEC/FSE 2022
An Empirical Investigation of Missing Data Handling in Cloud Node Failure Prediction
[PDF] [BibTex]
ESEC/FSE 2022
SPINE: A Scalable Log Parser with Feedback Guidance
[PDF] [BibTex]
ESEC/FSE 2022
Distinguished Paper Award
XLM-D: Decorate Cross-lingual Pre-training Model as Non-Autoregressive Neural Machine Translation
[PDF] [BibTex]
EMNLP 2022
UniParser: A Unified Log Parser for Heterogeneous Log Data
[PDF] [BibTex]
The Web Conference (WWW 2022)
Exploiting Inactive Examples for Natural Language Generation with Data Rejuvenation
[PDF] [BibTex]
IEEE/ACM Transactions on Audio, Speech, and Language Processing 2022
Fighting the Fog of War: Automated Incident Detection for Cloud Systems
[PDF] [BibTex]
2021 USENIX Annual Technical Conference (USENIX ATC 2021)
Onion: Identifying Incident-indicating Logs for Cloud Systems
[PDF] [BibTex]
ESEC/FSE 2021 Industry Track
Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation
[arXiv] [BibTex]
NAACL 2021
A Survey on Automated Log Analysis for Reliability Engineering
[arXiv] [BibTex]
ACM Computing Survey 2021
Loghub: A Large Collection of System Log Datasets towards Automated Log Analytics
[arXiv] [BibTex]
Arxiv 2020
Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation
[arXiv] [BibTex]
EMNLP 2020
Assessing the Bilingual Knowledge Learned by Neural Machine Translation Models
[arXiv] [BibTex]
Arxiv 2020
Towards Understanding Neural Machine Translation with Word Importance
[arXiv] [BibTex]
In Proceedings of the Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP 2019), Hong Kong, Nov 3 - 7, 2019
Logzip: Extracting Hidden Structures via Iterative Clustering for Execution Log Compression
[arXiv] [Code] [BibTex]
In Proceedings of the The 34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019), San Diego, United States, Nov 11 - 15, 2019
Tools and Benchmarks for Automated Log Parsing
[arXiv] [Code] [BibTex]
In Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE 2019), Montréal, QC, Canada, May 25 - 31, 2019
Identifying Impactful Service System Problems via Log Analysis
[PDF] [Code] [BibTex]
In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering(ESEC/FSE 2018), Lake Buena Vista, FL, USA, Nov 4 - 9, 2018
Characterizing the natural language descriptions in software logging statements
[PDF] [Code] [BibTex]
In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering (ASE 2018), Montpellier, France, Sept 03 - 07, 2018
Towards Automated Log Parsing for Large-scale Log Data Analysis [Journal]
[PDF] [BibTex]
IEEE Transactions on Dependable and Secure Computing (TDSC) (Volume: 15, Issue: 6, Nov.-Dec. 1, 2018)
Experience Report: System Log Analysis for Anomaly Detection
[PDF] [Code] [BibTex]
In Proceedings of 2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE 2016), Ottawa, Canada, Oct 23-27, 2016
Selected as Most Influential Papers of 30 Years of ISSRE [Link]
An Evaluation Study on Log Parsing and its Use in Log Mining
[PDF] [Code] [BibTex]
In Proceedings of 2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2016), Toulouse, France, Jun 28 - Jul 1, 2016