📖 Publications

Google Scholar
DBLP
  1. Efficient and Green Large Language Models for Software Engineering: Vision and the Road Ahead
    Jieke Shi, Zhou Yang, and David Lo
    ACM Transactions on Software Engineering and Methodology (TOSEM, 21 Pages)
  2. Prioritizing Speech Test Cases
    Zhou Yang, Jieke Shi, Muhammad Hilmi Asyrofi, Bowen Xu, Xin Zhou, DongGyun Han, and David Lo
    ACM Transactions on Software Engineering and Methodology (TOSEM, 28 Pages) [Code]
  3. Gotcha! This Model Uses My Code! Evaluating Membership Leakage Risks in Code Models
    Zhou Yang, Zhipeng Zhao, Chenyu Wang, Jieke Shi, Dongsun Kim, DongGyun Han, and David Lo
    IEEE Transactions on Software Engineering (TSE, 16 Pages) [Code]
  4. BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
    Chen Gong, Zhou Yang, Yunpeng Bai, Junda He, Jieke Shi, Kecen Li, Arunesh Sinha, Bowen Xu, Xinwen Hou, David Lo, and Tianhao Wang
    2024 45th IEEE Symposium on Security and Privacy (S&P 2024, Main Track, 16 Pages) [Code]
  5. Stealthy Backdoor Attack for Code Models
    Zhou Yang, Bowen Xu, Jie M. Zhang, Hong Jin Kang, Jieke Shi, Junda He, and David Lo
    IEEE Transactions on Software Engineering (TSE, 18 Pages) [Code]
  6. Greening Large Language Models of Code
    Jieke Shi, Zhou Yang, Hong Jin Kang, Bowen Xu, Junda He, and David Lo
    2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE 2024, SEIS Track, 12 Pages) [Code]
  7. Curiosity-Driven Testing for Sequential Decision-Making Process
    Junda He, Zhou Yang, Jieke Shi, Chengran Yang, Kisub Kim, Bowen Xu, Xin Zhou, and David Lo
    2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE 2024, Research Track, 13 Pages) [Code]
  8. Unveiling Memorization in Code Models
    Zhou Yang, Zhipeng Zhao, Chenyu Wang, Jieke Shi, Dongsun Kim, DongGyun Han, and David Lo
    2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE 2024, Research Track, 13 Pages) [Code]
  9. What Do Users Ask in Open-Source AI Repositories? An Empirical Study of GitHub Issues
    Zhou Yang, Chenyu Wang, Jieke Shi, Thong Hoang, Pavneet Kochhar, Qinghua Lu, Zhenchang Xing, David Lo
    2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR 2023, Technical Track, 12 Pages) [Code]
  10. Curiosity-Driven and Victim-Aware Adversarial Policies
    Chen Gong, Zhou Yang, Yunpeng Bai, Jieke Shi, Arunesh Sinha, Bowen Xu, David Lo, Xinwen Hou, Guoliang Fan
    2022 39th Annual Computer Security Applications Conference (ACSAC 2022, Technical Paper, Honorable Mention Award, 15 Pages) [Code]
  11. Compressing Pre-trained Models of Code into 3 MB
    Jieke Shi, Zhou Yang, Bowen Xu, Hong Jin Kang, and David Lo
    2022 IEEE/ACM 37th International Conference on Automated Software Engineering (ASE 2022, Research Paper, Nominated for ACM SIGSOFT Distinguished Paper Award, 12 Pages) [Poster][Code]
  12. Answer Summarization for Technical Queries: Benchmark and New Approach
    Chengran Yang, Bowen Xu, Ferdian Thung, Yucen Shi, Ting Zhang, Zhou Yang, Xin Zhou, Jieke Shi, Junda He, DongGyun Han, and David Lo
    2022 IEEE/ACM 37th International Conference on Automated Software Engineering (ASE 2022, Research Paper, 12 Pages) [Code]
  13. Can Identifier Splitting Improve Open-Vocabulary Language Model of Code?
    Jieke Shi, Zhou Yang, Junda He, Bowen Xu, and David Lo
    2022 IEEE 29th International Conference on Software Analysis, Evolution and Reengineering (SANER 2022, ERA Track, 5 Pages) [Poster][Code][Video]
  14. Revisiting Neuron Coverage Metrics and Quality of Deep Neural Networks
    Zhou Yang, Jieke Shi, Muhammad Hilmi Asyrofi, and David Lo
    2022 IEEE 29th International Conference on Software Analysis, Evolution and Reengineering (SANER 2022, RENE Track, 12 Pages) [Code]
  15. On the Influence of Biases in Bug Localization: Evaluation and Benchmark
    Ratnadira Widyasari, Stefanus Agus Haryono, Ferdian Thung, Jieke Shi, Constance Tan, Fiona Wee, Jack Phan, and David Lo
    2022 IEEE 29th International Conference on Software Analysis, Evolution and Reengineering (SANER 2022, RENE Track, 11 Pages) [Code] [Dataset][Video]
  16. Natural Attack for Pre-trained Models of Code
    Zhou Yang, Jieke Shi, Junda He, and David Lo
    2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE 2022, Technical Track, 12 Pages) [Code]
  17. BiasHeal: On-the-Fly Black-Box Healing of Bias in Sentiment Analysis Systems
    Zhou Yang, Harshit Jain, Jieke Shi, Muhammad Hilmi Asyrofi, and David Lo
    2021 IEEE 37th International Conference on Software Maintenance and Evolution (ICSME 2021, NIER Track, 5 Pages) [Code]
  18. Can Differential Testing Improve Automatic Speech Recognition Systems?
    Muhammad Hilmi Asyrofi, Zhou Yang, Jieke Shi, Chu Wei Quan, and David Lo
    2021 IEEE 37th International Conference on Software Maintenance and Evolution (ICSME 2021, NIER Track, 5 Pages) [Code]
  19. IncBL: Incremental Bug Localization
    Zhou Yang*, Jieke Shi*, Shaowei Wang, and David Lo
    2021 IEEE/ACM 36th International Conference on Automated Software Engineering (ASE 2021, Tool Demonstrations, *Equal contributions, 4 Pages) [Poster][Code][Video]