Publications

My advisees*. Equal contribution^.

2024

  1. Preprint
    A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
    Daking Rai*, Yilun Zhou, Shi Feng, Abulhair Saparov, and Ziyu Yao
    arXiv preprint arXiv:2407.02646, 2024
  2. Preprint
    IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers
    Hao Yan*, Thomas D. Latoza, and Ziyu Yao
    arXiv Preprint, 2024
  3. Preprint
    MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
    (Invited Presentation at Wolfram Research LLM Agent Colloquium)
    Murong Yue*, Wijdane Mifdal*, Yixuan Zhang, Jennifer Suh, and Ziyu Yao
    arXiv Preprint, 2024
  4. Preprint
    Lens: A Foundation Model for Network Traffic
    Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, and Huajie Shao
    arXiv Preprint, 2024
  5. CASE’24
    Look Further Ahead: Testing the Limits of GPT-4 in Path Planning
    Mohamed Aghzal*, Erion Plaku, and Ziyu Yao
    IEEE CASE 2024, 2024
  6. ACL’24
    An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs
    Daking Rai*, and Ziyu Yao
    ACL, 2024
  7. ACL’24
    Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
    Saurabh Srivastava*, Chengyue Huang, Weiguo Fan, and Ziyu Yao
    ACL 2024 Findings, 2024
  8. ICLR’24W
    Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
    Mohamed Aghzal*, Erion Plaku, and Ziyu Yao
    ICLR Workshop on LLM Agents, 2024
  9. ICLR’24
    Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
    Murong Yue*, Jie Zhao, Min Zhang, Liang Du, and Ziyu Yao
    The Twelfth International Conference on Learning Representations (also at ICLR Workshop on Reliable and Responsible Foundation Models), 2024

2023

  1. EMNLP’23 Demo
    Gentopia: A Collaborative Platform for Tool-Augmented LLMs
    (An open-source planform for creating, evaluating, and community-sharing Augmented Language Model (ALM)-based Agents)
    Binfeng Xu, Xukun Liu, Hua Shen, Zeyu Han, Yuhan Li, Murong Yue*, and 4 more authors
    arXiv preprint arXiv:2308.04030 (to appear at EMNLP’23), 2023
  2. EMNLP’23
    MAILEX: Email Event and Argument Extraction
    Saurabh Srivastava*, Gaurav Singh*, Shou Matsumoto, Ali Raz, Paulo Costa, Joshua Poore, and 1 more author
    arXiv preprint arXiv:2305.13469 (to appear at EMNLP’23), 2023
  3. ACL’23
    Improving Generalization in Language Model-based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-based Techniques
    (Rank#8 on Spider leaderboard as of Aug 2023)
    Daking Rai*, Bailin Wang, Yilun Zhou, and Ziyu Yao
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
  4. ACL’23
    Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
    Hao Yan*, Saurabh Srivastava*, Yintao Tai*, Sida I. Wang, Wen-tau Yih, and Ziyu Yao
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
  5. AAAI’23 SA
    Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract)
    Daking Rai*, Yilun Zhou, Bailin Wang, and Ziyu Yao
    AAAI Student Abstract, 2023
  6. JBP’23
    A paradigm shift from “human writing” to “machine generation” in personality test development: An application of state-of-the-art natural language processing
    (Editor Commendation, one of 13 out of 1,000+ submissions in 2022)
    Philseok Lee, Shea Fyffe, Mina Son, Zihao Jia, and Ziyu Yao
    Journal of Business and Psychology, 2023

2022

  1. EMNLP’22
    UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
    Tianbao Xie^, Chen Henry Wu^, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, and 17 more authors
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  2. ICLR’22 DL4Code
    Code Editing from Few Exemplars by Adaptive Multi-Extent Composition
    Peizhao Li, Xuchao Zhang, Ziyu Yao, Wei Cheng, Haifeng Chen, and Hongfu Liu
    In Deep Learning for Code Workshop at International Conference on Learning Representations, 2022
  3. ACL’22
    Synthetic Question Value Estimation for Domain Adaptation of Question Answering
    Xiang Yue, Ziyu Yao, and Huan Sun
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

  1. Dissertation
    On Advancing Natural Language Interfaces: Data Collection, Model Development, and User Interaction
    Ziyu Yao
    2021
  2. ICLR’21
    Learning Structural Edits via Incremental Tree Transformations
    Ziyu Yao, Frank F. Xu, Pengcheng Yin, Huan Sun, and Graham Neubig
    In International Conference on Learning Representations, 2021
  3. BIBM’21
    Cliniqg4qa: Generating diverse questions for domain adaptation of clinical question answering
    (Best Paper Award)
    Xiang Yue, Xinliang Frederick Zhang, Ziyu Yao, Simon Lin, and Huan Sun
    In 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2021

2020

  1. EMNLP’20
    An Imitation Game for Learning Semantic Parsers from User Interaction
    Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, and Yu Su
    In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

2019

  1. EMNLP-IJCNLP’19
    Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
    Ziyu Yao, Yu Su, Huan Sun, and Wen-tau Yih
    In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019
  2. ACL’19
    Reinforced Dynamic Reasoning for Conversational Question Generation
    Boyuan Pan, Hao Li, Ziyu Yao, Deng Cai, and Huan Sun
    In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019
  3. WWW’19
    CoaCor: Code annotation for code retrieval with reinforcement learning
    Ziyu Yao, Jayavardhan Reddy Peddamail, and Huan Sun
    In The World Wide Web Conference, 2019
  4. AAAI’19
    Interactive semantic parsing for if-then recipes via hierarchical reinforcement learning
    Ziyu Yao, Xiujun Li, Jianfeng Gao, Brian Sadler, and Huan Sun
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2019

2018

  1. KDD’18 DL Day
    A comprehensive study of staqc for deep code summarization
    Jayavardhan Reddy Peddamail, Ziyu Yao, Zhen Wang, and Huan Sun
    In Deep Learning Day at KDD, 2018
  2. WWW’18
    Staqc: A systematically mined question-code dataset from stack overflow
    Ziyu Yao, Daniel S Weld, Wei-Peng Chen, and Huan Sun
    In Proceedings of the 2018 World Wide Web Conference, 2018

2016

  1. AAAI’16
    Semi-supervised multinomial naive bayes for text classification by leveraging word-level statistical constraint
    Li Zhao, Minlie Huang, Ziyu Yao, Rongwei Su, Yingying Jiang, and Xiaoyan Zhu
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2016