About me

Hi! 👋🏻 I’m Yuxuan (Leo) Lu, a Ph.D. student at Northeastern University. I’m currently working as an intern applied scientist at Amazon. Before that, I got my B.E. in Computer Science and Technology and Graduated with honor at Beijing University of Technology. I’m advised by Prof. Dakuo Wang. My research interest includes Human Computer Interaction and Natural Language Processing , especially in training, running and utilizing Large Language Models (LLMs) effiently and effectively. In the past, I’ve worked as Machine Learning Researcher at a joint program between LinkedIn and Microsoft Research Asia. I’ve also worked as an intern research assistant at THUNLP lab, supervised by Prof. Zhiyuan Liu(刘知远).

Picture of me, taken in The Sayram Lake (赛里木湖)

Education

I’m currently persuing my Ph.D. in Computer Science at Khoury College of Computer Sciences, Northeastern University, advised by Prof. Dakuo Wang.

I got my B.E. in Computer Science and Technology and Graduated with honor at Beijing University of Technology. Before that, I’ve finished my junior and senior high at Beijing National Day School (北京市十一学校).

Preprints

2024

  1. In Submission
    RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative GI Cancer Care
    Ziqi Yang*Yuxuan Lu*, Jennifer Bagdasarian, Vedant Das Swain, Ritu Agarwal, Collin Campbell, Waddah Al-Refaire, Dr Jehan El-Bayoumi, Guodong (Gordon) Gao,  shara, Dakuo Wang, and Bingsheng Yao
    In Submission to CHI 2025, 2024
  2. In Submission
    UXAgent: An Large Language Model-based Agent System for Usability Testing of Web Design
    Yuxuan Lu, Arthur Yao, Hansu Gu, Jing Huang, Jessie Wang, Laurence Li, Haiyang Zhang, Qi He, Toby Jia-Jun Li, and Dakuo Wang
    In Submission to CHI 2025, 2024
  3. In Submission
    Characterizing LLM-Empowered Personalized Story Reading and Interaction for Children: Insights From Multi-Stakeholders’ Perspective
    Jiaju Chen, Minglong Tang, Yuxuan LuBingsheng Yao, Elissa Fan, Xiaojuan Ma, Ying Xu, Dakuo WangYuling Sun, and Liang He
    In Submission to CHI 2025, 2024
  4. In Submission
    From Dark Data to Open Data: Challenges and Practices for Data Integrators of Data-Driven Open Science Projects in Geoscience
    Shao ZhangShihan Fu, Bin Lu, Yuxuan LuToby Jia-Jun LiDakuo Wang, Ying Wen, Xinbing Wang, and Chenghu Zhou
    In Submission to CSCW 2025, 2024
  5. In Submission
    Exploring Domain Adaptation with LLMs for Real-World Augmented Question Answer Generation (RA-QAG) in Children Storytelling
    Jiaju Chen, Yuxuan LuShao ZhangBingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, and Yuling Sun
    In Submission to EMNLP 2024, 2024
  6. In Submission
    ALERTS: Active Learning and Ensemble LLM Real-Time Switch for Real-World Data Drift Challenges
    Yuxuan LuBingsheng YaoShao Zhang, Yisi Sang, Yun Wang, Hansu Gu, Peng Zhang, Tun Lu, Toby Jia-Jun Li, and Dakuo Wang
    In Submission to EMNLP 2024, 2024

2023

  1. Human Still Wins over LLM: An Empirical Study of Active Learning on Domain-Specific Annotation Tasks
    Yuxuan LuBingsheng YaoShao Zhang, Yun Wang, Peng Zhang, Tun Lu, Toby Jia-Jun Li, and Dakuo Wang
    arXiv preprint arXiv:2311.09825, 2023

Publications

2024

  1. More Samples or More Prompt Inputs? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering
    In Findings of the Association for Computational Linguistics: NAACL 2024, 2024
  2. Professional Network Matters: Connections Empower Person-Job Fit
    Hao ChenLun DuYuxuan Lu, Qiang Fu, Xu Chen, Shi Han, Yanbin Kang, Guangming Lu, and Zi Li
    In Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024
  3. Exploring Parent’s Needs for Children-Centered AI to Support Preschoolers’ Storytelling and Reading Activities
    Yuling Sun, Jiali Liu, Bingsheng Yao, Jiaju Chen, Dakuo Wang, Xiaojuan Ma, Yuxuan Lu, Ying Xu, and Liang He
    Proc. ACM Hum.-Comput. Interact., 2024
  4. Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis
    Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan LuBingsheng Yao, Melanie Tory, Lace M. Padilla, Jeffrey Caterino, Ping Zhang, and Dakuo Wang
    In Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024
  5. StorySpark: Expert-Annotated QA Pairs with Real-World Knowledge for Children Storytelling
    Jiaju Chen, Yuxuan LuShao ZhangBingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, and Yuling Sun
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

  1. Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture
    Bingsheng YaoIshan JindalLucian PopaYannis Katsis, Sayan Ghosh, Lihong He, Yuxuan Lu, Shashank Srivastava, Yunyao Li, James Hendler, and Dakuo Wang
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  2. TCBB 2023
    Improving Biomedical Question Answering by Data Augmentation and Model Weighting
    Yongping Du, Jingya Yan, Yuxuan Lu, Yiliang Zhao, and Xingnan Jin
    IEEE/ACM Transactions on Computational Biology and Bioinformatics, Dec 2023

2022

  1. Contextual Embedding and Model Weighting by Fusing Domain Knowledge on Biomedical Question Answering
    Yuxuan Lu, Jingya Yan, Zhixuan Qi, Zhongzheng Ge, and Yongping Du
    In Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Dec 2022

2021

  1. BIBM 2021
    Dual Model Weighting Strategy and Data Augmentation in Biomedical Question Answering
    Yongping Du, Jingya Yan, Yiliang Zhao, Yuxuan Lu, and Xingnan Jin
    In 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Dec 2021

Research Experience

My current research fields includes human-ai collaboration and interaction, especially in the area of Large Language Models (LLMs).

I’m currently working as an intern applied scientist at Amazon.

Before that, I’ve worked as Machine Learning Researcher at a joint program between LinkedIn and Microsoft Research Asia where I do study about LinkedIn’s social network data. I’ve also worked as an intern research assistant at THUNLP lab, supervised by Prof. Zhiyuan Liu(刘知远). My research area there includes Knowledge Embedding.

Open source communities

I’ve participated in many open-source communities. I’m the maintainer of the VSCode extension LaTeX-Utilities, and I’m the founder and maintainer of the EduOJ project. Furthermore, I’ve contributed to many open-source projects, like GitLab, UniversalOJ, OI-Wiki, nix and others.

I’ve participated as mentor and community leader in the Open Source Promotion Plan 2021. All my 3 students successfully finished their projects. I’ve participated as a student in the OSPP 2020 in the UniversalOJ community, and successfully finished my project.

Learn more about my open-source experience at here.