Zongxia Li

Department of Computer Science

University of Maryland

Hi, I’m Zongxia

I am a third-year Ph.D. candidate at the Department of Computer Science at University of Maryland, College Park. I am fortunate to be advised by Jordan Boyd-Graber. I got a B.S. degree in Computer Science and Mathematics from University of Maryland. My current research lies in Human-Centeric NLP, Multimodal Models and Evaluation.

Current Research Focus

My research aims to develop AI systems and better evaluations that align closely with human needs. In the text-only domain, I aim to develop interactive systems that help humans explore and understand abstract concepts given large amount of data and improve the robustness of current automaticevaluation metrics. In the multimodal domain, I aim to analyze and evaluate multimodal models including question answering, hallucination, video generation and reasoning.

Human-Centered AI: Creating interactive systems and evaluation frameworks to assess AI reliability.
Evaluation: Improving trustworthiness and robustness of current evaluation metrics.
Multimodality: Analyze and evaluate multimodal models including question answering, hallucination, video generation and reasoning.

Research Vision

The quick advancements in LLMs and LVLMs models and applications influence the relationship between humans and AI, and how humans use AI. I particularly value how AI can serve humans, not replace humans through interactive systems and better evaluation frameworks.

Papers

Zongxia Li, Lorena Calvo-Bartolomé, Alexander Hoyle, Paiheng Xu, Alden Dima, Juan Francisco Fung, Jordan Boyd-Graber (2025). Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs. Association of Computer Linguistics.

PDF Data

Zongxia Li*, Xiyang Wu*, Yubin Qin, Hongyang Du,Tianyi Zhou, Guangyao Shi, Dinesh Manocha, Jordan Boyd-Graber (2025). VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding. Preprint.

PDF Github

Zongxia Li, Yapei Chang, Yuhang Zhou, Xiyang Wu, Yoo Yeon Sung, Jordan Boyd-Graber (2025). Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation. Preprint.

PDF Github

Zongxia Li*, Xiyang Wu*, Hongyang Du, Huy Nghiem, Guangyao Shi (2025). Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey. Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops.

PDF Github

Zongxia Li, Ishani Mondal, Huy Nghiem, Yijun Liang, Jordan Lee Boyd-Graber (2024). PEDANTS: Cheap but Effective and Interpretable Answer Equivalence. Empirical Methods in Natural Language Processing. Findings.

PDF Code

Ishani Mondal, Zongxia Li, Yufang Hou, Anandhavelu Natarajan, Aparna Garimella and Jordan Boyd-Graber (2024). SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement. Empirical Methods in Natural Language Processing. Findings.

PDF

Zongxia Li, Andrew Mao, Daniel Stephens, Pranav Goel, Emily Walpole, Alden Dima, Juan Fung, Jordan Boyd-Graber, Jordan Boyd-Graber (2024). Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis . European Chapter of the Association for Computational Linguistics. Main.

PDF Dataset

Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou (2024). HallusionBench: an advanced diagnostic suite for entangled language hallucination and visual illusion in large vision-language models . Conference on Computer Vision and Pattern Recognition.

PDF Dataset

Haozhe An, Christabel Acquaye, Colin Wang, Zongxia Li, Rachel Rudinger (2024). Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?. Association for Computational Linguistics.

PDF

Haozhe An, Zongxia Li, Jieyu Zhao, Rachel Rudinger (2023). SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models. European Chapter of the Association for Computational Linguistics. Main.

PDF