Publications
2025
-
Is Extending Modality The Right Path Towards Omni-Modality?arXiv preprint arXiv:2506.01872, 2025 -
Can LLMs Learn to Map the World from Local Descriptions?arXiv preprint arXiv:2505.20874, 2025 -
MciteBench: A Benchmark for Multimodal Citation Text Generation in MLLMsEMNLP, 2025
2024
-
Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language ModelsarXiv preprint arXiv:2410.03659, 2024 -
From Persona to Personalization: A Survey on Role-Playing Language AgentsTMLR, 2024 -
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?COLM, 2024 -
TravelPlanner: A Benchmark for Real-World Planning with Language AgentsICML spotlight, 2024 -
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought ReasoningCOLM, 2024
2023
-
Towards Visual Taxonomy ExpansionProceedings of the 31st ACM International Conference on Multimedia, 2023 -
SLR:A Million-Scale Comprehensive Crossword Dataset for Simultaneous Learning and Reasoning2023 - End-to-end entity linking with hierarchical reinforcement learningProceedings of the AAAI Conference on Artificial Intelligence, 2023