Multimodal Large Language Model
Multimodal LLMs are pivotal in evolving AI from linguistic intelligence to cross-modal AGI. Our lab targets bottlenecks in heterogeneous data fusion, generalization, and explainable reasoning. We develop human-centric paradigms through integrated research on representation, architecture, and data to build systemic capabilities.






