Tu, H. (CSE) – From Evaluation to Adaptation: Building Reliable Multimodal Intelligence
Multimodal large language models (MLLMs) are rapidly becoming general-purpose AI systems, yet their capabilities are advancing faster than our ability to evaluate, improve, and validate their reliability in realistic use. […]