May 6 CSE Colloquium – Evaluating AI Agents in the Real World: Lessons from Two Benchmarks 11:00 am Event Details Get Directions