Yang, S. (CSE) – Beyond Image Editing: Building Generalized Image Customization Systems
Current generative vision models struggle with image customization that requires multi-step reasoning or real-world knowledge. This proposal introduces generalized image customization, enabling systems to execute complex, inferential modifications rather than just simple edits. The research focuses on the foundational framework required for this generalization, specifically high-quality training data, scalable evaluation benchmarks, self-improving training paradigms that […]