SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Paper • 2603.19028 • Published 13 days ago • 18
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 13 days ago • 50
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 13 days ago • 41
Towards Localized Fine-Grained Control for Facial Expression Generation Paper • 2407.20175 • Published Jul 25, 2024 • 1
Towards Localized Fine-Grained Control for Facial Expression Generation Paper • 2407.20175 • Published Jul 25, 2024 • 1