CAST: Modeling Visual State Transitions for Consistent Video Retrieval Paper • 2603.08648 • Published 22 days ago • 4