Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 6 days ago • 20
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 6 days ago • 253
Less Detail, Better Answers: Degradation-Driven Prompting for VQA Paper • 2604.04838 • Published 9 days ago • 13
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 20 days ago • 183
GenMask: Adapting DiT for Segmentation via Direct Mask Paper • 2603.23906 • Published 21 days ago • 10
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 210
Believe Your Model: Distribution-Guided Confidence Calibration Paper • 2603.03872 • Published Mar 4 • 40
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding? Paper • 2603.03241 • Published Mar 3 • 87
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models Paper • 2602.22859 • Published Feb 26 • 151