Benjamingr25's picture

Benjamingr25

benjamingr25

·

AI & ML interests

None yet

Recent Activity

liked a model about 5 hours ago

tencent/HY-Embodied-0.5

liked a dataset 2 days ago

alkaidone/psse-agent-traces-flat

upvoted a paper 3 days ago

Structured Distillation of Web Agent Capabilities Enables Generalization

View all activity

Organizations

None yet

upvoted 2 papers 3 days ago

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published 6 days ago • 20

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 6 days ago • 253

upvoted a paper 6 days ago

Less Detail, Better Answers: Degradation-Driven Prompting for VQA

Paper • 2604.04838 • Published 9 days ago • 13

upvoted a paper 13 days ago

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published 20 days ago • 183

upvoted a paper 14 days ago

GenMask: Adapting DiT for Segmentation via Direct Mask

Paper • 2603.23906 • Published 21 days ago • 10

upvoted 3 papers about 1 month ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

upvoted 2 papers about 2 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519