InfiniteScienceGym: An Unbounded, Procedurally-Generated Benchmark for Scientific Analysis Paper • 2604.13201 • Published 3 days ago • 1
MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments Paper • 2604.13418 • Published 2 days ago • 5
SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering Paper • 2604.11548 • Published 4 days ago • 15
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space Paper • 2604.14142 • Published 2 days ago • 23
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models Paper • 2604.10866 • Published 4 days ago • 46
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published 2 days ago • 60
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 4 days ago • 95
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 2 days ago • 110
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization Paper • 2604.12887 • Published 3 days ago • 4
Accelerating Speculative Decoding with Block Diffusion Draft Trees Paper • 2604.12989 • Published 3 days ago • 5
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation Paper • 2604.13010 • Published 3 days ago • 8
Rethinking the Diffusion Model from a Langevin Perspective Paper • 2604.10465 • Published 5 days ago • 12
Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective Paper • 2604.14025 • Published 2 days ago • 11
Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Paper • 2604.12626 • Published 3 days ago • 13