Dmitri’s papers
updated
ReLearn: Unlearning via Learning for Large Language Models
Paper
• 2502.11190
• Published
• 30
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse
Attention
Paper
• 2502.11089
• Published
• 168
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for
Multimodal Web Agents
Paper
• 2502.11357
• Published
• 11
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs
for Knowledge-Intensive Visual Grounding
Paper
• 2503.12797
• Published
• 32
Towards Self-Improving Systematic Cognition for Next-Generation
Foundation MLLMs
Paper
• 2503.12303
• Published
• 7
R1-VL: Learning to Reason with Multimodal Large Language Models via
Step-wise Group Relative Policy Optimization
Paper
• 2503.12937
• Published
• 30
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based
Scientific Research
Paper
• 2503.13399
• Published
• 22
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated
Agent Intelligence
Paper
• 2506.15677
• Published
• 23
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science
Reasoning
Paper
• 2507.16812
• Published
• 64
MeshLLM: Empowering Large Language Models to Progressively Understand
and Generate 3D Mesh
Paper
• 2508.01242
• Published
• 11
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Paper
• 2508.10833
• Published
• 45
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic
Paper
• 2509.01363
• Published
• 59