3 21 4

Xiangdong Zhang

aHapBean

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

upvoted a paper 8 days ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

commentedon a paper 20 days ago

DreamWorld: Unified World Modeling in Video Generation

View all activity

Organizations

None yet

upvoted a paper 6 days ago

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Paper • 2603.19228 • Published 7 days ago • 64

upvoted a paper 8 days ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published 14 days ago • 13

upvoted a paper 21 days ago

DreamWorld: Unified World Modeling in Video Generation

Paper • 2603.00466 • Published 27 days ago • 16

upvoted a paper about 2 months ago

On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Paper • 2602.00130 • Published Jan 28 • 3

upvoted 2 papers 3 months ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 52

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Paper • 2512.11792 • Published Dec 12, 2025 • 10

upvoted 3 papers 5 months ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50

ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning

Paper • 2510.18250 • Published Oct 21, 2025 • 14

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15, 2025 • 58

upvoted a paper 6 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

upvoted a paper 7 months ago

Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views

Paper • 2509.01250 • Published Sep 1, 2025 • 2

upvoted a paper 9 months ago

IntFold: A Controllable Foundation Model for General and Specialized Biomolecular Structure Prediction

Paper • 2507.02025 • Published Jul 2, 2025 • 35

upvoted 6 papers 10 months ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 108

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Paper • 2506.04633 • Published Jun 5, 2025 • 20

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Paper • 2505.23656 • Published May 29, 2025 • 25

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 132

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Paper • 2505.17399 • Published May 23, 2025 • 14

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13, 2025 • 42

upvoted a paper 11 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

upvoted a paper 12 months ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

Xiangdong Zhang

AI & ML interests

Recent Activity

Organizations

aHapBean's activity