Mano: Restriking Manifold Optimization for LLM Training
Paper • 2601.23000 • Published • 3
None defined yet.
Optimizing Few-Step Generation with Adaptive Matching Distillation
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers