TiDAR: Think in Diffusion, Talk in Autoregression Paper • 2511.08923 • Published Nov 12, 2025 • 128
Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion Paper • 2601.13599 • Published Jan 20 • 7
Running on CPU Upgrade Featured 3.03k The Smol Training Playbook 📚 3.03k The secrets to building world-class LLMs
Running 3.72k The Ultra-Scale Playbook 🌌 3.72k The ultimate guide to training LLM on large GPU Clusters
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • Updated Apr 13, 2025 • 5.92k • 2.06k