Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 21 days ago • 66
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 501
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models Paper • 2602.04649 • Published Feb 4 • 12
Jamba2 Collection Jamba2 is a highly-efficient open source family of language models built for maximum reliability and steerability in the enterprise. • 3 items • Updated Jan 8 • 5
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 30 items • Updated 1 day ago • 123
i3-Series Collection Note: The models are listed in the default order set by Hugging Face, so the latest model appears at the botSeries • 6 items • Updated Mar 2 • 2
MemMamba: Rethinking Memory Patterns in State Space Model Paper • 2510.03279 • Published Sep 28, 2025 • 74
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 513
view changelog Hugging Face Changelog Repositories total file size is now displayed Sep 18, 2025 • 175
Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models Paper • 2507.17702 • Published Jul 23, 2025 • 6
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 7 days ago • 216