view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG zilliz • Jan 15 • 67
Running on CPU Upgrade Featured 3.17k The Smol Training Playbook 📚 3.17k The secrets to building world-class LLMs
view article Article Sentence Transformers is joining Hugging Face! tomaarsen • Oct 22, 2025 • 88
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models nvidia • Jul 18, 2025 • 51
Hf-native ColVision Models Collection Models that can be used with the native transformers 🤗 implementation instead of colpali-engine. • 4 items • Updated Jan 9 • 8
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Training and Finetuning Reranker Models with Sentence Transformers tomaarsen • Mar 26, 2025 • 193
Running 3.84k The Ultra-Scale Playbook 🌌 3.84k The ultimate guide to training LLM on large GPU Clusters
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 258
view article Article Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably abhinand • Aug 20, 2024 • 27