view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 18 days ago • 49
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 861
On the Optimal Reasoning Length for RL-Trained Language Models Paper • 2602.09591 • Published Feb 10 • 6
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 309
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 186
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 3 days ago • 47
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 713
Asagi-VLM Collection Asagi is a Japanese Vision & Language model, trained on a large-scale synthetic dataset. • 4 items • Updated Nov 27, 2025 • 7