Instruction & Reasoning Collection Datasets for instruction following, code, and reasoning. • 13 items • Updated 4 days ago • 7
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models Paper • 2412.13702 • Published Dec 18, 2024 • 2
Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models Paper • 2601.18129 • Published Jan 26 • 11
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published Jan 19 • 12
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published Jan 21 • 15
OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value Paper • 2512.14051 • Published Dec 16, 2025 • 46
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 93
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning Paper • 2506.16123 • Published Jun 19, 2025 • 8
Prior Prompt Engineering for Reinforcement Fine-Tuning Paper • 2505.14157 • Published May 20, 2025 • 7
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
European LLMs Collection Large language models for European languages (multilingual and monolingual) • 13 items • Updated May 26, 2024 • 3
Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging -- An Open Recipe Paper • 2502.09056 • Published Feb 13, 2025 • 31
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 100
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 288
Typhoon 2 Multimodal Collection Latest Official Multimodal ThaiLLM release by SCB 10X. • 3 items • Updated Jan 28 • 4