MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 3 days ago • 74
LLM Training Datasets Collection A collection of datasets for training LLMs. • 126 items • Updated Jan 26 • 30
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 51
On the Multi-turn Instruction Following for Conversational Web Agents Paper • 2402.15057 • Published Feb 23, 2024 • 1
Ask-before-Plan: Proactive Language Agents for Real-World Planning Paper • 2406.12639 • Published Jun 18, 2024 • 1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Paper • 2503.19950 • Published Mar 25, 2025 • 12
Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models Paper • 2503.17811 • Published Mar 22, 2025 • 13