Submitted by akhaliq 134 MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases · 12 authors 1.41k 13
Submitted by akhaliq 20 ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition · 4 authors 85 6
Submitted by akhaliq 19 Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models · 3 authors 56 6
Submitted by akhaliq 17 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning · 16 authors 3
Submitted by akhaliq 15 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs · 10 authors 22 3
Submitted by akhaliq 14 Seamless Human Motion Composition with Blended Positional Encodings · 3 authors 271 1