LLMs
updated
Ziya2: Data-centric Learning is All LLMs Need
Paper
• 2311.03301
• Published
• 20
Co-training and Co-distillation for Quality Improvement and Compression
of Language Models
Paper
• 2311.02849
• Published
• 8
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
• 2311.02303
• Published
• 12
ADaPT: As-Needed Decomposition and Planning with Language Models
Paper
• 2311.05772
• Published
• 12
Prompt Engineering a Prompt Engineer
Paper
• 2311.05661
• Published
• 23
FinGPT: Large Generative Models for a Small Language
Paper
• 2311.05640
• Published
• 30
Language Models can be Logical Solvers
Paper
• 2311.06158
• Published
• 20
Lumos: Learning Agents with Unified Data, Modular Design, and
Open-Source LLMs
Paper
• 2311.05657
• Published
• 30
Exponentially Faster Language Modelling
Paper
• 2311.10770
• Published
• 119
SparQ Attention: Bandwidth-Efficient LLM Inference
Paper
• 2312.04985
• Published
• 40
PathFinder: Guided Search over Multi-Step Reasoning Paths
Paper
• 2312.05180
• Published
• 10
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language
Models with 3D Parallelism
Paper
• 2312.04916
• Published
• 7