LLM-Papers
updated
PDFTriage: Question Answering over Long, Structured Documents
Paper
• 2309.08872
• Published
• 55
Adapting Large Language Models via Reading Comprehension
Paper
• 2309.09530
• Published
• 82
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
• 2310.09263
• Published
• 40
Context-Aware Meta-Learning
Paper
• 2310.10971
• Published
• 17
Data-Centric Financial Large Language Models
Paper
• 2310.17784
• Published
• 15
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
• 2310.19019
• Published
• 9
Contrastive Chain-of-Thought Prompting
Paper
• 2311.09277
• Published
• 35
Orca 2: Teaching Small Language Models How to Reason
Paper
• 2311.11045
• Published
• 77
Context Tuning for Retrieval Augmented Generation
Paper
• 2312.05708
• Published
• 16
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
• 2312.15166
• Published
• 61
Improving Text Embeddings with Large Language Models
Paper
• 2401.00368
• Published
• 82
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
• 2401.00908
• Published
• 189
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
• 2401.04398
• Published
• 25
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
• 2401.08967
• Published
• 31
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper
• 2401.15024
• Published
• 73
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
• 2402.05140
• Published
• 23
AutoMathText: Autonomous Data Selection with Language Models for
Mathematical Texts
Paper
• 2402.07625
• Published
• 16
How to Train Data-Efficient LLMs
Paper
• 2402.09668
• Published
• 43
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
• 2402.10986
• Published
• 81
Knowledge Fusion of Large Language Models
Paper
• 2401.10491
• Published
• 5
SaulLM-7B: A pioneering Large Language Model for Law
Paper
• 2403.03883
• Published
• 90
RAFT: Adapting Language Model to Domain Specific RAG
Paper
• 2403.10131
• Published
• 72
TnT-LLM: Text Mining at Scale with Large Language Models
Paper
• 2403.12173
• Published
• 20
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper
• 2403.13372
• Published
• 179
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small
Reference Models
Paper
• 2405.20541
• Published
• 24
Towards a Personal Health Large Language Model
Paper
• 2406.06474
• Published
• 22
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Paper
• 2406.09170
• Published
• 27
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
• 2406.14491
• Published
• 96
The FineWeb Datasets: Decanting the Web for the Finest Text Data at
Scale
Paper
• 2406.17557
• Published
• 100
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
• 2406.19215
• Published
• 32
Show Less, Instruct More: Enriching Prompts with Definitions and
Guidelines for Zero-Shot NER
Paper
• 2407.01272
• Published
• 8
LETS-C: Leveraging Language Embedding for Time Series Classification
Paper
• 2407.06533
• Published
• 2
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Paper
• 2407.09413
• Published
• 11
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Paper
• 2407.12854
• Published
• 31
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
• 2407.18961
• Published
• 40
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal
Domain
Paper
• 2407.19584
• Published
• 66
Visual Riddles: a Commonsense and World Knowledge Challenge for Large
Vision and Language Models
Paper
• 2407.19474
• Published
• 23
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
• 2407.18248
• Published
• 33
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper
• 2407.20183
• Published
• 43
LAMBDA: A Large Model Based Data Agent
Paper
• 2407.17535
• Published
• 37
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Paper
• 2407.15017
• Published
• 34
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
• 2407.03502
• Published
• 51
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Paper
• 2408.14717
• Published
• 26
Foundation Models for Music: A Survey
Paper
• 2408.14340
• Published
• 44
Efficient Detection of Toxic Prompts in Large Language Models
Paper
• 2408.11727
• Published
• 13
Sapiens: Foundation for Human Vision Models
Paper
• 2408.12569
• Published
• 94
Controllable Text Generation for Large Language Models: A Survey
Paper
• 2408.12599
• Published
• 65
TableBench: A Comprehensive and Complex Benchmark for Table Question
Answering
Paper
• 2408.09174
• Published
• 52
OLMoE: Open Mixture-of-Experts Language Models
Paper
• 2409.02060
• Published
• 80
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Paper
• 2409.05840
• Published
• 49
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
• 2409.02795
• Published
• 72
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
Language Models
Paper
• 2409.11136
• Published
• 22
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published
• 140
A Controlled Study on Long Context Extension and Generalization in LLMs
Paper
• 2409.12181
• Published
• 45
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic
reasoning
Paper
• 2409.12183
• Published
• 39
A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language
Models: An Experimental Analysis up to 405B
Paper
• 2409.11055
• Published
• 17
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector
Retrieval
Paper
• 2409.10516
• Published
• 43
Guiding Vision-Language Model Selection for Visual Question-Answering
Across Tasks, Domains, and Knowledge Types
Paper
• 2409.09269
• Published
• 8
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Paper
• 2409.15277
• Published
• 38
HelloBench: Evaluating Long Text Generation Capabilities of Large
Language Models
Paper
• 2409.16191
• Published
• 41
Boosting Healthcare LLMs Through Retrieved Context
Paper
• 2409.15127
• Published
• 19
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Paper
• 2409.17481
• Published
• 47
Instruction Following without Instruction Tuning
Paper
• 2409.14254
• Published
• 29
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Paper
• 2409.20566
• Published
• 55
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper
• 2410.00531
• Published
• 33
Law of the Weakest Link: Cross Capabilities of Large Language Models
Paper
• 2409.19951
• Published
• 54
Embodied-RAG: General non-parametric Embodied Memory for Retrieval and
Generation
Paper
• 2409.18313
• Published
• 3
Not All LLM Reasoners Are Created Equal
Paper
• 2410.01748
• Published
• 29
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
Foundation Models
Paper
• 2410.02740
• Published
• 54
Video Instruction Tuning With Synthetic Data
Paper
• 2410.02713
• Published
• 41
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Paper
• 2410.03017
• Published
• 29
LLMs Know More Than They Show: On the Intrinsic Representation of LLM
Hallucinations
Paper
• 2410.02707
• Published
• 47
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for
Data-Driven Scientific Discovery
Paper
• 2410.05080
• Published
• 21
Personalized Visual Instruction Tuning
Paper
• 2410.07113
• Published
• 70
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
Enhanced Following of Instructions with Multiple Constraints
Paper
• 2410.06458
• Published
• 8
MLE-bench: Evaluating Machine Learning Agents on Machine Learning
Engineering
Paper
• 2410.07095
• Published
• 8
MathCoder2: Better Math Reasoning from Continued Pretraining on
Model-translated Mathematical Code
Paper
• 2410.08196
• Published
• 48
MLLM as Retriever: Interactively Learning Multimodal Retrieval for
Embodied Agents
Paper
• 2410.03450
• Published
• 36
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
with Curriculum Preference Learning
Paper
• 2410.06508
• Published
• 11
Vector-ICL: In-context Learning with Continuous Vector Representations
Paper
• 2410.05629
• Published
• 4
Baichuan-Omni Technical Report
Paper
• 2410.08565
• Published
• 87
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via
Inference-time Hybrid Information Structurization
Paper
• 2410.08815
• Published
• 47
From Generalist to Specialist: Adapting Vision Language Models via
Task-Specific Visual Instruction Tuning
Paper
• 2410.06456
• Published
• 37
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper
• 2410.10563
• Published
• 37
Thinking LLMs: General Instruction Following with Thought Generation
Paper
• 2410.10630
• Published
• 20
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Paper
• 2410.11096
• Published
• 13
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language
Models
Paper
• 2410.13085
• Published
• 24
WorldCuisines: A Massive-Scale Benchmark for Multilingual and
Multicultural Visual Question Answering on Global Cuisines
Paper
• 2410.12705
• Published
• 32
TransAgent: Transfer Vision-Language Foundation Models with
Heterogeneous Agent Collaboration
Paper
• 2410.12183
• Published
• 4
UCFE: A User-Centric Financial Expertise Benchmark for Large Language
Models
Paper
• 2410.14059
• Published
• 63
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial
Samples
Paper
• 2410.14669
• Published
• 39
Improve Vision Language Model Chain-of-thought Reasoning
Paper
• 2410.16198
• Published
• 26
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper
• 2410.17215
• Published
• 16
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Paper
• 2410.19168
• Published
• 24
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum
Reinforcement Learning
Paper
• 2411.02337
• Published
• 36
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge
in RAG Systems
Paper
• 2411.02959
• Published
• 71
Self-Consistency Preference Optimization
Paper
• 2411.04109
• Published
• 19
From Generation to Judgment: Opportunities and Challenges of
LLM-as-a-judge
Paper
• 2411.16594
• Published
• 39
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A
Comprehensive Multimodal Dataset Towards General Medical AI
Paper
• 2411.14522
• Published
• 38
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented
LMs
Paper
• 2411.14199
• Published
• 34
RedPajama: an Open Dataset for Training Large Language Models
Paper
• 2411.12372
• Published
• 57
Stronger Models are NOT Stronger Teachers for Instruction Tuning
Paper
• 2411.07133
• Published
• 38
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context
Learning via MCTS
Paper
• 2411.18478
• Published
• 37
NVILA: Efficient Frontier Visual Language Models
Paper
• 2412.04468
• Published
• 60
Note LLM Efficient
Efficiently Serving LLM Reasoning Programs with Certaindex
Paper
• 2412.20993
• Published
• 36
Note LLM Efficient
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction
System
Paper
• 2412.20005
• Published
• 17
Note Knowledge Base
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation
Models
Paper
• 2501.00316
• Published
• 23
Note Geospatial
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
• 2501.04682
• Published
• 99
Note Reasoning
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper
• 2501.05366
• Published
• 102
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction
Following
Paper
• 2501.08187
• Published
• 27
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning
Instructions
Paper
• 2412.08864
• Published
• 2
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Paper
• 2501.09686
• Published
• 41
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and
Understanding
Paper
• 2501.18362
• Published
• 23
Large Language Models Think Too Fast To Explore Effectively
Paper
• 2501.18009
• Published
• 23
Teaching Language Models to Critique via Reinforcement Learning
Paper
• 2502.03492
• Published
• 24
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
Paper
• 2502.06772
• Published
• 21
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of
Language Models
Paper
• 2502.04404
• Published
• 25
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Paper
• 2502.02339
• Published
• 23
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Paper
• 2502.08127
• Published
• 59
Distillation Scaling Laws
Paper
• 2502.08606
• Published
• 47
Selective Self-to-Supervised Fine-Tuning for Generalization in Large
Language Models
Paper
• 2502.08130
• Published
• 9
InductionBench: LLMs Fail in the Simplest Complexity Class
Paper
• 2502.15823
• Published
• 7
Large-Scale Data Selection for Instruction Tuning
Paper
• 2503.01807
• Published
• 14
DeepSolution: Boosting Complex Engineering Solution Design via
Tree-based Exploration and Bi-point Thinking
Paper
• 2502.20730
• Published
• 38
TeleRAG: Efficient Retrieval-Augmented Generation Inference with
Lookahead Retrieval
Paper
• 2502.20969
• Published
• 11
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic
Iterative Reasoning Agents
Paper
• 2502.18017
• Published
• 21
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive
Cognitive-Inspired Sketching
Paper
• 2503.05179
• Published
• 46
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation
for Everyday Household Activities
Paper
• 2503.05652
• Published
• 11
Genius: A Generalizable and Purely Unsupervised Self-Training Framework
For Advanced Reasoning
Paper
• 2504.08672
• Published
• 55
DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Paper
• 2412.02205
• Published