How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities Paper • 2603.02578 • Published 8 days ago • 22
view article Article Making LLMs Truly Remember You | LightMem: Lightweight and Efficient Memory-Augmented Generation 10 days ago • 3
view article Article Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill Graph" Knowledge Base 10 days ago • 12
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem Paper • 2602.14367 • Published 23 days ago • 17
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 26 days ago • 8
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 26 days ago • 8
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 26 days ago • 8
From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published Feb 4 • 15
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published Jan 19 • 15
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 20