arxiv:2511.15248
Kai Yang
yangkaiSIGS
AI & ML interests
None yet
Recent Activity
upvoted a paper about 14 hours ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models upvoted a paper about 14 hours ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 27 days ago
ProAct: Agentic Lookahead in Interactive Environments