LaSeR - a Keven16 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Keven16 's Collections

LaSeR

updated Oct 17, 2025

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding"

Keven16/ORZ-7B-LaSeR

8B • Updated Oct 15, 2025 • 4 • 1
Keven16/Qwen2.5-7B-LaSeR

8B • Updated Oct 15, 2025 • 2
Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated Oct 15, 2025 • 2
Keven16/LaSeR_training_data

Viewer • Updated Oct 16, 2025 • 104k • 9 • 2
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs