Resources for hybrid preferences research where we learn how to route preference instances for human vs. AI feedback
-
allenai/multipref
Viewer • Updated • 31.4k • 624 • 23 -
ljvmiranda921/multipref
Viewer • Updated • 31.4k • 35 -
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Paper • 2410.19133 • Published • 11 -
allenai/Llama-3-8B-Instruct-Analyzer
Text Generation • Updated • 18 • 3