Preference Datasets for DPO Collection This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 48
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare +1 aaditya, pminervini, clefourrier • Apr 19, 2024 • 198
Arabic Aya DPO Datasets Collection Our synthetic DPO datasets for Arabic Aya. • 5 items • Updated Jun 4, 2024 • 4
Tokenization Falling Short: The Curse of Tokenization Paper • 2406.11687 • Published Jun 17, 2024 • 16
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26
view article Article 🥐CroissantLLM: A Truly Bilingual French-English Language Model manu • Feb 5, 2024 • 15
FrenchBench Evaluation datasets Collection These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 8
view article Article Introducing the Open Arabic LLM Leaderboard +3 alielfilali01, Hamza-Alobeidli, rcojocaru, basma-b, clefourrier • May 14, 2024 • 103
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 43
view article Article Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B +3 asoria, tdoehmen, senwu, lorr, vpm238 • Apr 4, 2024 • 29