Marwa El Kamil's picture

Open to Collab

Marwa El Kamil

maghwa

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Qwen/Qwen3.6-35B-A3B

liked a Space about 1 month ago

abdeljalilELmajjodi/moroccan_darija_asr_leaderboard

liked a dataset 3 months ago

abdeljalilELmajjodi/Lhadith_dataset

View all activity

Organizations

upvoted a collection about 1 year ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 736

upvoted 2 collections over 1 year ago

PaliGemma FT Models

108 items • Updated Mar 12 • 35

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 48

upvoted 2 articles over 1 year ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

omarkamali

•

Dec 8, 2024

• 23

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

+1

aaditya, pminervini, clefourrier

•

Apr 19, 2024

• 198

upvoted a collection over 1 year ago

Arabic Aya DPO Datasets

Our synthetic DPO datasets for Arabic Aya. • 5 items • Updated Jun 4, 2024 • 4

upvoted a paper almost 2 years ago

101 Billion Arabic Words Dataset

Paper • 2405.01590 • Published Apr 29, 2024 • 6

upvoted an article almost 2 years ago

Article

Tokenization Is A Dead Weight (Tokun Part 1)

apehex

•

Jun 27, 2024

• 18

upvoted 2 papers almost 2 years ago

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 16

CroissantLLM: A Truly Bilingual French-English Language Model

Paper • 2402.00786 • Published Feb 1, 2024 • 26

upvoted an article almost 2 years ago

Article

🥐CroissantLLM: A Truly Bilingual French-English Language Model

manu

•

Feb 5, 2024

• 15

upvoted a collection almost 2 years ago

FrenchBench Evaluation datasets

These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 8

upvoted an article almost 2 years ago

Article

Introducing the Open Arabic LLM Leaderboard

+3

alielfilali01, Hamza-Alobeidli, rcojocaru, basma-b, clefourrier

•

May 14, 2024

• 103

upvoted a paper almost 2 years ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 43

upvoted an article almost 2 years ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

upvoted an article about 2 years ago

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

+3

asoria, tdoehmen, senwu, lorr, vpm238

•

Apr 4, 2024

• 29

upvoted a paper about 2 years ago

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17, 2024 • 46

upvoted 2 papers over 2 years ago

BloombergGPT: A Large Language Model for Finance

Paper • 2303.17564 • Published Mar 30, 2023 • 32

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 58