ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a collection about 1 hour ago

Qwen3.6

liked a model about 5 hours ago

Qwen/Qwen3.5-9B

upvoted a paper 1 day ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

View all activity

Organizations

Collections 8

View 8 collections

Papers 8

models 4

datasets 3

ldwang/lighteval-ceval-exam

Updated Nov 14, 2024 • 31

ldwang/OpenHermes-2.5-zh

Preview • Updated Sep 2, 2024 • 27 • 1

ldwang/lighteval-cmmlu

Updated Aug 13, 2024 • 19

ldwang

AI & ML interests

Recent Activity

Organizations

Collections 8

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

OpenPipe/art-e-008

corbt/enron_emails_sample_questions

smolagents LLM leaderboard

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

OpenPipe/art-e-008

corbt/enron_emails_sample_questions

smolagents LLM leaderboard

Papers 8

models 4

ldwang/DeepScaleR-1.5B-Preview-Reproduce

ldwang/fasttext-oh-zh

ldwang/mamba-1.4b-aquila-400b-sft

ldwang/mamba-1.4b-aquila-400b

datasets 3

ldwang/lighteval-ceval-exam

ldwang/OpenHermes-2.5-zh

ldwang/lighteval-cmmlu

ldwang

AI & ML interests

Recent Activity

Organizations

Collections 8

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

smolagents LLM leaderboard

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

smolagents LLM leaderboard

Papers 8

models 4 Sort: Recently updated

datasets 3 Sort: Recently updated

models 4

datasets 3