arxiv:2405.01573
Anmol Agarwal
anmolagarwal999
·
AI & ML interests
None yet
Organizations
models 307
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-30
0.5B • Updated • 2
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-20
0.5B • Updated • 2
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-10
0.5B • Updated • 1
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_560
Text Generation • 0.5B • Updated • 2
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_550
Text Generation • 0.5B • Updated
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_540
Text Generation • 0.5B • Updated
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_530
Text Generation • 0.5B • Updated • 1
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_520
Text Generation • 0.5B • Updated
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510
Text Generation • 0.5B • Updated • 2
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_504
Text Generation • 0.5B • Updated • 1
datasets 9
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer • Updated • 4.37k • 8
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer • Updated • 4.37k • 8
anmolagarwal999/qwq_rl_train_dataset_countdown_v2
Viewer • Updated • 4.37k • 9
anmolagarwal999/math_dataset_train_based_on_qwen_distilled_r1_32b
Viewer • Updated • 3.64k • 4
anmolagarwal999/math_dataset_test_based_on_gt_reasoning_trace
Viewer • Updated • 500 • 7
anmolagarwal999/math_dataset_train_based_on_gt_reasoning_trace
Viewer • Updated • 3.64k • 5
anmolagarwal999/qwq_rl_train_dataset_countdown
Viewer • Updated • 4.37k • 6
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer • Updated • 440 • 5
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer • Updated • 2.72k • 5