DCAgent2/terminal_bench_2_exp_psu_stackoverflow_10K_glm_4_7_traces_20260311_170344 Updated about 1 hour ago
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_10K_glm_4_7_traces_20260311_170344 Updated about 1 hour ago
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_webshop_sandbod07c3d59 Updated about 2 hours ago
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_webshop_sandbod07c3d59 Updated about 2 hours ago
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_kg_sandboxes_me5f27cd1 Updated about 2 hours ago
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_kg_sandboxes_me5f27cd1 Updated about 2 hours ago
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_316_glm_4_7_traces_20260311_170339 Updated about 2 hours ago
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_316_glm_4_7_traces_20260311_170339 Updated about 2 hours ago
DCAgent2/medagentbench_laion_r2egym-nl2bash-stack-bugsseq Viewer β’ Updated about 3 hours ago β’ 899 β’ 9
laion/rl_r2egym-nl2bash-stack-bugsseq-fixthink-again_lr1e-5_curriculum-medium 8B β’ Updated about 5 hours ago
laion/rl_r2egym-nl2bash-stack-bugsseq-fixthink-again_lr1e-5_curriculum-medium 8B β’ Updated about 5 hours ago
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_1_0_traces_20260311_010108 Updated about 8 hours ago