PolyPythias
- Preview • Updated • 55
EleutherAI/pile-preshuffled-seeds
Updated • 325 • 1Note Training data information for each seed.
-
EleutherAI/pythia-14m-deduped
Text Generation • 39.2M • Updated • 69.4k • 28 -
EleutherAI/pythia-14m-seed1
Text Generation • Updated • 172 -
EleutherAI/pythia-14m-seed2
Text Generation • Updated • 230 -
EleutherAI/pythia-14m-seed3
Text Generation • Updated • 90 -
EleutherAI/pythia-14m-seed4
Text Generation • Updated • 89 -
EleutherAI/pythia-14m-seed5
Text Generation • Updated • 113 -
EleutherAI/pythia-14m-seed6
Text Generation • Updated • 99 -
EleutherAI/pythia-14m-seed7
Text Generation • Updated • 101 -
EleutherAI/pythia-14m-seed8
Text Generation • Updated • 99 -
EleutherAI/pythia-14m-seed9
Text Generation • Updated • 99 -
EleutherAI/pythia-31m-deduped
Text Generation • 55.7M • Updated • 3.59k • 5 -
EleutherAI/pythia-31m-seed1
Text Generation • Updated • 191 -
EleutherAI/pythia-31m-seed2
Text Generation • Updated • 191 -
EleutherAI/pythia-31m-seed3
Text Generation • Updated • 80 -
EleutherAI/pythia-31m-seed4
Text Generation • Updated • 74 -
EleutherAI/pythia-31m-seed5
Text Generation • Updated • 65 -
EleutherAI/pythia-31m-seed6
Text Generation • Updated • 65 -
EleutherAI/pythia-31m-seed7
Text Generation • Updated • 70 -
EleutherAI/pythia-31m-seed8
Text Generation • Updated • 67 -
EleutherAI/pythia-31m-seed9
Text Generation • Updated • 67 -
EleutherAI/pythia-70m
95.6M • Updated • 167k • 79 -
EleutherAI/pythia-70m-seed1
Text Generation • Updated • 996 -
EleutherAI/pythia-70m-seed2
Text Generation • Updated • 673 -
EleutherAI/pythia-70m-seed3
Text Generation • Updated • 595 -
EleutherAI/pythia-70m-seed4
Text Generation • Updated • 559 -
EleutherAI/pythia-70m-seed5
Text Generation • Updated • 546 -
EleutherAI/pythia-70m-seed6
Text Generation • Updated • 520 -
EleutherAI/pythia-70m-seed7
Text Generation • Updated • 551 -
EleutherAI/pythia-70m-seed8
Text Generation • Updated • 529 -
EleutherAI/pythia-70m-seed9
Text Generation • Updated • 514 -
EleutherAI/pythia-160m
Text Generation • Updated • 2.43M • 38 -
EleutherAI/pythia-160m-seed1
Text Generation • 0.2B • Updated • 1.4k -
EleutherAI/pythia-160m-seed2
Text Generation • 0.2B • Updated • 1.3k -
EleutherAI/pythia-160m-seed3
Text Generation • 0.2B • Updated • 1.19k -
EleutherAI/pythia-160m-seed4
Text Generation • Updated • 901 • 1 -
EleutherAI/pythia-160m-seed5
Text Generation • Updated • 629 -
EleutherAI/pythia-160m-seed6
Text Generation • Updated • 608 -
EleutherAI/pythia-160m-seed7
Text Generation • Updated • 614 -
EleutherAI/pythia-160m-seed8
Text Generation • Updated • 603 -
EleutherAI/pythia-160m-seed9
Text Generation • Updated • 600 -
EleutherAI/pythia-410m
Text Generation • 0.5B • Updated • 88.8k • 36 -
EleutherAI/pythia-410m-seed1
Text Generation • Updated • 670 -
EleutherAI/pythia-410m-seed2
Text Generation • Updated • 1.33k -
EleutherAI/pythia-410m-seed3
Text Generation • Updated • 621 -
EleutherAI/pythia-410m-seed4
Text Generation • Updated • 554 -
EleutherAI/pythia-410m-seed5
Text Generation • Updated • 542 -
EleutherAI/pythia-410m-seed6
Text Generation • Updated • 885 • 1 -
EleutherAI/pythia-410m-seed7
Text Generation • Updated • 582 -
EleutherAI/pythia-410m-seed8
Text Generation • Updated • 553 -
EleutherAI/pythia-410m-seed9
Text Generation • Updated • 745
EleutherAI/pythia-160m-data-seed1
Text Generation • Updated • 120Note Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-data-seed2
Text Generation • Updated • 103Note Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-data-seed3
Text Generation • Updated • 102Note Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed1
Text Generation • Updated • 257Note Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed2
Text Generation • Updated • 207Note Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed3
Text Generation • Updated • 198Note Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
-
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Paper • 2503.09543 • Published