Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PARTAGES-dev 's Collections
Encoder pretraining from scratch (commercial use)
Encoder continual pretraining (research use)
Qwen3+PDAPT+SLERP

Qwen3+PDAPT+SLERP

updated 3 days ago

Experiments conducted for the LREC paper ()

Upvote
-

  • PARTAGES-dev/Qwen3-8B-PDAPT-SLERP

    Text Generation • 8B • Updated Dec 3, 2025 • 238

  • PARTAGES-dev/Qwen3-4B-PDAPT-SLERP

    Text Generation • 4B • Updated Dec 3, 2025 • 40

  • Qwen/Qwen3-8B-Base

    Text Generation • 8B • Updated May 21, 2025 • 1.13M • • 90

  • Qwen/Qwen3-4B-Base

    Text Generation • 4B • Updated Jul 26, 2025 • 991k • 82

  • Qwen/Qwen3-1.7B-Base

    Text Generation • 2B • Updated Jul 26, 2025 • 339k • 65

  • Qwen/Qwen3-0.6B-Base

    Text Generation • Updated Jul 26, 2025 • 223k • 154

  • PARTAGES-dev/Qwen3-1.7B-PDAPT-SLERP

    Text Generation • 2B • Updated 16 days ago • 13

  • PARTAGES-dev/Qwen3-0.6B-PDAPT-SLERP

    Text Generation • 0.8B • Updated Dec 4, 2025 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs