Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
biki96 's Collections
image-text-to-video
I2I
Face Swap
Embedding
A2A
TTS
Text2Image
LLM
IT3D
OCR
I2V
STT
diffusion

STT

updated Feb 5
Upvote
-

  • Running on CPU Upgrade
    Featured
    1.3k

    Open ASR Leaderboard

    🏆
    1.3k

    Explore speech model benchmarks and request new evaluations


  • nvidia/canary-qwen-2.5b

    Automatic Speech Recognition • 3B • Updated Dec 15, 2025 • 133k • 408

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated Nov 27, 2025 • 272k • 761

  • nvidia/parakeet-tdt-0.6b-v2

    Automatic Speech Recognition • Updated Nov 27, 2025 • 169k • 1.45k

  • stabilityai/stable-video-diffusion-img2vid

    Image-to-Video • Updated Jul 10, 2024 • 51k • 1.02k

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated 12 days ago • 300 • 346

  • mistralai/Voxtral-Mini-4B-Realtime-2602

    Automatic Speech Recognition • 4B • Updated 28 days ago • 888k • 800
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs