Yang Yang's picture

19

Yang Yang

yangyang14641

·

yangyang14641

AI & ML interests

None yet

Organizations

None yet

liked a model 9 months ago

mistralai/Mistral-Small-3.2-24B-Instruct-2506

Updated Dec 22, 2025 • 434k • 570

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 17 models about 1 year ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 1.43M • • 13.1k

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 8.99M • • 5.82k

deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • 236B • Updated Aug 21, 2024 • 7.4k • 682

deepseek-ai/DeepSeek-Coder-V2-Lite-Base

Text Generation • 16B • Updated Jul 3, 2024 • 2.71k • 105

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Text Generation • 16B • Updated Jul 3, 2024 • 267k • • 563

deepseek-ai/DeepSeek-Coder-V2-Base

Text Generation • 236B • Updated Jul 3, 2024 • 238 • 81

deepseek-ai/DeepSeek-V3

Text Generation • Updated Mar 27, 2025 • 979k • • 4.01k

deepseek-ai/DeepSeek-V3-Base

Updated Mar 27, 2025 • 16.5k • 1.68k

nvidia/OpenMath2-Llama3.1-70B

Text Generation • 71B • Updated Nov 25, 2024 • 63 • 21

nvidia/OpenMath2-Llama3.1-8B

Text Generation • 8B • Updated Nov 25, 2024 • 5.46k • • 32

nvidia/OpenMath2-Llama3.1-70B-nemo

Updated Nov 25, 2024 • 9

nvidia/OpenMath2-Llama3.1-8B-nemo

Updated Nov 25, 2024 • 6

TheBloke/Llama-2-7B-Chat-GGUF

Text Generation • 7B • Updated Oct 14, 2023 • 156k • 513

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • 71B • Updated Apr 13, 2025 • 11.4k • 2.06k

foduucom/stockmarket-pattern-detection-yolov8

Object Detection • Updated Apr 2, 2025 • 19.9k • 396

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 580k • • 2.67k

openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 4.76M • • 2.86k