Inference Providers
Active filters: awq
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 30.4k
• 11
Brooooooklyn/Qwen3.6-35B-A3B-UD-Q4_K_XL-mlx
Text Generation
• 7B • Updated • 1.39k
• 5
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 387k
• 43
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 92.7k
• 6
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
• 15B • Updated • 1.84M
• 33
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated • 9k
• 21
QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ
Image-Text-to-Text
• 28B • Updated • 54k
• 12
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 22.7k
• 9
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
• Updated • 152k
• 109
Qwen/Qwen2.5-Coder-7B-Instruct-AWQ
Text Generation
• 8B • Updated • 257k
• 22
Qwen/Qwen2.5-Coder-32B-Instruct-AWQ
Text Generation
• 33B • Updated • 713k
• 35
casperhansen/llama-3.3-70b-instruct-awq
Text Generation
• 71B • Updated • 273k
• 42
kaitchup/QwQ-32B-AWQ-4bit
Text Generation
• 33B • Updated • 246
• 3
Text Generation
• 8B • Updated • 3
• 1
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
Text Generation
• 8B • Updated • 596
• 6
qdzzzxc/RuadaptQwen3-32B-Instruct-AWQ
33B • Updated • 722
• 3
sionic-ai/bge-reasoner-embed-qwen3-8b-0923-AWQ-4bit
Text Ranking
• 8B • Updated • 12
• 6
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 712k
• 42
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 88.6k
• 12
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
• 9B • Updated • 2.65k
• 19
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 75.7k
• 25
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 89.9k
• 15
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 152k
• 18
Text Generation
• 586B • Updated • 5.08k
• 6
Image-Text-to-Text
• 5B • Updated • 43.9k
• 8
Brooooooklyn/Qwen3.5-35B-A3B-UD-Q8_K_XL-mlx
Text Generation
• 10B • Updated • 428
• 3
Brooooooklyn/Qwen3.5-9B-UD-Q8_K_XL-mlx
Text Generation
• 3B • Updated • 505
• 1
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 12.7k
• 7
alonsoko/gemma-4-31b-it-abliterated-heretic-ara-AWQ
Image-Text-to-Text
• 32B • Updated • 2.24k
• 2
groxaxo/gemma4-31b-abliterated-multimodal-awq8
Image-Text-to-Text
• 34B • Updated • 489
• 3