Inference Providers
Active filters: fp8
RedHatAI/Meta-Llama-3-70B-Instruct-FP8
Text Generation
• 71B • Updated • 1.45k
• 13
comaniac/Meta-Llama-3-70B-Instruct-FP8-v1
Text Generation
• 71B • Updated • 9
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v1
Text Generation
• 47B • Updated • 6
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v2
Text Generation
• 47B • Updated • 7
Skywork/Skywork-MoE-Base-FP8
Text Generation
• 146B • Updated • 50
• 7
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
• 73B • Updated • 1.27k
• • 15
comaniac/Meta-Llama-3-70B-Instruct-FP8-v2
Text Generation
• 71B • Updated • 5
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v3
Text Generation
• 47B • Updated • 5
comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v2
Text Generation
• 141B • Updated • 3
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
• 141B • Updated • 38
• 3
Text Generation
• 8B • Updated • 10
RedHatAI/Qwen2-0.5B-Instruct-FP8
Text Generation
• 0.5B • Updated • 772
• • 4
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
• 2B • Updated • 55.2k
• RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
• 8B • Updated • 3.81k
• • 2
anyisalin/L3-70B-Euryale-v2.1-FP8
Text Generation
• 71B • Updated • 7
yentinglin/Llama-3-Taiwan-70B-Instruct-FP8
Text Generation
• 71B • Updated • 14
kuotient/llama3-instrucTrans-enko-8b-FP8
Text Generation
• 8B • Updated • 8
• 2
FlorianJc/Hermes-2-Pro-Mistral-7B-vllm-fp8
Text Generation
• 7B • Updated • 4
FlorianJc/openchat-3.6-8b-20240522-vllm-fp8
Text Generation
• 8B • Updated • 3
FlorianJc/Llama3-ChatQA-1.5-8B-vllm-fp8
Text Generation
• 8B • Updated • 11
TechxGenus/Codestral-22B-v0.1-FP8
Text Generation
• 22B • Updated • 128
Model-SafeTensors/Meta-Llama-3-70B-FP8-Dynamic
Text Generation
• 71B • Updated • 7
Model-SafeTensors/Qwen-Qwen2-72B-FP8-Dynamic
Text Generation
• 73B • Updated • 7
RedHatAI/Meta-Llama-3-70B-Instruct-FP8-KV
Text Generation
• 71B • Updated • 14
• 3
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
• 7B • Updated • 1.52k
• 3
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
• 7B • Updated • 72
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
• 4B • Updated • 67
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
• 14B • Updated • 199
• 5
Rallio67/llama-3-70B-actions-FP8
Text Generation
• 71B • Updated • 6
FlorianJc/google-gemma-2-9b-it-vllm-fp8
Text Generation
• 9B • Updated • 12
• 1