Inference Providers
Active filters: exl3
Jellon/Lyra-Gutenberg-12b-exl3-4bpw
Text Generation
• Updated • 1
Volko76/Llama-3.2-1B-Instruct-exl3
Volko76/Qwen2.5-Coder-0.5B-Instruct-exl3
Text Generation
• Updated • 6
• 1
Volko76/Lucie-7B-Instruct-abliterated-exl3
async0x42/cogito-v1-preview-qwen-32B-exl3_4.5bpw
Text Generation
• 10B • Updated • 2
async0x42/DeepCoder-1.5B-Preview-exl3_4.5bpw
Text Generation
• Updated • 1
async0x42/QwQ-32B-Snowdrop-v0-exl3_4.5bpw
Text Generation
• 10B • Updated • 3
async0x42/QwQ-32B-ArliAI-RpR-v1-exl3_4.5bpw
10B • Updated • 4
• 1
Jellon/Pantheon-RP-1.8-24b-Small-3.1-exl3-4bpw
6B • Updated • 4
• 1
Jellon/Pantheon-RP-1.8-24b-Small-3.1-exl3-3bpw
5B • Updated • 1
• 1
kaitchup/Llama-3.3-70B-Instruct-exl3-1.75bpw
Text Generation
• 9B • Updated • 5
kaitchup/Llama-3.3-70B-Instruct-exl3-4.0bpw
Text Generation
• 19B • Updated • 3
• 1
Text Generation
• 9B • Updated • 6
• 2
ThijsL202/Pantheon-RP-1.8-24b-Small-3.1-exl3-6.0bpw
9B • Updated • 7
turboderp/Llama-3.1-Nemotron-Ultra-253B-v1-exl3
turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3
Updated • 14
• 17
ArtusDev/nvidia_Llama-3_1-Nemotron-Ultra-253B-v1_EXL3_1.35bpw_H6
Text Generation
• 24B • Updated • 5
MikeRoz/Mistral-Large-Instruct-2407-exl3
Updated • 13
• 2
gghfez/L3.3-San-Mai-R1-70b-exl3-2.1bpw
Text Generation
• 10B • Updated • 1
MikeRoz/Electranova-70B-v1.0-exl3
Updated
isogen/reka-flash-3-exl3-3bpw
5B • Updated • 2
LatentWanderer/THUDM_GLM-4-32B-0414-6.5bpw-h8-exl3
Text Generation
• 14B • Updated • 1
isogen/reka-flash-3-exl3-4bpw
6B • Updated • 4
LatentWanderer/THUDM_GLM-4-9B-0414-6.5bpw-h8-exl3
Text Generation
• Updated • 2
isogen/Mistral-Nemo-Instruct-2407-exl3-6bpw
5B • Updated isogen/Mistral-Nemo-Instruct-2407-exl3-4bpw
turboderp/c4ai-command-r-08-2024-exl3
Updated • 6
• 6
Panchovix/Llama-3_1-Nemotron-Ultra-253B-v1-3.6bpw-h6-exl3
Text Generation
• 59B • Updated • 4
Panchovix/Llama-3_1-Nemotron-Ultra-253B-v1-3.25bpw-h6-exl3
Text Generation
• 54B • Updated • 3
turboderp/c4ai-command-r7b-12-2024-exl3
Updated • 2
• 1