Gene Library Courses Download Pricing Contact Sign in
Hugging Face

Hugging Face

North America Est. 2016 Free tier for public models; pay-per-compute for Inference Endpoints; enterprise contracts.
model-hubopen-source-platforminference-providercommunity

The 'GitHub of AI' — an open platform hosting 500,000+ models, 100,000+ datasets, and Spaces for demos.

What They Do

Hugging Face hosts the world's largest public repository of machine-learning models and datasets, anchored by the Transformers library used by millions of researchers. Its Inference API lets developers call models over HTTPS without managing GPU infrastructure.

Mission

Democratise good machine learning for researchers and practitioners everywhere.

Available Models

ModelFamilyContextInput /MOutput /M
Andycurrent/Gemma-3-1B-it-GLM-4.7-Flash-Heretic-Uncensored-Thinking_GGUF
Doradus-AI/RnJ-1-Instruct-FP8
Dream-org/Dream-v0-Instruct-7B
EleutherAI/gpt-neo-125m
EleutherAI/gpt-neo-2.7B
EleutherAI/gpt-neox-20b
EleutherAI/pythia-160m
EleutherAI/pythia-70m-deduped
GSAI-ML/LLaDA-8B-Instruct
HuggingFaceTB/SmolLM2-135M
HuggingFaceTB/SmolLM2-135M-Instruct
HuggingFaceTB/SmolLM3-3B
IlyaGusev/saiga_llama3_8b
KomeijiForce/bart-large-emojilm
Maykeye/TinyLLama-v0
MiniMaxAI/MiniMax-M2.5
MiniMaxAI/MiniMax-M2.7
NexVeridian/Qwen3-Coder-Next-8bit
OBLITERATUS/gemma-4-E4B-it-OBLITERATED
QuantTrio/DeepSeek-V3.2-AWQ
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Qwen/Qwen2-0.5B
Qwen/Qwen2-0.5B-Instruct
Qwen/Qwen2-1.5B-Instruct
Qwen/Qwen2-7B-Instruct
Qwen/Qwen2.5-0.5B
Qwen/Qwen2.5-0.5B-Instruct
Qwen/Qwen2.5-1.5B
Qwen/Qwen2.5-1.5B-Instruct
Qwen/Qwen2.5-1.5B-Instruct-AWQ
Qwen/Qwen2.5-14B-Instruct
Qwen/Qwen2.5-14B-Instruct-AWQ
Qwen/Qwen2.5-32B-Instruct
Qwen/Qwen2.5-32B-Instruct-AWQ
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int4
Qwen/Qwen2.5-3B
Qwen/Qwen2.5-3B-Instruct
Qwen/Qwen2.5-72B-Instruct
Qwen/Qwen2.5-72B-Instruct-AWQ
Qwen/Qwen2.5-7B
Qwen/Qwen2.5-7B-Instruct
Qwen/Qwen2.5-7B-Instruct-AWQ
Qwen/Qwen2.5-Coder-1.5B-Instruct
Qwen/Qwen2.5-Coder-14B-Instruct
Qwen/Qwen2.5-Coder-14B-Instruct-AWQ
Qwen/Qwen2.5-Coder-32B-Instruct
Qwen/Qwen2.5-Coder-32B-Instruct-AWQ
Qwen/Qwen2.5-Coder-3B
Qwen/Qwen2.5-Coder-7B-Instruct
Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4
Qwen/Qwen2.5-Math-1.5B
Qwen/Qwen3-0.6B
Qwen/Qwen3-0.6B-Base
Qwen/Qwen3-0.6B-FP8
Qwen/Qwen3-1.7B
Qwen/Qwen3-1.7B-Base
Qwen/Qwen3-1.7B-GPTQ-Int8
Qwen/Qwen3-14B
Qwen/Qwen3-14B-AWQ
Qwen/Qwen3-235B-A22B
Qwen/Qwen3-30B-A3B
Qwen/Qwen3-30B-A3B-Instruct-2507
Qwen/Qwen3-32B
Qwen/Qwen3-32B-AWQ
Qwen/Qwen3-32B-FP8
Qwen/Qwen3-4B
Qwen/Qwen3-4B-Base
Qwen/Qwen3-4B-Instruct-2507
Qwen/Qwen3-4B-Instruct-2507-FP8
Qwen/Qwen3-4B-Thinking-2507
Qwen/Qwen3-8B
Qwen/Qwen3-8B-AWQ
Qwen/Qwen3-8B-Base
Qwen/Qwen3-8B-FP8
Qwen/Qwen3-Coder-30B-A3B-Instruct
Qwen/Qwen3-Coder-30B-A3B-Instruct-FP8
Qwen/Qwen3-Coder-Next
Qwen/Qwen3-Coder-Next-FP8
Qwen/Qwen3Guard-Gen-0.6B
RedHatAI/Llama-3.2-1B-Instruct-FP8
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8
RedHatAI/Qwen2.5-1.5B-quantized.w8a8
TheBloke/TinyLlama-1.1B-Chat-v0.3-GPTQ
TinyLlama/TinyLlama-1.1B-Chat-v1.0
VLTX/VertaLily-1.2-1B-GGUF
allenai/OLMo-2-0425-1B
ansulev/Darwin-9B-NEG
antirez/deepseek-v4-gguf
apple/OpenELM-1_1B-Instruct
bigscience/bloom-560m
bigscience/bloomz-560m
casperhansen/llama-3.3-70b-instruct-awq
casperhansen/mistral-nemo-instruct-2407-awq
cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit
datajuicer/LLaMA-1B-dj-refine-150B
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
deepseek-ai/DeepSeek-R1
deepseek-ai/DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
deepseek-ai/DeepSeek-V2-Lite
deepseek-ai/DeepSeek-V2-Lite-Chat
deepseek-ai/DeepSeek-V3
deepseek-ai/DeepSeek-V3-0324
deepseek-ai/DeepSeek-V3.2
deepseek-ai/DeepSeek-V4-Flash
deepseek-ai/DeepSeek-V4-Pro
deepseek-ai/deepseek-coder-7b-instruct-v1.5
distilbert/distilgpt2
dphn/dolphin-2.9.1-yi-1.5-34b
facebook/opt-1.3b
facebook/opt-125m
google/gemma-2-9b-it
google/gemma-3-1b-it
google/gemma-3-270m
h2oai/h2ovl-mississippi-2b
h2oai/h2ovl-mississippi-800m
hmellor/tiny-random-BambaForCausalLM
hmellor/tiny-random-Gemma2ForCausalLM
hmellor/tiny-random-LlamaForCausalLM
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
ibm-granite/granite-4.0-h-small
ibm-research/PowerMoE-3b
kaitchup/Phi-3-mini-4k-instruct-gptq-4bit
llamafactory/tiny-random-Llama-3
lmstudio-community/DeepSeek-R1-0528-Qwen3-8B-MLX-4bit
meta-llama/Llama-2-7b-hf
meta-llama/Llama-3.1-70B-Instruct
meta-llama/Llama-3.1-8B
meta-llama/Llama-3.1-8B-Instruct
meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-3.3-70B-Instruct
meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-8B-Instruct
microsoft/Phi-3-mini-4k-instruct
microsoft/Phi-3.5-mini-instruct
microsoft/Phi-4-mini-instruct
microsoft/Phi-tiny-MoE-instruct
microsoft/phi-2
microsoft/phi-4
mistralai/Mistral-7B-Instruct-v0.2
mistralai/Mistral-7B-v0.1
mlabonne/Qwen3-30B-A3B-abliterated
mlx-community/gpt-oss-20b-MXFP4-Q8
moonshotai/Kimi-K2-Instruct
moonshotai/Kimi-K2-Instruct-0905
nm-testing/SmolLM-1.7B-Instruct-quantized.w4a16
nvidia/DeepSeek-R1-0528-NVFP4-v2
nvidia/Gemma-4-26B-A4B-NVFP4
nvidia/Gemma-4-31B-IT-NVFP4
nvidia/Kimi-K2.5-NVFP4
nvidia/Kimi-K2.6-NVFP4
nvidia/Llama-3.1-8B-Instruct-FP8
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
nvidia/MiniMax-M2.7-NVFP4
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
nvidia/NVIDIA-Nemotron-Nano-9B-v2
nvidia/Nemotron-Labs-Diffusion-8B-Base
nvidia/Nemotron-Mini-4B-Instruct
nvidia/Qwen3.5-397B-A17B-NVFP4
nvidia/Qwen3.6-35B-A3B-NVFP4
openai-community/gpt2
openai-community/gpt2-large
openai-community/gpt2-medium
openai/gpt-oss-120b
openai/gpt-oss-20b
peft-internal-testing/tiny-random-OPTForCausalLM
prefeitura-rio/Rio-3.0-Open
prefeitura-rio/Rio-3.0-Open-Mini
sakamakismile/Qwen3.6-27B-Text-NVFP4-MTP
sshleifer/tiny-gpt2
state-spaces/mamba-130m-hf
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
tiiuae/falcon-7b
trl-internal-testing/tiny-GptOssForCausalLM
trl-internal-testing/tiny-Qwen2ForCausalLM-2.5
trl-internal-testing/tiny-Qwen3ForCausalLM
trl-internal-testing/tiny-random-LlamaForCausalLM
unsloth/GLM-4.7-Flash
unsloth/Llama-3.2-1B-Instruct
unsloth/Meta-Llama-3.1-8B-Instruct
unsloth/Qwen2.5-7B-Instruct-bnb-4bit
unsloth/Qwen3-Coder-Next-GGUF
unsloth/mistral-7b-v0.3-bnb-4bit
zai-org/GLM-4.7-Flash
zai-org/GLM-5-FP8
zai-org/GLM-5.1-FP8

FAQ