开源AI模型硬件需求排行榜:你的显卡能跑哪些模型?

按显存需求从低到高排列77款开源AI模型,帮你快速定位自己硬件的"天花板"。

排行榜说明

所有数据基于Q4_K_M量化(最流行的量化格式,质量保留约88%)。显存需求 = 模型文件大小 + 0.5GB运行时开销 + 10%安全余量。MoE模型按总参数计算显存(需加载所有专家)。

超轻量级(<2GB)—— 任何设备都能跑

排名模型参数显存上下文许可证任务
1Qwen3 0.6B0.6B0.8 GB32KApache 2.0chat, edge
2Qwen3.5 0.8B0.8B0.9 GB32KApache 2.0chat, edge
3Llama3.2 1B1B1.0 GB128KLlama 3.2chat, edge
4Gemma3 1B1B1.0 GB32KGemmachat, edge
5TinyLlama 1.1B1.1B1.1 GB2KApache 2.0chat, edge
6Qwen2.5 Coder 1.5B1.5B1.3 GB32KApache 2.0code
7Deepseek R1 1.5B1.5B1.3 GB64KMITreasoning
8Qwen3 1.7B1.7B1.4 GB32KApache 2.0chat, multilingual
9Qwen3.5 2B2B1.5 GB32KApache 2.0chat, multilingual
10Gemma2 2B2B1.5 GB8KGemmachat, edge

适合设备: 集成显卡、4GB显存笔记本、树莓派5

轻量级(2-5GB)—— 核显笔记本可跑

排名模型参数显存上下文许可证任务
11Llama3.2 3B3B2.0 GB128KLlama 3.2chat, code
12SmolLM3 3B3B2.0 GB128KApache 2.0chat, reasoning
13Phi-3.5 Mini3.8B2.4 GB128KMITreasoning, code, chat
14Phi-4 Mini Reasoning3.8B2.4 GB16KMITreasoning
15Qwen3 4B4B2.5 GB32KApache 2.0chat, code
16Gemma3 4B4B2.5 GB128KGemmachat, vision
17Qwen3.5 4B4B2.5 GB32KApache 2.0chat, multilingual
18Gemma4 E2B IT5B3.1 GB256KGemmachat, vision
19Gemma4 E2B5B3.1 GB256KGemmavision
20Mistral 7B v0.37B4.1 GB32KApache 2.0chat, reasoning
21Qwen2.5 7B7B4.1 GB128KApache 2.0chat, multilingual, code
22Qwen2.5 Coder 7B7B4.1 GB128KApache 2.0code
23Deepseek R1 Distill 7B7B4.1 GB64KMITreasoning
24Gemma4 E4B IT8B4.6 GB256KGemmachat, vision
25Gemma4 E4B8B4.6 GB256KGemmavision

适合设备: 8GB显存显卡(RTX 4060)、16GB Mac

主流级(5-8GB)—— 甜品参数区间

排名模型参数显存上下文许可证任务
26Llama3.1 8B8B4.6 GB128KLlama 3.1chat, code, reasoning
27Qwen3 8B8B4.6 GB128KApache 2.0chat, code, reasoning
28Ministral 8B8B4.6 GB32KMRLchat
29Gemma2 9B9B5.1 GB8KGemmachat, reasoning
30GLM-4 9B9B5.1 GB128KGLM-4chat, multilingual, code
31Nemotron Nano 9B v29B5.1 GB128KNVIDIA Openreasoning
32Qwen3.5 9B9B5.1 GB32KApache 2.0chat, vision
33Llama3.2 11B Vision11B6.1 GB128KLlama 3.2chat, vision
34Gemma3 12B12B6.6 GB128KGemmachat, vision, reasoning

| 35 | Mistral Nemo 12B | 12B | 6.6 GB | 128K | Apache 2.0 | chat, multilingual |

适合设备: 12GB显存显卡(RTX 4070)、24GB Mac

进阶级(8-18GB)—— 高质量体验区

排名模型参数显存上下文许可证任务
36Qwen2.5 14B14B7.7 GB128KApache 2.0chat, multilingual, reasoning
37Phi-4 14B14B7.7 GB16KMITreasoning, code
38Qwen3 14B14B7.7 GB128KApache 2.0chat, code, reasoning
39DeepSeek R1 Distill 14B14B7.7 GB64KMITreasoning
40GPT-OSS 20B21B MoE11.3 GB128KApache 2.0chat, reasoning, code
41LFM2 24B24B MoE12.8 GB32KLiquid AIchat, edge, rag
42Devstral Small 2 24B24B12.8 GB256KApache 2.0code
43Mistral Small 3.1 24B24B12.8 GB128KApache 2.0chat, vision, code
44Gemma2 27B27B14.3 GB8KGemmachat, reasoning
45Gemma3 27B27B14.3 GB128KGemmachat, vision, reasoning
46Gemma4 26B-A4B IT27B MoE14.3 GB256KGemmachat, vision, reasoning
47Qwen3.5 27B27.8B14.7 GB256KApache 2.0chat, vision, reasoning
48Qwen3 30B-A3B30B MoE15.9 GB128KApache 2.0chat, reasoning
49Nemotron3 Nano 30B30B MoE15.9 GB1024KNVIDIA Openchat, reasoning
50Qwen2.5 32B32B16.9 GB128KApache 2.0chat, multilingual, reasoning
51Qwen2.5 Coder 32B32B16.9 GB128KApache 2.0code
52Qwen3 32B32B16.9 GB128KApache 2.0chat, code, reasoning
53DeepSeek R1 Distill 32B32B16.9 GB64KMITreasoning
54EXAONE 4.0 32B32B16.9 GB128KEXAONE AIchat, reasoning
55OLMo 2 32B32B16.9 GB4KApache 2.0chat, reasoning
56Gemma4 31B IT33B17.4 GB256KGemmachat, vision, reasoning
57Gemma4 31B33B17.4 GB256KGemmavision, reasoning
58Command R 35B35B18.4 GB128KCC BY-NC 4.0chat, rag
59Qwen3.5 35B-A3B35B MoE18.4 GB256KApache 2.0chat, vision

适合设备: 24GB显存显卡(RTX 4090/RTX 5090)、36GB+ Mac

旗舰级(>20GB)—— 需要高端硬件

排名模型参数显存上下文许可证任务
60Mixtral 8x7B47B MoE24.6 GB32KApache 2.0chat, code
61Llama3.3 70B70B36.4 GB128KLlama 3.3chat, reasoning, code
62Qwen2.5 72B72B37.4 GB128KQwenchat, multilingual, reasoning, code
63Llama4 Scout 17B109B MoE56.3 GB128KLlama 4chat, vision, reasoning
64GPT-OSS 120B117B MoE60.4 GB128KApache 2.0chat, reasoning, code
65Devstral 2 123B123B63.5 GB256KMRLcode
66Qwen3.5 122B-A10B122B MoE63.0 GB256KApache 2.0chat, vision, reasoning
67Mixtral 8x22B141B MoE72.7 GB64KApache 2.0chat, code, reasoning
序号模型名称参数量模型大小缓存大小许可证功能
68Qwen3 235B-A22B235B MoE120.9 GB128KApache 2.0chat, code, reasoning
69Qwen3.5 397B-A17B397B MoE203.9 GB256KApache 2.0chat, vision, reasoning, code
70Llama4 Maverick400B MoE205.4 GB1024KLlama 4chat, vision, reasoning, code
71Llama3.1 405B405B208.0 GB128KLlama 3.1chat, reasoning, code
72Qwen3 Coder 480B480B MoE246.4 GB256KApache 2.0code
73DeepSeek R1671B MoE344.2 GB64KMITreasoning
74DeepSeek V3.1671B MoE344.2 GB128KMITchat, code, reasoning
75DeepSeek V3.2685B MoE351.4 GB128KMITchat, code, reasoning
76Kimi K21T MoE512.7 GB128KKimichat, reasoning, code

适合设备: 多卡方案、Mac Studio M4 Ultra 192GB、云端部署

按显卡型号快速匹配

RTX 4060 (8GB / 272 GB/s)

可跑前25名全部模型。最佳选择:Llama3.1 8B(~50 tok/s)、Qwen3.5 9B

RTX 4070 (12GB / 504 GB/s)

可跑前35名。最佳选择:Gemma3 12B(~50 tok/s)、Mistral Nemo 12B

RTX 4090 (24GB / 1008 GB/s)

可跑前61名。最佳选择:Qwen3 32B(~40 tok/s)、Mistral Small 3.1 24B

Mac M4 Max (36GB / 546 GB/s)

可跑前62名(统一内存可用75%≈27GB)。最佳选择:Llama3.3 70B用Q2_K量化、Qwen3 32B Q4_K_M。

Mac M4 Ultra (192GB)

可跑几乎所有模型。Kimi K2DeepSeek V3.2仍需更低量化。


数据来源:CanIRun.ai,基于Q4_K_M量化,统计截至2026年5月