模型目录

还没有 LM Studio？立即下载macOS、Windows 或Linux 版本。

GGUF

MLX

seed-oss-36b

•

bytedance

seed_oss

Advanced reasoning model from ByteDance with flexible "thinking budget" control and ability to reflect on the length of its own reasoning

GGUF

MLX

hermes-4-70b

•

nousresearch

llama

70B

Hybrid-mode reasoning model based on Llama-3.1-70B by Nous Research

GGUF

MLX

qwen3-4b-thinking-2507

•

qwen

qwen3moe

Updated thinking version of Qwen3 4B featuring continued scaling of thinking capability, improving both the quality and depth of reasoning

GGUF

MLX

qwen3-4b-2507

•

qwen

qwen3moe

Updated version of Qwen3 4B non-thinking mode featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

GGUF

MLX

gpt-oss-120b

•

openai

gpt-oss

120B

The 120B variant of OpenAI's open source model. Apache 2.0 licensed.

GGUF

MLX

gpt-oss-20b

•

openai

gpt-oss

20B

The 20B variant of OpenAI's open source model. Apache 2.0 licensed.

GGUF

MLX

qwen3-coder-30b

•

qwen

qwen3moe

30B

A powerful 30B MoE coding model from Alibaba Qwen, joining its larger 480B counterpart

GGUF

MLX

qwen3-30b-a3b-2507

•

qwen

qwen3moe

30B

Updated version of Qwen3-30B-A3B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

GGUF

MLX

qwen3-coder-480b

•

qwen

qwen3_moe

480B

Qwen's most powerful code model, featuring 480B total parameters with 35B activated through Mixture of Experts (MoE) architecture.

GGUF

MLX

qwen3-235b-a22b-2507

•

qwen

qwen3moe

235B

Updated version of Qwen3-235B-A22B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

GGUF

MLX

lfm2-1.2b

•

liquid

lfm2

1.2B

Hybrid architecture model intended for local use, by Liquid AI

GGUF

MLX

ernie-4.5-21b-a3b

•

baidu

ernie4_5

21B

Medium-size mixture-of-experts model from Baidu's new Ernie 4.5 line of foundation models

MLX

GGUF

devstral-small-2507

•

mistralai

mistral

24B

Devstral excels at using tools to explore codebases and editing multiple files to power software engineering agents.

GGUF

MLX

gemma-3n-e4b

•

google

gemma3n

6.9B

Gemma 3n is a multimodal generative AI model optimized for use in everyday devices, such as phones, laptops, and tablets.

GGUF

MLX

mistral-small-3.2

•

mistralai

mistral

24B

Update to Mistral Small 3.1 with better instruction following, fewer infinite generation issues, and an improved tone.

GGUF

MLX

magistral-small

•

mistralai

mistral

23.6B

MistralAI's first reasoning model, based on Mistral Small 3.1

GGUF

MLX

deepseek-r1-0528-qwen3-8b

•

deepseek

qwen3

Distilled version of the DeepSeek-R1-0528 model, created by continuing the post-training process on the Qwen3 8B Base model using Chain-of-Thought (CoT) from DeepSeek-R1-0528.

MLX

GGUF

devstral-small-2505

•

mistralai

mistral

23.6B

Devstral by MistralAI is based on Mistral Small 3.1. Debuts as the #1 open source model on SWE-bench.

GGUF

MLX

phi-4-mini-reasoning

•

microsoft

phi-4

3.8B

Lightweight open model from the Phi-4 family

GGUF

MLX

phi-4-reasoning-plus

•

microsoft

phi-4

14.7B

Advanced open-weight reasoning model, finetuned from Phi-4 with additional reinforcement learning for higher accuracy

GGUF

MLX

qwen3-235b-a22b

•

qwen

qwen3moe

235B

The 235B parameter (MoE) version of the Qwen3 model family.

GGUF

MLX

qwen3-32b

•

qwen

qwen3

32B

The 32B parameter version of the Qwen3 model family.

GGUF

MLX

qwen3-30b-a3b

•

qwen

qwen3moe

30B

The 30B parameter (MoE) version of the Qwen3 model family.

GGUF

MLX

qwen3-1.7b

•

qwen

qwen3

The 1.7B parameter version of the Qwen3 model family.

GGUF

MLX

qwen3-4b

•

qwen

qwen3

The 4B parameter version of the Qwen3 model family.

GGUF

MLX

qwen3-14b

•

qwen

qwen3

14B

The 14B parameter version of the Qwen3 model family.

GGUF

MLX

qwen3-8b

•

qwen

qwen3

The 8B parameter version of the Qwen3 model family.

GGUF

MLX

gemma-3-27b

•

google

gemma3

27B

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

GGUF

MLX

gemma-3-12b

•

google

gemma3

12B

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

GGUF

MLX

gemma-3-4b

•

google

gemma3

State-of-the-art image + text input models from Google, built from the same research and tech used to create the Gemini models

GGUF

MLX

gemma-3-1b

•

google

gemma3

Tiny text-only variant of Gemma 3: Google's latest open-weight model family

GGUF

MLX

qwq-32b

•

qwen

qwen2

32B

Reasoning model from the Qwen family, rivaling DeepSeek R1 on benchmarks.

GGUF

granite-3.2-8b

•

ibm

granite

A small and capable LLM from IBM

GGUF

qwen2.5-vl-7b

•

qwen

qwen2vl

a 7B Vision Language Model (VLM) from the Qwen2.5 family

GGUF

phi-4

•

microsoft

phi

14B

The latest in the Phi model series: suitable for chats with a context of up to 16K tokens

GGUF

granite-3.1-8b

•

ibm

granite

Dense LLM from IBM supporting up to 128K context length, trained on 12T tokens. Suitable for general instructions following and can be used to build AI assistants

GGUF

llama-3.3-70b

•

qwen2.5-coder-14b

•

qwen

qwen2

14B

14B version of the code-specific Qwen 2.5 for code generation, code reasoning and code fixing.

GGUF

MLX

qwen2.5-coder-32b

•

qwen

qwen2

32B

32B version of the code-specific Qwen 2.5 for code generation, code reasoning and code fixing.

GGUF

mistral-nemo-instruct-2407

•

mistralai

mistral

12B

A slightly larger 12B parameter model from Mistral AI, NeMo offers a long 128k token context length, advanced world knowledge, and function calling for developers.

GGUF

mathstral-7b-v0.1

•

mistralai

mistral

14B

A scientific specialist finetune of Mistral AI's popular 7B model, Mathstral excels at STEM chats and tasks.

GGUF

gemma-2-9b

•

google

gemma2

The mid-sized option of the Gemma 2 model family. Built by Google, using from the same research and technology used to create the Gemini models

GGUF

gemma-2-27b

•

google

gemma2

27B

The large option of the Gemma 2 model family. Built by Google, using from the same research and technology used to create the Gemini models

GGUF

codestral-22b-v0.1

•

mistralai

mistral

22B

Mistral AI's latest coding model, Codestral can handle both instructions and code completions with ease in over 80 programming languages.

GGUF

mistral-7b-instruct-v0.3

•

mistralai

mistral

One of the most popular open-source LLMs, Mistral's 7B Instruct model's balance of speed, size, and performance makes it a great general-purpose daily driver.

模型目录

还没有 LM Studio？立即下载macOS、Windows 或Linux 版本。

seed-oss-36b

bytedance

hermes-4-70b

nousresearch

qwen3-4b-thinking-2507

qwen

qwen3-4b-2507

qwen

gpt-oss-120b

openai

gpt-oss-20b

openai

qwen3-coder-30b

qwen

qwen3-30b-a3b-2507

qwen

qwen3-coder-480b

qwen

qwen3-235b-a22b-2507

qwen

lfm2-1.2b

liquid

ernie-4.5-21b-a3b

baidu

devstral-small-2507

mistralai

gemma-3n-e4b

google

mistral-small-3.2

mistralai

magistral-small

mistralai

deepseek-r1-0528-qwen3-8b

deepseek

devstral-small-2505

mistralai

phi-4-mini-reasoning

microsoft

phi-4-reasoning-plus

microsoft

qwen3-235b-a22b

qwen

qwen3-32b

qwen

qwen3-30b-a3b

qwen

qwen3-1.7b

qwen

qwen3-4b

qwen

qwen3-14b

qwen

qwen3-8b

qwen

gemma-3-27b

google

gemma-3-12b

google

gemma-3-4b

google

gemma-3-1b

google

qwq-32b

qwen

granite-3.2-8b

ibm

qwen2.5-vl-7b

qwen

phi-4

microsoft

granite-3.1-8b

ibm

llama-3.3-70b

meta

qwen2.5-coder-14b

qwen

qwen2.5-coder-32b

qwen