Best StepFun Step 3.5 Flash Alternatives in 2026

StepFun Step 3.5 Flash scores 7.8/10 on our tests. Here are 16 alternatives worth considering in the Local & Open-Weight LLMs space.

StepFun Step 3.5 Flash

B

StepFun's (China) agent-focused open-weight model -- Step 3.5 Flash launched 2026-02-01. 196B sparse MoE, ~11B active. Benchmarks slightly ahead of DeepSeek V3.2 at over 3x smaller total size. Step 3 (321B / 38B active, Apache 2.0) and Step3-VL-10B multimodal also in the family

7.8

Current pick

Top Alternatives, Ranked

1

A

+1.0 higher

Alibaba's open-weights + API family -- Qwen3.6-27B dense (Apr 22 2026 Apache 2.0, beats the 397B MoE flagship on coding from a single consumer GPU), Qwen 3.6-Max-Preview (Apr 20 2026 closed-weights #1 on SWE-bench Pro/Terminal-Bench 2.0/SciCode), Qwen3.6-35B-A3B (Apr 16 open-weights MoE), plus Qwen 3.6-Plus API flagship and the new Qwen 3.7 Max preview (May 2026, proprietary/API-only)

Overall: 8.8/10Free tier availableFrom $0

2

A

+0.6 higher

MiniMax's open-weights self-evolving agent flagship -- M2.7 (released 2026-03-18) scores 56.22% SWE-Pro and 57.0% Terminal Bench 2 from a 229B/10B-active MoE

Overall: 8.4/10Free tier availableFrom $0

3

Gemma 4 (Google)

A

+0.5 higher

Google DeepMind's open-weights model family -- multimodal, 256K context, runs on edge devices

Overall: 8.3/10Free tier availableFrom $0

4

IBM Granite 4.0

A

+0.4 higher

IBM's enterprise-focused open-weight family -- Granite 4.0 hybrid Mamba-2 + transformer architecture (70-80% memory reduction vs pure transformer), 3B to 32B sizes, Apache 2.0. First open model family to secure ISO 42001 certification. Nano 350M runs on CPU with 8-16GB RAM. 3B Vision variant landed 2026-04-01

Overall: 8.2/10Free tier availableFrom $0

5

Kimi K2.6 (Moonshot)

A

+0.3 higher

Moonshot's 1T-parameter MoE open-weights flagship -- Kimi K2.6 (GA 2026-04-20) is #1 open-weights on Artificial Analysis Intelligence Index v4.0 (score 54, ranked #4 overall). Native video input, 256K context, Modified MIT license

Overall: 8.1/10Free tier availableFrom $0

6

gpt-oss (OpenAI)

A

+0.3 higher

OpenAI's FIRST open-weight models -- gpt-oss-120b (single 80GB GPU, near parity with o4-mini on reasoning) and gpt-oss-20b (runs on 16GB edge devices). Apache 2.0. Launched 2025-08-05. gpt-oss-safeguard ships in 2026 as the safety-tuned variant

Overall: 8.1/10Free tier availableFrom $0

7

Arcee Trinity-Large-Thinking

A

+0.3 higher

Arcee AI's US-made open-weight frontier reasoning model -- launched 2026-04-01. 398B total params, ~13B active. Sparse MoE (256 experts, 4 active = 1.56% routing). Apache 2.0, trained from scratch. #2 on PinchBench trailing only Claude 3.5 Opus. ~96% cheaper than Opus-4.6 on agentic tasks

Overall: 8.1/10Free tier availableFrom $0

8

A

+0.2 higher

DeepSeek V4 shipped 2026-04-24: V4-Pro (1.6T/49B active MoE) + V4-Flash (284B/13B active), 1M native context, Hybrid Attention Architecture, open-source on HF. Trails only Gemini 3.1 Pro on world knowledge

Overall: 8.0/10Free tier availableFrom $0

9

GLM / Z.ai (Zhipu AI)

A

+0.2 higher

Zhipu AI's open-weights family -- GLM-5.1 (launched 2026-04-07) is 744B MoE / 40B active, topped SWE-Bench Pro at 58.4 (beating GPT-5.4 and Claude Opus 4.6), MIT licensed, 200K context. Trained entirely on 100K Huawei Ascend 910B chips -- first frontier model with zero Nvidia in the training stack

Overall: 8.0/10Free tier availableFrom $0

10

A

+0.2 higher

AI21 Labs' hybrid SSM-Transformer (Mamba-style) open-weight family -- Jamba2 launched 2026-01-08. Two sizes: 3B dense (runs on phones / laptops) and Jamba2 Mini MoE (12B active / 52B total). Apache 2.0, 256K context, mid-trained on 500B tokens

Overall: 8.0/10Free tier availableFrom $0

11

B

+0.1 higher

Meta's open-weights flagship family -- Scout (10M context), Maverick (multimodal 400B MoE), Behemoth in preview

Overall: 7.9/10Free tier availableFrom $0

12

B

+0.1 higher

Allen Institute for AI's fully-open frontier reasoning models -- Olmo 3 family (2025-11-20) includes 7B and 32B sizes, four variants (Base, Think, Instruct, RLZero). Apache 2.0 with fully open data + checkpoints + training logs. Olmo 3-Think 32B matches Qwen3-32B-Thinking at 6x fewer training tokens

Overall: 7.9/10Free tier availableFrom $0

13

Nemotron (Nvidia)

B

Nvidia's open-weights family -- hybrid Mamba-Transformer MoE architecture, optimized for efficient reasoning on Nvidia hardware

Overall: 7.8/10Free tier availableFrom $0

14

B

European AI lab with open and commercial models -- Mistral Medium 3.5 SHIPPED 2026-04-29 (128B dense, 256k context, 77.6% SWE-Bench Verified) plus Vibe Remote Agents + Le Chat Work Mode. Earlier 2026 line: Small 4 (Mar 2026 119B MoE Apache 2.0 unified), Medium 3 (Apr 9 2026), Voxtral TTS (Mar 2026 open-source speech)

Overall: 7.5/10Free tier availableFrom $0

15

Cohere Command A

B

Cohere's enterprise-multilingual flagship -- 111B params, 256K context, runs on 2x H100. 23 languages. CC-BY-NC 4.0 on weights (research / non-commercial), commercial requires Cohere enterprise contract. Follow-ups: Command A Reasoning + Command A Vision

Overall: 7.5/10Free tier availableFrom $0

16

B

UAE's Technology Innovation Institute open-weights family -- Falcon 3 optimized for efficient sub-10B deployment on consumer hardware

Overall: 7.1/10Free tier availableFrom $0

Score Comparison

Tool	Ease of Use	Output Quality	Value	Features	Overall
StepFun Step 3.5 Flash(current)	6.0	8.0	9.0	8.0	7.8
Qwen (Alibaba)	7.0	9.0	10.0	9.0	8.8
MiniMax M2.7	6.5	9.0	9.5	8.5	8.4
Gemma 4 (Google)	7.0	8.0	10.0	8.0	8.3
IBM Granite 4.0	7.0	8.0	9.5	8.5	8.2
Kimi K2.6 (Moonshot)	6.0	9.0	8.5	9.0	8.1
gpt-oss (OpenAI)	7.0	8.5	10.0	7.0	8.1
Arcee Trinity-Large-Thinking	6.0	9.0	9.5	8.0	8.1
DeepSeek	7.5	8.0	9.5	7.0	8.0
GLM / Z.ai (Zhipu AI)	6.5	8.5	9.0	8.0	8.0
AI21 Jamba2	6.5	8.0	9.0	8.5	8.0
Llama 4 (Meta)	5.0	8.5	9.0	9.0	7.9
Olmo 3 (AI2)	6.0	8.0	9.5	8.0	7.9
Nemotron (Nvidia)	6.5	8.0	8.0	8.5	7.8
Mistral AI	6.0	8.0	9.0	7.0	7.5
Cohere Command A	6.5	8.5	7.0	8.0	7.5
Falcon (TII)	7.0	6.5	9.0	6.0	7.1

The Tier List Tuesday

Weekly newsletter: tier movers, new entrants, and the VS of the week. Built from our daily AI-tool sweeps. No spam, unsubscribe anytime.

Not sure which to pick?

Read our full reviews or use the comparison tool to see how they stack up head-to-head.

Full StepFun Step 3.5 Flash Review All Local & Open-Weight LLMs