Skip to content

Add MiniMax-M3 model (#1933)#1940

Merged
rekram1-node merged 1 commit into
anomalyco:devfrom
matstrange:add-minimax-m3
Jun 2, 2026
Merged

Add MiniMax-M3 model (#1933)#1940
rekram1-node merged 1 commit into
anomalyco:devfrom
matstrange:add-minimax-m3

Conversation

@matstrange
Copy link
Copy Markdown

Adds the MiniMax M3 model to the five providers where it should be available.

What this PR does

MiniMax-M3 is MiniMax's new frontier multimodal coding model: 1M context window, native text/image/video input, tool calling, reasoning, and open weights.

Adds MiniMax-M3.toml (and minimax-m3.toml for ollama-cloud) to:

  • providers/minimax/ — pay-as-you-go, global
  • providers/minimax-cn/ — pay-as-you-go, China
  • providers/minimax-coding-plan/ — token plan subscription, global
  • providers/minimax-cn-coding-plan/ — token plan subscription, China
  • providers/ollama-cloud/ — ollama cloud, minimax-m3:cloud

Source

Closes #1933.

Notes for reviewers

  • Cost on minimax and minimax-cn matches M2.7. MiniMax's own docs state the M3 pricing is unchanged from M2.7.
  • 1M/512K context divergence on ollama-cloud is intentional. ollama advertises M3 as having a 1M context window with a 512K minimum, so 512K is the safe floor — using 1M there would mean opencode users could hit silent mid-request rejections.
  • output = 131072 on the limit block is inherited from the existing M2.7 files in this repo. MiniMax's published M3 docs only advertise the 1M input context window, not a separate output cap. Worth verifying against the M3 changelog when reviewers have access.
  • minimax-coding-plan variant verified end-to-end in opencode against the MiniMax token plan API (auth, streaming, tool calls, reasoning) by the author before opening this PR.

Validation

bun validate errors on a pre-existing issue (Unable to resolve extends.from: mistral/mistral-medium-latest) that also occurs on a clean dev checkout — not introduced by this changeset. CI may need to be re-run once that's fixed upstream.

The five new TOML files parse cleanly and conform to the schema in packages/core/src/schema.ts.

MiniMax-M3 is MiniMax's new frontier multimodal coding model: 1M context
window (512K minimum on ollama-cloud), native text/image/video input,
tool calling, reasoning, and open weights.

Adds the model to all five providers where it should be available:

  - minimax (pay-as-you-go)
  - minimax-cn (pay-as-you-go, China)
  - minimax-coding-plan (token plan subscription)
  - minimax-cn-coding-plan (token plan subscription, China)
  - ollama-cloud

Closes anomalyco#1933.

Notes for reviewers:
  - Cost fields on minimax/minimax-cn match M2.7; M3 docs state the
    pricing is unchanged from M2.7.
  - The 1M/512K context divergence on ollama-cloud is intentional —
    ollama advertises 1M with a 512K minimum, so 512K is the safe floor
    that won't surprise opencode users with mid-request rejections.
  - output = 131072 is inherited from the existing M2.7 files; MiniMax's
    published M3 docs only advertise the 1M input context, not a separate
    output cap.
  - The minimax-coding-plan variant has been verified end-to-end in
    opencode against the MiniMax token plan API.
@rekram1-node
Copy link
Copy Markdown
Collaborator

MiniMax is currently limiting the context window to 500k temporarily, so this PR caps the MiniMax-M3 entries at 500k for now.

@Moximxxx
Copy link
Copy Markdown

Moximxxx commented Jun 2, 2026

thank you

rekram1-node pushed a commit that referenced this pull request Jun 2, 2026
The M3 entries added in #1940 copied M2.7's cost values. Correct them to
the official M3 pricing and limits:

- minimax / minimax-cn (pay-as-you-go): input 0.30 -> 0.60,
  output 1.20 -> 2.40, cache_read 0.06 -> 0.12, and remove cache_write
  (M3 has no active prompt-cache-write tier).
- max output 131072 -> 128000 across all four providers.
- coding-plan variants keep their subscription-plan zero pricing; only
  max output is corrected.

Context (512K), modalities, and the other flags are unchanged.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Missing MiniMax-M3 for MiniMax token plan

3 participants