AI Model Profile

DeepSeek V4 Flash

DeepSeek V4 Flash is listed in DeepSeek API docs as a current model supporting thinking and non-thinking modes, JSON output, tool calls, and a 1M context length.

Official model page Official docs Compare models

Family

DeepSeek V4

Release date

Unknown

Status

Current

Context window

1M tokens

Output limit

384K tokens maximum

API

yes

Open weights

Local/self-hosted

Pricing

Usage-based DeepSeek API pricing; verify the official DeepSeek pricing page before budgeting.

Verification

verified

Verification & Sources

Status: Verified
Source links: 2
Freshness: Verified June 18, 2026
Last verified: June 18, 2026
Last updated: June 17, 2026

Key source checks

Official site Pricing

Suggest a correction

Benchmark Caveat

Provider documentation and provider-published capability notes are directional, not universal rankings. Real results depend on prompts, tools, latency targets, pricing tier, safety filters, context length, and workload mix.

Best for

Candidate for cost-sensitive DeepSeek API workflows, agent integrations, and reasoning/coding tests.

Skip if

Skip if your team needs independently benchmarked performance claims or contractual guarantees not present in the official provider documentation.

Strengths

Official docs list a long context window, tool calls, JSON output, and low listed API prices.

Weaknesses

This profile does not independently validate benchmark quality, reliability, or latency.

Agent suitability

Candidate for agent tools because DeepSeek docs mention agent integrations and tool calls.

Kingy AI take

Use this profile as a source-backed reference point, not a ranking. Re-check official provider docs before making production or budget decisions.

Full Model Notes

DeepSeek V4 Flash is listed in DeepSeek API docs as a current model supporting thinking and non-thinking modes, JSON output, tool calls, and a 1M context length.

Coding notes

Evaluate coding workflows directly, especially if replacing other API backends.

Reasoning notes

Official docs describe thinking and non-thinking mode support.

Creative notes

Not primarily a creative/media model in this profile.

Research notes

Candidate for long-context and low-cost research tests with current API docs.

API pricing notes

$0.14 cache-miss input MTok and $0.28 output MTok in DeepSeek pricing docs as checked on 2026-06-18.

License notes

Review the official provider terms and model documentation before relying on license or redistribution assumptions.

Official Model Links

Official model page Docs Pricing