AI Model Profile

DeepSeek V4 Flash

DeepSeek V4 Flash is listed in DeepSeek API docs as a current model supporting thinking and non-thinking modes, JSON output, tool calls, and a 1M context length.

Family
DeepSeek V4
Release date
Unknown
Status
Current
Context window
1M tokens
Output limit
384K tokens maximum
API
yes
Open weights
no
Local/self-hosted
no
Pricing
Usage-based DeepSeek API pricing; verify the official DeepSeek pricing page before budgeting.
Verification
verified

Verification & Sources

Status
Verified
Source links
2
Freshness
Verified June 18, 2026
Last verified
June 18, 2026
Last updated
June 17, 2026

Key source checks

Suggest a correction

Form submissions, correction notes, score details, URLs, and analytics events may be stored for editorial review, spam prevention, product improvement, and follow-up. Avoid sending secrets, private customer data, unreleased financials, or regulated personal data through these forms.

Benchmark Caveat

Provider documentation and provider-published capability notes are directional, not universal rankings. Real results depend on prompts, tools, latency targets, pricing tier, safety filters, context length, and workload mix.

Best for

Candidate for cost-sensitive DeepSeek API workflows, agent integrations, and reasoning/coding tests.

Skip if

Skip if your team needs independently benchmarked performance claims or contractual guarantees not present in the official provider documentation.

Strengths

Official docs list a long context window, tool calls, JSON output, and low listed API prices.

Weaknesses

This profile does not independently validate benchmark quality, reliability, or latency.

Agent suitability

Candidate for agent tools because DeepSeek docs mention agent integrations and tool calls.

Kingy AI take

Use this profile as a source-backed reference point, not a ranking. Re-check official provider docs before making production or budget decisions.

Full Model Notes

DeepSeek V4 Flash is listed in DeepSeek API docs as a current model supporting thinking and non-thinking modes, JSON output, tool calls, and a 1M context length.

Coding notes

Evaluate coding workflows directly, especially if replacing other API backends.

Reasoning notes

Official docs describe thinking and non-thinking mode support.

Creative notes

Not primarily a creative/media model in this profile.

Research notes

Candidate for long-context and low-cost research tests with current API docs.

API pricing notes

$0.14 cache-miss input MTok and $0.28 output MTok in DeepSeek pricing docs as checked on 2026-06-18.

License notes

Review the official provider terms and model documentation before relying on license or redistribution assumptions.