Back to Models
DeepSeek DeepSeek V4 Flash
Standard
DeepSeek-V4 series incorporate several key upgrades in architecture and optimization: (1) a hybrid attention architecture that combines Compressed Sparse Attention (CSA)
and Heavily Compressed Attention (HCA) to improve long-context efficiency; (2) ManifoldConstrained Hyper-Connections (mHC) that enhance conventional residual connections; (3)
and the Muon optimizer for faster convergence and greater training stability
Features
VisionPDF ComprehensionImage generation
Model details
- Provider
- DeepSeek
- Context
- Unknown
- Multimodal
- No
Benchmark performance
via Artificial AnalysisIntelligence
Coding
Math
Knowledge
81%
Creative Writing
72%
Speed
15%
Value
65%