Opslagsindhold
🇨🇳DeepSeek just open-sourced a 1.6T AI model built for coding agents and million-token memory DeepSeek unveiled its V4 generation with two new foundation models: DeepSeek-V4-Pro (1.6 trillion parameters, activating only 49B per token) and DeepSeek-V4-Flash (284B parameters focused on efficiency). The release pushes open-source AI closer to the frontier dominated by closed labs. What makes V4 different: • Introduces DeepSeek Sparse Attention (DSA), a new architecture designed to cut memory and compute costs. • Uses token-wise compression, allowing far longer context windows without the usual infrastructure burden. • Result: 1 million token context becomes standard across official DeepSeek services. Built for coding and reasoning: • V4-Pro is positioned as a new open-source leader in agentic coding and advanced reasoning. • Designed to compete directly with top proprietary models from OpenAI, Anthropic, and Google. • Supports Thinking mode for deeper step-by-step reasoning and Non-Thinking mode for faster responses. DeepSeek’s strategy is clear: don’t just match frontier labs, make frontier AI cheaper, open, and easier to deploy. Source. @aipost🏴