DeepSeek cuts V4-Pro prices by 75%


The promotional discount runs until 5 May 2026. Even at full price, V4-Pro already undercuts GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro on per-token costs.

The move is a direct challenge to the pricing strategy of US AI providers at a moment when the Trump administration has accused Chinese firms of distilling American AI models on an industrial scale.


DeepSeek announced on Monday that it is offering a 75% discount on its newly released DeepSeek-V4-Pro model to developers until 5 May 2026, and is simultaneously cutting the price of input cache hits across its entire API suite to one-tenth of previous levels, effective immediately.

The discount was announced in a post on X. The move intensifies a pricing competition with US AI providers that DeepSeek first triggered in January 2025 with its R1 model, which claimed frontier-level reasoning performance at a fraction of the cost of comparable OpenAI products.

The 💜 of EU tech

The latest rumblings from the EU tech scene, a story from our wise ol’ founder Boris, and some questionable AI art. It’s free, every week, in your inbox. Sign up now!

The pricing context is important. At full price, before any promotional discount, DeepSeek-V4-Pro already costs $0.145 per million input tokens and $3.48 per million output tokens, undercutting OpenAI’s GPT-5.5, Google’s Gemini 3.1 Pro, and Anthropic’s Claude Opus 4.7 on per-token basis.

The 75% promotional discount on input tokens reduces the V4-Pro input price to approximately $0.036 per million tokens. The Flash variant, V4’s smaller, faster model, costs $0.14 per million input tokens and $0.28 per million output tokens at full price, already undercutting GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, and Claude Haiku 4.5.

The cache-hit price cut to one-tenth of prior levels specifically targets frequent users and enterprise developers who send similar or repeated requests, which is the dominant pattern in production agentic applications.

The strategic logic is explicit and well-documented in how DeepSeek has operated since R1. Open-source availability removes the model access barrier entirely; aggressive API pricing removes the cost barrier for production deployment; a 1 million-token context window makes the model viable for enterprise use cases involving large codebases or long documents that would otherwise require multiple API calls.

V4-Pro also integrates natively with Claude Code, OpenClaw, and OpenCode, the dominant agentic coding frameworks used by developers already in the Western AI ecosystem.

The combined effect is to lower the friction of switching from an OpenAI, Anthropic, or Google API to a DeepSeek API for any developer whose primary constraint is cost. Akshar Keremane, co-founder of Bangalore-based AI startup O-Health, described the combination of pricing, open-source availability, and the 1 million-token context window as lowering barriers “for developers, startups and small enterprises.”

The V4-Pro model, launched last Friday, is a mixture-of-experts model with 1.6 trillion total parameters and 49 billion active parameters per task, the largest open-weight model currently available, outstripping Moonshot AI’s Kimi K2.6 and MiniMax’s M1.

Its Hybrid Attention Architecture is designed to maintain coherence across long contexts. It is trained on and optimised for Huawei’s Ascend 950 chips and Cambricon hardware rather than Nvidia GPUs.

Zhang Yi, founder of tech research firm iiMedia, told AFP that V4’s architecture represents a “genuine inflection point” for long-context AI processing, predicting that ultra-long context support will move beyond research labs into mainstream commercial applications.

Wei Sun, principal analyst at Counterpoint Research, noted that V4 running on domestic chips “allows AI systems to be built and deployed without relying solely on Nvidia” and could “accelerating adoption domestically and contributing to faster global AI development overall.”

The pricing move arrives in a charged geopolitical context. On Thursday last week, White House Director of Science and Technology Policy Michael Kratsios accused foreign entities, primarily based in China, of conducting “industrial-scale” campaigns to distil frontier AI models from US companies, a process in which a smaller model is trained using the outputs of a larger model to acquire similar capabilities at lower cost.

Kratsios’s memo did not directly name DeepSeek, but DeepSeek has previously been accused by both Anthropic and OpenAI of distilling their models. CNN reported it has reached out to DeepSeek for comment on those accusations.

The US government’s distillation crackdown, alongside China’s parallel move to restrict US investment in its AI firms, was announced the day before V4’s launch.

DeepSeek’s response, three days later, is to cut prices rather than respond to the accusations directly: a competitive move that is also a political statement about where it believes the AI race will ultimately be decided.

OpenAI has cut API prices multiple times; Anthropic has introduced tiered pricing for different Claude model sizes; Google has progressively reduced Gemini API costs.

DeepSeek’s Monday announcement is the latest move in that ongoing compression, but it is distinctive in its scale, a 75% promotional discount on top of a model that already undercuts the US frontier at standard pricing, and in its timing, which positions the Hangzhou startup as the low-cost challenger in the same week that OpenAI shipped GPT-5.5 and the US government moved to restrict Chinese model distillation.



Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


If you’ve bought a new Raspberry Pi, or just got your hands on an older model that someone else didn’t want, there are many ways to put that little computer to good use, and here are six of them.

Retro gaming galore

Recalbox running on a Raspberry Pi 500+. Credit: Tim Brookes / How-To Geek

One of the most popular uses for Raspberry Pi computers is as a retro gaming emulation system. Which systems can be emulated depends on which specific model of Pi you have, but even the oldest ones can do a great job with retro 8-bit and 16-bit titles, or MAME arcade titles. In fact, building your own arcade cabinet with a Pi at its heart is a common project, and you’ll find lots of instructional guides on the web to that effect.

8bitdo arcade stick for Nintendo Switch.

8/10

Number of Colors

1

Control Types

Arcade Stick


Build your own NAS

A Raspberry Pi configured as a NAS. Credit: Raspberry Pi Foundation

A NAS or Network-Attached Storage device is effectively a local file server that lets you store and access data on your local network using hard drives. You can go out and buy a NAS or you can follow the official Raspberry Pi NAS tutorial and turn your old USB hard drives into a NAS using stuff you already have, or can get for just a few dollars.

Everyone loves local streaming tools like Plex or Jellyfin, but not everyone wants to dedicate an expensive computer to act as the streaming server. Well, as long as your requirements aren’t too fancy, you can use a Raspberry Pi as a Plex server.

Just don’t expect it to handle heavy-duty transcoding. The good news is that most of your client devices can probably play back videos without the need for transcoding.

Turn your Pi into a home automation hub

The Home Assistant Green smart home hub surrounded by smart home devices. Credit: home-assistant.io

Home automation hub devices can cost hundreds of dollars, but if you have an old Raspberry Pi, you can run your smart home off it. The most common and effective solution is an open-source app called Home Assistant.

Raspberry Pi logo above a photo of Raspberry Pi boards.


I Run My Smart Home Off a Raspberry Pi, Here’s How It Works

Make your home smarter on a budget with a Raspberry Pi.

Build a weather station

If you’re interested in the weather, want to contribute to weather data, or are just sick of getting rained on when you least expect it, you have the option of getting a weather station kit for your Raspberry Pi or using something like the Raspberry Pi Sense HAT, which can detect pressure, humidity, and temperature, but not wind speed. However, there are also generic wind and rain sensors you can buy, and, of course, don’t forget an outdoor project enclosure.

There are a few guides on the web, but this weather station guide for Raspberry Pi is a good place to get some ideas.

Create a home web server

Another fun project to do is hosting your own little web server using a Raspberry Pi. You can make a website that only works on your home LAN, or even host something that people from outside your home network can access. Using open source software to host your own web resources is highly educational, and it can also be a way to do something genuinely useful without having to rely on a cloud service somewhere on the internet.

Imagine having your own little bulletin board at home, or hosting content like ebooks, music, or audiobooks?


Infinite possibilities

Despite lacking in the raw power department, all Raspberry Pi devices are little miracles—single board computers that can (in principle) do anything their bigger cousins can. Just more slowly. So if you have a few old Raspberry Pis hanging around, don’t be too quick to retire them yet.



Source link