DeepL launches real-time voice-to-voice translation in 40+ languages



The Cologne-based translation company best known for its text tools has unveiled a full voice product suite covering meetings, conversations, group settings, and an API for enterprise integration. A live demo in Seoul showed one-to-two sentence delays, and DeepL’s CPO acknowledged word order differences between languages remain a fundamental challenge.

DeepL, the Cologne-based language AI company that built its reputation on high-quality text translation, has launched DeepL Voice-to-Voice: a real-time spoken translation suite designed for live business communication.

The product covers four distinct use cases, virtual meetings, mobile and web conversations, group settings for frontline workers, and enterprise applications through an API, and supports more than 40 languages including all 24 official EU languages and additions such as Vietnamese, Thai, Arabic, Norwegian, Hebrew, Bengali, and Tagalog.

The suite’s four components are at different stages of availability. Voice for Conversations, which enables real-time translation across mobile and web without requiring app installation, is now generally available.

Voice for Meetings, which integrates with Microsoft Teams and Zoom so participants can speak in their native language while others hear simultaneous translation in theirs, is opening an early access programme in June.

The Voice-to-Voice API, which lets businesses embed DeepL’s translation engine into their own customer-facing applications such as call centres, is in ongoing early access. A customisation feature, Spoken Terms, which allows the system to learn industry-specific vocabulary, company names, and personal names, is scheduled to become generally available on 7 May.

Jarek Kutylowski, DeepL’s founder and CEO, described the launch as reaching “another frontier in translation.”

“DeepL Voice-to-Voice allows everyone to speak naturally in their own language without the friction or cost of interpreters,” he said.

DeepL has positioned the product as an enterprise tool rather than a consumer one: the company said its voice technology never uses customer data to train its models, and does not permanently store transcription or translation data after a call ends, a security framing that distinguishes it from consumer AI voice products and is aimed at regulated industries.

The current system works through a three-step pipeline: speech is converted to text, the text is translated using DeepL’s established translation engine, and the output is then converted back to speech.

DeepL’s competitive argument rests on the quality of the middle step: the company says its text translation models outperform alternatives, and that advantage propagates through to the voice output.

In blind evaluations commissioned by DeepL and conducted independently by Slator, a language industry research firm, 96% of professional linguists preferred DeepL Voice over the native translation solutions in Google Meet, Microsoft Teams, and Zoom, citing superior fluency and contextual accuracy. DeepL Voice scored 96.4 out of 100 for Zoom and 96.3 for Microsoft Teams.

However, a live demonstration by Chief Product Officer Gonzalo Gaiolas at the company’s DeepL Connect Seoul event, held on 15 April, exposed the system’s current limitation: a visible delay of one to two sentences between the speaker finishing and the translation being delivered.

Gaiolas acknowledged the lag directly. “Different languages have different word orders and sentence structures, which causes delays in real-time interpretation,” he said, according to Seoul Economic Daily.

The company plans to reduce latency through continued model development. On the voice quality side, the current system translates using a fixed synthetic voice; DeepL said it plans to release a voice-preservation feature, which maintains the speaker’s original voice characteristics in the translated output, by the end of 2026.

DeepL is entering a market with multiple well-funded competitors. Sanas, which uses AI to modify speakers’ accents in real time for call centre applications, raised $65 million in a round led by Quadrille Capital.

Dubai-based Camb.AI focuses on speech synthesis and translation for media dubbing. Palabra, backed by Reddit co-founder Alexis Ohanian’s Seven Seven Six, is developing a real-time speech translation engine focused on preserving speaker voice characteristics.

Google, Microsoft, and Zoom all offer their own meeting translation features, the platforms DeepL is simultaneously challenging and integrating with. DeepL’s strategic bet is that translation quality, its longest-established differentiator, can outweigh the structural advantages incumbents hold in platform distribution.



Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


Google Maps has a long list of hidden (and sometimes, just underrated) features that help you navigate seamlessly. But I was not a big fan of using Google Maps for walking: that is, until I started using the right set of features that helped me navigate better.

Add layers to your map

See more information on the screen

Layers are an incredibly useful yet underrated feature that can be utilized for all modes of transport. These help add more details to your map beyond the default view, so you can plan your journey better.

To use layers, open your Google Maps app (Android, iPhone). Tap the layer icon on the upper right side (under your profile picture and nearby attractions options). You can switch your map type from default to satellite or terrain, and overlay your map with details, such as traffic, transit, biking, street view (perfect for walking), and 3D (Android)/raised buildings (iPhone) (for buildings). To turn off map details, go back to Layers and tap again on the details you want to disable.

In particular, adding a street view and 3D/raised buildings layer can help you gauge the terrain and get more information about the landscape, so you can avoid tricky paths and discover shortcuts.

Set up Live View

Just hold up your phone

A feature that can help you set out on walks with good navigation is Google Maps’ Live View. This lets you use augmented reality (AR) technology to see real-time navigation: beyond the directions you see on your map, you are able to see directions in your live view through your camera, overlaying instructions with your real view. This feature is very useful for travel and new areas, since it gives you navigational insights for walking that go beyond a 2D map.

To use Live View, search for a location on Google Maps, then tap “Directions.” Once the route appears, tap “Walk,” then tap “Live View” in the navigation options. You will be prompted to point your camera at things like buildings, stores, and signs around you, so Google Maps can analyze your surroundings and give you accurate directions.

Download maps offline

Google Maps without an internet connection

Whether you’re on a hiking trip in a low-connectivity area or want offline maps for your favorite walking destinations, having specific map routes downloaded can be a great help. Google Maps lets you download maps to your device while you’re connected to Wi-Fi or mobile data, and use them when your device is offline.

For Android, open Google Maps and search for a specific place or location. In the placesheet, swipe right, then tap More > Download offline map > Download. For iPhone, search for a location on Google Maps, then, at the bottom of your screen, tap the name or address of the place. Tap More > Download offline map > Download.

After you download an area, use Google Maps as you normally would. If you go offline, your offline maps will guide you to your destination as long as the entire route is within the offline map.

Enable Detailed Voice Guidance

Get better instructions

Voice guidance is a basic yet powerful navigation tool that can come in handy during walks in unfamiliar locations and can be used to ensure your journey is on the right path. To ensure guidance audio is enabled, go to your Google Maps profile (upper right corner), then tap Settings > Navigation > Sound and Voice. Here, tap “Unmute” on “Guidance Audio.”

Apart from this, you can also use Google Assistant to help you along your journey, asking questions about your destination, nearby sights, detours, additional stops, etc. To use this feature on iPhone, map a walking route to a destination, then tap the mic icon in the upper-right corner. For Android, you can also say “Hey Google” after mapping your destination to activate the assistant.

Voice guidance is handy for both new and old places, like when you’re running errands and need to navigate hands-free.

Add multiple stops

Keep your trip going

If you walk regularly to run errands, Google Maps has a simple yet effective feature that can help you plan your route in a better way. With Maps’ multiple stop feature, you can add several stops between your current and final destination to minimize any wasted time and unnecessary detours.

To add multiple stops on Google Maps, search for a destination, then tap “Directions.” Select the walking option, then click the three dots on top (next to “Your Location”), and tap “Edit Stops.” You can now add a stop by searching for it and tapping “Add Stop,” and swap the stops at your convenience. Repeat this process by tapping “Add Stops” until your route is complete, then tap “Start” to begin your journey.

You can add up to ten stops in a single route on both mobile and desktop, and use the journey for multiple modes (walking, driving, and cycling) except public transport and flights. I find this Google Maps feature to be an essential tool for travel to walkable cities, especially when I’m planning a route I am unfamiliar with.


More to discover

A new feature to keep an eye out for, especially if you use Google Maps for walking and cycling, is Google’s Gemini boost, which will allow you to navigate hands-free and get real-time information about your journey. This feature has been rolling out for both Android and iOS users.



Source link