Antzoki TTS

Antzoki-TTS: Adding Emotion and Acting Capabilities

Traditional Text-to-Speech technology gives us solid voices today, but offers no way to guide the interpretation of the audio. If we want to use these models to produce audiobooks, stories, game audio, etc., we soon realize the results are not ideal. We get cold, neutral voices. Robotic. Boring. Lately, however, models that allow us to specify paralinguistic cues (pauses, breathing, and emotional variations) are beginning to emerge. Thanks to this technology, by providing script-like instructions, we can create emotionally rich synthetic voices. ...

May 16, 2026

New Text-to-Speech Models for Basque: Maider, Antton and OmniVoice

In recent years, text-to-speech (TTS) models have advanced enormously. Month after month, new releases improved pronunciation, prosody, and overall audio quality. Unfortunately, most of those models were designed for English, or at best compatible with only a few languages. The tools available for generating Basque voices used to be quite limited: paid proprietary systems (such as Elhuyar’s neural TTS, developed by Orai), or older robotic voices that had clearly fallen behind. ...

April 4, 2026