Artificial Intelligence Newswire Startups Technology

Hume’s EVI 3 AI Now Creates Custom Voices Faster Than Ever

The Wiz May 30, 2025Last Updated: May 30, 2025

2 minutes read

A smiling person sits at a desk, working on a computer surrounded by colorful speech bubbles representing online communication.

▼ Summary

– Hume has launched EVI 3, an AI voice model designed for customer support, health coaching, storytelling, and companionship, offering natural and empathetic interactions.
– EVI 3 allows users to create custom voices by speaking to it and focuses on emotional understanding, tone adjustment, and faster responses.
– Developers can soon access EVI 3 via Hume’s API to integrate it into apps, though pricing details are pending, with usage-based models expected.
– Hume’s internal benchmarks show EVI 3 outperforms OpenAI’s GPT-4o and Google’s Gemini in naturalness, expressiveness, and emotional intelligence.
– While lacking voice cloning, Hume plans to add this feature to its Octave TTS model, prioritizing ethical safeguards before release.

Hume’s latest AI voice technology delivers unprecedented customization and emotional intelligence, setting a new benchmark for natural conversational experiences. The New York-based startup has launched EVI 3, its most advanced empathic voice interface yet, designed to transform industries from customer service to entertainment with lifelike synthetic voices.

Unlike conventional voice assistants, EVI 3 generates unique vocal personalities on demand—users simply describe their desired tone and characteristics, from warm confidence to playful mischief. During testing, the system produced customized voices within seconds, outperforming standard offerings from tech giants in realism and responsiveness.

Businesses and developers gain powerful tools with this release. The platform enables precise control over vocal emotion, speech patterns, and conversational style, allowing tailored experiences for diverse applications. Whether crafting an urgent French-accented mouse character for gaming or developing compassionate health coaching tools, EVI 3 adapts to creative and commercial needs alike.

While currently accessible through Hume’s demo platform, full API integration arrives soon, letting teams embed these capabilities directly into their products. Early benchmarks suggest significant advantages over competing models—users consistently preferred EVI 3 over OpenAI’s GPT-4o across multiple metrics including emotional resonance and interruption handling.

Key technical differentiators include:

300-millisecond response times for fluid dialogue
Multilingual support beginning with English and Spanish
Real-time voice modulation during conversations
Expressive prosody generation that captures human speech nuances

Notably absent is voice cloning functionality—a deliberate choice by Hume as it develops ethical safeguards. The company plans to introduce this separately through its Octave text-to-speech system, requiring just five seconds of sample audio for replication.

Pricing follows a flexible model across Hume’s product suite. While EVI 3’s exact rates remain undisclosed, the existing EVI 2 structure suggests cost-effective, usage-based plans with enterprise options. The Octave TTS platform already offers tiered subscriptions, from free basic access to high-volume business solutions.

Behind this innovation stands Hume’s unique research approach. The team, led by ex-Google DeepMind scientist Alan Cowen, trained their models on extensive behavioral datasets capturing vocal patterns and facial expressions. This foundation allows EVI 3 to interpret subtle emotional cues and respond with appropriate tonality—whether mirroring a user’s excitement or adjusting to frustration.

The technology builds on Hume’s previous milestones, including February’s Octave launch for emotion-controlled narration and early 2024’s EVI 2 update with faster processing. With developer access imminent, EVI 3 positions Hume at the forefront of emotionally intelligent voice interfaces, offering tools to reshape how humans interact with machines through speech.

(Source: VentureBeat)

Topics

evi 3 ai voice model 95% custom voice creation 90% emotional understanding tone adjustment 88% api integration developers 85% Performance Benchmarks 80% voice cloning future plans 75% Multilingual Support 70% real-time voice modulation 68% pricing models 65% humes research approach 60%

Hume’s EVI 3 AI Now Creates Custom Voices Faster Than Ever

Topics

The Wiz

Read Next

Perplexity AI Browser Now on Windows, Android Coming Soon

CoinMarketCap hacked: Fake Web3 popup drained crypto wallets

Sonic Racing CrossWorlds Adds SpongeBob, TMNT & Avatar Characters

Perplexity AI Browser Now on Windows, Android Coming Soon

CoinMarketCap hacked: Fake Web3 popup drained crypto wallets

Sonic Racing CrossWorlds Adds SpongeBob, TMNT & Avatar Characters

Empathy in AI: The Key to Overcoming Fear and Boosting Fluency

Master Programming Faster with AI: A Beginner’s Guide

Why AI Chatbots Keep Users Hooked

Your New AI Coworker Is Here. Getting Along Is Complicated.

TinyAI: Intelligence in Your Pocket

Protect Your Legacy: How to Draft a Will for the AI Age

3 Future-Proof Careers to Avoid AI Job Loss, Says Bill Gates

Part 4: Key Players in the Semiconductor Industry

Part 3: Core Technologies and Trends in the Semiconductor Industry

Topics

Read Next

Perplexity AI Browser Now on Windows, Android Coming Soon

CoinMarketCap hacked: Fake Web3 popup drained crypto wallets

Sonic Racing CrossWorlds Adds SpongeBob, TMNT & Avatar Characters

Adblock Detected