trending
Zac Zuo

16d ago

TADA - 1:1 text-acoustic alignment for 5x faster speech generation

TADA (Text-Acoustic Dual Alignment) is Hume AI's open-source speech-language model that synchronizes text and audio one-to-one. TADA synchronizes text and speech into a single continuous stream via 1:1 token alignment. Generating audio at 5x the speed of conventional LLM-based TTS systems completely eliminates skipped words and content hallucinations across 1000+ tests.
Ben Lang

1yr ago

Octave TTS - Describe any AI voice and prompt its emotional delivery

The first LLM for text-to-speech. While other TTS just “reads” words, Octave grasps their meaning. Create any AI voice with a descriptive prompt, guide its emotional delivery (angrier! more sarcasm!), and bring your stories to life with human-like expression.
Ankit Sharma

1yr ago

Hume OCTAVE - A next-generation speech-language model

A frontier speech-language model with new emergent capabilities, like on-the-fly voice and personality creation
Ankit Sharma

1yr ago

Hume AI - The foundational voice model for any interface

Empathic AI research lab building multimodal AI with emotional intelligence. Experience our API: https://demo.hume.ai