Dia
UltraDialogue-oriented TTS with voice cloning and nonverbal sounds
About Dia
Dia by Nari Labs is a 1.6B parameter dialogue-focused text-to-speech model. It excels at generating natural conversational speech with support for nonverbal sounds like laughter, sighs, and coughs. Dia supports multi-speaker dialogue generation and voice cloning from 5-10 seconds of reference audio, making it ideal for creating realistic conversations and character voices.
Key Features
Dialogue Generation
Generate natural multi-speaker conversations with distinct voices and turn-taking.
Nonverbal Sounds
Add [laughs], [sighs], [coughs], (gasps) for natural paralinguistic expression.
Voice Cloning
Clone any voice from 5-10 seconds of reference audio for personalized speech.
Natural Conversation
1.6B parameters produce highly natural conversational prosody and intonation.
Use Cases
Frequently Asked Questions
Technical Specs
- Generation Speed Medium
- Output Quality Excellent
- Voice Cloning Supported
- Languages 1
- GPU VRAM 10GB
- Credits/1000 chars 50