Zero-shot expressive voice cloning and speech generation. Describe how a voice sounds and feels, write what it should say, and the model generates a full vocal performance.
Built by Scenema AI, the AI filmmaking platform. GitHub | Demos & Samples