Speech Synthesis 2.0
With this release, the speech synthesis will get even better! We have added changes to how we train the model resulting in even better results on longer fragments. Our core changes include:
- Support for cased input, this makes it easier for the model to read names (like OpenAI or ChatGPT), construct pauses between fragments or names
- Longer & better training - the model seems to perform better on our long-form benchmarks
- Necessary components to support infilling - contextual changes to fragments
- Necessary components to extend the model across languages on the same platform
It is expected that your cloned or default Voices may have minor changes. Enjoy!