ChatTTS
AI Voice GeneratorTags

Introduction
ChatTTS is a voice generation model designed for conversational scenarios. It is ideal for applications such as dialogue tasks for large language model assistants, as well as conversational audio and video introductions. The model supports both Chinese and English, demonstrating high quality and naturalness in speech synthesis. This level of performance is achieved through training on approximately 100,000 hours of Chinese and English data. The project team plans to open-source a basic model trained with 40,000 hours of data, which will aid the academic and developer communities in further research and development.
How To Use
To use ChatTTS, download the code from GitHub, install the necessary dependencies (torch and ChatTTS), import the required libraries, initialize ChatTTS, prepare your text, generate speech using the infer method, and play the generated audio using the Audio class from IPython.display.
Pricing
Packages | Pricing | Features |
---|---|---|
Free Edition | Free | Unlimited public repositories, limited private repositories |
Team Edition | $4/user/month | Unlimited private repositories, basic features |
Enterprise Edition | $21/user/month | Advanced security and auditing features |