Overview
Here you can learn more information about how to create characters and perform voice cloning, and get various usage tips
You can perform instant voice cloning by creating a character and uploading or recording a short audio sample for it. You can also complete professional voice cloning within 3-60 minutes by providing 1-60 minutes of audio samples.
Subsequently, you can assign these characters to different texts in speech synthesis to let AI use these characters' voices for reading.
Currently, you can create a character through the "Add Character" button on the Character Management page, or use the "Quick Create New Character..." button in the bottom left corner of the Speech Generation page.
Instant Cloning
Instant cloning allows you to clone a voice almost instantly from very short samples. It should be noted that the basic principle of instant cloning is not to create or train a new model based on the provided voice samples, but to let AI reasonably infer and imitate based on the massive data it has learned before. Our model has been trained on a large amount of regular speech, so it should be very effective for most natural speech.
However, our model still has some imperfections. If the voice sample you provide is relatively unique and our AI has never learned similar voices before, it may lead to poor generation results or inability to reproduce the voice well. Currently, for specific introductions, shortcomings and limitations of our various models, please refer to Model Introduction
Professional Cloning
Through professional voice cloning, you only need to provide one minute or longer (up to 60 minutes) voice samples, and our AI will deeply train and learn every detail of the voice samples you provide, including every tone, pronunciation, rhythm, prosody, etc., within 3-60 minutes, achieving top-level cloning synthesis effects that are indistinguishable from the original voice, while retaining all the advanced features of the Vocu speech large model such as language understanding and emotional expressiveness.
Currently, whether it's professional cloning or instant cloning, we only support Chinese and English sample audio. Please ensure that the sample audio you provide contains correctly recognizable Chinese or English content and does not contain content in other languages, otherwise it will cause character creation to fail or lead to other various problems.
Do not use our services to clone or generate any content that infringes copyright, violates ethics, or violates the laws and regulations of the People's Republic of China and your local area. All content we generate comes with detailed logs, automatic/manual review, and traceable invisible audio watermarks. If we find that you have violated relevant rules, we reserve the right to terminate your service and report to government agencies and other institutions.
For more information, please refer to Service Agreement, Account Agreement, Privacy Statement.
Last updated
Was this helpful?