Instant Cloning
Learn here how to add a character and specify a voice sample for instant cloning
Through instant voice cloning, you only need to provide 5-30 seconds of any sample, and without any training of the model, cloning can be completed instantly. Our AI will instantly imitate the tone, speed, emotion, pauses, loudness, acoustic environment, breathing sounds, accent, vocalization and other characteristics of the cloned audio sample based on millions of hours of experience during generation, and try to understand the context of the target text as much as possible, and synthesize them to produce the most expressive and matching speech.
Currently, you can summon the character creation panel through the "Add Character" button on the Character Management page, or use the "Quick Create New Character..." button in the bottom left corner of the Speech Generation page to create a character.

You need to specify a name for the character you create, and optionally specify a description and an avatar. Currently, names, descriptions and avatars are for display only and do not affect voice cloning behavior.
Subsequently, you need to upload an audio file or record an audio as the default style guide sample for this cloning; this default style sample will be used to define the character's default voice performance, including voice line, emotion, speed, tone, prosody, etc. (you can add more different style samples later in the character details page).
After the audio upload is completed, click the Add button in the bottom right corner and wait for processing to complete.
Currently, we only support Chinese and English sample audio. Please ensure that the sample audio you provide contains correctly recognizable Chinese or English content and does not contain content in other languages, otherwise it will cause character creation to fail or lead to other various problems.
For detailed considerations and best practices regarding instant cloning sample audio, please refer to this page.
Do not use our services to clone or generate any content that infringes copyright, violates ethics, or violates the laws and regulations of the People's Republic of China and your local area. All content we generate comes with detailed logs, automatic/manual review, and traceable invisible audio watermarks. If we find that you have violated relevant rules, we reserve the right to terminate your service and report to government agencies and other institutions.
For more information, please refer to Service Agreement, Account Agreement, Privacy Statement.
Last updated
Was this helpful?