> For the complete documentation index, see [llms.txt](https://docs.vocu.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.vocu.ai/voices/overview.md).

# Overview

You can perform [instant voice cloning](/voices/create.md) by creating a character and uploading or recording a short audio sample for it. You can also perform [professional voice cloning](/voices/pro-clone.md) by providing 1-60 minutes of audio samples, completing it within 3-60 minutes.

After that, you can assign these characters to different texts in speech synthesis to let AI use these character voices for reading.

Currently, you can summon the character creation panel by clicking the **"Add Character"** button on the [Voice Management](https://app.vocu.ai/voices) page, or by selecting the **"Create New Character..."** button in the popup when selecting a character on the [Vocu Studio](https://app.vocu.ai/generate) page, and use it to create a character.

### Instant Cloning

[Instant cloning](/voices/create.md) allows you to clone a voice almost instantly from a very short sample. It should be noted that the basic principle of instant cloning is not to create or train a new model based on the provided voice sample, but to let AI reasonably guess and imitate based on the massive data it has learned. Our model has been trained on a large amount of regular speech, so it should be very effective for most natural speech processing in theory.

However, our model still has some imperfections. If the voice sample you provide is relatively unique and our AI has not learned similar voices before, it may lead to poor generation results or inability to replicate the voice well. Currently, for specific introductions, shortcomings and limitations of our various models, please refer to [Model Introduction](/introduction/models.md)

{% hint style="info" %}
Sample quality is more important than length. Noisy samples may produce poor results. Please provide high-quality sample speech as much as possible. Currently, sample speech length needs to be **greater than 2 seconds** and **file size not exceeding 20M**. You can also try to obtain high-quality vocal audio samples from any audio using **vocal separation/audio noise reduction/vocal beautification/loudness normalization** and other functions of some **audio editing software**.
{% endhint %}

### Professional Cloning

Through [professional voice cloning](/voices/pro-clone.md), you only need to provide one minute or longer (up to 60 minutes) voice samples, and our AI will deeply train and learn every detail of the voice samples you provide, including every tone, pronunciation method, rhythm, prosody, etc., within 3-60 minutes, achieving top-level cloning and synthesis effects indistinguishable from the original voice, while retaining all cutting-edge features of the Vocu voice large model such as language understanding, emotional expressiveness, etc.

{% hint style="warning" %}
V2 series models **(V2.9) only support Chinese and English**. When using V2 series models, please ensure that the input text does not contain any non-Chinese and English characters, such as Japanese and Korean, otherwise it may cause **generation failure** and other issues.

Starting from V3 series, we have added Cantonese, Japanese, Korean, French, German, Spanish and Portuguese in addition to Chinese and English, as well as more than 30 accent variants of these languages in total. Please ensure that the model version and text content you use are in line with the corresponding support capabilities.
{% endhint %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.vocu.ai/voices/overview.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.