Introduction
CosyVoice2-0.5B is a high-quality multilingual text-to-speech model with voice cloning capabilities, provided by FunAudioLLM.
Available NPU Models
cosyvoice2-0.5b-ax650
- Supported languages: Chinese, English, Japanese, Korean, etc.
- Supported platform: AI Pyramid
- Maximum single output audio length: 27s
- TTFT: 401.84ms
- RTF: 1.36
Installation
apt install llm-model-cosyvoice2-0.5b-ax650