pdf-icon

StackFlow AI Platform

Module LLM Applications

CV Vision Application

Vision Language Model (VLM)

Large Language Model (LLM)

Voice Assistant

CosyVoice2-0.5B

Introduction

CosyVoice2-0.5B is a high-quality multilingual text-to-speech model with voice cloning capabilities, provided by FunAudioLLM.

Available NPU Models

cosyvoice2-0.5b-ax650

  • Supported languages: Chinese, English, Japanese, Korean, etc.
  • Supported platform: AI Pyramid
  • Maximum single output audio length: 27s
  • TTFT: 401.84ms
  • RTF: 1.36

Installation

apt install llm-model-cosyvoice2-0.5b-ax650
On This Page