TTS API Server Implementation #141
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
主要功能
实现了一个基于 FastAPI 的文本转语音服务,提供音色管理、语音合成和声音克隆功能。核心功能包括三个主要接口:
1. 音色管理接口
GET /voices
: 获取已注册音色列表POST /register_voice
: 注册新音色到系统(需提供参考音频和文本)2. 文本转语音接口
POST /tts
: 使用已注册音色进行语音合成speaker
(指定音色名称),tts_text
(合成文本)3. 即时克隆接口
POST /clone
: 单次语音克隆合成prompt_wav
(参考音频),prompt_text
(参考文本),tts_text
(目标文本)使用示例