{"id":2975,"date":"2024-01-18T09:39:37","date_gmt":"2024-01-18T01:39:37","guid":{"rendered":"https:\/\/www.1ai.net\/?p=2975"},"modified":"2024-01-18T09:39:37","modified_gmt":"2024-01-18T01:39:37","slug":"gpt-sovits%ef%bc%9a%e4%b8%80%e4%b8%aa%e5%bc%ba%e5%a4%a7%e7%9a%84%e9%9b%b6%e6%a0%b7%e6%9c%ac%e8%af%ad%e9%9f%b3%e8%bd%ac%e6%8d%a2%e5%92%8c%e6%96%87%e6%9c%ac%e5%88%b0%e8%af%ad%e9%9f%b3webui","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/2975.html","title":{"rendered":"GPT-SoVITS: A Robust Zero-Shot Speech Conversion and Text-to-Speech WebUI"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-2976\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/01\/202401180900378890.png\" alt=\"\" width=\"1440\" height=\"900\" \/><\/p>\n<p><a href=\"https:\/\/www.1ai.net\/en\/tag\/gpt-sovits\" title=\"[See article with [GPT-SoVITS] label]\" target=\"_blank\" >GPT-SoVITS<\/a>-WebUI is a powerful zero-sample speech conversion and text-to-speech WebUI. it features zero-sample TTS, few-sample TTS, cross-language support and WebUI tools. It supports English, Japanese and Chinese and provides integrated tools including speech accompaniment separation, automatic training set segmentation, Chinese ASR and text annotation to help beginners create training datasets and GPT\/SoVITS models. Users can experience instant text-to-speech conversion by inputting a 5-second sound sample, and can fine-tune the model to improve speech similarity and fidelity by using only 1 minute of training data. The product supports environment preparation, Python and PyTorch versions, quick installation, manual installation, pre-training models, dataset formats, to-do lists, and acknowledgements.<\/p>\n<div class=\"detail-dl-div-item\" data-v-86be4cee=\"\">\n<p class=\"detail-dl-div-item-t\" data-v-86be4cee=\"\">Target group:<\/p>\n<p class=\"detail-dl-div-item-c\" data-v-86be4cee=\"\">\u201cUsers can be used in such settings as voice conversion, speech synthesis, speech processing, etc..\u201d<\/p>\n<\/div>\n<div class=\"detail-dl-div-item\" data-v-86be4cee=\"\">\n<p class=\"detail-dl-div-item-t\" data-v-86be4cee=\"\">Example usage scenarios:<\/p>\n<p class=\"detail-dl-div-item-c\" data-v-86be4cee=\"\">Users can experience instant text-to-speech conversion by entering a 5-second voice sample<\/p>\n<p class=\"detail-dl-div-item-c\" data-v-86be4cee=\"\">Users can fine-tune the model to improve speech similarity and fidelity by using only 1 minute of training data<\/p>\n<p class=\"detail-dl-div-item-c\" data-v-86be4cee=\"\">Users can infer languages different from the training dataset, currently supporting English, Japanese, and Chinese<\/p>\n<p data-v-86be4cee=\"\">Official website address:<a href=\"https:\/\/github.com\/RVC-Boss\/GPT-SoVITS\">https:\/\/github.com\/RVC-Boss\/GPT-SoVITS<\/a><\/p>\n<p>&nbsp;<\/p>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>GPT-SoVITS-WebUI is a powerful zero-sample speech conversion and text-to-speech WebUI. it features zero-sample TTS, few-sample TTS, cross-language support and WebUI tools. It supports English, Japanese and Chinese and provides integrated tools including speech accompaniment separation, automatic training set segmentation, Chinese ASR and text annotation to help beginners create training datasets and GPT\/SoVITS models. Users can experience instant text-to-speech conversion by inputting a 5-second sound sample, and can fine-tune the model to improve speech similarity and fidelity by using only 1 minute of training data. The product supports environment preparation, Python and PyTorch versions, quick installation, manual installation, pre-training models<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[138,147],"tags":[952,953],"collection":[],"class_list":["post-2975","post","type-post","status-publish","format-standard","hentry","category-product","category-yinpin","tag-gpt-sovits","tag-953"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/2975","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=2975"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/2975\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=2975"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=2975"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=2975"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=2975"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}