模型:
microsoft/speecht5_hifigan
这是与SpeechT5文本到语音和语音转换模型配套使用的HiFi-GAN声码器。
SpeechT5首次发布于 this repository 年, original weights 年。使用的许可证是 MIT 。
声明:发布SpeechT5的团队没有为此模型编写模型卡片,因此这个模型卡片是由Hugging Face团队编写的。
BibTeX:
@inproceedings{ao-etal-2022-speecht5, title = {{S}peech{T}5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing}, author = {Ao, Junyi and Wang, Rui and Zhou, Long and Wang, Chengyi and Ren, Shuo and Wu, Yu and Liu, Shujie and Ko, Tom and Li, Qing and Zhang, Yu and Wei, Zhihua and Qian, Yao and Li, Jinyu and Wei, Furu}, booktitle = {Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)}, month = {May}, year = {2022}, pages={5723--5738}, }