数据集:

hanamizuki-ai/genshin-voice-v3.3-mandarin

语言:

zh

计算机处理:

monolingual

源数据集:

original
中文

Dataset Card for Genshin Voice

Dataset Description

Dataset Summary

The Genshin Voice dataset is a text-to-voice dataset of different Genshin Impact characters unpacked from the game.

Languages

The text in the dataset is in Mandarin.

Dataset Creation

Source Data

Initial Data Collection and Normalization

The data was obtained by unpacking the Genshin Impact game.

Who are the source language producers?

The language producers are the employee of Hoyoverse and contractors from EchoSky Studio .

Annotations

The dataset contains official annotations from the game, including ingame speaker name and transcripts.

Additional Information

Dataset Curators

The dataset was created by w4123 initially in his GitHub repository .

Licensing Information

Copyright © COGNOSPHERE. All Rights Reserved.