数据集:
Jzuluaga/uwb_atcc
UWB-ATCC语料库由捷克西波希米亚大学控制工程系提供。该语料库包含空中交通管制员与飞行员之间的通信录音。音频经过手动转录,并用发言人的信息(飞行员/控制员,而不是完整的个人身份)进行标注。当前语料库规模较小(20小时),但我们计划在明年寻找更多的数据。音频数据格式为:8kHz、16bit PCM、单声道。
从以下的方括号``字段中,您可以获取到发言人的角色信息。例如:
文本和录音为英文。作者利用了他们的一个工业合作伙伴的优势,该合作伙伴为多个ATC机构和机场开发复杂的IT解决方案,并且能够获得在捷克领空收集的ATC通信录音。这个合作伙伴能够获取到以下数据:
(并非所有数据都已发布。请检查他们的网站 here )
数据集的许可状态取决于 UWB-ATCC corpus 创建者的法律地位。
他们采用了 Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) 许可证。
准备、处理、规范化和上传数据集到HuggingFace的贡献者:
@article{zuluaga2022how, title={How Does Pre-trained Wav2Vec2. 0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications}, author={Zuluaga-Gomez, Juan and Prasad, Amrutha and Nigmatulina, Iuliia and Sarfjoo, Saeed and others}, journal={IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar}, year={2022} } @article{zuluaga2022bertraffic, title={BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications}, author={Zuluaga-Gomez, Juan and Sarfjoo, Seyyed Saeed and Prasad, Amrutha and others}, journal={IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar}, year={2022} } @article{zuluaga2022atco2, title={ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications}, author={Zuluaga-Gomez, Juan and Vesel{\`y}, Karel and Sz{\"o}ke, Igor and Motlicek, Petr and others}, journal={arXiv preprint arXiv:2211.04054}, year={2022} }
数据集作者:
@article{vsmidl2019air, title={Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development}, author={{\v{S}}m{\'\i}dl, Lubo{\v{s}} and {\v{S}}vec, Jan and Tihelka, Daniel and Matou{\v{s}}ek, Jind{\v{r}}ich and Romportl, Jan and Ircing, Pavel}, journal={Language Resources and Evaluation}, volume={53}, number={3}, pages={449--464}, year={2019}, publisher={Springer} }