数据集:

msarmi9/korean-english-multitarget-ted-talks-task

语言:

en ko

语言创建人:

other

批注创建人:

expert-generated
中文

Dataset Card for english-korean-multitarget-ted-talks-task

Dataset Summary

  • Parallel English-Korean Text Corpus
  • Text was originally transcribed to English from various Ted Talks, then translated to Korean by TED translators
  • Approximately 166k train, 2k validation, and 2k test sentence pairs.

Supported Tasks and Leaderboards

  • Machine Translation

Languages

  • English
  • Korean

Additional Information

Dataset Curators

Kevin Duh, "The Multitarget TED Talks Task", http://www.cs.jhu.edu/~kevinduh/a/multitarget-tedtalks/ , 2018

Licensing Information

TED makes its collection available under the Creative Commons BY-NC-ND license. Please acknowledge TED when using this data. We acknowledge the authorship of TED Talks (BY condition). We are not redistributing the transcripts for commercial purposes (NC condition) nor making derivative works of the original contents (ND condition).

Citation Information

@misc{duh18multitarget, author = {Kevin Duh}, title = {The Multitarget TED Talks Task}, howpublished = {\url{ http://www.cs.jhu.edu/~kevinduh/a/multitarget-tedtalks/}} , year = {2018}, }