数据集:

myanmar_news

语言:

my

计算机处理:

monolingual

大小:

1K<n<10K

语言创建人:

found

批注创建人:

found

源数据集:

original

许可:

gpl-3.0
中文

Dataset Card for Myanmar_News

Dataset Summary

The Myanmar news dataset contains article snippets in four categories: Business, Entertainment, Politics, and Sport.

These were collected in October 2017 by Aye Hninn Khine

Languages

Myanmar/Burmese language

Dataset Structure

Data Fields

  • text - text from article
  • category - a topic: Business, Entertainment, Politic , or Sport (note spellings)

Data Splits

One training set (8,116 total rows)

Source Data

Initial Data Collection and Normalization

Data was collected by Aye Hninn Khine and shared on GitHub with a GPL-3.0 license.

Multiple text files were consolidated into one labeled CSV file by Nick Doiron.

Additional Information

Dataset Curators

Contributors to original GitHub repo:

Licensing Information

GPL-3.0

Citation Information

See https://github.com/ayehninnkhine/MyanmarNewsClassificationSystem

Contributions

Thanks to @mapmeld for adding this dataset.