数据集:

fscheffczyk/2D_20newsgroups_embeddings

中文

Dataset Card for feature vector embeddings of the 20newsgroup dataset

Dataset Summary

This dataset contains dimensional reduced vector embeddings of the 20newsgroups dataset . This dataset contains two dimensions.

The dimensional reduced embeddings were created with the TruncatedSVD function from the scikit-learn library . These reduced feature vectors are based on the fscheffczyk/20newsgroup_embeddings dataset .

Supported Tasks and Leaderboards

[More Information Needed]

Languages

[More Information Needed]

Dataset Structure

Data Instances

[More Information Needed]

Data Fields

[More Information Needed]

Data Splits

[More Information Needed]

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

[More Information Needed]

Citation Information

[More Information Needed]

Contributions

Thanks to @github-username for adding this dataset.