allenai/led-large-16384 | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

模型:

allenai/led-large-16384

任务:

文生文

类库:

PyTorch TensorFlow Transformers

语言:

其他:

led AutoTrain Compatible

预印本库:

arxiv:2004.05150

许可:

apache-2.0

模型介绍文件清单

中文

Introduction

Allenai's Longformer Encoder-Decoder (LED) .

As described in Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan, led-large-16384 was initialized from bart-large since both models share the exact same architecture. To be able to process 16K tokens, bart-large 's position embedding matrix was simply copied 16 times.

This model is especially interesting for long-range summarization and question answering.

Fine-tuning for down-stream task

This notebook shows how led-large-16384 can effectively be fine-tuned on a downstream task.

作者:

Allen Institute for AI

数据集大小:

3.43 GB