BigBird

Year: 2,020
Journal: Conference on Neural Information Processing Systems
Languages: All Languages
Programming languages: Python
Input data:

text

BigBird, is a sparse-attention based transformer which extends Transformer based models, such as BERT to much longer sequences. Moreover, BigBird comes along with a theoretical understanding of the capabilities of a complete transformer that the sparse model can handle.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.