XLNet

Year: 2,019
Journal: NeurIPS / Conference on Neural Information Processing Systems
Languages: English
Programming languages: Python
Input data:

Plain text, format: https://github.com/zihangdai/xlnet

XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.