XLNet
Year: 2,019
Journal: NeurIPS / Conference on Neural Information Processing Systems
Languages: English
Programming languages: Python
Input data:
Plain text, format: https://github.com/zihangdai/xlnet
Project website: https://github.com/zihangdai/xlnet
XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining