Bort
Year: 2,020
Languages: English
Programming languages: Python
Input data:
text
Project website: https://github.com/alexa/bort/
Bort is an optimal subset of architectural parameters for the BERT architecture, extracted by applying a fully polynomial-time approximation scheme (FPTAS) for neural architecture search. Bort has an effective (that is, not counting the embedding layer) size of 5.5% the original BERT-large architecture, and 16% of the net size.