Year: 2,020
Languages: English
Programming languages: Python
Input data:


Bort is an optimal subset of architectural parameters for the BERT architecture, extracted by applying a fully polynomial-time approximation scheme (FPTAS) for neural architecture search. Bort has an effective (that is, not counting the embedding layer) size of 5.5% the original BERT-large architecture, and 16% of the net size.

Sign In


Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.