Scaling laws for neural language models

6 years ago 19
Read Entire Article