Scaling laws for neural language models

6 years ago 28
Read Entire Article