Scaling laws for neural language models

6 years ago 13
Read Entire Article