Transformer DeepMind AI cs.CL cs.LG arXiv Training Compute-optimal Large Language Models by AI Reference Mar 29, 2022 arXiv V1: Training Compute-Optimal Large Language Models Previous Article Efficient Transformers - a Survey Next Article Beyond 3DMM - Learning to Capture High-fidelity 3D Face Shape