Transformer AI arXiv Cramming - Training a Language Model on a Single Gpu in One Day by AI Reference Dec 28, 2022 arXiv V1: CRAMMING: TRAINING A LANGUAGE MODEL ON A SINGLE GPU IN ONE DAY Previous Article Constitutional AI - Harmlessness From AI Feedback Next Article Llama - Open and Efficient Foundation Language Models