A concise, personal comparison of key LLM architectures developed over the past few years. This document reflects my individual understanding and curiosity-driven research from the year 2017 to February 2025. This is by no means an exhaustive list, and many other excellent models exist in the field. List of LLMs Covered (2017–2025) Transformer, BERT, […]
