ModernBERT: Smarter, Better, Faster and with Longer context

Large Language Models and Transformers (Videos, Simons Institute for the Theory of Computing)