Quantisation and co. Reducing inference times on LLMs by 80% medium.com Post date October 27, 2023 No Comments on Quantisation and co. Reducing inference times on LLMs by 80% Related External Tags ai, data-science, deep-dives, large-language-models, python ← Decision Science Meets Design → Don’t Apply to Tech Without Mastering These 6 Must-Have Data Science Skills — A Spotify Data… Leave a ReplyCancel reply This site uses Akismet to reduce spam. Learn how your comment data is processed.