The maths you need to start understanding LLMs
Archives
Categories
Blogroll
This article is the second of three "state of play" posts that explain how Large Language
Models work, aimed at readers with the level of understanding I had in mid-2022: techies
with no deep AI knowledge. It grows out
of part 19 in my series
working through Sebastian Raschka's book
"Build a Large Language Model (from Scratch)".
You can read the first post in the series here.
Actually coming up with ideas like GPT-based LLMs and doing serious AI research requires
ser...
Read more at gilesthomas.com