Resources to brush up from 'Intro to ML' to current LLMs/generative AI?

Question

My 'AI' experience to date consists of basically a prolog course, and an 'Intro to ML'. I could say handwavy things about regression and SVMs. I'm pretty sure we covered convolutional neural nets but I barely recall that at all.I'm interested in understanding more about transformers/GPT/LLMs and more 'media-rich' generative AI like DALLE and Midjourney etc. (I assume they're linked, because they seemed to have breakthroughs and blow up at the same time, but I don't understand that at all) - but not 'prompt engineering' or specifics about tuning model parameters etc.Can anyone recommend any resources for 'writing a CMS' vs. 'how to configure Wordpress and install a plugin', as it were?(Prefer text, but understand it might be too new for good ones to have established themselves. In that case, something like OCW preferred to 'screamface'.)Cheers!

tayo42 · Accepted Answer

I really like Andrej Karpathys contentyeah its youtube i know... but, its hand on toofor gpt/llm/ml in general https://karpathy.ai/zero-to-hero.htmlit starts with writing back prop from scratch and take you through writing everything you need and training a gpt2 equivalent model in the endI also thought his lectures at standford, on youtube cs231n 2016, were really good. they cover GAN for generation, but I think after that you can read the papers that were the source for diffusion models dalle and midjouney use

vikp · Answer

I'm building a course that teaches deep learning from the ground up - https://github.com/VikParuchuri/zero_to_gpt .
It balances theory and code, and builds from the foundation up, so you're never typing something without understanding it. Teaching method is text, diagrams, and code. Most lessons have optional videos, too.
It focuses on text models over image models (rnn, transformer, etc).
It's not 100% finished, but has enough to get you very far.

jonnycoder · Answer

https://www.deeplearning.ai/This is probably what you&rsquo;re looking for.

cellis · Answer

Contrarian view. Don't bother with that, just use a GPT (ideally 4) to "write a neural network to do where the input is and explain your reasoning". You'll learn way more from doing this and actually get a lot further than "starting from scratch".

Tomte · Answer

The handouts at https://fleuret.org/dlc/ are fantastic.Also two books: Data Science from Scratch and Deep Learning from Scratch. They are more hand-on, but you'll build all the low-level things in Python and learn a lot.

tikkun · Answer

I think you're asking for some of the things mentioned in the 'research science' section here - https://news.ycombinator.com/item?id=36195527 - is that right?

Metalic · Answer

Depending on your math background, i would go for Bishop's Deep Learning Foundations and Concepts and Simon Prince's Understanding Deep Learning.

kaycebasques · Answer

https://course.fast.ai/

naijaboiler · Answer

Good resources here. Thanks guys