Do I need to know the basics? Shall I just utilize the existing solutions like gpt-3, openAI, stable diffusion and built applications with them? Can I make those tools tailored for my uses cases(model training) or I should built similar from the scratch?
Looking for advice!
From there, if you're interested in how it works, I highly recommend the last 4 videos on Jeremy Howard's youtube channel: https://www.youtube.com/user/howardjeremyp/videos
He's currently teaching a class on stable diffusion from the ground up and these lectures give a really good introduction to how it all works.
2. Learn to use the Hugging Face library, and use their stuff on your Notebooks.
3. Learn some ML theory so you can understand hyperparameters better, and can tweak them in a better way.
____
If you want to get into training models by yourself from scratch, you have to learn in a deeper manner, and cannot overlook learning ML theory in a deeper manner.
____
The most obvious ways would be:
1. Looking into stuff that John Whitaker does [0] and his elaborate free course on AI Art [1].
2. Learning ML from scratch starting from Andrew Ng ML, then going to DL, then learning about GANs.
3. Learning from fast.ai through their two-part course on Deep Learning, where Stable Diffusion is now being taught. Then learn PyTorch from another place like Sebastian Raschka's book.
4. Watching old videos from Stanford CS231n when Karpathy was a TA, and taught in the class. Then Deep Dream was standard.
_____
If you are a responsible, mature person, and you are in it for the long term, and have deep pockets, buy some GPU. 2x 3090 is reasonable, and should be enough.
____
Let me know if you have any further questions.
[0]: https://datasciencecastnet.home.blog/
[1]: https://youtube.com/playlist?list=PL23FjyM69j910zCdDFVWcjSIK...