2023 AI-ML Bookshelf

Textbooks

Wow, an excellent textbook: https://d2l.ai/

Articles

Stable Diffusion from Scratch

AI-ML Youtube Channels

Three blue one brown
two-minute papers
StatQuest
Prompt Muse

From the illustrated transformer

Read the Attention Is All You Need paper, the Transformer blog post (Transformer: A Novel Neural Network Architecture for Language Understanding), and the Tensor2Tensor announcement.
Watch Łukasz Kaiser’s talk walking through the model and its details
Play with the Jupyter Notebook provided as part of the Tensor2Tensor repo
Explore the Tensor2Tensor repo.

Bert

https://huggingface.co/blog/bert-101

Pytorch

Huggingface

Andrej Karpathy

https://github.com/karpathy
https://cs.stanford.edu/people/karpathy/
https://karpathy.ai/
Deep Reinforcement Learning: Pong from Pixels: https://karpathy.github.io/2016/05/31/rl/

Datasets

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM

Follow-up works

Fundamental Tutorials

Book :: DAVID SILVER :: UCL Course on RL https://www.davidsilver.uk/teaching/

Self-instruct:

https://github.com/yizhongw/self-instruct
Self-Instruct: Aligning Language Models with Self-Generated Instructions https://arxiv.org/abs/2212.10560
https://github.com/tatsu-lab/stanford_alpaca
LLaMA: Open and Efficient Foundation Language Models (not actually open?)
- https://arxiv.org/abs/2302.13971v1 (saved)
Not really open then: https://crfm.stanford.edu/2023/03/13/alpaca.html
- https://github.com/tatsu-lab/stanford_alpaca#data-generation-process

EleutherAI

https://www.eleuther.ai/releases
https://en.wikipedia.org/wiki/EleutherAI
Pythia, A suite of models designed to enable controlled scientific research on transparently trained LLMs https://github.com/EleutherAI/pythia

Code