Learn how to use LLMs and create datasets with simple and reproducible notebooks.
Note
All notebooks checked and updated last June 12, 2025.
I try to execute them once a month to make sure they still work on Google Colab. If you find any bugs or issues, please let me know! I'll try to fix them as soon as possible.
- Browse the notebooks.
- Open them in Google Colab using the links.
- In some notebooks you may need to set API keys or your Hugging Face token in order to interact with LLMs from OpenAI, Anthropic, and Google, or to interact with your huggingface hub.
- Run, explore, and modify to suit your needs!
Try out Datafast for a convenient way to create datasets.
Proprietary models
Proprietary models
- Simple Question Generation with Distilabel and OpenAI
- Getting Started with Genstruct7B
- SelfInstruct with Distilabel and OpenAI
Created with ❤️ by Patrick Fleith.