Skip to content

unslothai/notebooks

Repository files navigation

📒 Fine-tuning Notebooks

Below are Colab notebooks, organized by model. You can also view all notebooks in our docs.
The notebooks run locally and feature data prep, training and inference. Read our fine-tuning guide.

Main Notebooks

Model Type Notebook Link
Unsloth Studio Chat UI Open in Colab
Gemma 4 (E2B) Vision Open in Colab
Qwen3.5 (4B) Vision Open in Colab
Qwen3.5 (2B) Vision Open in Colab
gpt-oss (20B) Fine-tuning Open in Colab
gpt-oss (20B) GRPO Open in Colab
Qwen3 (14B) Conversational Open in Colab
Qwen3-VL (8B) Vision Open in Colab
Qwen3-Embedding (0.6B) Embeddings Open in Colab
Qwen3: Advanced GRPO GRPO Open in Colab
Gemma 3 (4B) Vision Open in Colab
Gemma 3N (4B) Audio Open in Colab
embeddinggemma (300M) Embeddings Open in Colab
Mistral Ministral 3 (3B) Vision Open in Colab
Mistral v0.3 (7B) Vision Open in Colab
Llama 3.1 (8B) Alpaca Alpaca Open in Colab
Llama 3.2 (1B + 3B) Conversational Open in Colab
Phi-4 (14B) Conversational Open in Colab
Orpheus-TTS (3B) TTS Open in Colab

Gemma 4 Notebooks

Model Type Notebook Link
Gemma4 (E2B) Sudoku (GRPO RL) Open In Colab
Gemma4 (E2B) Auto Kernel Creation (GRPO RL) Open In Colab
Gemma4 (E2B) 2048 Game (GRPO RL) Open In Colab
Gemma4 (31B) Vision Open In Colab
Gemma4 (31B) Conversational Open In Colab
Gemma4 (E4B) Vision Open In Colab
Gemma4 (E4B) Conversational Open In Colab
Gemma4 (E4B) Audio Open In Colab
Gemma4 (26B A4B) Vision Open In Colab
Gemma4 (26B A4B) Conversational Open In Colab
Gemma4 (E2B) Vision Open In Colab
Gemma4 (E2B) Conversational Open In Colab
Gemma4 (E2B) Audio Open In Colab

GRPO & Reinforcement Learning Notebooks

Model Type Notebook Link
Llama3.1 (8B) GSM8K Math + vLLM Open In Colab
NeMo Gym Sudoku Sudoku Open In Colab
NeMo Gym Multi Environment Multi Environment Open In Colab
gpt oss BF16 (20B) 2048 Game Open In Colab
gpt oss (20B) Minesweeper Game Open In Colab
gpt oss (20B) Auto Kernel Creation Open In Colab
Qwen3 (8B) DAPO Math + vLLM Open In Colab
Llama3 (8B) ORPO Open In Colab
Openenv wordle Wordle + vLLM Open In Colab
Qwen2.5 VL (7B) Vision Math + vLLM Open In Colab
Zephyr (7B) DPO Open In Colab
(OpenEnv) gpt oss BF16 (20B) 2048 Game Open In Colab
gpt oss (20B) Auto Kernel Creation Open In Colab
gpt oss (20B) 2048 Game Open In Colab
(OpenEnv) gpt oss (20B) 2048 Game Open In Colab
(DGX Spark) gpt oss (20B) 2048 Game Open In Colab
(A100) gpt oss (20B) Auto Kernel Creation Open In Colab
Qwen2.5 (3B) GSM8K Math + vLLM Open In Colab
Llama3.2 (1B) DAPO Math + vLLM Open In Colab
Qwen3 VL (8B) Vision Math Open In Colab
Mistral v0.3 (7B) GSM8K Math + vLLM Open In Colab
Qwen3 5 (4B) Vision Math Open In Colab
Gemma3 (4B) Vision Math Open In Colab
Phi 4 (14B) GSM8K Math + vLLM Open In Colab
Gemma3 (1B) GSM8K Math Open In Colab
DeepSeek R1 0528 Qwen3 (8B) DAPO Math + vLLM Open In Colab
LFM2.5 (1.2B) DAPO Math Open In Colab
Gemma4 (E2B) Sudoku Open In Colab
Gemma4 (E2B) Auto Kernel Creation Open In Colab
Gemma4 (E2B) 2048 Game Open In Colab
Qwen3 (4B) DAPO Math + vLLM Open In Colab
Ministral3 (3B) Sudoku Open In Colab

Tool Calling Notebooks

Model Type Notebook Link
Qwen2.5 Coder (1.5B) Tool Calling Open In Colab
FunctionGemma (270M) Tool Calling Open In Colab
FunctionGemma (270M) Mobile Actions Open In Colab
FunctionGemma (270M) Inference Open In Colab
FunctionGemma (270M) Conversational Open In Colab

Text-to-Speech (TTS) Notebooks

Model Type Notebook Link
Spark TTS (0.5B) TTS Open In Colab
Llasa TTS (1B) TTS Open In Colab
Orpheus (3B) TTS Open In Colab
Llasa TTS (3B) TTS Open In Colab
Sesame CSM (1B) TTS Open In Colab
Oute TTS (1B) TTS Open In Colab

Vision (Multimodal) Notebooks

Model Type Notebook Link
Qwen2.5 VL (7B) Vision Math + vLLM (GRPO RL) Open In Colab
Qwen2.5 VL (7B) Vision Open In Colab
ERNIE 4 5 VL 28B A3B PT Vision Open In Colab
Qwen3 VL (8B) Vision Open In Colab
Qwen3 VL (8B) Vision Math (GRPO RL) Open In Colab
Qwen3 5 (4B) Vision Open In Colab
Qwen3 5 (4B) Vision Math (GRPO RL) Open In Colab
Gemma3 (4B) Vision Math (GRPO RL) Open In Colab
Qwen3 5 (0 8B) Vision Open In Colab
Gemma4 (31B) Vision Open In Colab
Qwen2 VL (7B) Vision Open In Colab
Gemma4 (E4B) Vision Open In Colab
Llama3.2 (11B) Vision Open In Colab
Qwen3 5 (2B) Vision Open In Colab
Gemma4 (26B A4B) Vision Open In Colab
Gemma4 (E2B) Vision Open In Colab
Pixtral (12B) Vision Open In Colab
LFM2.5 VL (1.6B) Vision Open In Colab
Ministral3 VL (3B) Vision Open In Colab
Gemma3 (4B) Vision Open In Colab
Gemma3N (4B) Vision Open In Colab

Embedding Notebooks

Model Type Notebook Link
ModernBert Classification Open In Colab
Qwen3 Embedding (4B) Embeddings Open In Colab
Qwen3 Embedding (0 6B) Embeddings Open In Colab
EmbeddingGemma (300M) Embeddings Open In Colab
ModernBERT (Large) Classification Open In Colab
BGE M3 Embeddings Open In Colab
All MiniLM L6 v2 Embeddings Open In Colab

Speech-to-Text (STT) Notebooks

Model Type Notebook Link
Whisper (Large) Fine Tuning Open In Colab

OCR Notebooks

Model Type Notebook Link
Deepseek OCR (3B) Fine Tuning Open In Colab
Deepseek OCR (3B) Evaluation Open In Colab
Deepseek OCR (3B) Eval Open In Colab
Deepseek OCR 2 (3B) Fine Tuning Open In Colab
Paddle OCR (1B) Vision Open In Colab

BERT Notebooks

Model Type Notebook Link
ModernBert Classification Open In Colab
ModernBERT (Large) Classification Open In Colab

Deepseek Notebooks

Model Type Notebook Link
Deepseek OCR (3B) Fine Tuning Open In Colab
Deepseek OCR (3B) Evaluation Open In Colab
Deepseek OCR (3B) Eval Open In Colab
Deepseek OCR 2 (3B) Fine Tuning Open In Colab

ERNIE Notebooks

Model Type Notebook Link
ERNIE 4 5 VL 28B A3B PT Vision Open In Colab
ERNIE 4 5 21B A3B PT Conversational Open In Colab

GLM Notebooks

Model Type Notebook Link
(A100) GLM Flash(80GB) Conversational Open In Colab

GPT-OSS Notebooks

Model Type Notebook Link
gpt oss MXFP4 (20B) Inference Open In Colab
gpt oss (20B) Fine Tuning Open In Colab
gpt oss (20B) Fine Tuning Open In Colab
gpt oss BNB (20B) Inference Open In Colab
(A100) gpt oss (120B) Fine Tuning Open In Colab

Gemma Notebooks

Model Type Notebook Link
Gemma3 (4B) Conversational Open In Colab
(A100) Gemma3 (27B) Conversational Open In Colab
Gemma3 (270M) Phone Deployment Open In Colab
Gemma3 (270M) Conversational Open In Colab
Gemma3N (4B) Multimodal Open In Colab
Gemma3N (4B) Audio Open In Colab
Gemma3N (2B) Inference Open In Colab
Gemma3 (4B) Vision Open In Colab
Gemma3N (4B) Vision Open In Colab
EmbeddingGemma (300M) Embeddings Open In Colab
Gemma2 (9B) Alpaca Open In Colab
Gemma2 (2B) Alpaca Open In Colab
CodeGemma (7B) Conversational Open In Colab

Granite Notebooks

Model Type Notebook Link
Granite4.0 (3B) Conversational Open In Colab
Granite4.0 (350M) Conversational Open In Colab

Hybrid Attention Notebooks

Model Type Notebook Link
LFM2.5 (1.2B) Conversational Open In Colab
Liquid LFM2 (1.2B) Conversational Open In Colab
Liquid LFM2 Conversational Open In Colab
LFM2.5 VL (1.6B) Vision Open In Colab
LFM2.5 (1.2B) Translation Open In Colab
Falcon H1 (0.5B) Alpaca Open In Colab
Falcon H1 Alpaca Open In Colab

Llama Notebooks

Model Type Notebook Link
Llama3.1 (8B) Inference Open In Colab
Llama3 (8B) Ollama Open In Colab
Llama3 (8B) Alpaca Open In Colab
Llama3.2 (1B) RAFT Open In Colab
Llama3.2 (1B and 3B) Conversational Open In Colab
Llama3.1 (8B) Alpaca Open In Colab
(A100) Llama3.3 (70B) Conversational Open In Colab
Llama3.2 (11B) Vision Open In Colab
Llama3 (8B) Conversational Open In Colab
TinyLlama (1.1B) Alpaca Open In Colab

Mistral Notebooks

Model Type Notebook Link
Mistral v0.3 (7B) Conversational Open In Colab
Magistral (24B) Reasoning Conversational Open In Colab
Pixtral (12B) Vision Open In Colab
Mistral Small (22B) Alpaca Open In Colab
Ministral3 VL (3B) Vision Open In Colab
Mistral v0.3 (7B) Alpaca Open In Colab
Mistral Nemo (12B) Alpaca Open In Colab

Nemotron Notebooks

Model Type Notebook Link
(A100) Nemotron Nano 3 30B A3B Conversational Open In Colab
(A100) Nemotron 3 Nano 30B A3B Conversational Open In Colab

Paddle Notebooks

Model Type Notebook Link
Paddle OCR (1B) Vision Open In Colab

Phi Notebooks

Model Type Notebook Link
Phi 4 Conversational Open In Colab
Phi 3.5 Mini Conversational Open In Colab
Phi 3 Medium Conversational Open In Colab

Qwen Notebooks

Model Type Notebook Link
Qwen3 (0.6B) Reasoning Conversational Open In Colab
Qwen3 (0 6B) Phone Deployment Open In Colab
Qwen3 (4B) QAT Open In Colab
Qwen3 (4B) Conversational Open In Colab
Qwen2.5 VL (7B) Vision Open In Colab
Qwen3 VL (8B) Vision Open In Colab
Qwen3 5 MoE MoE Open In Colab
(A100) Qwen 3 5 27B(80GB) Conversational Open In Colab
(A100) Qwen3 (32B) Reasoning Conversational Open In Colab
Qwen3 5 (4B) Vision Open In Colab
Qwen3 (14B) Reasoning Conversational Open In Colab
Qwen3 (14B) Conversational Open In Colab
Qwen3 5 (0 8B) Vision Open In Colab
Qwen2 VL (7B) Vision Open In Colab
Qwen3 (4B) Thinking Open In Colab
Qwen3 MoE MoE Open In Colab
Qwen3 5 (2B) Vision Open In Colab
Qwen2.5 (7B) Alpaca Open In Colab
Qwen2.5 Coder (14B) Conversational Open In Colab
Qwen3 6 MoE MoE Open In Colab
Qwen3 Embedding (4B) Embeddings Open In Colab
Qwen3 Embedding (0 6B) Embeddings Open In Colab
Qwen3 (14B) Alpaca Open In Colab
Qwen2 (7B) Alpaca Open In Colab
TinyQwen3 MoE MoE Open In Colab

Text Completion / Continued Pretraining Notebooks

Model Type Notebook Link
LFM2.5 (1.2B) Text Completion Open In Colab
Mistral v0.3 (7B) CPT Open In Colab
Mistral (7B) Text Completion Open In Colab

Specific use-case Notebooks

Usecase Model Notebook Link
Text Classification Llama 3.1 (8B) Open In Colab
Tool Calling Qwen2.5-Coder (1.5B) Open In Colab
Multiple Datasets Open In Colab
KTO Qwen2.5-Instruct (1.5B) Open In Colab
Inference Chat UI LLaMa 3.2 Vision Open In Colab
Conversational LLaMa 3.2 (1B and 3B) Open In Colab
ChatML Mistral (7B) Open In Colab
Text Completion Mistral (7B) Open In Colab

Other Notebooks

Model Type Notebook Link
CodeForces CoT Reasoning Open In Colab
Synthetic Data Hackathon Synthetic Data Open In Colab
Unsloth Studio Open In Colab

📒 Kaggle Notebooks

Click for all our Kaggle notebooks categorized by model:

GRPO & Reinforcement Learning Notebooks

Model Type Notebook Link
Llama3.1 (8B) GSM8K Math + vLLM Open in Kaggle
gpt oss (20B) Minesweeper Game Open in Kaggle
gpt oss (20B) Auto Kernel Creation Open in Kaggle
Qwen3 (8B) DAPO Math + vLLM Open in Kaggle
Llama3 (8B) ORPO Open in Kaggle
Qwen2.5 VL (7B) Vision Math + vLLM Open in Kaggle
Ministral3 (3B) Sudoku Open in Kaggle
Zephyr (7B) DPO Open in Kaggle
gpt oss (20B) Auto Kernel Creation Open in Kaggle
(A100) gpt oss (20B) Auto Kernel Creation Open in Kaggle
Qwen2.5 (3B) GSM8K Math + vLLM Open in Kaggle
Llama3.2 (1B) DAPO Math + vLLM Open in Kaggle
Qwen3 VL (8B) Vision Math Open in Kaggle
Mistral v0.3 (7B) GSM8K Math + vLLM Open in Kaggle
Gemma3 (4B) Vision Math Open in Kaggle
Phi 4 (14B) GSM8K Math + vLLM Open in Kaggle
Gemma3 (1B) GSM8K Math Open in Kaggle
DeepSeek R1 0528 Qwen3 (8B) DAPO Math + vLLM Open in Kaggle
Qwen3 (4B) DAPO Math + vLLM Open in Kaggle

Tool Calling Notebooks

Model Type Notebook Link
Qwen2.5 Coder (1.5B) Tool Calling Open in Kaggle

Text-to-Speech (TTS) Notebooks

Model Type Notebook Link
Spark TTS (0.5B) TTS Open in Kaggle
Llasa TTS (1B) TTS Open in Kaggle
Orpheus (3B) TTS Open in Kaggle
Llasa TTS (3B) TTS Open in Kaggle
Sesame CSM (1B) TTS Open in Kaggle
Oute TTS (1B) TTS Open in Kaggle

Vision (Multimodal) Notebooks

Model Type Notebook Link
Qwen2.5 VL (7B) Vision Math + vLLM (GRPO RL) Open in Kaggle
Qwen2.5 VL (7B) Vision Open in Kaggle
ERNIE 4 5 VL 28B A3B PT Vision Open in Kaggle
Qwen3 VL (8B) Vision Open in Kaggle
Qwen3 VL (8B) Vision Math (GRPO RL) Open in Kaggle
Gemma3 (4B) Vision Math (GRPO RL) Open in Kaggle
Qwen2 VL (7B) Vision Open in Kaggle
Llama3.2 (11B) Vision Open in Kaggle
Pixtral (12B) Vision Open in Kaggle
Ministral3 VL (3B) Vision Open in Kaggle
Gemma3 (4B) Vision Open in Kaggle
Gemma3N (4B) Vision Open in Kaggle

Embedding Notebooks

Model Type Notebook Link
ModernBert Classification Open in Kaggle
Qwen3 Embedding (4B) Embeddings Open in Kaggle
Qwen3 Embedding (0 6B) Embeddings Open in Kaggle
EmbeddingGemma (300M) Embeddings Open in Kaggle
ModernBERT (Large) Classification Open in Kaggle
BGE M3 Embeddings Open in Kaggle
All MiniLM L6 v2 Embeddings Open in Kaggle

Speech-to-Text (STT) Notebooks

Model Type Notebook Link
Whisper (Large) Fine Tuning Open in Kaggle

OCR Notebooks

Model Type Notebook Link
Deepseek OCR (3B) Fine Tuning Open in Kaggle
Deepseek OCR (3B) Evaluation Open in Kaggle
Deepseek OCR (3B) Eval Open in Kaggle
Deepseek OCR 2 (3B) Fine Tuning Open in Kaggle
Paddle OCR (1B) Vision Open in Kaggle

BERT Notebooks

Model Type Notebook Link
ModernBert Classification Open in Kaggle
ModernBERT (Large) Classification Open in Kaggle

Deepseek Notebooks

Model Type Notebook Link
Deepseek OCR (3B) Fine Tuning Open in Kaggle
Deepseek OCR (3B) Evaluation Open in Kaggle
Deepseek OCR (3B) Eval Open in Kaggle
Deepseek OCR 2 (3B) Fine Tuning Open in Kaggle

ERNIE Notebooks

Model Type Notebook Link
ERNIE 4 5 VL 28B A3B PT Vision Open in Kaggle
ERNIE 4 5 21B A3B PT Conversational Open in Kaggle

GPT-OSS Notebooks

Model Type Notebook Link
gpt oss MXFP4 (20B) Inference Open in Kaggle
gpt oss (20B) Fine Tuning Open in Kaggle
gpt oss (20B) Fine Tuning Open in Kaggle
gpt oss BNB (20B) Inference Open in Kaggle
(A100) gpt oss (120B) Fine Tuning Open in Kaggle

Gemma Notebooks

Model Type Notebook Link
Gemma3 (4B) Conversational Open in Kaggle
(A100) Gemma3 (27B) Conversational Open in Kaggle
Gemma3 (270M) Conversational Open in Kaggle
Gemma3N (4B) Multimodal Open in Kaggle
Gemma3N (4B) Audio Open in Kaggle
Gemma3N (2B) Inference Open in Kaggle
Gemma3 (4B) Vision Open in Kaggle
Gemma3N (4B) Vision Open in Kaggle
EmbeddingGemma (300M) Embeddings Open in Kaggle
Gemma2 (9B) Alpaca Open in Kaggle
Gemma2 (2B) Alpaca Open in Kaggle
CodeGemma (7B) Conversational Open in Kaggle

Granite Notebooks

Model Type Notebook Link
Granite4.0 (3B) Conversational Open in Kaggle
Granite4.0 (350M) Conversational Open in Kaggle

Hybrid Attention Notebooks

Model Type Notebook Link
Liquid LFM2 (1.2B) Conversational Open in Kaggle
Falcon H1 (0.5B) Alpaca Open in Kaggle

Llama Notebooks

Model Type Notebook Link
Llama3.1 (8B) Inference Open in Kaggle
Llama3 (8B) Ollama Open in Kaggle
Llama3 (8B) Alpaca Open in Kaggle
Llama3.2 (1B) RAFT Open in Kaggle
Llama3.2 (1B and 3B) Conversational Open in Kaggle
Llama3.1 (8B) Alpaca Open in Kaggle
(A100) Llama3.3 (70B) Conversational Open in Kaggle
Llama3.2 (11B) Vision Open in Kaggle
Llama3 (8B) Conversational Open in Kaggle
TinyLlama (1.1B) Alpaca Open in Kaggle

Mistral Notebooks

Model Type Notebook Link
Mistral v0.3 (7B) Conversational Open in Kaggle
Magistral (24B) Reasoning Conversational Open in Kaggle
Pixtral (12B) Vision Open in Kaggle
Mistral Small (22B) Alpaca Open in Kaggle
Ministral3 VL (3B) Vision Open in Kaggle
Mistral v0.3 (7B) Alpaca Open in Kaggle
Mistral Nemo (12B) Alpaca Open in Kaggle

Nemotron Notebooks

Model Type Notebook Link
(A100) Nemotron Nano 3 30B A3B Conversational Open in Kaggle
(A100) Nemotron 3 Nano 30B A3B Conversational Open in Kaggle

Paddle Notebooks

Model Type Notebook Link
Paddle OCR (1B) Vision Open in Kaggle

Phi Notebooks

Model Type Notebook Link
Phi 4 Conversational Open in Kaggle
Phi 3.5 Mini Conversational Open in Kaggle
Phi 3 Medium Conversational Open in Kaggle

Qwen Notebooks

Model Type Notebook Link
Qwen3 (4B) QAT Open in Kaggle
Qwen3 (4B) Conversational Open in Kaggle
Qwen2.5 VL (7B) Vision Open in Kaggle
Qwen3 VL (8B) Vision Open in Kaggle
(A100) Qwen3 (32B) Reasoning Conversational Open in Kaggle
Qwen3 (14B) Reasoning Conversational Open in Kaggle
Qwen3 (14B) Conversational Open in Kaggle
Qwen2 VL (7B) Vision Open in Kaggle
Qwen3 (4B) Thinking Open in Kaggle
Qwen2.5 (7B) Alpaca Open in Kaggle
Qwen2.5 Coder (14B) Conversational Open in Kaggle
Qwen3 Embedding (4B) Embeddings Open in Kaggle
Qwen3 Embedding (0 6B) Embeddings Open in Kaggle
Qwen3 (14B) Alpaca Open in Kaggle
Qwen2 (7B) Alpaca Open in Kaggle

Text Completion / Continued Pretraining Notebooks

Model Type Notebook Link
Mistral v0.3 (7B) CPT Open in Kaggle
Mistral (7B) Text Completion Open in Kaggle

Other Notebooks

Model Type Notebook Link
CodeForces CoT Reasoning Open in Kaggle
Unsloth Studio Open in Kaggle

🐧 AMD Notebooks

These notebooks target AMD ROCm GPUs and are not available in Colab. View / download them directly from GitHub:

Model Type Notebook
Unsloth Studio Chat UI GitHub
Gemma4 (E2B) Vision GitHub
Qwen3 5 (4B) Vision GitHub
Qwen3 5 (2B) Vision GitHub
gpt oss (20B) Fine Tuning GitHub
gpt oss (20B) Auto Kernel Creation GitHub
Click for all our AMD ROCm notebooks:
Model Type Notebook
Qwen3 (0 6B) Phone Deployment GitHub
Qwen3 (0.6B) Reasoning Conversational GitHub
Llama3.1 (8B) Inference GitHub
Llama3.1 (8B) GSM8K Math + vLLM GitHub
NeMo Gym Sudoku Sudoku GitHub
NeMo Gym Multi Environment Multi Environment GitHub
Whisper (Large) Fine Tuning GitHub
gpt oss MXFP4 (20B) Inference GitHub
gpt oss BNB (20B) Inference GitHub
gpt oss BF16 (20B) 2048 Game GitHub
gpt oss (20B) 2048 Game GitHub
gpt oss (20B) Minesweeper Game GitHub
gpt oss (20B) Auto Kernel Creation GitHub
gpt oss (20B) Fine Tuning GitHub
(OpenEnv) gpt oss BF16 (20B) 2048 Game GitHub
(OpenEnv) gpt oss (20B) 2048 Game GitHub
(DGX Spark) gpt oss (20B) 2048 Game GitHub
Spark TTS (0.5B) TTS GitHub
Qwen3 (8B) DAPO Math + vLLM GitHub
Llama3 (8B) Ollama GitHub
Llama3 (8B) ORPO GitHub
Llama3 (8B) Alpaca GitHub
Openenv wordle Wordle + vLLM GitHub
gpt oss (20B) Auto Kernel Creation GitHub
gpt oss (120B) Fine Tuning GitHub
Qwen2.5 (3B) GSM8K Math + vLLM GitHub
ModernBert Classification GitHub
Qwen3 (4B) QAT GitHub
Qwen3 (4B) Conversational GitHub
Qwen2.5 VL (7B) Vision Math + vLLM GitHub
Qwen2.5 VL (7B) Vision GitHub
Llasa TTS (1B) TTS GitHub
Llama3.2 (1B) DAPO Math + vLLM GitHub
Llama3.2 (1B) RAFT GitHub
Deepseek OCR (3B) Fine Tuning GitHub
Deepseek OCR (3B) Evaluation GitHub
Deepseek OCR (3B) Eval GitHub
Paddle OCR (1B) Vision GitHub
ERNIE 4 5 VL 28B A3B PT Vision GitHub
Deepseek OCR 2 (3B) Fine Tuning GitHub
Qwen3 VL (8B) Vision GitHub
Qwen3 VL (8B) Vision Math GitHub
Mistral v0.3 (7B) GSM8K Math + vLLM GitHub
Mistral v0.3 (7B) Conversational GitHub
Qwen3 5 MoE MoE GitHub
Orpheus (3B) TTS GitHub
Llasa TTS (3B) TTS GitHub
Meta Synthetic Data Llama3.1 (8B) GRPO GitHub
Meta Synthetic Data Llama3 2 (3B) GRPO GitHub
Llama3.2 (1B and 3B) Conversational GitHub
Qwen 3 5 27B(80GB) Conversational GitHub
Qwen3 (32B) Reasoning Conversational GitHub
Llama3.1 (8B) Alpaca GitHub
Qwen3 5 (4B) Vision Math GitHub
Qwen3 (14B) Conversational GitHub
Qwen3 (14B) Reasoning Conversational GitHub
CodeForces CoT Reasoning GitHub
Llama3.3 (70B) Conversational GitHub
Synthetic Data Hackathon Synthetic Data GitHub
Gemma3 (4B) Conversational GitHub
Gemma3 (4B) Vision Math GitHub
Phi 4 (14B) GSM8K Math + vLLM GitHub
Phi 4 Conversational GitHub
Gemma3 (27B) Conversational GitHub
Qwen3 5 (0 8B) Vision GitHub
GLM Flash(80GB) Conversational GitHub
Sesame CSM (1B) TTS GitHub
Gemma4 (31B) Conversational GitHub
Gemma4 (31B) Vision GitHub
Qwen2 VL (7B) Vision GitHub
Qwen3 (4B) Thinking GitHub
Qwen3 MoE MoE GitHub
Gemma3 (1B) GSM8K Math GitHub
Nemotron Nano 3 30B A3B Conversational GitHub
Nemotron 3 Nano 30B A3B Conversational GitHub
Gemma4 (E4B) Conversational GitHub
Gemma4 (E4B) Vision GitHub
Gemma4 (E4B) Audio GitHub
Llama3.2 (11B) Vision GitHub
Phi 3.5 Mini Conversational GitHub
Gemma4 (26B A4B) Conversational GitHub
Gemma4 (26B A4B) Vision GitHub
Magistral (24B) Reasoning Conversational GitHub
Qwen2.5 (7B) Alpaca GitHub
DeepSeek R1 0528 Qwen3 (8B) DAPO Math + vLLM GitHub
Qwen2.5 Coder (1.5B) Tool Calling GitHub
Gemma3 (270M) Conversational GitHub
Gemma3 (270M) Phone Deployment GitHub
Qwen2.5 Coder (14B) Conversational GitHub
FunctionGemma (270M) Conversational GitHub
FunctionGemma (270M) Inference GitHub
FunctionGemma (270M) Tool Calling GitHub
FunctionGemma (270M) Mobile Actions GitHub
Gemma3N (4B) Multimodal GitHub
Gemma3N (4B) Audio GitHub
Gemma3N (2B) Inference GitHub
LFM2.5 (1.2B) DAPO Math GitHub
LFM2.5 (1.2B) Conversational GitHub
Gemma4 (E2B) Conversational GitHub
Gemma4 (E2B) Sudoku GitHub
Gemma4 (E2B) 2048 Game GitHub
Gemma4 (E2B) Auto Kernel Creation GitHub
Gemma4 (E2B) Audio GitHub
Qwen3 6 MoE MoE GitHub
Qwen3 Embedding (4B) Embeddings GitHub
Qwen3 (4B) DAPO Math + vLLM GitHub
Pixtral (12B) Vision GitHub
Qwen3 Embedding (0 6B) Embeddings GitHub
Mistral Small (22B) Alpaca GitHub
Liquid LFM2 (1.2B) Conversational GitHub
Liquid LFM2 Conversational GitHub
LFM2.5 VL (1.6B) Vision GitHub
Ministral3 VL (3B) Vision GitHub
Ministral3 (3B) Sudoku GitHub
Gemma3 (4B) Vision GitHub
Oute TTS (1B) TTS GitHub
Llama3 (8B) Conversational GitHub
ERNIE 4 5 21B A3B PT Conversational GitHub
Granite4.0 (3B) Conversational GitHub
Qwen3 (14B) Alpaca GitHub
LFM2.5 (1.2B) Translation GitHub
LFM2.5 (1.2B) Text Completion GitHub
Gemma3N (4B) Vision GitHub
Granite4.0 (350M) Conversational GitHub
TinyLlama (1.1B) Alpaca GitHub
Falcon H1 (0.5B) Alpaca GitHub
Falcon H1 Alpaca GitHub
Phi 3 Medium Conversational GitHub
EmbeddingGemma (300M) Embeddings GitHub
Gemma2 (9B) Alpaca GitHub
Gemma2 (2B) Alpaca GitHub
Mistral v0.3 (7B) CPT GitHub
Mistral v0.3 (7B) Alpaca GitHub
Mistral (7B) Text Completion GitHub
Qwen2 (7B) Alpaca GitHub
Zephyr (7B) DPO GitHub
ModernBERT (Large) Classification GitHub
Mistral Nemo (12B) Alpaca GitHub
CodeGemma (7B) Conversational GitHub
BGE M3 Embeddings GitHub
TinyQwen3 MoE MoE GitHub
All MiniLM L6 v2 Embeddings GitHub

🍃 Molab Notebooks

Run any of these on molab, Marimo's hosted GPU notebooks. They're reactive: change a value in one cell, the cells below recompute on their own.

Model Type Notebook
Unsloth Studio Chat UI Open in molab
Gemma4 (E2B) Vision Open in molab
Qwen3 5 (4B) Vision Open in molab
Qwen3 5 (2B) Vision Open in molab
gpt oss (20B) Fine Tuning Open in molab
gpt oss (20B) Auto Kernel Creation Open in molab
Click for all our molab notebooks:
Model Type Notebook
All MiniLM L6 v2 Embeddings Open in molab
BGE M3 Embeddings Open in molab
CodeForces CoT Reasoning Open in molab
CodeGemma (7B) Conversational Open in molab
Deepseek OCR (3B) Fine Tuning Open in molab
Deepseek OCR (3B) Eval Open in molab
Deepseek OCR (3B) Evaluation Open in molab
Deepseek OCR 2 (3B) Fine Tuning Open in molab
ERNIE 4 5 21B A3B PT Conversational Open in molab
ERNIE 4 5 VL 28B A3B PT Vision Open in molab
EmbeddingGemma (300M) Embeddings Open in molab
Falcon H1 Alpaca Open in molab
Falcon H1 (0.5B) Alpaca Open in molab
FunctionGemma (270M) Conversational Open in molab
FunctionGemma (270M) Inference Open in molab
FunctionGemma (270M) Mobile Actions Open in molab
FunctionGemma (270M) Tool Calling Open in molab
GLM Flash(80GB) Conversational Open in molab
gpt oss BNB (20B) Inference Open in molab
gpt oss MXFP4 (20B) Inference Open in molab
Gemma2 (2B) Alpaca Open in molab
Gemma2 (9B) Alpaca Open in molab
Gemma3N (2B) Inference Open in molab
Gemma3N (4B) Audio Open in molab
Gemma3N (4B) Multimodal Open in molab
Gemma3N (4B) Vision Open in molab
Gemma3 (270M) Conversational Open in molab
Gemma3 (270M) Phone Deployment Open in molab
Gemma3 (27B) Conversational Open in molab
Gemma3 (4B) Conversational Open in molab
Gemma3 (4B) Vision Open in molab
Gemma4 (26B A4B) Conversational Open in molab
Gemma4 (26B A4B) Vision Open in molab
Gemma4 (31B) Conversational Open in molab
Gemma4 (31B) Vision Open in molab
Gemma4 (E2B) Audio Open in molab
Gemma4 (E2B) Conversational Open in molab
Gemma4 (E2B) 2048 Game Open in molab
Gemma4 (E2B) Sudoku Open in molab
Gemma4 (E4B) Audio Open in molab
Gemma4 (E4B) Conversational Open in molab
Gemma4 (E4B) Vision Open in molab
Granite4.0 (3B) Conversational Open in molab
Granite4.0 (350M) Conversational Open in molab
LFM2.5 (1.2B) Conversational Open in molab
LFM2.5 (1.2B) Text Completion Open in molab
LFM2.5 (1.2B) Translation Open in molab
LFM2.5 VL (1.6B) Vision Open in molab
Liquid LFM2 Conversational Open in molab
Liquid LFM2 (1.2B) Conversational Open in molab
Llama3.1 (8B) Alpaca Open in molab
Llama3.1 (8B) Inference Open in molab
Llama3.2 (11B) Vision Open in molab
Llama3.2 (1B) RAFT Open in molab
Llama3.2 (1B and 3B) Conversational Open in molab
Llama3.3 (70B) Conversational Open in molab
Llama3 (8B) Alpaca Open in molab
Llama3 (8B) Conversational Open in molab
Llama3 (8B) ORPO Open in molab
Llama3 (8B) Ollama Open in molab
Llasa TTS (1B) TTS Open in molab
Llasa TTS (3B) TTS Open in molab
Magistral (24B) Reasoning Conversational Open in molab
Ministral3 (3B) Sudoku Open in molab
Ministral3 VL (3B) Vision Open in molab
Mistral (7B) Text Completion Open in molab
Mistral Nemo (12B) Alpaca Open in molab
Mistral Small (22B) Alpaca Open in molab
Mistral v0.3 (7B) Alpaca Open in molab
Mistral v0.3 (7B) CPT Open in molab
Mistral v0.3 (7B) Conversational Open in molab
ModernBert Classification Open in molab
NeMo Gym Multi Environment Multi Environment Open in molab
NeMo Gym Sudoku Sudoku Open in molab
Nemotron 3 Nano 30B A3B Conversational Open in molab
Nemotron Nano 3 30B A3B Conversational Open in molab
(OpenEnv) gpt oss (20B) 2048 Game Open in molab
(OpenEnv) gpt oss BF16 (20B) 2048 Game Open in molab
Openenv wordle Wordle + vLLM Open in molab
Orpheus (3B) TTS Open in molab
Oute TTS (1B) TTS Open in molab
Paddle OCR (1B) Vision Open in molab
Phi 3.5 Mini Conversational Open in molab
Phi 3 Medium Conversational Open in molab
Phi 4 Conversational Open in molab
Pixtral (12B) Vision Open in molab
Qwen2.5 (7B) Alpaca Open in molab
Qwen2.5 Coder (1.5B) Tool Calling Open in molab
Qwen2.5 Coder (14B) Conversational Open in molab
Qwen2.5 VL (7B) Vision Open in molab
Qwen2 (7B) Alpaca Open in molab
Qwen2 VL (7B) Vision Open in molab
Qwen3 (0.6B) Reasoning Conversational Open in molab
Qwen3 (0 6B) Phone Deployment Open in molab
Qwen3 (14B) Conversational Open in molab
Qwen3 (14B) Alpaca Open in molab
Qwen3 (14B) Reasoning Conversational Open in molab
Qwen3 (32B) Reasoning Conversational Open in molab
Qwen3 (4B) Conversational Open in molab
Qwen3 (4B) Thinking Open in molab
Qwen3 (4B) QAT Open in molab
Qwen3 5 (0 8B) Vision Open in molab
Qwen3 5 (4B) Vision Math (GRPO RL) Open in molab
Qwen3 5 MoE MoE Open in molab
Qwen3 6 MoE MoE Open in molab
Qwen3 Embedding (0 6B) Embeddings Open in molab
Qwen3 Embedding (4B) Embeddings Open in molab
Qwen3 MoE MoE Open in molab
Qwen3 VL (8B) Vision Open in molab
Qwen3 VL (8B) Vision Math (GRPO RL) Open in molab
Qwen 3 5 27B(80GB) Conversational Open in molab
Sesame CSM (1B) TTS Open in molab
Spark TTS (0.5B) TTS Open in molab
Synthetic Data Hackathon Synthetic Data Open in molab
TinyLlama (1.1B) Alpaca Open in molab
TinyQwen3 MoE MoE Open in molab
Whisper (Large) Fine Tuning Open in molab
Zephyr (7B) DPO Open in molab
ModernBERT (Large) Classification Open in molab
gpt oss (120B) Fine Tuning Open in molab
(A100) gpt oss (20B) Auto Kernel Creation Open in molab
gpt oss (20B) Fine Tuning Open in molab
gpt oss (20B) Auto Kernel Creation Open in molab
gpt oss (20B) 2048 Game Open in molab
gpt oss BF16 (20B) 2048 Game Open in molab
(DGX Spark) gpt oss (20B) 2048 Game Open in molab
gpt oss (20B) Minesweeper Game Open in molab

✨ Contributing to Notebooks

If you'd like to contribute to our notebooks, here's a guide to get you started:

  1. Find the Template: We've provided a template notebook called Template_Notebook.ipynb in the root directory of this project. This template contains the basic structure and formatting guidelines for all notebooks in this collection.
  2. Create Your Notebook:
    • Make a copy of Template_Notebook.ipynb.
    • Rename the copied file to follow this naming convention:
      • LLM Notebooks: <Model Name>-<Type>.ipynb (e.g., Mistral_v0.3_(7B)-Alpaca.ipynb)
      • Vision Notebooks: <Model Name>-Vision.ipynb (e.g., Llava_v1.6_(7B)-Vision.ipynb)
      • Example of <Type>: Alpaca, Conversational, CPT, DPO, ORPO, Text_Completion, CSV, Inference, Unsloth_Studio
  3. Place in original_template: Once your notebook is ready, move it to the original_template directory.
  4. Update Notebooks: Run the following command in your terminal:
    python update_all_notebooks.py
    This script will automatically:
    • Copy your notebook from original_template to the notebooks directory.
    • Update the notebook's internal sections (like Installation, News) to ensure consistency.
    • Add your notebook to the appropriate list in this README.md file.
  5. Create a Pull Request: After that, just create a pull request (PR) to merge your changes, making it available for everyone!
    • We appreciate your contributions and look forward to reviewing your notebooks!

About

250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors