Learning AI

This repository contains notes and code examples related to AI/ML, with a focus on understanding the fundamentals of large language models, inference engines, and hardware acceleration.

In-progress

Hailo Hailo-10H AI accelerator (NPU), Raspberry Pi AI HAT+
Parakeet — Supporting Parakeet in whisper.cpp
Kimi-Linear
CUDA FA exploration

Notes

Model Architectures

Model Formats & Quantization

Attention & Embeddings

Attention
Attention Sink
Flash Attention
Sage Attention
Ring Attention
MLA
Position Embeddings
- RoPE
- ALiBi
- Positional Encoding
- YARN
- XPOS
- LongRoPE
- P-ROPE
- MRL
- PLE
- GGML RoPE
- Embeddings
Tokenization
- BPE
- WordPiece
- SentencePiece
- Unigram
- RWKV
- Tiktoken
Word Embeddings
Normalization
Softmax / Logits / Logits
Residual Connections
Activation Functions
Loss Functions
Exp
One-Hot Encoding
Control Vectors
GRITLM

Inference & Decoding

Training & Fine-tuning

Hardware & Acceleration

CPU

GPU

NPU / Other

Audio & Speech

Whisper
VAD
Audio Notes
- Conformer
- DTW
- Mel
- LRC
- SRT
- VTT
- SDL2
- Whisper Stream

Vision & Multimodal

Agents & Applications

Miscellaneous Topics

LLM Overview
Diffusion / Stable Diffusion / Stable Diffusion
Apache Arrow
ONNX
PyTorch
vLLM
TRT-LLM
Mistral
Bloom
Granite Model
Mod
Minja
Trie
Symbols
Variables
Count-based
Background
Security
Memory
Android
Colab
Groq
ROC
zDNN
Spark
Copilot

Code

Fundamentals

Exploration code for core AI/ML concepts, libraries, and frameworks.

Project	Description
GGML	GGML C++ library exploration
Llama.cpp	Llama.cpp library exploration (inference, finetuning)
Python	Python ML examples
Rust	Rust ML examples (llm-chains, tch-rs, etc.)
vLLM	vLLM exploration
OpenVINO	OpenVINO Python examples
OpenVINO C++	OpenVINO C++ examples
PyTorch	PyTorch & pybind examples
SIMD	SIMD instruction exploration
SIMD Assembly	Low-level SIMD assembly
SVE	ARM SVE exploration
NEON	ARM NEON examples
AMX	Intel AMX exploration
VNNI	VNNI instruction exploration
BLAS	OpenBLAS exploration
ROCm	AMD ROCm examples
SYCL	SYCL examples
KleidiAI	KleidiAI examples
Grammars	LLaGuidance grammar exploration
Tokenization	Tokenization examples
Data Structures	ML-relevant data structures
Image Processing	Image processing examples
JavaScript	TensorFlow.js examples
WASM	WebAssembly NN examples
Whisper	Whisper.cpp exploration
Templates	Minja template engine

GPU Code

GPU compute exploration across multiple APIs.

Project	Description
CUDA	CUDA examples in C++
OpenCL	OpenCL examples
Vulkan	Vulkan examples
Kompute	Kompute (Vulkan compute) examples
Metal	Metal examples
ROCm	AMD ROCm/HIP examples
WebGPU	WebGPU examples
XRT	XRT examples

NPU Code

Neural Processing Unit exploration (Hailo).

Project	Description
Hailo	Hailo-10H AI accelerator, Raspberry Pi AI HAT+

Vector Databases

Vector database examples and exploration.

Project	Description
Qdrant	Qdrant examples (Python, Rust)
LanceDB	LanceDB examples (Python, Rust)

Embeddings

Word and sentence embedding examples.

Project	Description
Rust	Embeddings examples in Rust

Audio Code

Audio processing and speech-to-text.

Project	Description
Silero VAD	Silero Voice Activity Detection
Whisper.cpp	Whisper.cpp submodule

Agents Code

AI agent frameworks and examples.

Project	Description
llama-cpp-agent	AI agent using llama.cpp

Huggingface API

Language	Description
Python	Huggingface API example
Rust	Candle example

Notes Index

For a complete list of all notes, see the notes directory.

Name		Name	Last commit message	Last commit date
Latest commit History 2,614 Commits
.github		.github
agents		agents
audio		audio
embeddings		embeddings
fundamentals		fundamentals
gpu		gpu
hugging-face		hugging-face
notes		notes
npu/hailo		npu/hailo
vector-databases		vector-databases
.env.sh		.env.sh
.gitmodules		.gitmodules
README.md		README.md

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Learning AI

In-progress

Table of Contents

Notes

Code

Notes

Model Architectures

Model Formats & Quantization

Attention & Embeddings

Inference & Decoding

Training & Fine-tuning

Hardware & Acceleration

CPU

GPU

NPU / Other

Audio & Speech

Vision & Multimodal

Agents & Applications

Miscellaneous Topics

Code

Fundamentals

GPU Code

NPU Code

Vector Databases

Embeddings

Audio Code

Agents Code

Huggingface API

Notes Index

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages