Basics of Transformers
Learn tokens, vectors, embeddings, attention, positional information, and why Transformer architecture changed sequence modeling.
- Vocabulary
- Transformer intuition
- Attention math
- Paper reading
Learn the artificial intelligence engineering skills you actually need for real jobs: Transformers, large language models, retrieval-augmented generation, AI agents, fine-tuning, model literacy, and production infrastructure. No encyclopedia-style detours; each topic is chosen because it shows up in professional AI systems.
The curriculum starts with Transformer basics, then expands into LLMs, RAG, agents, data pipelines, training, fine-tuning, and production systems. Every module is selected for job-ready understanding, implementation, or debugging work.
Learn tokens, vectors, embeddings, attention, positional information, and why Transformer architecture changed sequence modeling.
Understand next-token prediction, context windows, decoding, prompting, instruction tuning, model limits, and evaluation basics.
Build systems that retrieve outside knowledge, rank evidence, cite sources, and reduce hallucinations with measurable grounding.
Design model workflows that plan, call tools, inspect results, recover from errors, and stop safely.
Prepare, clean, version, and evaluate the data that powers retrieval, training, fine-tuning, and production feedback loops.
Understand datasets, loss, optimization, checkpoints, validation, regression tests, and how model behavior changes during training.
Learn when to adapt a model with examples, how to format training data, and how to compare the result against prompting or RAG.
Operate AI features with serving, latency, caching, monitoring, tracing, cost controls, and incident response.
After the foundations, build small projects that exercise the professional skills: evaluation, retrieval, tool use, dataset planning, and release gates.
Create a tiny evaluation harness that sends the same task examples to two prompts or models, scores the outputs, and prints a pass/fail report.
First buildBuild a local question-answering app over a small document folder, retrieve source chunks, and answer only when evidence is available.
First buildBuild a step-limited agent with one read-only tool, strict argument validation, structured observations, and a trace of every decision.
BuilderPrepare a small supervised fine-tuning dataset plan, split it correctly, define quality checks, and decide whether fine-tuning is justified.
Production gateAdd tracing, metrics, and a pre-release evaluation gate to one AI workflow so production changes are measurable and reversible.