What is the ChatGPT Bootcamp?

The ChatGPT Bootcamp is a free beginner-friendly course that teaches ChatGPT fundamentals, prompting skills, and how to use AI tools effectively.

Who is this course for?

The course is designed for beginners, students, professionals, and anyone who wants to learn how to use ChatGPT to improve productivity and creativity.

Is this course really free?

Yes. The ChatGPT Bootcamp is 100% free with no subscription required.

How long does the course take?

The course can be completed at your own pace and typically takes a few hours depending on your learning speed.

Do I get a certificate?

The course does not currently provide a certificate, but it offers practical skills and hands-on exercises.

Start your AI Journey

Build an LLM from Scratch 2: Working with text data

staff

May 21, 2026

Links to the book:
– https://amzn.to/4fqvn0D (Amazon)
– https://mng.bz/M96o (Manning)

Link to the GitHub repository: https://github.com/rasbt/LLMs-from-scratch

This is a supplementary video going over text data preparations steps (tokenization, byte pair encoding, data loaders, etc.) for LLM training.

00:00 2.2 Tokenizing text
14:02 2.3 Converting tokens into token IDs
23:56 2.4 Adding special context tokens
30:26 2.5 Byte pair encoding
44:00 2.6 Data sampling with a sliding window
1:07:10 2.7 Creating token embeddings
1:15:45 2.8 Encoding word positions

You can find additional bonus materials on GitHub:

Byte Pair Encoding (BPE) Tokenizer From Scratch, https://github.com/rasbt/LLMs-from-scratch/blob/main/ch02/05_bpe-from-scratch/bpe-from-scratch.ipynb

Comparing Various Byte Pair Encoding (BPE) Implementations, https://github.com/rasbt/LLMs-from-scratch/blob/main/ch02/02_bonus_bytepair-encoder/compare-bpe-tiktoken.ipynb

Understanding the Difference Between Embedding Layers and Linear Layers, https://github.com/rasbt/LLMs-from-scratch/blob/main/ch02/03_bonus_embedding-vs-matmul/embeddings-and-linear-layers.ipynb

Data sampling with a sliding window with number data, https://github.com/rasbt/LLMs-from-scratch/blob/main/ch02/04_bonus_dataloader-intuition/dataloader-intuition.ipynb

A video on the effect of random seeds: https://www.youtube.com/watch?v=ii89_SqKB08&feature=youtu.be

source

Categories: OpenAI

Start your AI Journey

Build an LLM from Scratch 2: Working with text data

Like this:

Related Posts :-

Your cart (items: 0)

Build an LLM from Scratch 2: Working with text data

Share this:

Like this:

Related Posts :-

Parts of Dialysis machine #nephrologist #dialysis #dialysisstudy #viralshorts #kidney

Get Free Google & Microsoft Swags in 2026 | 5 Official Programs Explained

police 😀 22Age Gemini prompt 👇 #gemini #ai #viral #army #police #photoediting #indianarmy #trending