How are LLMs built?



In this video, we will understand the 5 step process of building LLMs.

⭐️ Timestamps ⭐️
00:00 Introduction
00:34 Data curation
06:50 Tokenization
08:03 Model architecture
12:08 Model training
21:52 Evaluation

Transformer architecture: https://youtu.be/ZhAz268Hdpw?si=XQWQ56HPj4PXbpEk

Datasets:
– Math Solving: https://huggingface.co/datasets/open-r1/OpenR1-Math-220k/viewer/default/train
– Physics QnA: https://huggingface.co/datasets/herronej/SciTrust2-PhysicsQA
– Instruction Following: https://huggingface.co/datasets/HuggingFaceH4/instruction-dataset
– Chat: https://huggingface.co/datasets/Isotonic/human_assistant_conversation

Reference videos:
– https://youtu.be/OkEGJ5G3foU?si=K-gadWZMz7HQmWQU&t=585
– https://youtu.be/ZLbVdvOoTKM?si=BOnGPxLlngz0Q04n

Do you want to learn technology from me? Check https://codebasics.io/?utm_source=description&utm_medium=yt&utm_campaign=description&utm_id=description for my affordable video courses.

Need help building software or data analytics/AI solutions? My company https://www.atliq.com/ can help. Click on the Contact button on that website.

🎥 Codebasics Hindi channel: https://www.youtube.com/channel/UCTmFBhuhMibVoSfYom1uXEg

#️⃣ Social Media #️⃣

🧑‍🤝‍🧑 Discord for Community Support: https://discord.gg/r42Kbuk
📸 Codebasics’ Instagram: https://www.instagram.com/codebasicshub/
📝 Codebasics’ Linkedin : https://www.linkedin.com/company/codebasics/

——

📝 Dhaval’s Linkedin : https://www.linkedin.com/in/dhavalsays/
📝 Hem’s Linkedin: https://www.linkedin.com/in/hemvad/

📽️ Hem’s Instagram for daily tips: https://www.instagram.com/hemvadivel/
📸 Dhaval’s Personal Instagram: https://www.instagram.com/dhavalsays/

🔗 Patreon: https://www.patreon.com/codebasics?fan_landing=true

source