Lightning AI转发了
How do you build a LLM from Scratch? This is the key question that I discussed with Sebastian Raschka, PhD on the AI Stories Podcast! For those who still don’t him, Sebastian is a Senior Staff Research Engineer at Lightning AI and a bestselling book author. He recently released a new book: "Build A Large Language Model From Scratch". This is a technical conversation around LLMs, we cover various topics including:? ?? His new book: "How to Build A LLM From Scratch" (link in the comments) ?? Differences in architecture between GPT2 and more recent LLMs like Llama 3.1 ?? How to build high performing LLMs ?? How to train multimodal LLMs ?? Lightning AI, PyTorch Lightning and litgpt libraries to train, finetune & deploy LLMs ?? Long context windows vs RAG And much more ... What are you waiting for? Go and listen to our conversation on your favourite platforms ??: Youtube: https://lnkd.in/e4h9vWGS Spotify: https://lnkd.in/exCbPMis Apple Podcast: https://lnkd.in/eXpDrdxg Have you read Sebastian's book? Keen to get your thoughts in the comments! What other LLM books do you recommend? Please share them in the comments as well :)
Intriguing post. Fascinating insights into LLM architecture evolution.
Insightful
Very informative, thanks a lot for sharing
Interesting, looking forward to reading the book.
Love this
Thanks again Sebastian Raschka, PhD, it was a real pleasure to have you on the show!
Data Scientist & Machine Learning Engineer Specialized in Deep Learning , NLP & LLMs
2 天前Neil Leiser Thaks for sharing ... here are the three more books that are highly recommended with this book ... 1- Deep Learning for Natural Language Processing" by Yoav Goldberg 2- "Natural Language Processing (almost) from Scratch" by Collobert et al. 3- "Transformers: State-of-the-Art Natural Language Processing" by Lewis et al.