AI Stories转发了
How do you build a LLM from Scratch? This is the key question that I discussed with Sebastian Raschka, PhD on the AI Stories Podcast! For those who still don’t him, Sebastian is a Senior Staff Research Engineer at Lightning AI and a bestselling book author. He recently released a new book: "Build A Large Language Model From Scratch". This is a technical conversation around LLMs, we cover various topics including:? ?? His new book: "How to Build A LLM From Scratch" (link in the comments) ?? Differences in architecture between GPT2 and more recent LLMs like Llama 3.1 ?? How to build high performing LLMs ?? How to train multimodal LLMs ?? Lightning AI, PyTorch Lightning and litgpt libraries to train, finetune & deploy LLMs ?? Long context windows vs RAG And much more ... What are you waiting for? Go and listen to our conversation on your favourite platforms ??: Youtube: https://lnkd.in/e4h9vWGS Spotify: https://lnkd.in/exCbPMis Apple Podcast: https://lnkd.in/eXpDrdxg Have you read Sebastian's book? Keen to get your thoughts in the comments! What other LLM books do you recommend? Please share them in the comments as well :)