What is an LLM?
Atlanta in the future

What is an LLM?

LLM’s learn patterns from giant datasets of media and produce multi-modal content.

An LLM is a kind of neural network that is trained on a massive dataset of human-created media (text, audio, images, and more). They are designed so that when you give it an input (otherwise known as a prompt), it will generate an output based on your prompt and the patterns it learned from its training dataset.

LLM’s can be used to create queries (SQL,KQL), code (Python scripts, Java), images and videos that can capture your imagination, audio that sounds like celebrities, and text that can be either creative or research-focused in nature. Keep in mind that while the text it produces is human-sounding, it is not entirely accurate. LLM’s merely predict the next word based on three factors: the previous words they have generated in the interaction, the prompt, and its training data.

Fine Tuning

Most LLM’s were created to be general-purpose, and based on their giant dataset and the Deep Learning techniques that trained them, they have a lot of connected information they rely upon. However, when you need an LLM to be very specific, you can give it data for fine tuning, focusing its informational universe on the tasks you want it to execute. Good examples of this are a document containing company’s knowledge base that helps them execute customer service requests. By being able to have data that is directly related to what the company does, and focusing all queries on that data, the LLM is able to extract more information in a faster and more accurate way.

Agentic Workflows

A fascinating application of LLM’s are agentic workflows. Usually, people go into an LLM’s interface and type in what they want and get an output. What an agentic workflow does is it divides a single prompt into multiple input-output pairs: input #1’s output becomes prompt #2’s input and so on until you have completed all the steps related to the task. It is a way of ensuring more thought and computational energy goes into producing the overall result. Think of it like this: you’re using a single prompt, converting it into 2 (or more) prompts, and creating a single, more refined output. This can be done on the backend in a programming-defined way to make things far more interesting. The game with this is just beginning. Add fine-tuning data that maps on to your goals and you’re off to the races.

For example, when you tell the LLM: plan a project for writing a new application. It will divide that prompt into multiple prompts and steps that create the final output, like: 1) “state the goal of creating a project for a new application”, 2) “outline all the necessary steps that are necessary to complete the project from start to finish”, and 3) “then generate a succinct outline of all the steps”. I’m sure you can get more creative than that. Each time, the outputs can be used to give the model additional context. Now imagine creating plans for: creating a new material that replaces plastic, designing a GPU, and picking AI companies to invest in.

Giving Context

A cool idea I’ve always thought about is teaching an LLM to become a wizard, giving it fine-tuning data that will make it respond as a wizard and infuse additional customized context into your responses. You can give the wizard .txt files that show how the wizard has operated in the past and the things is should know (like where the medicinal herbs are in the forest). The LLM will draw from its knowledge of the world to ‘understand’ what a wizard is through a contextual search of its training data and the fine-tuning data you have provided. You could add agentic workflows in the code in order to get more powerful and refined answers.

Conclusion

Large Language Models are a new technology that are revolutionizing the way data is created. Companies like IBM are getting after it to ensure that business are maximizing the value of this new and dynamic tech stack. I’m excited to continue writing about this topic. Feel free to leave any questions or additional topics you’d like me to research in the comments. This technology is just the beginning of a new revolution. Stay tuned, stay safe, and make sure you use it right.

Janine Vici-Senior

private practice, craniosacral therapist.

9 个月

I always learn something innovative and creative from you. I confess I'm wary of all this AI innovation. I grew up with scary Sci Fi that ????????would be our overlords!!

要查看或添加评论,请登录

Mattias Acosta的更多文章

  • Learn How AI Agents Use Knowledge to Make Smarter, Faster Decisions (Without Jargon)

    Learn How AI Agents Use Knowledge to Make Smarter, Faster Decisions (Without Jargon)

    Knowledgeable people run the world. But what if AI had the capacity to hold and utilize knowledge? Spoiler: It already…

  • Medicine in the Age of AI

    Medicine in the Age of AI

    Medicine is evolving. Every day, we understand more about our bodies at the cellular, physiological, and psychological…

  • ML vs Deep Learning vs Generative AI

    ML vs Deep Learning vs Generative AI

    Machine Learning, Deep Learning, and Generative AI are accelerating business. It is important to understand them so…

    15 条评论
  • Learn how to create a strategy and organize your life

    Learn how to create a strategy and organize your life

    Cal Newport is a successful distributed computing theoretician and productivity writer. Recently, I have been reading…

  • How to Secure your Data

    How to Secure your Data

    A world that leverages data to do just about everything needs infrastructure. The most competitive companies of our…

    8 条评论
  • An Adventure to the Tallest Waterfall on the Planet

    An Adventure to the Tallest Waterfall on the Planet

    This is not one of my traditional posts. This is an adventure.

    1 条评论
  • What is Retrieval Augmented Generation?

    What is Retrieval Augmented Generation?

    Retrieval Augmented Generation — this AI technique uses an external authoritative data source in order to enhance a an…

    1 条评论
  • Get an Automated Sensitive Data Inventory for your Cloud

    Get an Automated Sensitive Data Inventory for your Cloud

    As you and your development team continue building out your cloud infrastructure to manage your company’s compute, you…

    1 条评论
  • Why is Reading Valuable?

    Why is Reading Valuable?

    Reading gives us the perspective of others, bringing us thoughts, experiences, and knowledge. It is a medium through…

    1 条评论
  • What is a computer network?

    What is a computer network?

    Without data pathways connecting networks of computers, computing would be a very solitary enterprise. A network is a…

社区洞察

其他会员也浏览了