Large Language Models - LLMs
Jose OLIMPIO dos Santos FILHO
Electronics Engineering Technologist | Manufacturing Process | IT Automation | 6σ Black Belt
Definition: Large language models (LLMs) are a type of language model that can perform various natural language processing (NLP) tasks, such as text generation, summarization, translation, question answering, and more. LLMs are trained on massive amounts of data, usually from the web, and learn billions of parameters that capture the patterns and structures of natural language. LLMs use a neural network architecture called transformers, which rely on self-attention to process long text sequences efficiently and effectively.
Some examples of LLMs are:
- GPT-3: This is a generative pre-trained transformer model developed by OpenAI. It has 175 billion parameters and can generate coherent, diverse texts on various topics and domains. It can also perform zero-shot or few-shot learning, which means it can adapt to new tasks without or with minimal additional training data. GPT-3 is one of the most advanced LLMs as of 2023.
- PaLM 2: This is a pre-trained language model developed by Google. It has up to 340 billion parameters and can perform multiple NLP tasks, such as natural language understanding, natural language generation, and natural language reasoning. PaLM 2 is based on a novel architecture that combines recurrent neural networks and transformers. It can also leverage both structured and unstructured data to improve its performance.
- BLOOM: This is a bidirectional language model developed by Facebook. It has 190 billion parameters and can perform both masked language modelling and causal language modelling, which means it can predict both the missing words and the next words in a text. BLOOM is based on a hybrid architecture that combines convolutional neural networks and transformers. It can also handle multiple languages and modalities, such as text, speech, and images.
Is LLM an artificial intelligence, or is it a computer logical response?
Large language models (LLMs) are a type of artificial intelligence (AI) that can perform various natural language processing (NLP) tasks, such as text generation, summarization, translation, question answering, and more. AI is the theory and development of computer systems capable of performing tasks that historically required human intelligence. Therefore, LLMs are a subset of AI focusing on natural language as the main input and output modality.
However, not all computer logical responses are AI or LLMs. A computer logical response is any output that a computer system produces based on some input and some rules or algorithms. For example, a calculator can produce a logical response by applying arithmetic operations to numbers, but this is not considered AI or LLM. AI and LLMs require more complex and sophisticated algorithms that can learn from data and mimic human intelligence.
So is there any cognitive action in LLMs?
Depending on how the LLM is used and what kind of tasks it performs, there may be some cognitive involvement in a LLM application. Cognitive involvement refers to the degree to which a task requires mental effort, attention, and processing. Some LLM applications may require more cognitive involvement than others, such as those that involve reasoning, planning, or problem-solving. For example, an LLM application that generates a summary of a text may require less cognitive involvement than an LLM application that generates a plan for a trip.
However, cognitive involvement in an LLM application does not necessarily imply that the LLM itself has cognitive abilities or human-like intelligence. LLMs are trained on large amounts of text data and learn to produce outputs that are coherent and relevant to the inputs. However, they do not have a deep understanding of the meaning, context, or implications of the texts they process or generate. They also do not have any intrinsic goals, motivations, or emotions that drive their behaviour. Therefore, LLMs are not cognitively equivalent to humans, and their outputs should not be taken at face value without verification and evaluation.
Even though they are not really true intelligence, LLMs are robust and highly productive tools. The recent embroglio involving OpenAI may or may not have much to do with this. Who knows?
领英推荐
References
Artificial intelligence (AI) | Definition, Examples, Types .... https://www.britannica.com/technology/artificial-intelligence .
What Is Artificial Intelligence? Definition, Uses, and Types. https://www.coursera.org/articles/what-is-artificial-intelligence .
Artificial intelligence - Wikipedia. https://en.wikipedia.org/wiki/Artificial_intelligence .
Artificial intelligence Definition & Meaning - Merriam-Webster. https://www.merriam-webster.com/dictionary/artificial%20intelligence .
Conversation with Bing, 2023-11-30
The enterprise generative AI application lifecycle with Azure AI .... https://azure.microsoft.com/en-us/blog/building-for-the-future-the-enterprise-generative-ai-application-lifecycle-with-azure-ai/ .
This one is very good ->>> Evaluating Cognitive Maps in Large Language Models with CogEval: No .... https://www.microsoft.com/en-us/research/uploads/prod/2023/09/cogEval_Cognitive_Maps_and_planning_in_LLMs.pdf .
Step-by-Step Guide to Integrate Azure Cognitive’s Vector ... - Medium. https://medium.com/@akshaykokane09/building-knowledge-base-for-your-llm-powered-app-using-azure-cognitive-search-part-1-4686127c49cb .
Capabilities and alignment of LLM cognitive. https://www.alignmentforum.org/posts/ogHr8SvGqg9pW5wsT/capabilities-and-alignment-of-llm-cognitive-architectures .
The Future of Large Language Models (LLMs): Strategy .. - INDIAai. https://indiaai.gov.in/article/the-future-of-large-language-models-llms-strategy-opportunities-and-challenges .