登录查看更多内容

A Brief History Of AI (part 1)

Dr. Bernd Fritzke

AI @ DekaBank | Speaker

发布日期: 2023年12月22日

Sometimes, in a single day, there are so many reports on groundbreaking AI discoveries and novel AI-supported applications that it becomes challenging to process everything and grasp where the field is heading.

In response, I've decided to step back and explore the historical origins of AI. This 3-part series, presented in non-technical language, will concisely list some fundamental AI achievements. While necessarily incomplete and somewhat subjective, I aim for it to serve as a useful guide through the current media frenzy surrounding AI.

Each part of the series, starting from 1950, 1997, and 2017 respectively, will highlight key milestones and developments, offering a chronological journey through AI's evolution. The division into three parts is tailored to fit the image constraints of a LinkedIn article.

I warmly welcome feedback on any aspect of the articles, including any topics you feel are missing or wish to see discussed more deeply, in the comments section. Let's embark on this exploration together with this first installment.

1950 - Alan Turing Proposes a Test for Machine Intelligence

In 1950 Alan Turing published "Computing Machinery and Intelligence," proposing what is now known as the Turing Test. The Turing Test is performed by having a human evaluator interact with two entities, one a machine and the other a human, through a computer interface that conceals their identities. The evaluator then asks questions or engages in conversation, and based on the responses, must determine which entity is the machine. If the evaluator consistently cannot distinguish the machine from the human based on the responses, the machine is considered to have passed the test, demonstrating human-like intelligence.

1956 - The Term "Artificial Intelligence" (AI) is coined

Darthmouth Conference as depicted by Dall-E 3

This happened at the famous Darthmouth conference held in 1956. The proposal for the conference, written by John McCarthy, Marvin Minsky, Nathaniel Rochester, and Claude Shannon, provided an early definition of artificial intelligence. They described it as:

"Every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it."

This definition was groundbreaking as it suggested that any feature of human intelligence could be simulated by a machine.

1957 Rosenblatt's Perceptron - A single layer network that can learn to classify simple patterns

A simplified sketch of a Perceptron (a single layer network). The sum of the weighted inputs is passed through an activation function (step function, not shown here). (display program written by ChatGPT)

The perceptron is a fundamental type of artificial neural network and one of the earliest models developed for machine learning. Invented in 1957 by Frank Rosenblatt, it was designed to mimic the way a human brain processes information. A perceptron consists of input nodes, each associated with a weight, which are then combined in a weighted sum. This sum is then passed through an activation function, which determines the output of the perceptron. Initially conceptualized for tasks like pattern recognition, the perceptron laid the groundwork for more complex neural networks, despite its limitation of only being able to solve linearly separable problems (problems with two classes the examples of which can be perfectly separated by a line or more general a hyperplane). The formal proof that the perceptron learning algorithm would learn any linearly separable problem in a finite number of steps, caused a huge excitement and expectations of immensely smart learning machines. But this found a sudden end in 1969 (see below)

1964-1967 - The Program Eliza - A mock Psychotherapist

ELIZA re-implementation from Norbert Landsteiner

Eliza was an early computer program created in the 1960s to chat with people. Made by Joseph Weizenbaum at MIT, it worked by changing words in a conversation to make it seem like it understood what was being said. The most well-known version acted like a therapist, repeating phrases to make people talk more. Eliza was simple and followed set rules, but it was one of the first programs to show how computers could mimic talking to people. Weizenbaum invited students and colleagues to interact with the system and quickly they were engaged in deep (from their point of view) conversations with the program. At one point his secretary, which had observed him programming the system many month, even insisted that he leave the room so she could talk to Eliza in private.

1969: Proof that Perceptrons are (very) limited!

In 1969, Marvin Minsky and Seymour Papert published "Perceptrons," a seminal work that critically analyzed the limitations of perceptrons, an early form of neural networks. They demonstrated that perceptrons were incapable of processing complex, non-linearly separable functions, such as the XOR problem. This revelation significantly dampened the enthusiasm for neural networks, contributing to reduced funding and interest in AI research. This shift in perception played a crucial role in the onset of the first AI Winter, a period of stagnation in AI development that lasted into the early 1980s. It was already clear at that time that multi-layer networks were more capable, but since no efficient learning algorithm for them was known, it did not help.

领英推荐

AI & Our World: What is Artificial Intelligence?

Beyond Limits 3 年前

FOD#65: Jevons' Paradox in AI

TuringPost 6 个月前

A conversation with Yuval Noah Harari about Artificial…

Nicholas Thompson 1 年前

1975-early 1980s: First AI Winter

The AI Winter of the late 1970s was a period of reduced funding and interest in artificial intelligence, caused by the failure to meet the overly ambitious expectations set in the field's early days. Key challenges in natural language processing, machine learning, and computer vision, coupled with the era's limited computational power, led to widespread skepticism and a consequent downturn in AI research and development. This period marked a recalibration in the AI community, shifting focus to more realistic goals.

1980s Expert Systems

Creation of an Expert System: 1) Experts explain their knowledge 2) The knowlege is encoded as rules and facts into a computer program (Expert System) 3) The Expert System is used to solve problems (without the original experts needed (Dall-E 3)

Expert systems are rule-based AI programs that simulate the decision-making ability of a human expert in specific domains. Developed primarily in the 1980s, these systems combine a comprehensive knowledge base with an inference engine to apply rules to data and solve complex problems. They were particularly significant in fields like medicine and engineering, where they assisted experts in diagnosis and decision-making. Their importance in AI history lies in their demonstration of how machines can use rules and knowledge to address specialized tasks.

1986 Backpropagation for Training Multi-Layer Networks

Multi-Layer Network with 3 imputs, 2 outputs and 3 hidden layers of size 5,8,8. (display program written by ChatGPT)

Backpropagation, a method used for training artificial neural networks consisting of many layers, became widely known and influential in the 1980s, largely thanks to the work of Geoffrey Hinton and his colleagues (although the method itself can be traced back at least to the dissertation of Paul Werbos in 1974). This technique involves adjusting the weights of the neural network by propagating the error back through the network layers. It calculates the gradient of the error function with respect to the neural network's weights, allowing for efficient optimization. Backpropagation is still a core component in most modern neural network architectures (including Large Language Models like ChatGPT or Gemini).

1989 Convolutional Neural Networks for Handwritten Digit Classification

Example Handwritten Digits from MNIST Database (By Suvanjanprasai - Own work, CC BY-SA 4.0)

LeCun's LeNet-5, developed in the late 1980s and early 1990s by Yann LeCun, was a pioneering so-called convolutional neural network (CNN) designed primarily for postal digit recognition. CNNs use local arrays of pixels in one layer to identify patterns, which are then compiled and abstracted in subsequent layers to recognize increasingly complex features in the data. LeNet-5 was adept at recognizing handwritten digits and was used by the United States Postal Service to automate the sorting of mail. LeNet-5's architecture featured convolutional layers to detect local features in images, pooling layers to reduce spatial size, and fully connected layers for classification. Its success in accurately classifying digits on envelopes marked a significant advancement in the application of neural networks for practical, real-world tasks. LeNet-5 laid the foundational design principles for modern CNNs, which are now extensively used in various fields, including image and speech recognition.

LeNet-5 (from "Gradient-Based Learning Applied to Document Recognition", LeCun et al., 1998)

1990s Interest in Neural Networks Fades, Support Vector Machines Become Popular

Support Vector Machine Classifier for a two-class problem . This is also an example of a problem which is not linearly separable since no straight line can separate the two classes. (example fully generated by prompting ChatGPT, no manual coding)

In the 1990s, interest in neural networks began to diminish due to a lack of significant breakthroughs. Researchers, grappling with neural networks' limitations in complex problem-solving and computational constraints, shifted their attention to alternative methods. This shift was marked by the rise of support vector machines (SVMs), popularized by Vladimir Vapnik, and other kernel methods, known for their effectiveness in classification tasks. Simultaneously, the field saw advancements in reinforcement learning, notably through the work of Richard Sutton and Andrew Barto, and Bayesian networks, championed by researchers like Judea Pearl. These explorations led to significant progress, temporarily moving the spotlight away from neural networks until their resurgence in the 2000s, fueled by advancements in deep learning.

要查看或添加评论，请登录

Dr. Bernd Fritzke的更多文章

AI Agents: To RAG or not to RAG?

2025年1月24日

AI Agents: To RAG or not to RAG?

?? AI agents are transforming industries—but can they reach their full potential without Retrieval-Augmented Generation…

2 条评论
KI vs. Adventskranz: Ein epischer Kampf

2025年1月10日

KI vs. Adventskranz: Ein epischer Kampf

Manchmal offenbaren kleine Beispiele die erstaunlichen Beschr?nkungen aktueller KI-Systeme. Im Dezember wollte ich ein…

9 条评论
Public Key Kryptographie und die Rolle von Zertifizierungsstellen (Certificate Authorities, CAs) bei der Verschlüsselung im Internet

2024年9月1日

Public Key Kryptographie und die Rolle von Zertifizierungsstellen (Certificate Authorities, CAs) bei der Verschlüsselung im Internet

(überarbeitete Version nach Feedback in den Kommentaren) In Zeiten von KI-generierten Falschinformationen immer…

4 条评论
Breathing K-Means: Superior K-Means Solutions through Dynamic K-Values

2024年8月20日

Breathing K-Means: Superior K-Means Solutions through Dynamic K-Values

Introduction Running a k-means algorithm on your numerical data is a common first step to get a compact representation…
The "Magical" Ingredient of LLMs: Vector Embeddings

2024年6月30日

The "Magical" Ingredient of LLMs: Vector Embeddings

In the field of machine learning, vector embeddings have emerged as a central component of large language models…

1 条评论
Linear Regression (Less Linear Than You Might Think)

2024年2月12日

Linear Regression (Less Linear Than You Might Think)

For a very long time I associated Linear Regression with fitting a straight line (or hyperplane in higher dimensions)…

2 条评论
A Bird's Eye View of AI

2024年2月4日

A Bird's Eye View of AI

AI is more than a singular technological marvel; it's a symphony of capabilities that replicate and extend human…

4 条评论
Basic Machine Learning: The k-Nearest Neighbor (k-NN) Classifier

2024年1月28日

Basic Machine Learning: The k-Nearest Neighbor (k-NN) Classifier

Introduction Recently, I was in the situation to explain the foundations of machine learning to a group of students…

2 条评论
A Brief History Of AI (part 3)

2023年12月22日

A Brief History Of AI (part 3)

part 1, part 2 2017 Transformers ("Attention is all you Need") Illustration of a transformer model focusing its…
A Brief History Of AI (part 2)

2023年12月22日

A Brief History Of AI (part 2)

part 1, part 3 1997: IBM's Deep Blue defeats world chess champion Garry Kasparov. In 1997, a landmark event in the…

See all articles

A Brief History Of AI (part 1)

Dr. Bernd Fritzke

AI @ DekaBank | Speaker

1950 - Alan Turing Proposes a Test for Machine Intelligence

1956 - The Term "Artificial Intelligence" (AI) is coined

1957 Rosenblatt's Perceptron - A single layer network that can learn to classify simple patterns

1964-1967 - The Program Eliza - A mock Psychotherapist

1969: Proof that Perceptrons are (very) limited!

领英推荐

1975-early 1980s: First AI Winter

1980s Expert Systems

1986 Backpropagation for Training Multi-Layer Networks

1989 Convolutional Neural Networks for Handwritten Digit Classification

1990s Interest in Neural Networks Fades, Support Vector Machines Become Popular

Dr. Bernd Fritzke的更多文章

社区洞察

其他会员也浏览了

AI Hype: Why the Reality Often Falls Short of Expectations

The Things AI Can't Do: A Comparison of Human and Machine Intelligence

Guest Post: The Elusive Definition of Artificial General Intelligence (AGI)

Unraveling the AI Mystery: Join Digitate on an Exploration

First Principles Thinking in AI: A Progressive Leap from Product Thinking to Fundamental Innovations in AI

Best of last 30 days intelligence

Aye Aye AI - Golden Age of Innovation in Artificial Intelligence & Computer Science – How will this impact Humans?

AI with Persistent Memory: Because Forgetting is So Human

AI and the Dynamics of Change: Understanding Paradigm Shifts and Drifts

Layered Intelligence: Unlocking the True Potential of AI with Hierarchy

1950 - Alan Turing Proposes a Test for Machine Intelligence

1956 - The Term "Artificial Intelligence" (AI) is coined

1957 Rosenblatt's Perceptron - A single layer network that can learn to classify simple patterns

1964-1967 - The Program Eliza - A mock Psychotherapist

1969: Proof that Perceptrons are (very) limited!

领英推荐

1975-early 1980s: First AI Winter

1980s Expert Systems

1986 Backpropagation for Training Multi-Layer Networks

1989 Convolutional Neural Networks for Handwritten Digit Classification

1990s Interest in Neural Networks Fades, Support Vector Machines Become Popular

Dr. Bernd Fritzke的更多文章

AI Agents: To RAG or not to RAG?

KI vs. Adventskranz: Ein epischer Kampf

Public Key Kryptographie und die Rolle von Zertifizierungsstellen (Certificate Authorities, CAs) bei der Verschlüsselung im Internet

Breathing K-Means: Superior K-Means Solutions through Dynamic K-Values

The "Magical" Ingredient of LLMs: Vector Embeddings

Linear Regression (Less Linear Than You Might Think)

A Bird's Eye View of AI

Basic Machine Learning: The k-Nearest Neighbor (k-NN) Classifier

A Brief History Of AI (part 3)

A Brief History Of AI (part 2)

社区洞察

其他会员也浏览了

AI Hype: Why the Reality Often Falls Short of Expectations

The Things AI Can't Do: A Comparison of Human and Machine Intelligence

Guest Post: The Elusive Definition of Artificial General Intelligence (AGI)

Unraveling the AI Mystery: Join Digitate on an Exploration

First Principles Thinking in AI: A Progressive Leap from Product Thinking to Fundamental Innovations in AI

Best of last 30 days intelligence

Aye Aye AI - Golden Age of Innovation in Artificial Intelligence & Computer Science – How will this impact Humans?

AI with Persistent Memory: Because Forgetting is So Human

AI and the Dynamics of Change: Understanding Paradigm Shifts and Drifts

Layered Intelligence: Unlocking the True Potential of AI with Hierarchy