登录查看更多内容

What’s New In Deep Learning Research: An IQ Test Proves that Neural Networks are Capable of Abstract Reasoning

Jesus Rodriguez

CEO of IntoTheBlock, Co-Founder, Co-Founder of LayerLens, Faktory,and NeuralFabric, Founder of The Sequence AI Newsletter, Guest Lecturer at Columbia, Guest Lecturer at Wharton Business School, Investor, Author.

发布日期: 2018年7月16日

The ability to create abstractions from knowledge representations is one of the hallmarks of human intelligence. Arguably, the two most famous theories about the dynamics of the universe were derived as results of abstractions. Legend has it that a young Isaac Newton was sitting under an apple tree when he was bonked on the head by a falling piece of fruit, a 17th-century “aha moment” that prompted him to suddenly come up with his law of gravity. In reality, things didn’t happen quite like that but I will take the folklore version for the purposes of this article. The ability to relate two abstract concepts also allowed Albert Einstein to formulate the basics of the theory of relativity as he reasoned that an equivalence relation exists between an observer falling in uniform acceleration and an observer in a uniform gravitational field. Abstract reasoning has long been used as an example that separates human cognition from artificial intelligence(AI). Are AI systems capable of abstract thinking? A recent research paper from DeepMind seem to think that is possible and proposes a methodology for evaluating abstract reasoning in deep neural networks.

Abstract thinking can be seen as a form of knowledge generalization which is a widely used concept in deep learning systems. However, one of the main differences between abstractions and pure generation is that the former is based on deriving new knowledge from seemingly unrelated data. One of the paradoxical characteristics about human’s abstract reasoning is that is surprisingly quantifiable. In 1936, psychologist John Raven introduces the first version of the IQ Test that is widely used as a quantified estimator of human intelligence. One of the components of the IQ Test are the famous Raven’s Progressive Matrices (RPM) which consist of an incomplete 3x3 matrix of context images, and some candidate answer images. The subject must decide which of the candidate images is the most appropriate choice to complete the matrix. To solve an RPM puzzle, the candidate must consider a large number of possible answers which is why this test has long considered a measure of eductive, fluid and, therefore, abstract reasoning.

An IQ Test for Neural Networks

In their research, DeepMind borrows some ideas from the RPM section of the IQ test in order to measure abstract reasoning in deep learning agents. Specifically, the researchers built a generator for creating RPM-line matrix problems involving a set of abstract factors such as the following:

· Relation types (R, with elements r): progression,XOR, OR, AND, consistent union.

· Object types (O, with elements o): shape, line.

· Attribute types (A, with elements a): size, type, color, position, number.

Using those primitives, DeepMind generated a dataset known as Procedurally Generated Matrices(PGM) that consists of triplets [progression, shape, color]. The relationship between the attributes in a triplet represent an abstract challenge. For instance, if the first attribute is progression, the values of the other two attributes must along rows or columns in the matrix.

In order to show signs of abstract reasoning using PGM, a neural network must be able to explicitly compute relatioships between different matrix images and evaluate the viability of each potential answer in parallel. To address this challenge, the DeepMind team created a new neural network architecture called Wild Relation Network(WReN) in recognition of John Rave’s wife Mary Wild who was also a contributor to the original IQ Test.

In the WReN architecture, a convolutional neural network(CNN) processes each context panel and an individual answer choice panel independently to produce 9 vector embeddings. This set of embeddings is then passed to an recurrent network, whose output is a single sigmoid unit encoding the “score” for the associated answer choice panel. 8 such passes are made through this network, one for each answer choice, and the scores are put through a softmax function to determine the model’s predicted answer.

The experiments conducted by the DeepMind team used a series of PGM datasets with different deep neural network models such as CNNs, Long-Short-Term-Memory(LSTM), ResNet and the new WReN. The results showed that WReN was able to outperform the other architectures but all of them exhibit different levels of abstract reasoning.

To pass the PGM experiment, a deep neural network needed to be able to solve complex visual reasoning questions, and to do so, it needed to induce and detect from raw pixel input the presence of abstract notions such as logical operations and arithmetic progressions, and apply these principles to never-before observed data. The WReN architecture was able to excel in those tasks partly given to the fact that it promotes the relationships between different parts of the dataset since the first level of the network.

The DeepMind experiment produced a lot of interesting results that could help us understand how deep neural networks abstract knowledge. For instance, the different models generalized relatively well when required to reason using attribute values ‘interpolated’ between previously seen attribute values, and also when applying known abstract relations in unfamiliar combinations. That wasn’t the case, in ‘extrapolation’ scenarios in which attribute values in the test set did not lie within the same range as those seen during training. An example of this occurs for puzzles that contain dark colored objects during training and light colored objects during testing. Despite the variety of results in the experiment was thing was clear, neural networks exhibit primitive ways of abstract reasoning.

Fred Simkin

Developing and delivering knowledge based automated decisioning solutions for the Industrial and Agricultural spaces.

6 年

I call BS starting with the headline.

3 次回应

查看更多评论

要查看或添加评论，请登录

Jesus Rodriguez的更多文章

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

2024年2月28日

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

Last year, I had the unique opportunity to incubate a new project in the autonomous agents space, alongside a…

1 条评论
Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

2020年5月27日

Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

Natural language generation(NLG) is one of the fastest growing areas of research in deep learning. NLG applications are…
Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

2020年5月25日

Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

Training models with massive datasets is becoming the norm in modern deep learning applications. Some of the latest…
Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

2020年5月18日

Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

Rapid experimentation is a key element of modern software development. The raise in popularity of machine learning, has…
Uber Unveils Its New Data Quality Management Solution

2020年5月13日

Uber Unveils Its New Data Quality Management Solution

Data quality management is one of those often forgotten aspects of machine learning workflows. Small inconsistencies or…
LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

2020年5月7日

LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

Interoperating TensorFlow and Apache Spark is a common challenge in real world machine learning scenarios. TensorFlow…
Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

2020年5月6日

Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

Querying relational data structures using natural languages has long been a dream of technologists in the space. With…
Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

2020年5月4日

Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

Natural language understanding(NLU) has been one of the most active areas adopting state-pf-the-art deep learning…

2 条评论
Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

2020年4月27日

Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

Generative models have been an important component of machine learning for the last few decades. With the emergence of…
Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

2020年4月22日

Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

PyTorch is one of the fastest growing open source projects in the deep learning space. Initially incubated by Facebook,…

See all articles

What’s New In Deep Learning Research: An IQ Test Proves that Neural Networks are Capable of Abstract Reasoning

Jesus Rodriguez

CEO of IntoTheBlock, Co-Founder, Co-Founder of LayerLens, Faktory,and NeuralFabric, Founder of The Sequence AI Newsletter, Guest Lecturer at Columbia, Guest Lecturer at Wharton Business School, Investor, Author.

An IQ Test for Neural Networks

Jesus Rodriguez的更多文章

社区洞察

其他会员也浏览了

Top 5 Types of Neural Networks in Deep Learning

Introduction to Deep Learning: Unlocking the Power of Neural Networks

Grokking: A Deep Dive into Delayed Generalization in Neural Networks

Understanding the Perceptron: The First Step in Deep Learning

Exploring Deep Learning with Neural Networks at the AI for Good Institute

Understanding Neural Networks: The Brain of Artificial Intelligence

Deep Learning Techniques | An Overview

This New Google Technique Help Us Understand How Neural Networks are Thinking

The Deep Learning Revolution: How Machines Learn to See

The Double Descent Hypothesis: How Bigger Models and More Data Can Hurt Performance

An IQ Test for Neural Networks

Jesus Rodriguez的更多文章

Robust Agents Are All We Need: Faktory Emerges from Stealth Mode with a Private?Alpha

Google’s BLEURT is BERT for Evaluating Natural Language Generation Models

Two Deep Learning Frameworks and an AI Super-Computer: Microsoft Launches New Efforts to Achieve Large-Scale AI

Uber Open Sources a New Framework for Designing Optimal Statistical Experiments

Uber Unveils Its New Data Quality Management Solution

LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability

Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language

Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot

Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

Facebook and Amazon Bring Two Projects to PyTorch 1.5 that Streamline the Lifecycle of Production-Ready Deep Learning Models

社区洞察

其他会员也浏览了

Top 5 Types of Neural Networks in Deep Learning

Introduction to Deep Learning: Unlocking the Power of Neural Networks

Grokking: A Deep Dive into Delayed Generalization in Neural Networks

Understanding the Perceptron: The First Step in Deep Learning

Exploring Deep Learning with Neural Networks at the AI for Good Institute

Understanding Neural Networks: The Brain of Artificial Intelligence

Deep Learning Techniques | An Overview

This New Google Technique Help Us Understand How Neural Networks are Thinking

The Deep Learning Revolution: How Machines Learn to See

The Double Descent Hypothesis: How Bigger Models and More Data Can Hurt Performance