登录查看更多内容

FPGAs and Deep Machine Learning

Avi Chami

Principal Application Engineer at Cadence

发布日期: 2016年9月15日

The concept of machine learning is not new. Attempts at systems emulating intelligent behavior, like expert systems, go as far back as the early 1980's. And the very notion of modern Artificial Intelligence has a long history. The name itself was coined at a Dartmouth College conference (1956), but the idea of an "electronic brain" was born together with the development of modern computers. AI as an idea accompanies us from the dawn of human history.

Three latest development are pushing forward "Machine Learning":

Powerful distributed processors
Cheap and high volume storage
High bandwidth interconnection to bring the data to the processors

As in many other fields, development of Machine Learning is also seeing development on algorithms that take advantage of the new hardware capabilities.

There are four types of algorithms used in machine learning:

Supervised - The vast majority of systems today. These systems are 'trained' based on past data on an attempt to predict future outcomes.
Unsupervised - These systems try to build models, by themselves, of the process analyzed.
Semi supervised - is a combination of the first two, where a small amount of data is 'labeled' (i.e. related to known training rules) and the machine uses this as a seed to label the rest of the data
Reinforcement - The algorithm creates its rules through trial and error.

According Wikipedia, Deep Learning is "a part of a broader family of machine learning methods based on learning representations of data. An observation (e.g., an image) can be represented in many ways such as a vector of intensity values per pixel, or in a more abstract way as a set of edges, regions of particular shape, etc. Some representations are better than others at simplifying the learning task".

Many Deep Learning solutions were based on the use of GPUs. However, FPGAs are being seen as a valid alternative for GPU based Deep Learning solutions.

The main reason for that is the lower cost and lower power consumption of FPGAs compared to GPUs in Deep Learning applications.

Microsoft adopted Altera Arria 10 devices for their Convolutional Neural Network (CNNs), estimating that the usage of FPGAs would increase their system throughput roughly at 70% with the same power consumption.

A recent article on Next Platform comments on how Baidu has also adopted FPGAs for deep learning solutions.

Teradeep is another company (startup) developing CNNs, and one among the first of those adopting FPGAs as an alternative to GPUs. In May this year Xilinx announced it invested in Teradeep and continue working closely together to optimize its technology.

For some time Altera has been pushing OpenCL for the implementation of Neural Networks. New devices from Altera and Xilinx are specifically oriented for distributed processing applications, efficient integration of FPGAs with high end processors and/or high bandwidth throughput (see my blog articles: Stratix 10MX – High memory bandwidth on SiP package and Intel announces Xeon processor with FPGA accelerator)

要查看或添加评论，请登录

Avi Chami的更多文章

Demystifying the Machine Brain: Neural Networks, Deep Neural Networks and Neuromorphic chips (*)

2024年4月22日

Demystifying the Machine Brain: Neural Networks, Deep Neural Networks and Neuromorphic chips (*)

The human brain, with its intricate web of neurons and synapses, has long inspired scientists and engineers. Artificial…
A little bit of portable storage media history

2020年12月18日

A little bit of portable storage media history

I found almost all these portable storage media in the trash, and I thought of recovering them for these photos. In a…

2 条评论
Introduction to NVMe and NVMe/TCP

2020年12月4日

Introduction to NVMe and NVMe/TCP

NVMe stands for Non-Volatile Memory access through the PCIe (PCI Express) bus. This standard defines how to access…

1 条评论
FPGA based Robot Motion Planning is lightning fast

2016年11月5日

FPGA based Robot Motion Planning is lightning fast

Motion planning is determining how a robot should move to achieve a goal; for example, a to move the arm of a robot to…

1 条评论
SoC FPGA for IoT Edge Computing

2016年9月30日

SoC FPGA for IoT Edge Computing

One of the reasons for the explosive growth of IoT is that embedded devices with networking capabilities and sensor…
FPGA Design Flow

2016年9月22日

FPGA Design Flow

Are you going to make an FPGA design? Are you asking yourself where to start, how to continue, and finish? These are…

1 条评论

See all articles

FPGAs and Deep Machine Learning

Avi Chami

Principal Application Engineer at Cadence

Avi Chami的更多文章

社区洞察

其他会员也浏览了

Maximizing Efficiency: Leveraging Neural Network Scalable Vector Graphics with AWS and Azure for Cost-Effective AI Visualization

Beyond Binary: 23 Leading-Edge Computing Concepts Shaping the Future of AI & Information Processing

Deep Dive into Deep Learning Frameworks: A Technical Perspective

AI Research News Updates: Issue 11 (Feb 2-Feb 13, 2022)

Unveiling the Tensor Dimensions: A Journey from Scalars to Higher-Dimensional Data in Machine Learning

AI Accelerators: Driving Efficiency and Performance in Machine Learning

Superfast Matrix-Multiplication-Free LLMs Are Finally Here

Unleashing the Power of Innovative Thinking in Machine Learning Research and Development

Deep Learning: Train fast and Pay less.

Generating AI Images Using Google Colab and Stable Diffusion Model

Avi Chami的更多文章

Demystifying the Machine Brain: Neural Networks, Deep Neural Networks and Neuromorphic chips (*)

A little bit of portable storage media history

Introduction to NVMe and NVMe/TCP

FPGA based Robot Motion Planning is lightning fast

SoC FPGA for IoT Edge Computing

FPGA Design Flow

社区洞察

其他会员也浏览了

Maximizing Efficiency: Leveraging Neural Network Scalable Vector Graphics with AWS and Azure for Cost-Effective AI Visualization

Beyond Binary: 23 Leading-Edge Computing Concepts Shaping the Future of AI & Information Processing

Deep Dive into Deep Learning Frameworks: A Technical Perspective

AI Research News Updates: Issue 11 (Feb 2-Feb 13, 2022)

Unveiling the Tensor Dimensions: A Journey from Scalars to Higher-Dimensional Data in Machine Learning

AI Accelerators: Driving Efficiency and Performance in Machine Learning

Superfast Matrix-Multiplication-Free LLMs Are Finally Here

Unleashing the Power of Innovative Thinking in Machine Learning Research and Development

Deep Learning: Train fast and Pay less.

Generating AI Images Using Google Colab and Stable Diffusion Model