登录查看更多内容

Unleashing the Potential of AI Inference Engines - What is an AI inference engine?

Raja Mahendra Pellakuru

Data & AI Strategy | AI Agent Architect | Cloud Data Solutions | Business Intelligence & Automation

发布日期: 2023年3月21日

Before going to understand the AI inference engine, firstly you should get an idea about AI inference. Let's talk about AI Inference.

AI inference: AI inference refers to the process of using a trained machine learning model to make predictions or draw conclusions from new data.

AI inference : [{ Feed new data -> Pre-Trained model } - (conclusion) ]

AI inference time: Inference time, also known as inference latency or prediction time, refers to the amount of time it takes for a trained machine learning model to process a new input and generate a prediction or output. It is the time required for the model to perform inference or prediction, which involves applying the learned knowledge to a new instance of data.

AI inference time : Prediction time

An AI inference engine, also known as an inference system or reasoning engine, is a component of an AI system that uses rules and logical reasoning to make predictions, draw conclusions, or generate insights from data.

AI inference engine : (Optimize and Accerlate the Inference process)GPU

AI inference engines are also an important component in driving assistance and autonomous vehicles, enabling lane departure detection and collision avoidance, and other capabilities

Types of AI inference engines

Rule-based inference engines
Bayesian inference engines
Fuzzy logic inference engines
Neural network inference engines
Genetic algorithm inference engines:
Decision tree inference engines

领英推荐

Weekly Research Roundup (19 - 26 AUGUST)

Generative AI 6 个月前

How Generative AI is Altering the AI Chip Industry…

Data Science Dojo 12 个月前

NVIDIA Reshapes the AI Landscape: Beyond GPUs to…

Cogent Integrated Business Solutions Inc. 3 个月前

AI inference engines are being developed by

Technology companies: Google, Microsoft, IBM, Amazon, Meta & more
Startups: Cognitivescale, Ayasdi, and Numenta.
Research institutions: universities and national labs
Open-source communities: TensorFlow, PyTorch

Nvidia has developed several AI inference engines, which are software libraries that are optimized for running AI models on Nvidia GPUs.

TensorRT: TensorRT is an inference engine that is designed to optimize and deploy deep learning models for production environments. It can be used to accelerate a wide range of applications, including image and speech recognition, natural language processing, and recommendation systems.
Triton Inference Server: Triton Inference Server is an open-source inference engine that is designed to support the deployment of deep learning models in production environments. It can be used to serve multiple models simultaneously and supports a wide range of input and output formats.
DeepStream: DeepStream is an AI-based video analytics platform that includes an inference engine optimized for processing video data. It can be used to perform real-time object detection, classification, and tracking in video streams.

The first inference engine is often attributed to the expert system known as MYCIN, developed in the early 1970s by Edward Shortliffe and his team at Stanford University. MYCIN was designed to assist physicians in diagnosing and treating bacterial infections, and it used a rule-based system to make recommendations based on patient data and medical knowledge.

MYCIN was considered a groundbreaking achievement in the field of AI and paved the way for the development of more sophisticated expert systems and inference engines in the decades that followed.

Overall, AI inference engines are essential to modern AI systems and are likely to play an increasingly important role in shaping our technological future.

References :

https://developer.nvidia.com/ai-inference-software

https://www.xilinx.com/products/technology/ai-engine.html

https://www.arm.com/glossary/ai-inference

https://github.com/topics/inference-engine

https://www.alibabacloud.com/tech-news/deep-learning/gi6tclkgu4-deep-learning-inference-engine

要查看或添加评论，请登录

Raja Mahendra Pellakuru的更多文章

RAG's Influence on Language Models: Shaping the Future of AI

2023年10月21日

RAG's Influence on Language Models: Shaping the Future of AI

In the era of large language models, the concept of Retrieval Augmented Generation (RAG) is a game-changer. It's not…
What Is Neuro-Symbolic AI?

2023年3月8日

What Is Neuro-Symbolic AI?

Neuro-Symbolic AI = Deep Learning Neural Network Architectures + Symbolic Reasoning CLEVRER: The first video dataset…

1 条评论
PaddlePaddle - AI Deep Framework

2022年4月19日

PaddlePaddle - AI Deep Framework

PaddlePaddle : (PArallel Distributed Deep LEarning) -> It is a powerful deep learning neural network framework (open…

1 条评论
Why XAI is an important ?

2022年4月5日

Why XAI is an important ?

Why XAI is an important ? Just developing a ML model and producing some decent accuracy is not enough. Explanations of…

Unleashing the Potential of AI Inference Engines - What is an AI inference engine?

Raja Mahendra Pellakuru

Data & AI Strategy | AI Agent Architect | Cloud Data Solutions | Business Intelligence & Automation

领英推荐

Raja Mahendra Pellakuru的更多文章

社区洞察

其他会员也浏览了

Artificial Intelligence and Machine Learning -- Explained

Revolutionizing AI Workloads with ComputerVault

Latest AI Developments: From Enhanced Persuasion and Reasoning Capabilities to Groundbreaking Chips and Models

NewMind AI Journal #11

Introduction to AI and ML: Understanding the Basics of Artificial Intelligence and Machine Learning

AI Efficiency Limits: Datasets, Computation Budget and Chinchilla Scaling Laws

OpenAI might start making its own AI chips

?? Daily News in AI Agents: Key Updates 01/05 - Generative AI at Work: Adobe Firefly, OpenAI’s GPT-4, and Nvidia’s Omniverse

The Future Is Now: How Machine Learning Is Revolutionizing Everyday Life

Key AI Updates from 2024: Major Breakthroughs in Generative AI

领英推荐

Raja Mahendra Pellakuru的更多文章

RAG's Influence on Language Models: Shaping the Future of AI

What Is Neuro-Symbolic AI?

PaddlePaddle - AI Deep Framework

Why XAI is an important ?

社区洞察

其他会员也浏览了

Artificial Intelligence and Machine Learning -- Explained

Revolutionizing AI Workloads with ComputerVault

Latest AI Developments: From Enhanced Persuasion and Reasoning Capabilities to Groundbreaking Chips and Models

NewMind AI Journal #11

Introduction to AI and ML: Understanding the Basics of Artificial Intelligence and Machine Learning

AI Efficiency Limits: Datasets, Computation Budget and Chinchilla Scaling Laws

OpenAI might start making its own AI chips

?? Daily News in AI Agents: Key Updates 01/05 - Generative AI at Work: Adobe Firefly, OpenAI’s GPT-4, and Nvidia’s Omniverse

The Future Is Now: How Machine Learning Is Revolutionizing Everyday Life

Key AI Updates from 2024: Major Breakthroughs in Generative AI