What's the difference between artificial intelligence, machine learning, and natural language processing?

What's the difference between artificial intelligence, machine learning, and natural language processing?

It’s almost harder to understand all the acronyms that surround artificial intelligence (AI) than the underlying technology. Couple that with the different disciplines of AI as well as application domains and it’s easy for the average person to tune out and move on.

Below we attempt to explain the important parts of artificial intelligence and how they fit together. At Sonix we are specifically focused on automatic speech recognition so we explain the key technologies with that in mind.

First let’s start with some of the most commonly used acronyms and their definitions:

  • Artificial Intelligence (AI) - the broad discipline of creating intelligent machines
  • Machine Learning (ML) - refers to systems that can learn from experience
  • Deep Learning (DL) - refers to systems that learn from experience on large data sets
  • Artificial Neural Networks (ANN) - refers to models of human neural networks that are designed to help computers learn
  • Natural Language Processing (NLP) - refers to systems that can understand language
  • Automated Speech Recognition (ASR) - refers to the use of computer hardware and software-based techniques to identify and process human voice

Artificial intelligence (AI) is the overarching discipline that covers anything related to making machines smart. Whether it’s a robot, a refrigerator, a car, or a software application, if you are making them smart, then it’s AI. Machine Learning (ML) is commonly used alongside AI but they are not the same thing. ML is a subset of AI. ML refers to systems that can learn by themselves. Systems that get smarter and smarter over time without human intervention. Deep Learning (DL) is ML but applied to large data sets. Most AI work now involves ML because intelligent behavior requires considerable knowledge, and learning is the easiest way to get that knowledge. The image below captures the relationship between AI, ML, and DL.

There are many techniques and approaches to ML. One of those approaches is artificial neural networks (ANN), sometimes just called neural networks. A good example of this is Amazon’s recommendation engine.  Amazon uses artificial neural networks to generate recommendations for its customers. Amazon suggests products by showing you “customers who viewed this item also viewed” and “customers who bought this item also bought”. Amazon assimilates data from all its users browsing experiences and uses that information to make effective product recommendations.

At Sonix we convert audio to text using machines. The principle underlying technologies are automated speech recognition (ASR) and natural language processing (NLP). ASR is the processing of speech to text whereas NLP is the processing of the text to understand meaning. Because humans speak with colloquialisms and abbreviations it takes extensive computer analysis of natural language to drive accurate outputs.

ASR and NLP fall under AI. ML and NLP have some overlap as ML is often used for NLP tasks. ASR also overlaps with ML. It has historically been a driving force behind many machine learning techniques.

In summary, DL is subset of ML and both are subsets of AI. ASR & NLP are fall under AI and overlap with ML & DL.


Vicki Morgan

Audio Production Manager at Family Stations, Inc.

6 年

I helped produce the Kim Komando podcast on this subject. Very interesting. https://bit.ly/2z8rzjy

Andrea Arenas

Bilingual Communications Specialist

6 年

Rodolfo didididiificiicicufidiiei

回复

Great article Jaime ....let's touch base and collaborate. I will be in the Bay Area after 25 December.

Brett Hill

Brett Hill - "The Mindful Coach", Hakomi Somatic Coaching Certificate, ICF, and Mindful/Somatic Coach Certifications—founder of The Mindful Coach Association and Host of The Mindful Coach podcast.

6 年

Nice summary. May I suggest a post about other companies using this in a practical way that are relatable.

要查看或添加评论,请登录

Jamie Sutherland的更多文章

  • Revolutionizing video production: Edit video by editing text!

    Revolutionizing video production: Edit video by editing text!

    If you are a video producer, documentary filmmaker, or anyone that edits spoken-word video, our new integration with…

    3 条评论
  • Sonix ranked 1st! Automated Transcription Services Compared:

    Sonix ranked 1st! Automated Transcription Services Compared:

    Manual methods of transcribing audio and video files have traditionally been a costly and time-consuming act. And the…

    1 条评论
  • Is Voice the Next Major UI Layer for Software?

    Is Voice the Next Major UI Layer for Software?

    While speech recognition technology has been around since the 1950s, recent advances in computing power and machine…

    2 条评论
  • A short history of speech recognition

    A short history of speech recognition

    There has been more progress in speech recognition technology in the last 3 years than in the first 30 years. Computing…

  • Sonix + Patreon

    Sonix + Patreon

    We are excited to announce that Sonix is the first and only audio-to-text transcription partner in Patreon’s newly…

    3 条评论
  • TIPS FOR CAPTURING GREAT AUDIO

    TIPS FOR CAPTURING GREAT AUDIO

    RECORD INDOORS IF AT ALL POSSIBLE In windy or noisy outdoor environments, you simply want to record most of your audio…

    2 条评论
  • 5 REASONS YOU SHOULD TRANSCRIBE YOUR PODCAST

    5 REASONS YOU SHOULD TRANSCRIBE YOUR PODCAST

    1. IMPROVE DISCOVERABILITY Think about how you currently search for answers on the web.

    1 条评论

社区洞察

其他会员也浏览了