Day 94 – Multi-Speaker Speech Separation and Recognition Using SpeechBrain

Day 94 – Multi-Speaker Speech Separation and Recognition Using SpeechBrain

We saw a post in the previous blog about SpeechBrain, Features, PreTrained models, and Speech Recognition On Different Languages By SpeechBrain.

Today, we are going to see in detail about Multi-Speaker Separation and Recognition.

What is Multi-Speaker Separation and Recognition?

When you were listening to audio and found that there were many people talking on the audio.?However, you want to hear audio from a particular person.?This feature requires high-end software or need to work with sound engineers or audio professionals to extract only the voice which you want. The emergence of Artificialy Intelligence brings this task very easy in just 13 lines of code and produce multi-speaker separation.

Let’s get into a code to check simple Multi-Speaker Separation and Recognition.

I have used SpeechBrain Pretrained models and audio files and downloaded mixed audio files (Audacity) from Azure Github.

To check my full code in Google Colab as well as here.

#Install Torchaudio, SpeechBrain and Transformers
!pip install torchaudio==0.8.1 #Temporary (until pytorch 0.9 is supported in Colab)
!pip install speechbrain        
!pip install transformers'
        
#Import all libraries
import speechbrain as sb
from speechbrain.dataio.dataio import read_audio        
from IPython.display import Audio
        
#Download pretrained SepformerSeparation from SpeechBrain

from speechbrain.pretrained import SepformerSeparation as separator

model = separator.from_hparams(source="speechbrain/sepformer-wsj02mix", savedir='pretrained_models/sepformer-wsj02mix')        
est_sources = model.separate_file(path='speechbrain/sepformer-wsj02mix/test_mixture.wav')

        

Continue reading...

Further Reading

Posts on?Artificial Intelligence,?Deep Learning,?Machine Learning, and?Design Thinking?articles:

Autonomous Vehicle Environment Perception Task By Pandas Team

Deep Learning Basics: Introduction, Concepts, and Overview

TextStyleBrush by Facebook AI Research Team

Watercolor Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

Marker Pen Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

Detail Sentence Analyzer Using spaCy Natural Language Processing – Part II

Named Entity Recognition Using spaCy Natural Language Processing – Part III

AI Converts Your Photo Or Video Selfie To Animation – GANsNRoses

Artificial Intelligence Chatbot Using Neural Network and Natural Language Processing

Tensorflow Sobel Filter Image Processing and Computer Vision

Oil Painting Under 5 Mins Using Stylized Neural Painting Artificial Intelligence

COCO Keypoint Detection Detectron2 Computer Vision by Facebook AI Research (FAIR)

Translate 125 Plus Languages Using Google Artificial Intelligence – Part 1

Translate 125 Plus Languages Using Google Artificial Intelligence – Part 2

Posts on SAP:

How to Transform Your Business with SAP Data Intelligence?

SAP AI Business Services – Business Entity Recognition

SAP AI Business Services – Document Information Extraction

SAP AI Business Services – Service Ticket Intelligence

SAP Intelligent Robotic Process Automation, Use Case, Benefits, and Available Features

SAP Conversational AI

SAP AI Business Services

要查看或添加评论,请登录

社区洞察

其他会员也浏览了