登录查看更多内容

Role of Sparse matrix in machine learning

Deepak Kumar

Propelling AI To Reinvent The Future ||Author|| 150+ Mentorship|| Leader || Innovator || Machine learning Specialist || Distributed architecture | IoT | Cloud Computing

发布日期: 2021年2月17日

Why to read this?

Large sparse matrices are common in general and especially in applied machine learning, such as in data that contains counts, data encodings that map categories to counts, and even in whole subfields of machine learning such as natural language processing.

Technical explanation

Sparse matrix is a matrix which contains very few non-zero elements. It has space and time complexity overheads.

There is no strict definition how many elements need to be zero for a matrix to be considered sparse but a common criterion is that the number of non-zero elements is roughly the number of rows or columns.

The sparsity of a matrix can be quantified with a score, which is the number of zero values in the matrix divided by the total number of elements in the matrix.

sparsity = count zero elements / total elements

Origin of sparse matrix in ML

While they occur naturally in some data collection processes, more often they arise when applying certain data transformation techniques like:

One-hot encoding
CountVectorizing
TfidfVectorizing

Efficient data structures

There are many data structures for storing sparse matrix. MATLAB stores sparse matrices in compressed sparse column format.

Its mostly best to use a library for doing sparse matrix calculations since library ensures efficient implementation which is suitable for most cases.

Use in machine learning

In recommender systems, we typically work with very sparse matrices as the item universe is very large while a single user typically interacts with a very small subset of the item universe.

In image processing, Sparse matrix is a special way of representing the image in a matrix format. In sparse matrix, most of the elements are zero. This reduces the unwanted processing of the pixel values. This papertalks about it in detail.

Relevance in neural networks

Sparse networks, that is, neural networks in. which a large subset of the model parameters are zero, have. emerged as one of the leading approaches for reducing. model parameter count.

It has been shown empirically that deep neural networks can achieve state-of-the-art results under high levels of sparsity (Han et al., 2015; Louizos et al., 2017; Gale et al., 2019) , and this property has been leveraged to significantly reduce the parameter footprint and inference complexity (Kalchbrenner et al., 2018) of densely connected neural networks. This paper talks about it in detail. This is another relevant article.

For transformer NLP neural model, memory and time complexity is big issue. OpenAI used sparse factorisation of the attention matrices which reduced the network complexity from O(N2) to O(N√N). This allows OpenAI to "model sequences with tens of thousands of elements using hundreds of layers," compared to other networks that can only handle sequences of "a few thousand elements."

Reference

Thanks to these helping hands

https://machinelearningmastery.com/sparse-matrices-for-machine-learning/

https://dziganto.github.io/Sparse-Matrices-For-Efficient-Machine-Learning/

https://en.wikipedia.org/wiki/Sparse_matrix

https://www.quora.com/What-is-the-fastest-sparse-matrix-data-structure-for-speeding-up-multiplication-of-a-1000-x-1000-there-about-double-valued-matrix

https://towardsdatascience.com/why-we-use-sparse-matrices-for-recommender-systems-2ccc9ab698a4

https://arxiv.org/pdf/1906.10732.pdf

https://analyticsindiamag.com/rigl-google-algorithm-neural-networks/

https://www.ijert.org/research/image-interpolation-using-sparse-matrix-representation-IJERTV4IS020434.pdf

https://www.researchgate.net/publication/258499564_Sparse_Matrix_Factorization

https://www.infoq.com/news/2019/05/openai-sparse-transformers/

https://images.app.goo.gl/5senKK5qmieqzVNcA

https://matteding.github.io/2019/04/25/sparse-matrices/

https://images.app.goo.gl/oV7jkjPbPvxYtJCY6

要查看或添加评论，请登录

Deepak Kumar的更多文章

Role of DBSCAN in machine learning

2023年12月21日

Role of DBSCAN in machine learning

Why to read this? Density-based spatial clustering of applications with noise (DBSCAN)is a well-known data clustering…
Choice between multithreading and multi-processing: When to use what

2023年12月20日

Choice between multithreading and multi-processing: When to use what

Introduction Single threaded and single process solution is normal practice. For example, if you open the text editor…
Artificial Narrow Intelligence

2023年12月18日

Artificial Narrow Intelligence

About ANI ANI stands for "Artificial Narrow Intelligence." ANI refers to artificial intelligence systems that are…
Federated learning and Vehicular IoT

2023年11月29日

Federated learning and Vehicular IoT

Definition Federated Learning is a machine learning paradigm that trains an algorithm across multiple decentralised…
An age old proven technique for image resizing

2023年7月14日

An age old proven technique for image resizing

Why to read? Anytime, was you curious to know how you are able to zoom small resolution picture to bigger size?…

1 条评论
Stock Market Volatility Index

2023年7月12日

Stock Market Volatility Index

Why? Traders and investors use the VIX index as a tool to gauge market sentiment and assess risk levels. It can help…
The case for De-normalisation in Machine learning

2023年7月8日

The case for De-normalisation in Machine learning

Why? The need for inverse normalization arises when you want to interpret or use the normalized data in its original…

1 条评论
Kubernetes complements Meta-verse

2023年7月4日

Kubernetes complements Meta-verse

Motivation The #metaverse is a virtual world or space that exists on the #internet . It's like a big interconnected…

1 条评论
Which one offers better Security- OSS or Proprietary software

2023年6月24日

Which one offers better Security- OSS or Proprietary software

Motivation World is using so many OSS. Apache Kafka is a core part of our infrastructure at LinkedIn Redis is core part…
Why chatGPT/LLM should have unlearning capability like human has..

2023年5月29日

Why chatGPT/LLM should have unlearning capability like human has..

Executive Summary Do you know, chatGPT/LLM has this open problem to solve. This problem(unlearn) has potential to…

1 条评论

See all articles

Role of Sparse matrix in machine learning

Deepak Kumar

Propelling AI To Reinvent The Future ||Author|| 150+ Mentorship|| Leader || Innovator || Machine learning Specialist || Distributed architecture | IoT | Cloud Computing

Deepak Kumar的更多文章

社区洞察

其他会员也浏览了

The Foundation of Understanding Artificial Intelligence

Neural Networks and Deep Learning: Simply Explained

We Know the How but Not the Why: In Search of a Theory of Deep Learning

What content my trained model have?

Understanding Deep Belief Networks

The ANN-cillary of Deep Learning

Where is the meaning in language models?

Classifying Short-Form Text With Neural Networks – Our Journey So Far

Artificial Intelligence (AI) Jargon Demystified

Deepak Kumar的更多文章

Role of DBSCAN in machine learning

Choice between multithreading and multi-processing: When to use what

Artificial Narrow Intelligence

Federated learning and Vehicular IoT

An age old proven technique for image resizing

Stock Market Volatility Index

The case for De-normalisation in Machine learning

Kubernetes complements Meta-verse

Which one offers better Security- OSS or Proprietary software

Why chatGPT/LLM should have unlearning capability like human has..

社区洞察

其他会员也浏览了

The Foundation of Understanding Artificial Intelligence

Neural Networks and Deep Learning: Simply Explained

We Know the How but Not the Why: In Search of a Theory of Deep Learning

What content my trained model have?

Understanding Deep Belief Networks

The ANN-cillary of Deep Learning

Where is the meaning in language models?

Classifying Short-Form Text With Neural Networks – Our Journey So Far

Artificial Intelligence (AI) Jargon Demystified