Decomposition of data analytics
Data analytics is combination of data science , deep learning and big data (+7TB)
Data science : math, structured data(text) and machine learning through algorithm
Deep learning :Image,video and unstructured data (text). (prediction of behaviour )
Big data :distributed storage and distributed computation
Best Tools now (2017):
Data science : python with ide spyder and sklearn package (Google summer code)
Deep learning : tensor flow (google)
Big data : Spark (IBM)
Problem definition Data analytics solves
Problem is 2 type :
1. Predictive analysis
- classification
- regression
- recommendation
2.Descriptive analysis
- clustering
- anomaly detection
- dimension reduction
- Market-basket
1.1: Classification:
- majority
- random
- human discovery
- machine discovery
Coming Soon ....