Why is Python so popular in machine learning?
Python is like the swiss army knife of machine learning.
It is not the fastest. Scala, Julia is faster and most of the heavy lifting in Python is actually done by C or Fortran libraries backend. Neither is it the easiest to learn. R is easier for beginners. Rather, it is a general language that does a little of everything at a good enough complexity-performance tradeoff with a full suite of tools for productionizing machine learning.
Tutorials on machine learning with python
A sample process of machine learning deployment with python:
- Build prototypes on laptops with scikit-learn: machine learning in Python,NumPy, Python Data Analysis Library in an interactive Project Jupyternotebook with python plotting - Matplotlib visualization. Examples - List of interesting notebooks, more interesting notebooks, Probabilistic-Programming-and-Bayesian-Methods-for-Hackers.
- Train the full model on Hadoop cluster with PySpark and MLlib | Apache Spark.Example - train a recommender on spark
- Setup an API server with CherryPy - Minimalist Python Web Framework, build aFlask (A Python Microframework) app around the model and serve the model out as a REST api to other applications. Examples: Deploy recommender | Dockerized Data Science
Furthermore, advanced Python modules extend the capabilities:
Interactive plot extensions
Deep learning
- Deep learning framework by Google: https://www.tensorflow.org/
- TensorFlow + scikit-lean = skflow
- Pylearn2 + Theano
Natural Language Processing (NLP) / text analytics / parts of speech tagging
- Natural Language Toolkit
- https://spacy.io/
- TextBlob: Simplified Text Processing
- Pineapple: proycon/pynlpl
- gensim: topic modelling for humans
For more data related posts, visit my quora page.?
Senior Specialist Data Engineer at LTIMindtree
7 年i like python programming
Let's Build AI Agents Together! Let’s Learn Agentic Workflows
7 年In the end, it is only a tool and popularity depends a lot on the community. Yes, it is very popular among individuals with a computer science background or working in Internet-related/tech industries. However, among social scientists, academics, statisticians and anyone who only wants to do (advanced) data analysis and nothing more, definitely R enjoys more popularity. In addition, if you are in a corporate setting and you have to use Windows, Python is not pleasant to work with, specially compared with Windows-friendly alternatives such as R.
Data Scientist - NLP & Advanced Analytics
7 年Haritha C
Founder at Absolute Collateral
7 年Riley O'Shea