Data Scraping + Data Science | Reverse Engineering Case study
This is a short story about how to mix data scrapping and data science???????.
The other day Yohn Jairo Parra Bautista from the Florida A&M University invite me to give a talk about a project I did some years ago (thanks!)
Pablo Alejandro Seibelt did the scrapping and I did the data science. The interesting point here is how two topics, that you can read in two different places, can be unified in a project (that you can be paid for ??).
??I share the idea in case you want to explore. The project was about reverse engineering, imagine selling a product at Amazon. What are the rules that lead a product to appear on the 1st page?
?? From theory to practice: One approach is to collect the results from several queries in the site (i.e. "shampoo"->??, "soap"->??), and then assign TRUE to those products appearing on the 1st page, FALSE on the contrary.
? Now you can extract the rules based on a simple classification #ML model, (based on price, rate, description, etc). I create a random forest and then explore the most important variables with the cross_plot function (funModeling package in #Rstats) https://tinyurl.com/y94ql2p9
Fun, profitable, and multi-disciplinary project you can work as a freelancer ????????????.
Questions? Will read the comments ?
#datascientist