Day 8 of 365: Data Collection and Understanding Data
Ajinkya Deokate
Data Scientist | Researcher | Author | Public Speaking Expert @PlanetSpark | Freelancer
Welcome to Day 8 of my #365DaysOfDataScience journey! ??
As we continue our data science journey, today we dive into the exciting world of data collection! ?? Whether it’s through APIs, web scraping, or grabbing CSVs from online databases, knowing how to collect data is essential for any aspiring data scientist. ??
?? What We’re Learning Today:
1. Data Collection Methods: APIs, web scraping, CSV files, and databases.
2. Structured vs. Unstructured Data: What’s the difference and why it matters in data science.
?? Learning Resources:
- Watch: A quick YouTube tutorial on data collection methods.
- Read: An introductory article on APIs and web scraping from Towards Data Science to understand how to gather data programmatically.
领英推荐
?? Today’s Task:
Let’s get practical! Today, we’ll collect our own dataset:
- Option 1: Pull data from a public API (e.g., Twitter, weather data).
- Option 2: Use Python libraries like requests and BeautifulSoup to scrape data from a webpage.
I’m going to try scraping a simple webpage for data – join me, and we can troubleshoot together if things get tricky! ??
Data collection is one of the first steps in any project, and the quality of your data makes all the difference. Let’s see what we can gather today! If you’ve done this before or have any tips, feel free to share them in the comments – we’re all in this together. ??
***
Extra Resources ??
Follow this video by freeCodeCamp for Web Scraping with Python
#DataScience #365DaysOfDataScience #LearningTogether #DataCollection #WebScraping #Python