Types of data

Machine learning algorithms often get the majority of attention when people discuss Artificial Intelligence; however, success depends on good data.

There are mainly two types of data

·????????Structured data, and

·????????Unstructured data.

Understanding data is critical to our success. If we build a model based on bad data, our predictions will be inaccurate. We should also think about what data to include in our machine learning application.

Identify Relevant Data: Structured Data and Unstructured Data

Business decisions must be made based on constantly changing data from various sources. Our data sources can include both traditional systems of record data (such as customer, product, transactional, and financial data) and external data (for example, social media, news, weather data, image data, or geospatial data). Also, many data structures are essential for analyzing information, including structured data and unstructured data.

Structured Data Sources

Structured data is generally stored in traditional relational databases and refers to data that has defined a certain length and a format. Most organizations have a large amount of structured data in their on-premises data centers. Here are some examples of structured data:

  • Sensor data: Examples include radio frequency identification (RFID) tags, smart meters, medical devices, and global positioning system (GPS) data.
  • Blog data: when servers, applications, networks, etc. work, they capture all kinds of data about their activity.
  • Point of Sale Data: When the cashier swipes the barcode of any product we purchase, all data associated with the product is generated.
  • Financial data: Many financial systems are now programmatic; they operate according to predefined rules that automate the processes.
  • Weather data: Sensors to collect weather data are deployed in towns, cities, and regions to collect data on things like temperature, wind, barometric pressure, and precipitation. This data can help meteorologists create hyperlocal forecasts.
  • Click Flow Data: Data is generated every time we click a link on a website. This data can be analyzed to determine customer behavior and purchasing patterns.

?Unstructured Data Sources?

Although unstructured data has an implicit structure, it does not follow a specified format. Unstructured data is still vastly underutilized by businesses and offers a great opportunity for monetization.

Cloud, mobile and social media have contributed to a huge increase in unstructured data. Here are examples of unstructured data:

  • Internal text of the company: Think about all the text in documents, journals, survey results, and emails. Corporate information today represents a significant percentage of textual information in the world.
  • Social media data: As the name suggests this data is generated from social media platforms, such as Facebook, Twitter, YouTube, LinkedIn, etc.
  • Mobile data: This includes text messages, notes, calendar entries, images, videos, and data entered into third-party mobile apps.
  • Satellite imagery: This includes weather data or data that the government captures in its satellite surveillance imagery.
  • Photographs and video: This includes security, surveillance, and traffic data.
  • Radar or Sonar Data: This includes vehicle, weather, and oceanographic data.

?

Rafeh Hameed

Data Analyst @ Ray Analytics , Data Science/ML/Python

3 年

Really good read sir, Thanks for sharing. Would appreciate more insight on different models with respect to real-life use-cases based on your valuable experience.

回复

要查看或添加评论,请登录

Waqas Ali - FCMA, CAIS, ADMA, CDS的更多文章

社区洞察

其他会员也浏览了