登录查看更多内容

Let's Get The Iron

Diego Manssur

Data Analyst | SQL | Tableau | Excel | Data Visualization

发布日期: 2024年9月9日

Mining and manufacturing can be a very complicated process. Besides finding a proper location to dig, professionals also need to separate the materials they want from the ones they don’t. In this case, iron is the desired element to obtain. The problem is that it’s surrounded by dirt, silica or sand.

In the next case, we’ll be analyzing data from a mining company called Metals R’Us using Python. Our main objective is to check that there’s no anomalies in the scientific process of obtaining iron. One of the most important values we’ll be looking at is ‘% Iron Concentrate’, which represents the iron’s purity.

The Data Set:

We’ll be working with a real dataset taken from March to September 2017. There are 24 columns and 737453 rows. Here’s a brief description of each column:

Date: Date-time stamp
% Iron Feed: % of Iron that comes from the iron ore that is being fed into the flotation cells
% Silica Feed: % of silica (impurity) that comes from the iron ore that is being fed into the flotation cells
Starch Flow: Starch (reagent) Flow measured in m3/h; this is a chemical used for the flotation operation
Amina Flow: Amina (reagent) Flow measured in m3/h; another chemical used for the flotation operation
Ore Pulp Flow: You feed ore as pulp to the flotation operation
Ore Pulp pH: pH scale from 0 to 14; You can accept it as a condition for chemical reactions to occur
Ore Pulp Density: Flotation feed solid density; density scale from 1 to 3 kg/cm3
Flotation Column 01 to 07 Air Flow: These 7 columns show air flow that goes into the flotation cell measured in Nm3/h; this is a condition required for flotation where air turns into bubbles in the pulp and the amount of air adjusts the surface area of these bubbles
Flotation Column 01 to 07 Level: These 7 columns show froth level in the flotation cell measured in mm (millimeters); this gives us the thickness of the floats in the flotation, the lower the level, the higher the grade of concentration
% Iron Concentrate: The product of the flotation process: % of Iron which represents how much iron is presented in the end of the flotation process (0-100%, lab measurement)
% Silica Concentrate: The product of the flotation process: % of silica (impurity) which represents how much iron is presented in the end of the flotation process (0- 100%, lab measurement)

Questions

Where there any anomalies or important events in the month of June?
What were the min and max of Flotation Level 05 in the first month compared to the last month?
When was the Ore Pulp pH at its max and when at its min?
What’s the correlation between % Iron Concentrate and % Silica Concentrate in the first and last month?

Installing data libraries

In order to analyze and visualize our data, first we need to install our different libraries. This will allow us not only to clean and run different commands in our dataset, but also create easy graphs and visualizations in order to find trends and insights.

The data libraries we’ll install are:

Pandas
Seaborn
Matplotlib

Reading the Data Set

With our libraries installed, we can import, read and look at our data to have a general understanding our rows and columns.

Fixing the Date column

Since we’ll be working with the Date, it’s important to make sure that the data type is the right one. We can check the date data type with the following lines.

As suspected, the date data type is str. This is something we need to change in order to manipulate and group our rows. ?Let’s fix it with the following code lines:

We can also use the one line of code to check the count, mean, min, max and other important values in your dataset.

Before we start our analysis, I think it’s important that we check the min and max dates of our data set. That can be easily done with the following lines:

1. Important events in June

?In order to analyze our data from June, we can create a different table for that month only. This will make things easier in the next steps.?

We will also select just the columns that are important for our analysis. This can be achieved by narrowing down our June table and creating another one from it.

Now that we have the values we need, we can create a series of graphs in order to have a quick look at our correlations or trends. Using the following line, we can get 16 graphs.

领英推荐

MODEL VALIDATION IN MINERAL RESOURCE ESTIMATION

MEHMET AL? AKBABA,QP,CPG (Geology) 1 年前

Mining for Mining Data with Python

Cynthia Clifford 1 年前

The Application of Soft Sensors in the Mining Industry

Ali Soofastaei 1 年前

By looking at the charts, it’s quite obvious that there’s nothing to be alarmed about in the month of June. But that’s in terms of visuals. We could also run a correlation code in order to see the numbers.

Again, there doesn’t seem to be any trend or important correlation. Our decimal numbers indicate the probability of values being related. In this case, most of our numbers have small decimals, with the biggest one being 0.302 (That’s a 30% correlation). It confirms what we saw in the graphs above.

Now, let’s see what are the highs and lows of each value during the first day of June. If we run a loop, we can create four different line graphs that will indicate the levels of our values.?

For the % Iron Concentrate, there was a decline around 11am. Curiously, there was also a spike on % Silica Concentrate at the same time. This makes sense, since those 2 values are the most important ones when obtaining iron. The purer the iron, the smaller the percentage of Silica Concentrate is.

For the Pulp pH and the Flotation 05 Level, there was a decline and a spike between 2 and 5pm, in that order.

2. Flotation Level 05: First and Last Month

To check what were the min and max Flotation Levels from the first and last month, we will need to create two new tables. One for the first month, and one for the last one.

Now that we have both tables, we can easily check the min and max of the Flotation Level 05 and compared them to see if there’s anything to be worried about.

Seeing the results, all the numbers are really close to each other. That suggests that our mining process has been steady and consistent.

3. Ore Pulp pH: Min and Max

Now let’s check the min and max levels of Ore Pulp pH in our entire data frame. This constitute checking the values in the 6 months of records that our data set offers. In this case, we’ll create a table with our important values and use it to make a heatmap of the Ore Pulp pH from March to September.

Our heatmap offers us a general representation of our Ore Pulp pH values and suggests that it’s been stable during those 6 months of data. But it’s hard to see what the exact values are, and when they happened. For this, we’ll use the following code:

Now we can see that our min is 8.753 and happened on March 15th. Our max is 10.8081, and happened on July 20th.

4. Correlation between % Iron Concentrate and % Silica Concentrate

?At this point it’s well known that in order to get pure iron, our % Iron Concentrate needs to be high and our % Silica Concentrate low. It would be interesting to verify this by creating a scatterplot that shows us this. We will make a graph for the first month and a graph for the last one.?

These two graphs look quite similar, which indicates that there’s no anomaly in our mining process. ?We can see most of the points that are on the right side of the chart are also at the bottom, and the ones at the top are mostly on the left. That indicates that the smaller % Silica Concentrate is, the bigger the % Iron Concentrate is.

Findings:

We didn’t find any anomalies or important events in the month of June.
The min for the Flotation Level 05 was 166.99 in March and 167.82 in September. The max for the Flotation Level 05 was 675.644 in March and 657.257 in September.
The Ore Pulp pH was at its min (8.75) on March 15th, and at its max (10.80) on July 20th.
The correlation between % Iron Concentrate and % Silica Concentrate was quite similar in the first and last month.

要查看或添加评论，请登录

Diego Manssur的更多文章

Take The Shot!

2024年10月9日

Take The Shot!

When we talk about sports, we usually think of classic activities like soccer, football, hockey or baseball. But in the…

6 条评论
Hired or Fired?

2024年9月16日

Hired or Fired?

The job market has changed a lot through the years. While traditionally, employees would spend most of their careers in…

1 条评论
Dribble Pass & Shoot!

2024年9月3日

Dribble Pass & Shoot!

Have you ever been nervous before your favorite team plays a game? Have you wondered what the chances of winning are? I…

5 条评论
Health is Wealth!

2024年8月26日

Health is Wealth!

Do you remember your last stay at the hospital? Was it pleasant? Did you wait a long time to get attention? We all have…
Welcome To Canada

2024年8月19日

Welcome To Canada

In the past 8 years, immigration laws and the education system have drastically changed in order to welcome a higher…

2 条评论
Where is The Money?

2024年8月15日

Where is The Money?

Ever since I was a kid, I’ve always been conscious about money and its role in life. Growing up in Ecuador, there was a…

3 条评论
Analyzing DoorDash Sales Throughout The Year

2024年7月31日

Analyzing DoorDash Sales Throughout The Year

Ever since we experienced a lockdown for the first time, food delivery services have appeared and increased rapidly…

4 条评论
Soulmates

2015年5月26日

Soulmates

Hi!! I'm happy to show you my film called "Soulmates". I hope you like it!

See all articles

社区洞察

Mining Engineering

What are the most common errors in sampling and assaying, and how can they be avoided?

Let's Get The Iron

Diego Manssur

Data Analyst | SQL | Tableau | Excel | Data Visualization

The Data Set:

Questions

Installing data libraries

Reading the Data Set

Fixing the Date column

1. Important events in June

领英推荐

2. Flotation Level 05: First and Last Month

3. Ore Pulp pH: Min and Max

4. Correlation between % Iron Concentrate and % Silica Concentrate

Findings:

Diego Manssur的更多文章

社区洞察

其他会员也浏览了

Iron Ore Incident Investigation: An Analysis of Manufacturing Processes

Simple ways to Incorporate Fragmentation analysis to your mine: Advantage in view

How can AI help mining companies to predict iron ore demand?

The Role of Software in Resource Estimation Calculations

The Gritty Reality of Mining: Not Your Average Assembly Line

Application of X-ray Diffraction Topography system in XX antimony industry

Process Mining Preprocessing Tasks

Representativeness and Generalizability in Geometallurgy

Analysis of Iron Ore mining Data Analyst Project for Metals R’ Us using Python

Exploratory Analysis of Froth Characteristics in Iron Flotation

The Data Set:

Questions

Installing data libraries

Reading the Data Set

Fixing the Date column

1. Important events in June

领英推荐

2. Flotation Level 05: First and Last Month

3. Ore Pulp pH: Min and Max

4. Correlation between % Iron Concentrate and % Silica Concentrate

Findings:

Diego Manssur的更多文章

Take The Shot!

Hired or Fired?

Dribble Pass & Shoot!

Health is Wealth!

Welcome To Canada

Where is The Money?

Analyzing DoorDash Sales Throughout The Year

Soulmates

社区洞察

其他会员也浏览了

Iron Ore Incident Investigation: An Analysis of Manufacturing Processes

Simple ways to Incorporate Fragmentation analysis to your mine: Advantage in view

How can AI help mining companies to predict iron ore demand?

The Role of Software in Resource Estimation Calculations

The Gritty Reality of Mining: Not Your Average Assembly Line

Application of X-ray Diffraction Topography system in XX antimony industry

Process Mining Preprocessing Tasks

Representativeness and Generalizability in Geometallurgy

Analysis of Iron Ore mining Data Analyst Project for Metals R’ Us using Python

Exploratory Analysis of Froth Characteristics in Iron Flotation