EDA

EDA

Exploratory Data Analysis also known as EDA is a way to dive into your data and try to understand it, learn from it and use it to help drive forward. Some of the basic things you can get from this are: what is the mean or average of the data, how many samples are there, how many missing data are there, what is the median, mode, standard deviation, variance, coefficient of variation, interquartile range, Q1, Q3, min, max, etc.

Why would you want to do this?

It's a way to gain knowledge of all the data that you have been collecting and it can be done altogether, broken up into various domains, lithologies, by variable, etc. Understanding patterns, relationships and characteristics of datasets is needed before diving in and doing further complex analysis and model building. Key assumptions have to be checked, further work might be needed, and it can help drive what methodologies and decisions are made down the road throughout the estimation and other geostatistical workflows.

Whenever I'm trying to teach someone about something that I know about (or I think I know LOL) I try to relate it to something they know. I've been told that I'm great at simplifying complex problems into details that can be understood by most audiences. Don't forget, someday I want to be a professor...that is in my long-term plan once I have all of these fascinating stories to relate mathematics, statistics, geostatistics back to. I love to teach but more importantly, I love it when someone that tells me they won't ever be able to understand it, finally understands. That is a game for me, that's what excites me, makes me happy is when someone is winning...I'm winning. I try various different ways with words, visuals, stories, etc. to break it down and try to make it relatable for them as well as in the form they like to learn in. Anyways getting off track.

So, what can I relate this to. Let's relate it to buying a car. Most people in their lives will buy a car at some point, even if you are like me and try to constantly make your brother, dad or husband do it for you...they always want you to learn how to do it, so you have to do it. And oh, when I'm driving through the parking lot and see the salesman start walking toward my car as I'm increasing speed trying to get away from them because I don't want to talk to them LOL! They always try to wheel and deal. And I know that you all have done the same thing! This week, I went and looked at a Chevy Traverse and this guy named Chewy came up to me trying to shmooze me stating "Hey do you work here"...Really? That isn't going to work dude. "Oh yeah we have lots of Traverse" and they only had three. Then he said "oh I've only been here a week" now I used to say the same thing when I was giving tours at Montana Tech because I knew 1) it made them laugh when I told them actually, I had been doing it for a year or 2) I got better scores since they felt I did a great job being so new. Anyways, back to this.

When you are buying a car, what sort of analysis are you doing? You first are browsing all the vehicles; you should sort of have in mind what you want but maybe you don't. You have colors you like and ones you really don't like. You may need a bigger car for your family or maybe you just want a nice simple commuter that has great gas mileage. You look at the car - is it new or used? How many miles does it have? What sort of miles to the gallon does it get on highway and in town? What safety features does it have? Was it owned by someone previous? Was it a rental previously? Is there anything missing? (sometimes those car lots like to hide things---my 2nd car a Pontiac Grand Am had been wrecked prior and the bracket that holds the lights was broken). Are there any dents? scratches? or mechanical issues? So just like data you are cautious with data that has significant quality issues.

When looking at fuel efficiency, engine size, mileage and safety ratings, this is similar to exploring data for distributions and spread of values or to see what the typical values are. Does the car suit your needs? Is the data in a format that suits your needs?

When looking at a car you may look at relationships like the trade-off between fuel efficiency and engine power. In EDA you look at relationships among variables and how or if they impact each other - any dependencies in data.

When buying a car it may lead to further hypotheses like "A car with higher mileage should be cheaper" or "Toyotas can have infinite mileage and last forever" (no, just me? ha ha my car battery lasted 10 years...please don't try that at home!). EDA also does similar allowing you to generate insights and further theories about your data building further steps, analysis and model building.

Before finalizing the purchase on a car, you need to make sure the car fits your requirements much like you need to ensure that the data is ready for modeling. You may need to "tune" the data by transforming, further domaining, sending it back, or cleaning it, similar to prepping a car for optimal use.

In summary whether you are buying a car or building a model, ensure that you use EDA to ensure quality, understanding all the features and prepare for making well-informed decisions. Don't end up like I did with my broken headlight bracket that eventually caused my blinker to emit the noise erratically on a trip from Butte, MT to Seattle, WA. At first, I thought I accidentally left on my blinker but when the sound would speed up or slow down and be intermittent, that was when I knew there was something wrong.

Next week, we'll talk about different types and techniques for EDA, what it means and what to look for. I find that throughout my career, sometimes I would walk around and see different employees run EDA but just as a checkbox exercise and maybe that was because they had a historic deposit that was set in stone, or they had a recipe, or they were told to do it or they didn't really know what they needed it for. So, next week, let's take a look and see how we can use this information for our benefit and to push us further down the road.

#WisdomWednesdayWithCW #Geostatistics

André Hanekom

Resource Geologist (MAusIMM,Pr.sci.Nat,MSc.Eng)

3 个月

Thank you Celeste Wilson , looking forward to your article next week ??

Fabricio Franco Msc. geochemistry., Esp.mining-metallurgy,PQR.CBRR

Especialista / Coordenador Geólogo na Ligga com experiência em modelagem de recursos e gest?o de projetos.

3 个月

Muito útil

要查看或添加评论,请登录

Celeste Wilson的更多文章

  • Variography: Post VI - Variogram Maps

    Variography: Post VI - Variogram Maps

    This week I was going to go into variogram models but someone commented and brought up a good point about sharing a bit…

    3 条评论
  • Variography: Post V - Experimental Points

    Variography: Post V - Experimental Points

    Experimental points are calculated semivariance values that are plotted on a graph to create a variogram. If you've…

    3 条评论
  • Variography: Post IV - The Range

    Variography: Post IV - The Range

    Last week we touched on the Sill, there is a lot more I could have shared about it and maybe will in the future but…

    3 条评论
  • Variography: Post III - The Sill

    Variography: Post III - The Sill

    Let's continue on with learning about Variography. This week's post is about the sill.

    2 条评论
  • Imposter Syndrome

    Imposter Syndrome

    We've all been there and if you haven't ever experienced it, you are lucky. .

    5 条评论
  • Wisdom Wednesday Links

    Wisdom Wednesday Links

    Before we move on with more talk about Variography, I've had a few people ask me if there is an easy way on LinkedIn to…

    1 条评论
  • Variography: Post II - The Nugget Effect

    Variography: Post II - The Nugget Effect

    Welcome to 2025! Hopefully the new year for you all is starting out in the right direction. I know some years start and…

  • Variography: Post I

    Variography: Post I

    Variography is one of the most important pieces of building a block model. I would say Data is most important because…

    12 条评论
  • Compositing

    Compositing

    If you are new to Wisdom Wednesday with CW, welcome -- I'm so glad you are here to read along to my crazy brain…

    2 条评论
  • Thanksgiving

    Thanksgiving

    Instead of the normal wisdom, I'm going to share some other wisdom since I feel that to be a "good human" in my…

    2 条评论

社区洞察

其他会员也浏览了