A series on EV Charger Utilization - P. 1 - Data Bias
One of my least favorite types of classes at Georgia Tech was Lab classes.? In my entire college career, I only got 2 C’s.? One was in Physics 3 (where I excelled at doing time dilation calculations but the probability of electron appearing in a certain location completely blew my mind) and the other was in an "Intro to Machine Design Lab" (or something like that).?
In a Lab course, the goal is to follow extremely specific directions to run experiments, record the data, and then write a detailed and highly structured report on the results.? The reason I got the C, and the reason I didn’t like the labs is that 1 of 2 core concepts is the importance of following directions.? For instance, in a technical report it’s extremely important that if you’re presenting a Figure, then the “Figure” label goes BELOW the figure; whereas if you are presenting a Table then the “Table” label goes ABOVE the table.? In my Intro to Machine Design class I was in charge of writing the final report, and I screwed this up, and it knocked off a whole letter grade. (And if you’re a fan of this newsletter you know that I don’t like adhering to typical “rules of writing”.)
But the other core concept taught in labs, and a valuable life and professional lesson, is understanding the importance of gathering and interpreting data.? I learned how bad data can’t simply be thrown out if it doesn’t fit the curve that all your other classmates got and how data bias (or statistical bias) can destroy the outcome of experiments, or worse, provide a result that is completely misleading.?From Wikipedia on statistical bias :
“Statistical bias, in the mathematical field of statistics , is a systematic tendency in which the methods used to gather data and generate statistics present an inaccurate, skewed or biased depiction of reality. Statistical bias exists in numerous stages of the data collection and analysis process, including: the source of the data, the methods used to collect the data, the estimator chosen, and the methods used to analyze the data.”
One of the more famous examples of a type of data bias is the “survivorship bias ” of?bomber planes in WW2:
"A famous and early example of survivorship bias involves planes returning from missions during World War Two. The military wanted to put armor on the aircraft to protect vulnerable spots. However, they couldn’t place armor everywhere because it would be too heavy.
They looked at the bullet holes on the planes that returned, the survivors in this example. The military’s first inclination was to reinforce locations with the most hits. That seems to make sense. However, Abraham Wald, a mathematician, realized that survivorship bias was at work here.
The surviving planes got hit in the observed locations and still returned. Consequently, strengthening these locations aren’t top priorities. Instead, it’s critical to infer the missing data about where the non-returning planes were hit. Wald realized they needed to reinforce the locations on returning planes that were not hit. Clearly, the aircraft that got hit in those areas did not return!"
So how does this matter to EV Charging?
One example I've thought about is how methods of EV charger utilization prediction may be impacted by free charging offered by Charge Point Operators (CPO) (as I posted about last week ). Another good lesson I learned in school (Econ 101) is that when something is free, it gets consumed more than if it is not free. And so if a company like Electrify America provides free charging to buyers who purchase a Volkswagen ID.4, then those buyers are more likely to use EA's DCFC units and not install L2 in their homes (or at least use the DCFC more than they would otherwise).
Which means that data we may see advertised about utilization rates may be artificially higher than they would be otherwise without free charging (aka DATA BIAS).
And what's great about thinking about these topics and posting them to LinkedIn is I actually get knowledgable people adding value in the comments. For instance, on that post, Ryan Prazen commented:
"There was a study conducted by Energetics and EVEVWATTS Dashboard which analyzed data from a self-selected set of EV owners nationwide (excluding Tesla's Supercharger network), there were approximately 2.4 million charging sessions between June 30, 2020, and June 30, 2023. Out of these sessions, about 957,265 were free, and 1,412,050 were paid."
and then Ewan Pritchard, PhD, PE , who works at Energetics commented below Ryan:
领英推荐
"you can slice the data on the EV WATTS dashboard in a variety of ways (free versus paid, DCFC or L2. Take a look "
And so by posting a question about data bias, I find more sources of data to help uncover any bias!
And it's not just DCFC where data bias can skew results.
Using that same data set I dug into one of the reports on L2 MUD and it closes with:
“MUD is a broad classification that comprises varying housing (and even parking) options. Different structures that are classified as MUDs may exhibit very different charging use patterns. For example, notably high EVSE utilization takes place at MUDs in the Mountain region and at MUDs in densely populated cities, whereas lower EVSE use occurs at MUDs in New England and in rural areas. Therefore, MUD locations could be better categorized or broken down into sub-categories, each with its own usage pattern.”
And so it appears that the report acknowledges that the data isn’t granular enough to really draw conclusions and so should be applauded for that.??
But…
Shouldn’t we know going into a data gathering exercise that MUD is a broad classification that comprises varying housing and parking options and different demographics in different regions of the country with different rates of EV ownership???
In other words, do we need to spend time and money collecting data that yields results that we could have determined using common sense? Which is related to my criticism of this study that analyzed user reviews of public DCFC and found that many users find them unreliable...which isn't news to anyone.
And sure, it can be argued I'm splitting hairs, but I'm trying to get at something more, which is that the EV Charging industry need to get better faster. We need to ask better questions and get better data and better analyze that data with a critical mind AND ALL INVOLVED NEED TO ACTUALLY DRIVE EVs.
I was going to end this thinking I was smart and use a play on the political term “All politics is local” and when I googled that phrase I found this book titled "All data are local" and from the heading it sounds exactly like the issue I’m highlighting: “How to analyze data settings rather than data sets, acknowledging the meaning-making power of the local.”
DATA SETTINGS RATHER THAN DATA SETS!
How we design, setup, and analyze our data gathering exercises is just as important - if not more - as the data we get from those experiments.??
Large scale reports on EV Charger utilization across the country are - IMO - irrelevant due to the varying rates of EV adoption, local demographics, usage patterns, driver needs, etc.
What matters is hyper local data and then what conclusions we can draw from the analysis of that hyper local data. This impacts what businesses get funded, where chargers get placed, and ultimately the rate of EV adoption. Bad data in, bad data out --> worse EV adoption because the companies installing and operating EV Chargers didn't succeed because their business plans were built on a foundation misleading conclusions due to data bias.
Let's all get better faster.
Transport Electrification and Solar Advocate, IT Guy, and Cat Dad. All views expressed are my own.
2 个月"...ALL INVOLVED NEED TO ACTUALLY DRIVE EVS." This! As an EV owner and charging consumer, this drives me crazy. We have government agencies and private companies funding projects being built and maintained by other companies and it seems that few have any real world EV experience.
IT Director - COMEX member - P&L Leader of Data and Cloud Platform
2 个月Post about Tesla situation in 2024 ; please give me your comment on my post - https://www.dhirubhai.net/posts/olivierlehe_rien-de-va-plus-chez-tesla-tesla-fait-activity-7235526735566368768-XOfA?utm_source=share&utm_medium=member_ios
Engineer - Innovator - Entrepreneur - Incubator - Investor - Next ?
2 个月Very interesting Chris Kaiser ! One of the questions I am pondering about are the use cases of FCDC charging. Eg How many users are bulk charging (>80%) vs an emergency charge that gets them home/office/hotel for a longer Level 1 / Level 2 charge. What is the user experience particularly during longer wait times ? Where might one find authentic information/analysis on this? Perhaps something you could cover in your series?
Advancing eMobility by strategically leveraging data to help Discover, Acquire, Plan, Deliver, and Generate Insights.
2 个月Really enjoy this. Data and the ability to read it has become the norm to function in large parts of our world. Data literacy needs to be prioritized moving forward!
Clean Energy Consultant; Board Member; Former CEO
2 个月Great post Chris Kaiser! This very accurate assessment of data interpretation misses in the EV space, also is compounded by where we are in adoption, and the effect of “crossing the chasm” to majority adoption. Much of what was done in getting from 0-1% EV adoption - prepaid fast charging, free workplace charging, certain destination charging - will not make sense in going from 1-10%, let alone beyond that to majority adoption. Investments in infrastructure to support EV chargers will generally increase electric rates, but this impact on rates varies widely by application. It’s critical that the choices we make today, actually drive value for EV owners of tomorrow.