登录查看更多内容

Aggregated Data Dilemma

Bill Schmarzo

Dean of Big Data, CDO Chief AI Officer Whisperer, recognized global innovator, educator, and practitioner in Big Data, Data Science, & Design Thinking

发布日期: 2017年2月14日

Okay, I am weird (tell me something that I don’t know, say most of my friends). For Christmas I wanted a Nike Apple Watch to go with my existing FitBit and Garmin fitness trackers (I look sort of like a cyborg in the photo below…which is always cool).

While I was intrigued by the ability to do all sorts of cool things on the Apple Watch (like take a phone call and talk into my wrist watch like Dick Tracy), the thing that most intrigued me was the ability to buy third-party apps that could yield detailed exercise and health data. I was hoping that this detailed exercise and health data could help me understand what effect particular behaviors or activities (or lack of particular behaviors and activities) were having on my overall health.

Why is this important to me? You can thank articles like “Unexpected Heart Attack Triggers” for my health and exercise anxiety. The article highlighted several things that can trigger a heart attack including:

Lack of sleep (definitely an issue, especially when I’m traveling so much)
Migraine Headaches (how can you work in technology and not have headaches)
Cold Weather (need to find more clients in warmer weather)
Big, Heavy Meals (with the exception of Chipotle, right?)
Getting Out of Bed in the Morning (see, I knew that was a big danger!!)
Alcohol (just like to drink a beer now and then)
Coffee (I drink Chai Tea Lattes, that’s technically not coffee, and I know that I shouldn’t admit that I drink Chai Tea Lattes)

So there are many items on that above list that could trigger a heart attack, and I enjoy many of the things on that list (like sleeping and eating and the occasional beer). Consequently, I thought I’d put my data science experience to work to monitor my exercise and diet behaviors and predict potential health outcomes.

Personal Fitness Analytics

I tested the downloadable data from each of the three devices. The Fitbit offered the easiest way to download my fitness data (and I have TONS of useful fitness and diet tracking suggestions if anyone at Fitbit, Garmin or Apple ever read this blog!!). The problem with the fitness data is that I can only get daily level data (see Table 1).

Table 1: Daily Fitness Tracking Data

I can add more external data to the aggregated fitness data (e.g., days of the week, days when I travel, how much I travel on those travel days) to come up with some simple plots.

For example, Figure 2 shows a visual correlation between the calories that I burn per step and the days that I travel. My assumption is that I burn more calories per step when I am doing something that requires more exertion (like running or climbing steps), so it makes sense that on days when I am traveling, I have less opportunities for highly exertive activities.

Figure 2: How Many Calories I Burn Per Step When Traveling

While this information is “interesting,” unfortunately, data at the aggregated daily level is not actionable. If I had more detailed or granular fitness data, I’d like to chart what happens to my heart rate (and related stress levels):

During an airplane flight
When racing through an airport to catch a connecting flight
Waking up very early in the morning while traveling
Immediately after eating a large meal
While I’m doing my taxes (I hate doing my taxes)

The problem is that the data provided by my fitness band is aggregated to a level what is not actionable. If I had my fitness data at 5 or 10-minute intervals, then I could more easily spot unusual health outcomes and determine (and eventually predict?) what behaviors (e.g., flying in an airplane, eating large meals, heavy exercise exertion, waking up extremely early) might be causing health concerns.

Power of Granular Data

Big Data and data science are all about granular data because valuable performance and behavioral nuances can be buried in the aggregated data. For example, the chart in Figure 3 shows how additional performance nuances are being uncovered as we transition from a 5-minute to a 1-minute and finally to a 5-second interval in the capture of the performance data.

Figure 3: Performance Nuances Uncovered in Granular Data

As the data gets more granular, the behavioral and performance nuances buried in the data start to surface. Data at the 5 minute and 1 minute intervals in Figure 3 tell you very little. Aggregated data is the anti-data science. Data at the 5-second interval highlights some potential performance concerns. In this example, data at the 5-second interval starts to become actionable.

For example, I might notice too sedentary of a heart rate whenever I sit too long on a cross-country flight or my stress level jumping whenever I get another “flight delayed” message while trying to catch a connecting flight. I might then learn to perform some in-seat exercises and walking around during those long flights, or practicing controlled breathing and some simple yoga when enduring yet another flight delay (SFO airport does have a yoga room, and now I know why).

Preparing for an IOT World of Granular Data

Understanding the challenges of capturing and analyzing real-time granular machine and device-generated data will become even more critical as we move into the Internet of Things (IOT), where hundreds of sensors are kicking off tens, hundreds or even thousands of data points per minute. This will force two specific challenges upon those of us coming from the more traditional human-generated big data world:

Real-time data capture and compression
Real-time analytics at the edge

For my fitness focus, I might need to expand my Personal Fitness Analysis to capture and analyze more of this detailed data in (near) real-time so that I can become aware of behaviors that are hurting or improving my health and fitness. Ultimately, my goal is to change my behaviors, but I need to understand (and quantify?) what behaviors lead to desirable health and fitness outcomes (e.g., improved blood pressure, lower weight, less stress).

--------------------

Thanks for taking the time to read my post. I’m fortunate that I spend most of my time with very interesting clients which fuel many of my topics. I hope that you are able to leave a comment or some thoughts about the blog. If you would like to read my regular blogs, please follow me on LinkedIn and/or Twitter.

In case you are interested, here are some of my favorite posts:

· Determining the Economic Value of Data

· The Big Data Intellectual Capital Rubik’s Cube

· How to Avoid “Orphaned Analytics”

· To Achieve Big Data’s Potential, Get It Into The Boardroom

· Vision Workshop

· Big Data Business Model Maturity Index (animation)

· How I’ve Learned To Stop Worrying And Love The Data Lake

I am the author of two Big Data books: “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”. I also teach the "Big Data MBA" at the University of San Francisco (USF) School of Management, where I was named the School of Management’s first Executive Fellow. The opportunity to teach at USF gives me the perfect petri dish to test new ideas and concepts both in the classroom and in the field with clients.

John Raley LACP, FSCP, CLTC, MBA

Owner, John Raley Agency - American Family Insurance

8 年

Love the slow down granular distinction. Once we see the results of the change in our habits we are free to re-design the results we want in our lives. I do like my hot yoga daily and it affords me to still play in the alumni basketball game at 57 that's where the 20 year olds think I am weird. Keep up your great work. JOHN

1 次回应

查看更多评论

要查看或添加评论，请登录

Bill Schmarzo的更多文章

Why Everyone Needs to Think Like a Data Scientist in Today’s Environment

2022年7月16日

Why Everyone Needs to Think Like a Data Scientist in Today’s Environment

The rise of data is driving an unprecedented wave of business opportunity across all business areas. However, with such…

39 条评论
Data Management Sessions at Dell Technologies World 2022

2022年4月25日

Data Management Sessions at Dell Technologies World 2022

Data, data everywhere…not a byte to use! As much as enterprises are getting ready to brace for the Data Decade, it is a…

18 条评论
Mastering the Data Economic Multiplier Effect and Marginal Propensity to Reuse

2021年6月6日

Mastering the Data Economic Multiplier Effect and Marginal Propensity to Reuse

Note: this blog introduces the concept of the Marginal Propensity to Reuse which is the primary driver behind the Data…

29 条评论
Data Science 2.0: From Analytic Outputs to Business Outcomes

2021年4月25日

Data Science 2.0: From Analytic Outputs to Business Outcomes

The “Data Science Learning Roadmap for 2021” in Figure 1 created by FreeCodeCamp does a great job of articulating the…

5 条评论
Data Science 2.0: From Analytic Outputs to Business Outcomes

2021年3月9日

Data Science 2.0: From Analytic Outputs to Business Outcomes

The “Data Science Learning Roadmap for 2021” in Figure 1 created by FreeCodeCamp does a great job of articulating the…

5 条评论
Digital Transformation Requires Redefining Role of Data Governance

2021年2月8日

Digital Transformation Requires Redefining Role of Data Governance

I’m overjoyed to announce the release of my latest book “The Economics of Data, Analytics, and Digital Transformation.”…

17 条评论
Master Machine and Human Learning to Win the Digital Transformation Wars

2021年1月18日

Master Machine and Human Learning to Win the Digital Transformation Wars

The “Economies of Learning” are more powerful than the “Economies of Scale” This may be my most powerful concept…

12 条评论
Crossing the Analytics Chasm with Nanoeconomics

2021年1月11日

Crossing the Analytics Chasm with Nanoeconomics

“I love it when a plan comes together” – John (Hannibal) Smith, The A Team One of the biggest challenges that I…

16 条评论
Ethical AI, Monetizing False Negatives and Growing Total Addressable Market

2020年12月28日

Ethical AI, Monetizing False Negatives and Growing Total Addressable Market

What if I told you that companies that don’t embrace Ethical AI are leaving significant amounts of “Money on the…

5 条评论
Mastering Nanoeconomics in the Era of Digital Transformation

2020年12月21日

Mastering Nanoeconomics in the Era of Digital Transformation

As I state in the opening paragraph of my new book “The Economics of Data, Analytics, and Digital Transformation”: “The…

11 条评论

See all articles

Aggregated Data Dilemma

Bill Schmarzo

Dean of Big Data, CDO Chief AI Officer Whisperer, recognized global innovator, educator, and practitioner in Big Data, Data Science, & Design Thinking

Personal Fitness Analytics

Power of Granular Data

Preparing for an IOT World of Granular Data

Bill Schmarzo的更多文章

社区洞察

其他会员也浏览了

Fitbit’s Data-Driven Leap: Making Wellness Accessible for All

'New Year, New Marketing'

WWDC 22: Apple Watch Will Have 16 Major Updates

Transforming Fitness Businesses for Success with Wearable Technology

OmniWatch UK (United Kingdom) Reviews (Warning!) Know ALL the Facts Before Buy!!

Bellabeat Breakdown: Data-Driven Insights for Women's Fitness ????♀??

Best Fitness Watch for Android 2025

Best Smartwatch for Health Monitoring

Fitbit Charge 6 Review: The Ultimate Fitness Companion

Best Smartwatches for Strength Training

Personal Fitness Analytics

Power of Granular Data

Preparing for an IOT World of Granular Data

Bill Schmarzo的更多文章

Why Everyone Needs to Think Like a Data Scientist in Today’s Environment

Data Management Sessions at Dell Technologies World 2022

Mastering the Data Economic Multiplier Effect and Marginal Propensity to Reuse

Data Science 2.0: From Analytic Outputs to Business Outcomes

Data Science 2.0: From Analytic Outputs to Business Outcomes

Digital Transformation Requires Redefining Role of Data Governance

Master Machine and Human Learning to Win the Digital Transformation Wars

Crossing the Analytics Chasm with Nanoeconomics

Ethical AI, Monetizing False Negatives and Growing Total Addressable Market

Mastering Nanoeconomics in the Era of Digital Transformation

社区洞察

其他会员也浏览了

Fitbit’s Data-Driven Leap: Making Wellness Accessible for All

'New Year, New Marketing'

WWDC 22: Apple Watch Will Have 16 Major Updates

Transforming Fitness Businesses for Success with Wearable Technology

OmniWatch UK (United Kingdom) Reviews (Warning!) Know ALL the Facts Before Buy!!

Bellabeat Breakdown: Data-Driven Insights for Women's Fitness ????♀??

Best Fitness Watch for Android 2025

Best Smartwatch for Health Monitoring

Fitbit Charge 6 Review: The Ultimate Fitness Companion

Best Smartwatches for Strength Training