Faking it so it's real: synthetic data
Photo credit: Markus Spiske, Pexels

Faking it so it's real: synthetic data

Think back to a time where you had to confide in someone about another person’s situation (or maybe it was really you talking about yourself without letting the other person know for sure). In either case, you couldn’t reveal all of the details or specifics otherwise you’d betray the other person’s trust in you. Perhaps struggling, you had to convey just enough details to accurately characterize the situation in hopes of, perhaps, getting good advice from the person in which you’re confiding. Can you relate to this? I’ve gone through this and sometimes my efforts to mask the details didn’t always give me confidence in what I was feeding my friend in order to get a nuanced piece of advice from them.

So let’s shift from an interpersonal situation to an analytical situation and you find yourself butting up against why synthetic data has business merits. From an economics and compliancy view, this speaks to efficiencies and trust. Synthetic data generation holds merit for companies who have to predict future, representative outcomes based on limited information while avoiding introducing biases or spilling the privacy beans from past cases.

Synthetic data supports better data sharing by removing private or sensitive information while preserving critical elements needed to inform predictive models. Synthetic data also supports creating more cases where sparse past observations exist. And subsequently, synthetic data generation supports scalable access to more representative data.

Zero-in on multiplying images from pictures that need to feed a computer vision model. I really like this video since it sets up some of the challenges while providing ?clear examples about creating derivative, yet representative images for use in training computer vision models. At one point, you’ll see the point of how one original image spawns four others which may be used in a training data set. It makes me think of a past chat with one of my friends in the automotive industry who told me: “We have to basically train an autonomous car to decide, on the fly, if that small beige object 75 yards ahead of the car is a small child or a plastic grocery bag blowing in the lane.”

From HR to Marketing to Manufacturing, there are viable uses for synthetic data . Drawing from my past, economic gains from this technology can look like:

… third parties safely evaluating another supplier’s data to see if the data are worth buying. Based on sharing data that is holistic, unbiased and representative of future cases, the supplier increases its representation to potential buyers for use by others.

… marketers stumbling into next best offer recommendations after doing internal analyses that reveal few, but highly valuable purchase scenarios that are realistic amongst its target audience. In this case synthetic data lends a hand to deriving different purchase permutations residing in a data warehouse. Early in my career, I had to design market research studies that tested reactions to new product offerings. We used conjoint analyses and we struggled with thinking through all of the high-value combinations to test with a sample of respondents.

While analyst communities project continued growth in this space, I find having a sensible understanding of what this new type of data does at ground level is always useful. I hope this you in a pragmatic, conversational way.

Nice article Lonnie. We knew that all data were not created equal - we had mode, variance and outliers that gave us a sense of the relative position. But with GPTs now churning out more info, I wonder when there will be an inflection between raw (original) and synthetic data? Does this make a case for knowing the source of the data (like the good old days of having a citation / reference)?

回复

要查看或添加评论,请登录

Lonnie Miller的更多文章

  • "Field Cheats" to Making the Case for Analytics

    "Field Cheats" to Making the Case for Analytics

    I read a quote recently about sales roles and the reminder that "you're not hired to sell - you're hired to win."…

    3 条评论
  • Goals-to-Action: Why Is This So Hard?

    Goals-to-Action: Why Is This So Hard?

    A dear friend and colleague told me recently I'm good at putting action to goals. I took the kind compliment at…

    3 条评论
  • Work Backwards

    Work Backwards

    In my career in working with numerous clients (and internal teams) I find a recurring pattern when it comes to getting…

    2 条评论
  • Asset uptime - we want predictive, but it's really hard! (Part II)

    Asset uptime - we want predictive, but it's really hard! (Part II)

    Last month I shared some field observations after working with manufacturing and transportation companies who sought to…

    3 条评论
  • Asset uptime - we want predictive, but it's really hard! (Part I)

    Asset uptime - we want predictive, but it's really hard! (Part I)

    I'm fresh from two consecutive weeks of being part of in-depth meetings centered around a singular (and costly!) topic:…

    4 条评论
  • My Favorite Discovery Questions

    My Favorite Discovery Questions

    Having been part of enterprise Sales organizations most of my career (in different capacities and industries), I am…

    8 条评论
  • Supply Chain Leaders Give Gritty Feedback

    Supply Chain Leaders Give Gritty Feedback

    It's nearly impossible to not grow and learn when surrounded by bright, motivated and experienced individuals who are…

    4 条评论
  • Buying A Car: June/July 2021

    Buying A Car: June/July 2021

    After my recent vehicle shopping and buying experiences over the last 30 days (yes, plural - one new / one used), I'm a…

    30 条评论
  • How to Forecast the Unplannable: Ensuring Manufacturing and Supply Chain Demand Planning Stability During COVID-19

    How to Forecast the Unplannable: Ensuring Manufacturing and Supply Chain Demand Planning Stability During COVID-19

    Authored by Adrian Crawford, Principal Industry Consultant, US Manufacturing, SAS Here’s a question: How do you…

    2 条评论
  • Well Done

    Well Done

    With our current global climate, I bet many of you can relate to phrases that I still hear on phone calls with friends…

    4 条评论

社区洞察

其他会员也浏览了