Party Buzz Kill: modifying data

Party Buzz Kill: modifying data

So Steve (SQL), Marsha (C), Bob (Python), and I (R) are at this party. We have TOTALLY cleared the room, especially now that Steve and I are deep into a debate about saving native data objects to disk versus storing data in a database.

I see my friend Monica enter from the kitchen, carrying a bowl full of punch. It's an awkward task and the fruity, sticky liquid is sloshing on the floor. Monica does data science, so I'm hoping she'll come to my assist. Sure enough, she places the punch bowl on the table and joins us.

Monica is a real person! She does

She listens for a minute, then interrupts the pointless debate between Steve and I. "People who are math aficionados" she says, "are a lot more comfortable generating datasets on-the-fly. People like me enjoy relying on the safety and reliability of importing a structured dataset we checked earlier!"

Photo by Tima Miroshnichenko:

Steve is happy to hear someone is on his side. Steve thinks I'm a knucklehead. There are many people who agree.

But Monica isn't done. "But you are correct - the question is technically two sides of the same coin."

"Sure, but there are advantages to not messing around with unnecessary overhead," I say. "Let's play with an example."

Read more about this exciting installment...

#rstats #sqlite #punchbowl

Helen Wall

LinkedIn [in]structor | Data Science Consulting

10 个月

I like generating my own datasets from public data sources. I will say, though, that it can become time-consuming to do this every time. So, I often stick to a dataset (especially ones that update to the latest data like APIs and FTP folders) until I get tired of it and want to try something new...

回复
Monika Wahi

Epidemiology & Biostatistics Consultant a/k/a Data Scientist | Exclusive and innovative solutions for data science challenges in public health, research and education

10 个月

Hey everyone reading this - it's worth it to go to the blog post, because there is a function there that will knock your socks off!!! Thank you for making this argument actually interesting, Mark Niemann-Ross!

回复

要查看或添加评论,请登录

Mark Niemann-Ross的更多文章

  • Documenting My Code ... For Me

    Documenting My Code ... For Me

    There are two signs of old age: old age, and ..

  • R Meets Hardware

    R Meets Hardware

    R is a programming language for statistical computing and data visualization. It has been adopted in the fields of data…

    2 条评论
  • Rain - Evapotranspiration = mm Water

    Rain - Evapotranspiration = mm Water

    "Eeee-VAP-oooo-TRANS-PURR-ation," I savor the word as I release it into our conversation. I'm still at the party with…

  • Party Buzz Kill: Data Storage

    Party Buzz Kill: Data Storage

    I'm at this party where Bob and Marsha and I are discussing the best languages for programming a Raspberry Pi. Bob…

    5 条评论
  • R Waters My Garden

    R Waters My Garden

    I'm at a party, and the topic of programming languages comes up. A quarter of the room politely leaves, another half…

    10 条评论
  • Caning and Naming

    Caning and Naming

    We've been back from Port Townsend for a week. Progress on the boat isn't as dramatic as it is when we're spending the…

    1 条评论
  • Irrigate with R and Raspberry Pi

    Irrigate with R and Raspberry Pi

    I’m working on my irrigation system. This requires a controller to turn it on and off.

    3 条评论
  • 5 Reasons to Learn Natural Language Processing with R

    5 Reasons to Learn Natural Language Processing with R

    Why learn R? Why learn Natural Language Processing? Here's five reasons..

    1 条评论
  • Performing Natural Language Processing with R

    Performing Natural Language Processing with R

    I recently released a course on Educative covering topics in Natural Language Processing. Different Learners -…

    1 条评论
  • Pi Day

    Pi Day

    For years, I've assumed Raspberry Pi Ltd would release new versions of the Raspberry Pi on Pi Day (March 14. Aka 3.

    3 条评论

社区洞察

其他会员也浏览了