The @taylorswift13 #datascience Competition vs @jedludlow
@bentaylordata

The @taylorswift13 #datascience Competition vs @jedludlow

When it comes to prediction @jedludlow doesn't mess around. Jed came out swinging on day one with a 99.37% accuracy on the private validation set! For a relative comparison Google's prediction engine is only pulling in 94.86%.

The original competition was posted here and is attempting to predict if a sub second clip of audio will be a viral success for Taylor Swift.

The future applications are fun, go get your $1M seed for a new big data application.

Have you ever heard a cord or chorus where you thought "Now that is good...". Well, with enough data and the latest machine learning algorithms couldn't a computer do that for you? Imagine using big data to consult artists in realtime in the studio as their iterate on their creations. 98% confidence that this is a killer chorus, but your intro needs some work...

There are less than 2 days left for the competition to close. To help you I have included a full working code solution with the data here:
https://taylorswiftdata.s3-website-us-east-1.amazonaws.com/data_comp.html

With that you should be able to get a solution uploaded and scored in <2 minutes. An online kaggle-style platform was stood up for realtime scoring and peer ranking here

I Still Don't Care

Why should I care? What is in it for me? 2 things.

(1) Visibility/Marketability:

Employers love this stuff. They watch you fight and use results to compare who is epic and who is boring. Win enough data competitions and employer's pupils will dilate when they look at you instead of squint as they determine where to place you in the pecking order of applicants.

(2) You Will LEARN Something

The top contestants are doing things Google's ensemble prediction API wasn't. They are all beating the traditional random forest by a large margin. If your submission is higher than the default code you will be in the know. I will make sure you are. You will learn from the best in the game and learn new skills you can take to your next problem.

Competition ends this Friday at midnight.

Keywords: Random Forest, Deep Learning, Data Competition, Big Data, Predictive Modeling, Big Data Analytics, Big Data Analysis.

Dave Sewell

CDO | CIO | CTO | Founder | Entrepreneur | Emerging Technology Strategist | Board Advisor | Building Innovation Leaders One Project At A Time

10 å¹´

Proof once again that the real data gurus are located right here on the Silicon Slopes...

Adam Flugel

Lead Data Scientist at Enova International

10 å¹´

"Employers love this stuff." Quoted for emphasis.

赞
回复
Marissa Saunders

I build and lead high-performing teams through curiosity, collaboration, and trust. Together we use data, AI and ML to make better decisions and improve business outcomes.

10 å¹´

Have to say, its kind of depressing to hit 97.8 percent accuracy and still feel like I'm overmatched. Nice job though, Jed! I'm interested to hear how you tackled this after the competition ends.

Jed Ludlow

Chief Technologist | High-Accuracy Inspection Systems | Energy Pipelines

10 å¹´

Thanks, Ben. I guess the cat's out of the bag now!

要查看或添加评论,请登录

??Jepson Taylor的更多文章

  • TROLL Voices 2020: Data Science & AI

    TROLL Voices 2020: Data Science & AI

    I am debuting the 1st annual TROLL Voices list, a collection of trolls. As COVID-19 continues to upend our lives, these…

    35 条评论
  • Bump in the night? AI can help

    Bump in the night? AI can help

    This story starts with a scary clown. Not pennywise, a real one.

    8 条评论
  • Let's talk imposter syndrome

    Let's talk imposter syndrome

    Talking to people who are wanting to break into the data science space many have opened up about their concerns and…

    44 条评论
  • How to recognize AI snake oil... in academics

    How to recognize AI snake oil... in academics

    Here is my 10-minute digest of a piece making the rounds online: "How to recognize AI snake oil":…

    42 条评论
  • Millions dead thanks to HIPAA privacy

    Millions dead thanks to HIPAA privacy

    I upset people with my data rants sometimes. Several weeks ago I made the point online that #HIPAA kills people.

    74 条评论
  • The AI Shitshow: Hype To Reality #69

    The AI Shitshow: Hype To Reality #69

    The AI hype wave has been followed by disappointment. Hire a data science team! Buy some GPUs! Get ready for AI! You…

    90 条评论
  • AI Replaces Appraisers

    AI Replaces Appraisers

    All the data that matters: What data actually matters for appraising a property? The number of bedrooms? The number of…

    6 条评论
  • AI Thinks Men Are Shallow

    AI Thinks Men Are Shallow

    The data doesn't lie. We started noticing this several years ago, first with some of the attraction data that was…

    21 条评论
  • Fiction Today Is Reality Tomorrow

    Fiction Today Is Reality Tomorrow

    My concept of reality has always had boundaries. Moving outside of those boundaries in the past has been classified as…

    16 条评论
  • DeepXmas: AI knows if you are naughty or?nice

    DeepXmas: AI knows if you are naughty or?nice

    Who did this! I say as I look at the eggs, baking soda, flour, and real-lemon juice on the kitchen floor. The recipe…

    12 条评论

社区洞察

其他会员也浏览了