登录查看更多内容

Building Data Apps with Python Workshop on June 6th

Tony Ojeda

Data Science & AI Executive

发布日期: 2015年5月11日

Data Community DC and District Data Labs are hosting a full-day Building Data Apps with Python workshop on Saturday June 6th. More info and registration can be found here. Register before May 23rd for an early bird discount!

Overview

Data products are usually software applications that derive their value from data by leveraging the data science pipeline and generate data through their operation. They aren’t apps with data, nor are they one time analyses that produce insights - they are operational and interactive. The rise of these types of applications has directly contributed to the rise of the data scientist and the idea that data scientists are professionals “who are better at statistics than any software engineer and better at software engineering than any statistician.”

These applications have been largely built with Python. Python is flexible enough to develop extremely quickly on many different types of servers and has a rich tradition in web applications. Python contributes to every stage of the data science pipeline including real time ingestion and the production of APIs, and it is powerful enough to perform machine learning computations. In this class we’ll produce a data product with Python, leveraging every stage of the data science pipeline to produce a book recommender.

What You Will Learn

Python is one of the most popular programming languages for data analysis. Because of this, it is important to have a basic working knowledge of the language in order to access more complex topics in data science and natural language processing. The purpose of this one-day course is to introduce the development process in Python using a project-based, hands-on approach. In particular you will learn how to structure a data product using every stage of the data science pipeline including ingesting data from the web, wrangling data into a structured database, computing a non-negative matrix factorization with Python, and then producing a web based report.

Course Outline

The workshop will cover the following topics:

Basic project structure of a Python application
virtualenv & virtualenvwrapper
Managing requirements outside the stdlib
Creating a testing framework with nose
Ingesting data with requests.py
Wrangling data into SQLite Databases using SQLAlchemy
Building a recommender system with Python
Computing a matrix factorization with Numpy
Storing computational models using pickles
Reporting data with JSON
Data visualization with Jinja2

After this course you should understand how to build a data product using Python and will have built a recommender system that implements the entire data science pipeline.

Instructor: Benjamin Bengfort

Benjamin is an experienced Data Scientist and Python developer who has worked in military, industry, and academia for the past eight years. He is currently pursuing his PhD in Computer Science at The University of Maryland, College Park, doing research in Metacognition and Active Logic. He is also a Data Scientist at Cobrain Company in Bethesda, MD where he builds data products including recommender systems and classifier models. He holds a Masters degree from North Dakota State University where he taught undergraduate Computer Science courses. He is also adjunct faculty at Georgetown University where he teaches Data Science and Analytics.

More info and registration

Building Data Apps with Python Workshop on June 6th

Tony Ojeda

Data Science & AI Executive

Overview

What You Will Learn

Course Outline

Instructor: Benjamin Bengfort

更多精彩文章

社区洞察

其他会员也浏览了

Revolutionize Your Data Analysis with Python

Top 10 Python Libraries Every Data Science

Unlock the Power of Data Science with Python

Unleashing the Power of Data Science with Python ????

Empowering Data Analysis with Python: Unleash Your Analytical Superpowers!

Future-Proofing Your Skills: Mastering Python Data Science for Growth

15 Python Libraries for Data Science You Should Know

The Key Differences Between Pandas, NumPy, and SciPy in Python:

Leveraging Python's Power for Advanced Data Analysis: Unleash Your Analytical Superpowers!

Python and Its Libraries - A Snapshot

Overview

What You Will Learn

Course Outline

Instructor: Benjamin Bengfort

5 Reasons to Automate Manual Processes in Your Business

2018年10月3日

Applied Data Science & AI Round-Up: January 2018 Edition

2018年2月2日

Data Exploration with Python, Part 3

2017年3月31日

Data Exploration with Python, Part 2: Preparing Your Data to be Explored

2017年2月10日

Data Exploration with Python, Part 1: Preparing Yourself to Become a Great Explorer

2016年12月29日

Applications Open: DDL Data Science Incubator

2016年8月8日

New Video Workshop: Content Optimization with Multi-Armed Bandits and Python

2016年5月3日

Supervised Machine Learning with R Workshop on April 30th

2016年4月12日

Natural Language Processing with Python Workshop on April 9th

2016年3月23日

Data Visualization with R Workshop on April 2nd

2016年3月12日

社区洞察

其他会员也浏览了

Revolutionize Your Data Analysis with Python

Top 10 Python Libraries Every Data Science

Unlock the Power of Data Science with Python

Unleashing the Power of Data Science with Python ????

Empowering Data Analysis with Python: Unleash Your Analytical Superpowers!

Future-Proofing Your Skills: Mastering Python Data Science for Growth

15 Python Libraries for Data Science You Should Know

The Key Differences Between Pandas, NumPy, and SciPy in Python:

Leveraging Python's Power for Advanced Data Analysis: Unleash Your Analytical Superpowers!

Python and Its Libraries - A Snapshot