Python for Data Science

Python Software logo

Python is one of the most powerful and widely used programming languages available. Its easily understood syntax makes it a popular choice for new coders, while its' many open source modules for everything from web development to data analysis make it the tool of choice for many data scientists.  These courses will familiarize students with the use of Python for data science and the common libraries to perform typical data science operations.

   
All courses are taught online by live Instructors
 
  • Introduction to Python for Data Science Online (no prior experience required)
         Coming early 2022

  • Intermediate Python for Data Science Online
         Oct 19, 21, 26, 28 -  8:30AM-12:30PM Details and Registration

  • Advanced Python for Data Science Online
          Nov 30, Dec 2, 7, 9 - 8:30AM-12:30PM  Details and Registration
"Online is always tougher, but having a live instructor vs. a course that is just a series of videos/tutorials makes a huge difference.“ (from training course survey)
 
"I completed an Intermediate class in using Python for Data Science at the UC Center for Business Analytics. The courses, even when presented online, are excellent. Turns out that the "math" and Python/programming side of Data Science is the easy stuff.  I've picked up a few O'Reily books to prep me for the last, advanced class in June. If you want to get comfortable with data modeling and python, I highly recommend these courses" (linkedin post)
 

Continuing Education Units

A Certificate of Completion is provided with each course showing number of contact hours (instruction), and field of study as "Information Technology" for continuing education purposes. Most courses provides 12 contact hours and are equivalent to 1.2 CEU's.

The term CEU is not a trademarked term; therefore, any educational institution may use it to describe their courses. Professions and industries usually regulate their approved continuing education within their bylaws and not one institute or accrediting body has become a standard to accept in this regard. Professionals should always consult their Association or regulating body prior to embarking on continuing education and not assume a CEU will be accepted as part of their professional development.


PYTHON FOR DATA SCIENCE CERTIFICATE OF COMPETENCY (3 Classes)

To achieve a Certificate of Competency, all classes must be taken through the Center for Business Analytics.  In the event that professional experience meets the prerequisite for a class, another approved class may be substituted (limited to 1 class per Certificate of Competency)


Introduction to Python for Data Science (Early 2022)

This course is an introduction to Python and its uses as a data analytics tool, requiring no previous Python experience. It begins with the core concepts of Python itself: data types, functions, and objects. With that foundational knowledge, students will be introduced to the core tools in Python’s data science toolkit: the Pandas package for data wrangling and the matplotlib package for visualization. Students will spend some of their time working through a case study, in which they can apply the concepts they’ve learned while instructors are available to help with questions.

Open to all.  4 online sessions

No prior experience required

 

                               Introduction to Python for Data Science Course Outline
Day One Day Two  
Introduction
Python and Jupyter Overview
Fundamentals
Packages, Modules, Methods, Functions
Importing Data
Selecting and Filtering Data
Working with Columns
Case Study: Part 1Q & A
 
Case Study: Part 1 Review
Summarizing Data
Summarizing Grouped Data
Joining Data
Exporting Data
Visualizing Data
Case Study: Part 2
Case Study: Part 2 Review
Q & A

 
  • 4 online sessions
  • Time commitment 12-16 hours     
  • Certificate of Completion.
  • Required class for “Python for Data Sciencel” Certificate of Competency

Online Course Fees: See registration page 

These trainings can also be customized and delivered at your location.


This workshop builds on concepts taught in the introductory course (i.e. the basics of Python and its data science stack). Students will learn how to integrate control flow into their code, write their own reusable functions, and build a variety of models using the cutting-edge scikit-learn library. They will also get more exposure to practical concerns of using Python in a reliable and scalable way: how to manage multiple Python environments using conda, what other packages exist in the data science ecosystem, and the basics of running Python from the command line.

Open to all.  4 online sessions

PREREQUISITE: Attendance at the Introduction to Python for Data Science training or previous experience using Python for data analysis in a professional environment.

 

                                      Intermediate Python for Data Science Course Outline.
Day One Day Two  
Introduction
Working with Data using Pandas
Conditions
Iterations
Functions
Applying Functions to Pandas Dataframes
Case Study: Part 1Q & A
Case Study Part 1 Review
Python from the Shell
Kernels and Environments
Python Data Science Ecosystem
Modeling with Scikit-learn
Case Study: Part 2
Case Study: Part 2 ReviewQ & A
 
 
  • 4 online sessions
  • Time commitment 12-16 hours     
  • Certificate of Completion.
  • Required class for “Python for Data Sciencel” Certificate of Competency

Online Course Fees: See registration page

These trainings can also be customized and delivered at your location.


This is a two-day course that introduces how one can use Python for advanced machine learning applications. Most of the time will be spent working through example problems end-to-end in the classroom. Students will learn the fundamentals of the scikit-learn library along with exploring several other tools and methodologies that allow you to implement a robust end-to-end machine learning workflow. Some additional time will be reserved for discussion of real programming challenges students have encountered, and for an overview of related relevant technologies students may need in an industry setting (e.g. Git and GitHub).

Objectives

  1. Develop an intuition for the machine learning workflow and Python tooling.
  2. Build familiarity with common software engineering tooling and methodologies for implementing a machine learning project.
  3. Gain a high-level understanding of the function of data science-adjacent technologies that students will encounter in the workplace, focusing on Git and GitHub.


Prerequisites

  • Strong understanding of core Python concepts: variables, loops, conditionals, and functions
  • Some experience using Jupyter Notebooks or Jupyter Lab
  • Solid grasp of Pandas and how to use it for data manipulation: filtering, selecting, aggregating, slicing (indexing), and updating
  • High-level understanding of modeling concepts: training and test data, model accuracy, and overfitting
  • 4 online sessions
  • Time commitment 12-16 hours     
  • Certificate of Completion.
  • Required class for “Python for Data Sciencel” Certificate of Competency

Online Course Fees: See registration page

These trainings can also be customized and delivered at your location.


Python for Data Science Instructors

Photo of Ethan Swan Python Instructor

Ethan Swan is a lead data scientist at 84.51°, where he designs and builds tools that simplify the work of other data scientists, often in the form of Python packages. He has been teaching courses on Python, Linux, and other data science technologies since 2016. He holds a BS in computer science and an MBA from the University of Notre Dame, and is an adjunct instructor at UC.

Website: ethanswan.com

Twitter: @eswan18

Instructor: Brad Boehmke, PhD, is the Director of Data Science at 84.51°, Professor at three universities, author of the Data Wrangling with R book, and creator of multiple R open source packages and data science short courses. He focuses on developing algorithmic processes, solutions, and tools that enable 84.51° and its analysts to efficiently extract insights from data and provide solution alternatives to decision-makers. He has a wide analytic skill set covering descriptive, predictive, and prescriptive analytic capabilities applied across multiple domains including retail, healthcare, cyber intelligence, finance, Department of Defense, and aerospace. Summary of his works is available online at bradleyboehmke.github.io.



For more information about these classes, or for custom training classes, please contact
Headshot of Larry Porter

Larry Porter

Training, Marketing and Sponsorship

513-556-4742