Python for Data Science
Python is one of the most powerful and widely used programming languages available. Its easily understood syntax makes it a popular choice for new coders, while its' many open source modules for everything from web development to data analysis make it the tool of choice for many data scientists. These courses will familiarize students with the use of Python for data science and the common libraries to perform typical data science operations.
- Intermediate Python for Data Science
March 30, April 1, 7, 8, 2021: 8:45AM–12:15PM Details and Registration
- Advanced Python for Data Science
June 15, 17, 22, 24 - 8:30AM-12:30PM Details and Registration
- Introduction to Python for Data Science (no prior experience required)
Aug 3, 5, 10, 12 - 8:30AM-12:30 PM Details and Registration
Continuing Education Units
A Certificate of Completion is provided with each course showing number of contact hours (instruction), and field of study as "Information Technology" for continuing education purposes. Most courses provides 12 contact hours and are equivalent to 1.2 CEU's.
The term CEU is not a trademarked term; therefore, any educational institution may use it to describe their courses. Professions and industries usually regulate their approved continuing education within their bylaws and not one institute or accrediting body has become a standard to accept in this regard. Professionals should always consult their Association or regulating body prior to embarking on continuing education and not assume a CEU will be accepted as part of their professional development.
PYTHON FOR DATA SCIENCE CERTIFICATE OF COMPETENCY (3 Classes)
- Introduction to Python for Data Science
- Intermediate Python for Data Science
- Advanced Python for Data Science or Fundamentals of Machine Learning
To achieve a Certificate of Competency, all classes must be taken through the Center for Business Analytics. In the event that professional experience meets the prerequisite for a class, another approved class may be substituted (limited to 1 class per Certificate of Competency)
This course is an introduction to Python and its uses as a data analytics tool, requiring no previous Python experience. It begins with the core concepts of Python itself: data types, functions, and objects. With that foundational knowledge, students will be introduced to the core tools in Python’s data science toolkit: the Pandas package for data wrangling and the matplotlib package for visualization. Students will spend some of their time working through a case study, in which they can apply the concepts they’ve learned while instructors are available to help with questions.
Open to all. 4 online sessions
No prior experience required
Day One | Day Two | |
---|---|---|
Introduction Python and Jupyter Overview Fundamentals Packages, Modules, Methods, Functions Importing Data Selecting and Filtering Data Working with Columns Case Study: Part 1Q & A |
Case Study: Part 1 Review Summarizing Data Summarizing Grouped Data Joining Data Exporting Data Visualizing Data Case Study: Part 2 Case Study: Part 2 Review Q & A |
- 4 online sessions
- Time commitment 12-16 hours
- Certificate of Completion.
- Required class for “Python for Data Sciencel” Certificate of Competency
Online Course Fees: See registration page
These trainings can also be customized and delivered at your location.
This workshop builds on concepts taught in the introductory course (i.e. the basics of Python and its data science stack). Students will learn how to integrate control flow into their code, write their own reusable functions, and build a variety of models using the cutting-edge scikit-learn library. They will also get more exposure to practical concerns of using Python in a reliable and scalable way: how to manage multiple Python environments using conda, what other packages exist in the data science ecosystem, and the basics of running Python from the command line.
Open to all. 4 online sessions
PREREQUISITE: Attendance at the Introduction to Python for Data Science training or previous experience using Python for data analysis in a professional environment.
Day One | Day Two | |
---|---|---|
Introduction Working with Data using Pandas Conditions Iterations Functions Applying Functions to Pandas Dataframes Case Study: Part 1Q & A |
Case Study Part 1 Review Python from the Shell Kernels and Environments Python Data Science Ecosystem Modeling with Scikit-learn Case Study: Part 2 Case Study: Part 2 ReviewQ & A |
- 4 online sessions
- Time commitment 12-16 hours
- Certificate of Completion.
- Required class for “Python for Data Sciencel” Certificate of Competency
Online Course Fees: See registration page
These trainings can also be customized and delivered at your location.
This is a two-day course that introduces how one can use Python for advanced data science tasks, such as deep learning and natural language processing. Most of the time will be spent working through example problems end-to-end in the classroom. Students will learn the fundamentals of the Keras package (for deep learning) and will explore several NLP packages and methodologies to see the strengths of each. Some additional time will be reserved for discussion of real programming challenges students have encountered, and for an overview of related relevant technologies students may need in an industry setting (e.g. Git and GitHub).
Objectives
- Develop an intuition for what problems are suited to deep learning- and/or NLP-based solutions.
- Build familiarity with the basic interfaces of key Python libraries for deep learning and NLP: Keras, FuzzyWuzzy, and gensim.
- Gain a high-level understanding of the function of data science-adjacent technologies that students will encounter in the workplace, focusing on Git and GitHub.
Prerequisites: Attendance at Introduction and Intermediate Python for Data Science classes or the following experience and skills.
- Strong understanding of core Python concepts: variables, loops, conditionals, and functions
- Some experience using Jupyter Notebooks or Jupyter Lab
- Solid grasp of Pandas and how to use it for data manipulation: filtering, selecting, aggregating, slicing (indexing), and updating
- High-level understanding of modeling concepts: training and test data, model accuracy, and overfitting
- 4 online sessions
- Time commitment 12-16 hours
- Certificate of Completion.
- Required class for “Python for Data Sciencel” Certificate of Competency
Online Course Fees: See registration page
These trainings can also be customized and delivered at your location.
Python for Data Science Instructors
Ethan Swan is a lead data scientist at 84.51°, where he designs and builds tools that simplify the work of other data scientists, often in the form of Python packages. He has been teaching courses on Python, Linux, and other data science technologies since 2016. He holds a BS in computer science and an MBA from the University of Notre Dame, and is an adjunct instructor at UC.
Website: ethanswan.com
Twitter: @eswan18
Instructor: Brad Boehmke, PhD, is the Director of Data Science at 84.51°, Professor at three universities, author of the Data Wrangling with R book, and creator of multiple R open source packages and data science short courses. He focuses on developing algorithmic processes, solutions, and tools that enable 84.51° and its analysts to efficiently extract insights from data and provide solution alternatives to decision-makers. He has a wide analytic skill set covering descriptive, predictive, and prescriptive analytic capabilities applied across multiple domains including retail, healthcare, cyber intelligence, finance, Department of Defense, and aerospace. Summary of his works is available online at bradleyboehmke.github.io.