Python Data Wrangling Training
Course 1273
3 DAY COURSE

Course Outline

In this Python Data Wrangling course, you will learn how to use Python to extract/transform data from various sources, including large database vaults and Excel financial tables.

You will also explore insights into why you should avoid traditional data cleaning methods, as done in other languages, and take advantage of the specialized functions from NumPy and Pandas.

Python Data Wrangling Training Benefits

  • In this Python Wrangling course, you will learn how to do the following:

    • Extract and parse data from various sources.
    • Transform and clean data using Numpy and Pandas.
    • Summarize and visualize data with Matplotlib.
    • Read HTML, XML, and JSON data from internet resources.
    • Search and filter data sets.
    • Apply Python tools and techniques to process data sets efficiently.
    • Continue learning and face new challenges with after-course one-on-one instructor coaching.
  • Prerequisites:

    You should know Python basics, including data structures, importing and using modules, creating functions, and using the Jupyter Notebook platform.

Python Data Wrangling Training Outline

Module 1: Introduction to Data Structure Using Python

In this module, you will learn about the following:

  • Python for Data Wrangling
  • Lists, Sets, Strings, Tuples, and Dictionaries

Module 2: Advanced Operations on Built-In Data Structure

In this module, you will learn about the following:

  • Advanced Data Structures
  • Basic File Operations in Python

Module 3: Introduction to NumPy, Pandas, and Matplotlib

In this module, you will learn about the following: 

  • NumPy Arrays 
  • Pandas DataFrames 
  • Statistics and Visualization with NumPy and Pandas 
  • Using NumPy and Pandas to Calculate Basic Descriptive Statistics on the DataFrame 

Module 4: Deep Dive into Data Wrangling with Python

In this module, you will learn about the following: 

  • Subsetting, Filtering, and Grouping 
  • Detecting Outliers and Handling Missing Values 
  • Concatenating, Merging, and Joining 
  • Useful Methods of Pandas 

Module 5: Getting Comfortable with Different Data Sources

In this module, you will learn about the following: 

  • Reading Data from Different Text-Based (and Non-Text-Based) Sources 
  • Introduction to BeautifulSoup4 and Web Page Parsing 

Module 6: Learning the Hidden Secrets of Data Wrangling

In this module, you will learn about the following: 

  • Advanced List Comprehension and the zip function 
  • Data Formatting 

Module 7: Advanced Web Scraping and Data Gathering

In this module, you will learn about the following: 

  • Basics of Web Scraping and BeautifulSoup libraries 
  • Reading Data from XML 

Module 8: RDBMS and SQL

In this module, you will learn about the following:

  • Refresher of RDBMS and SQL
  • Using an RDBMS (MySQL/PostgreSQL/SQLite)

Module 9: Application in Real Life and Conclusion of Course

In this module, you will learn about the following:

  • Applying Your Knowledge to a Real-life Data Wrangling Task
  • An Extension to Data Wrangling
Course Dates
Attendance Method
Note about the Certification Exam

When you register for the course, you will be prompted to choose Y/N to take the exam. Please select yes, as all HHS CISO employees are required to attempt the exam if one is offered for the course. Please be advised, if your course if funded by DIR, the Certification Organization has agreed to provide DIR the pass/fail status of your exam. DIR will only share this information in an aggregated report to state leadership that reflects total exam pass or fails. No individual names of any students will be included in any reports.

DIR requires that you submit the request for your exam voucher within one month of the last day of your course. DIR requires that you take your exam within six months of the last day of your course.

Additional comments or questions (optional)