Skip to content

Prerequisites

Before starting the 5-Hour Data Engineering Boot Camp, you should have the following prerequisites:

  • Basic understanding of Python programming
    • Variables, data types, and control structures
    • Functions and classes
    • Working with libraries and modules
  • Familiarity with SQL basics
    • SELECT, INSERT, UPDATE, DELETE statements
    • JOIN operations
    • Basic data filtering and aggregation
  • Python 3.11 or higher installed
  • A code editor (VS Code recommended)
  • Git for version control
  • Basic command line usage
  • Understanding of basic data structures
    • Lists, dictionaries, and dataframes
    • Working with CSV and JSON files
  • Basic knowledge of data types
    • Strings, numbers, dates
    • Boolean values
    • Null/None values
  • Basic understanding of:
    • Data warehousing concepts
    • ETL (Extract, Transform, Load) processes
    • Data quality principles
    • Version control with Git
    • Working with APIs

If you need to brush up on any of these prerequisites, check out our Tools and Setup guide for detailed instructions on setting up your development environment.

Ready to begin? Let’s start with Data Engineering Fundamentals!