Prerequisites
Prerequisites
Section titled “Prerequisites”Before starting the 5-Hour Data Engineering Boot Camp, you should have the following prerequisites:
Programming Knowledge
Section titled “Programming Knowledge”- Basic understanding of Python programming
- Variables, data types, and control structures
- Functions and classes
- Working with libraries and modules
- Familiarity with SQL basics
- SELECT, INSERT, UPDATE, DELETE statements
- JOIN operations
- Basic data filtering and aggregation
Tools and Environment
Section titled “Tools and Environment”- Python 3.11 or higher installed
- A code editor (VS Code recommended)
- Git for version control
- Basic command line usage
Data Concepts
Section titled “Data Concepts”- Understanding of basic data structures
- Lists, dictionaries, and dataframes
- Working with CSV and JSON files
- Basic knowledge of data types
- Strings, numbers, dates
- Boolean values
- Null/None values
Optional but Helpful
Section titled “Optional but Helpful”- Basic understanding of:
- Data warehousing concepts
- ETL (Extract, Transform, Load) processes
- Data quality principles
- Version control with Git
- Working with APIs
Getting Started
Section titled “Getting Started”If you need to brush up on any of these prerequisites, check out our Tools and Setup guide for detailed instructions on setting up your development environment.
Ready to begin? Let’s start with Data Engineering Fundamentals!