Recently attended an interview for a Data Engineer role.
The interview had multiple rounds covering SQL, Python, Airflow, GCP, and ETL concepts.
Some of the questions asked:
• SQL joins, window functions, dense_rank vs rank
• Finding duplicates and highest salary by department
• Python list/dictionary handling and basic scripting
• Airflow DAG flow, XComs, operators, scheduling
• GCS, Pub/Sub, Cloud Functions, and service accounts in GCP
• ETL pipeline design and data quality checks
• Scenario-based questions on handling failures and monitoring
The discussion was more practical and scenario-based rather than purely theoretical. They focused a lot on understanding the flow end-to-end and explaining concepts clearly.
Overall, it was a good learning experience and helped me understand areas I should improve further.
Sharing this in case it helps someone preparing for Data Engineering interviews.