I applied online. I interviewed at PayPay in Dec 2024
Interview
Data Engineer DaaS position.
- Codesignal test with SQL and Coding task.
- 1st live coding interview using python to implement a function that mimics the behavior of apache hudi. If you show you are able to code and make adjustments you will be fine.
- 2nd interview low level system design, deepdive low level problems related to spark. Need to know the fundamentals of spark in a deep level. Questions will be asked that may not be relevant to your way of working at all but apparently relevant to paypay. If you happen to not know the answer then you're out of luck. Paypay works with streaming data and you should know all concepts related to working with streaming data. (Restarting from failure)
- did not get 3rd interview but that would have been high level system design. Probably brush up big data architectures i.e. lambda, delta arch.
You need deep level understanding of the backend of spark. Be sure to know every item in the spark UI and be able to explain what is happening.
Interview questions [1]
Question 1
What is a job, stage and task? How would they look like in this example code?
I applied online. I interviewed at PayPay (New York, NY) in Mar 2025
Interview
Work through the previous project and deep dive into the technical details then write SQL code for the specific business situation and also Python coding to work with Dataframe. Heavily emphasize how to and what if and make sure you understand each component of the data ingestion and transformation process
Interview questions [1]
Question 1
What is the most challenging project you have ever done
I applied through a recruiter. The process took 1 week. I interviewed at PayPay (Tokyo) in Mar 2023
Interview
After a quick talk with HR, I was assigned a simple coding task, and then an interview with user. The interviewer drilled fairly deep into data engineering concepts, Spark, and pipeline architecture