Remote Data Engineer, Data Insights

Posted

Apply now
Please, let PayPay know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • PayPay is seeking a Data Engineer for the Data Insights department to support the rapid expansion of its product teams and the need for a robust Data Engineering Platform.
  • The mission of the Data Insights department is to drive product improvements through a scientific understanding of user and merchant behavior.
  • The Data Engineer will be responsible for designing, developing, and maintaining scalable data ingestion pipelines using AWS Glue, Step Functions, Lambda, and Terraform.
  • Responsibilities include optimizing and managing large-scale data pipelines for high performance, reliability, and efficiency.
  • The role involves implementing data processing workflows using Hudi, Delta Lake, Spark, and Scala.
  • The Data Engineer will maintain and enhance Lakeformation and Glue Data Catalog for effective data management and discovery.
  • The position requires designing, building, and maintaining infrastructure to support the improvement and deployment of machine learning models.
  • Collaboration with cross-functional teams is essential to ensure seamless data flow and integration across the organization.
  • Best practices for observability, data governance, security, and compliance must be implemented.

Requirements:

  • Candidates must have 5+ years of experience as a Data Engineer or in a similar role.
  • Hands-on experience with Apache Hudi, Delta Lake, Spark, and Scala is required.
  • Experience in designing, building, and operating a DataLake or Data Warehouse is necessary.
  • Knowledge of Data Orchestration tools such as Airflow, Dagster, or Prefect is essential.
  • Strong expertise in AWS services, including Glue, Step Functions, Lambda, and EMR, is required.
  • Familiarity with change data capture tools like Canal, Debezium, and Maxwell is needed.
  • Experience with data warehousing tools like AWS Athena, BigQuery, or Databricks is necessary.
  • Proficiency in Python and SQL (any variant) is required, with preferable experience in Scala and/or Java.
  • Experience with data cataloging and metadata management using AWS Glue Data Catalog, Lakeformation, or Unity Catalog is essential.
  • Proficiency in Terraform for infrastructure as code (IaC) is required.
  • An overall understanding of machine learning technologies and deep learning concepts is necessary.
  • Strong problem-solving skills and the ability to troubleshoot complex data issues are essential.
  • Excellent communication and collaboration skills are required.
  • The ability to work in a fast-paced, dynamic environment and manage multiple tasks simultaneously is necessary.

Benefits:

  • The position offers a full-time employment status with the option to work remotely from anywhere in Japan.
  • Employees enjoy super flex time with no core hours, typically working from 10:00 am to 6:45 pm.
  • Paid leave includes annual leave (up to 14 days in the first year) and personal leave (5 days each year).
  • An annual salary is paid in 12 installments, based on skills, experience, and abilities, with annual reviews and a special incentive based on company performance.
  • Additional benefits include a late overtime allowance, a Work from Anywhere allowance (JPY 100,000), and the option for digital salary payment through "PayPay Paycheck."
  • Employees are provided with social insurance (health insurance, employee pension, employment insurance, and compensation insurance), a 401K plan, translation/interpretation support, and VISA sponsorship with relocation support.
Apply now
Please, let PayPay know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback