Remote Operations Engineer - Big Data

at Gate

Posted 3 days ago 3 applied

Description:

  • The Operations Engineer will be responsible for the deployment, monitoring, maintenance, and optimization of PB-scale big data clusters to ensure stability and performance.
  • The role involves monitoring and analyzing system logs to quickly locate and resolve faults.
  • The engineer will conduct in-depth research on components such as Doris, Hbase, k8s, and Flink, continuously optimizing the service architecture.
  • Regular performance evaluations of big data components will be conducted, with improvement measures proposed to enhance platform stability.
  • The position includes promoting the construction of an automated operation and maintenance platform to improve operational efficiency.
  • The engineer will also be responsible for writing and maintaining operational documentation, including user manuals and emergency impact processes.

Requirements:

  • A bachelor's degree or higher in a computer-related field is required, along with at least 3 years of experience in big data platform operations and maintenance.
  • Familiarity with Doris, including knowledge of FE/BE architecture, data sharding, and table model optimization is necessary.
  • Proficiency in Flink, particularly in job scheduling, state management, and resource monitoring (especially in On YARN/K8s environments) is required.
  • Expertise in HBase, including read/write performance optimization, Phoenix integration, and HDFS storage layer tuning is essential.
  • Knowledge of Kafka, including familiarity with the ISR mechanism, replica synchronization, consumer group management, and Kafka Connect applications is needed.
  • Solid Linux system operation and maintenance skills are required, with proficiency in using Shell/Python to write operational scripts.
  • Experience in building monitoring systems with Prometheus+Grafana and familiarity with tools like Zabbix/ELK is necessary.
  • Understanding of containerization technologies (Docker/K8s) and knowledge of CI/CD pipeline construction is required.
  • A strong sense of responsibility, excellent fault diagnosis skills, and the ability to work under pressure are essential.

Benefits:

  • The position offers the opportunity to work with a top-tier team in a leading global digital asset exchange, providing a first-class service experience in trading, security, and blockchain product innovation.
  • Employees will have unlimited development space and potential for future growth within the company.
  • The company values integrity, insight, innovation, knowledge, and collaboration, fostering an environment where employees can fully utilize their knowledge, vision, and autonomy.
  • Gate is recognized as one of the safest and most reliable cryptocurrency platforms globally, providing a stable and secure work environment.