The Operations Engineer will be responsible for the deployment, monitoring, maintenance, and optimization of PB-scale big data clusters to ensure stability and performance.
The role involves monitoring and analyzing system logs to quickly locate and resolve faults.
The engineer will conduct in-depth research on components such as Doris, Hbase, k8s, and Flink, continuously optimizing the service architecture.
Regular performance evaluations of big data components will be conducted, with improvement measures proposed to enhance platform stability.
The position includes promoting the construction of an automated operation and maintenance platform to improve operational efficiency.
The engineer will also be responsible for writing and maintaining operational documentation, including user manuals and emergency impact processes.
Requirements:
A bachelor's degree or higher in a computer-related field is required, along with at least 3 years of experience in big data platform operations and maintenance.
Familiarity with Doris, including knowledge of FE/BE architecture, data sharding, and table model optimization is necessary.
Proficiency in Flink, particularly in job scheduling, state management, and resource monitoring (especially in On YARN/K8s environments) is required.
Expertise in HBase, including read/write performance optimization, Phoenix integration, and HDFS storage layer tuning is essential.
Knowledge of Kafka, including familiarity with the ISR mechanism, replica synchronization, consumer group management, and Kafka Connect applications is needed.
Solid Linux system operation and maintenance skills are required, with proficiency in using Shell/Python to write operational scripts.
Experience in building monitoring systems with Prometheus+Grafana and familiarity with tools like Zabbix/ELK is necessary.
Understanding of containerization technologies (Docker/K8s) and knowledge of CI/CD pipeline construction is required.
A strong sense of responsibility, excellent fault diagnosis skills, and the ability to work under pressure are essential.
Benefits:
The position offers the opportunity to work with a top-tier team in a leading global digital asset exchange, providing a first-class service experience in trading, security, and blockchain product innovation.
Employees will have unlimited development space and potential for future growth within the company.
The company values integrity, insight, innovation, knowledge, and collaboration, fostering an environment where employees can fully utilize their knowledge, vision, and autonomy.
Gate is recognized as one of the safest and most reliable cryptocurrency platforms globally, providing a stable and secure work environment.