Remote Site Reliability Engineer - II (SRE II)

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • LivePerson is seeking a Site Reliability Engineer II (SRE II) to join their Global Product & Technology Division.
  • The role involves building and managing highly available, distributed systems within the LivePerson SRE team.
  • Responsibilities include ensuring product high uptime and reliability 24x7, managing Linux servers in a multi-cloud environment, and overseeing high availability Kubernetes resources using Helm charts.
  • The engineer will assist with deploying upgrades and patches using Chef, Ansible, Puppet, or Helm.
  • Monitoring and troubleshooting warnings and alerts related to the reporting platform’s performance is a key duty.
  • The role also includes developing monitoring resources and alerting systems such as Grafana, Prometheus, Kibana, DataDog, and PagerDuty.
  • Coordination with DBA and developers to manage SQL and NoSQL database systems, including MongoDB, ElasticSearch, Postgres, and MySQL, is required.
  • The engineer will manage message bus systems such as Kafka and Pulsar and build and maintain CI/CD pipelines using Jenkins, Gitlab, or TeamCity.

Requirements:

  • Candidates must have a minimum of 4+ years of experience managing cloud-based production environments (AWS, GCP, Azure, etc.).
  • Highly experienced in working in a Linux environment and proficient in scripting with Bash and Python.
  • Strong experience with configuration management systems like OpsCode Chef, Ansible, and Puppet is required.
  • Candidates should have experience in Terraform, CloudFormation, or other Infrastructure as Code (IAC) tools.
  • Proficiency in SQL, including DDL and complex queries, is necessary.
  • Experience working in the Kubernetes platform and in a microservices architecture using a message bus is essential.
  • Good knowledge of CI/CD pipeline orchestrators like TeamCity, Jenkins, and Gitlab is required.
  • Candidates must be able to integrate security best practices into the SRE workflow.
  • A highly motivated and independent work ethic is essential, along with being a team player with excellent interpersonal skills.
  • Excellent written and verbal communication skills are required.
  • A BS in Computer Science or a related field, or equivalent work experience, is necessary.
  • A strong background in cloud, network, and application security and compliance is preferred.
  • Experience with GPT or other large language models (LLMs) is a strong advantage.

Benefits:

  • Health benefits include medical, dental, and vision coverage.
  • Employees receive time away for vacation and holidays.
  • The company offers generous tuition reimbursement and access to internal professional development resources.
  • LivePerson is an equal opportunity employer, ensuring all qualified applicants receive consideration for employment without discrimination.
Leave a feedback