This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
LivePerson is seeking a Site Reliability Engineer II (SRE II) to join their Global Product & Technology Division.
The role involves building and managing highly available, distributed systems within the LivePerson SRE team.
Responsibilities include ensuring product high uptime and reliability 24x7, managing Linux servers in a multi-cloud environment, and overseeing high availability Kubernetes resources using Helm charts.
The engineer will assist with deploying upgrades and patches using Chef, Ansible, Puppet, or Helm.
Monitoring and troubleshooting warnings and alerts related to the reporting platform’s performance is a key duty.
The role also includes developing monitoring resources and alerting systems such as Grafana, Prometheus, Kibana, DataDog, and PagerDuty.
Coordination with DBA and developers to manage SQL and NoSQL database systems, including MongoDB, ElasticSearch, Postgres, and MySQL, is required.
The engineer will manage message bus systems such as Kafka and Pulsar and build and maintain CI/CD pipelines using Jenkins, Gitlab, or TeamCity.
Requirements:
Candidates must have a minimum of 4+ years of experience managing cloud-based production environments (AWS, GCP, Azure, etc.).
Highly experienced in working in a Linux environment and proficient in scripting with Bash and Python.
Strong experience with configuration management systems like OpsCode Chef, Ansible, and Puppet is required.
Candidates should have experience in Terraform, CloudFormation, or other Infrastructure as Code (IAC) tools.
Proficiency in SQL, including DDL and complex queries, is necessary.
Experience working in the Kubernetes platform and in a microservices architecture using a message bus is essential.
Good knowledge of CI/CD pipeline orchestrators like TeamCity, Jenkins, and Gitlab is required.
Candidates must be able to integrate security best practices into the SRE workflow.
A highly motivated and independent work ethic is essential, along with being a team player with excellent interpersonal skills.
Excellent written and verbal communication skills are required.
A BS in Computer Science or a related field, or equivalent work experience, is necessary.
A strong background in cloud, network, and application security and compliance is preferred.
Experience with GPT or other large language models (LLMs) is a strong advantage.
Benefits:
Health benefits include medical, dental, and vision coverage.
Employees receive time away for vacation and holidays.
The company offers generous tuition reimbursement and access to internal professional development resources.
LivePerson is an equal opportunity employer, ensuring all qualified applicants receive consideration for employment without discrimination.