This position is for a Site Reliability Engineer at Launchpad Technologies, posted by Jobgether, and is open to candidates in Latam.
The role involves joining a global team that focuses on the intersection of development and operations.
Responsibilities include designing, automating, and maintaining high-availability systems for leading international clients.
The position allows for remote work while addressing complex infrastructure challenges, enhancing system resilience, and ensuring consistent performance across cloud environments.
The engineer will contribute to mission-critical platforms by leveraging automation, monitoring, and modern DevOps tools in a collaborative culture that promotes continuous learning and professional growth.
Key accountabilities include collaborating with cross-functional teams, designing monitoring systems, automating infrastructure provisioning, managing containerized environments, performing root cause analysis, conducting capacity planning, and supporting CI/CD pipeline maintenance.
Requirements:
Candidates must have proven experience in Site Reliability, DevOps, or Infrastructure Engineering roles.
Strong scripting and automation skills in Python, Bash, or Go are required.
Proficiency with Docker and Kubernetes for container orchestration is necessary.
Hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud is essential.
Familiarity with monitoring and logging tools like Prometheus, Grafana, or the ELK stack is expected.
A solid understanding of performance tuning, fault tolerance, and system observability is required.
Strong communication skills in English and a collaborative problem-solving mindset are essential.
Bonus points for knowledge of networking and security best practices, experience managing incident response, relevant certifications, and a background in designing highly available and fault-tolerant systems.
Benefits:
This is a 100% remote position with flexible working arrangements.
The role offers competitive compensation in USD.
Personal hardware setup support for remote work is provided.
Employees receive paid time off, including vacation, study leave, and personal time.
There are opportunities to collaborate with global teams across North America, Europe, and Asia.
Ongoing training and development allowances are available.
The company promotes a supportive, people-first culture that values employee well-being.