This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
Kunai is a fast-growing digital consultancy focused on banking, payments, and fintech, powered by a global network of diverse talent.
The company has shipped over 150 products for notable clients including Visa, American Express, and Wells Fargo.
The Site Reliability Engineer (SRE) role involves managing and supporting large-scale, fault-tolerant systems in a cloud environment.
SRE engineers ensure system reliability and uptime while monitoring availability, capacity, and performance in a 24/7 environment.
Responsibilities include managing troubleshooting and recovery of production incidents, driving incident resolution, and participating in Agile project work.
Engineers will create and manage technical documentation, monitor applications and infrastructure, and influence resiliency in production environments using AWS.
The role requires identifying opportunities for automated monitoring and conducting Root Cause Analysis on outages.
Engineers will also implement automations for routine tasks and share knowledge with team members.
Requirements:
A Bachelor's degree or equivalent certification is required.
Candidates must have at least 2 years of experience managing and troubleshooting incident bridge calls.
A minimum of 2 years of experience with Python scripting is necessary.
Candidates should have at least 2 years of experience using and supporting public cloud environments such as AWS, Azure, or GCP.
Experience with monitoring tools like Splunk, New Relic, or DataDog for at least 2 years is required.
Preferred qualifications include AWS Associate level certification and experience with Linux, UNIX, Ruby, Go, JavaScript, or NoSQL.
Candidates should have 3+ years of experience with public cloud environments and 2+ years of experience with networks, load balancers, and web application firewalls.
Experience with web API services for at least 2 years is also preferred.
The role requires working 40 hours per week during nights and weekends, with a specific coverage schedule.
Benefits:
The position offers a mostly remote work environment.
Employees will have the opportunity to work with a diverse and innovative team.
The role provides a chance to solve challenging technical problems and contribute to client success.
Employees are encouraged to continuously develop new skills and share knowledge with others.