We are looking for a Senior Site Reliability Engineer with strong experience in AWS, system monitoring, and infrastructure automation.
The role involves maintaining and improving the reliability and performance of a cloud-based lending platform used by mid-market and large financial institutions.
The ideal candidate will have a solid background in systems engineering and software development, be comfortable working across teams, and take ownership of operational stability and tooling improvements.
Responsibilities include owning deep knowledge about the software and its functions, overseeing systems to ensure reliability for customers, monitoring distribution systems, running the production environment, building software and systems to manage platform infrastructure, improving reliability and quality, measuring and optimizing system performance, and partnering with development teams to enhance services.
Requirements:
A Bachelor's Degree (B.A.) in Computer Science or Design or equivalent four-year degree, or equivalent related experience is required.
5-7 years of proven experience in a Site Reliability role or similar experience is necessary.
Excellent oral and written communication skills, including facilitation of group presentations and consulting skills in the English language, are essential.
Deep technical experience with AWS, containerization technologies, automated deployment frameworks, monitoring, logging, alerting, system internals, networking, databases, distributed systems, and service-oriented architecture is required.
Hands-on technical leadership and business impact in combining software engineering skills with systems engineering skills to solve complex automation and reliability challenges is expected.
Experience with Infrastructure and Application Monitoring tools such as New Relic, SumoLogic, Uptime monitoring (Pingdom), CloudTrail, CloudWatch Insights, CloudFormation, CodePipeline, and CodeDeploy is necessary.
Extensive working knowledge of managing AWS and Linux OS is required.
Experience with MSSQL, MySQL in cloud-based environments, and knowledge of AWS service technologies, such as Aurora and MySQL, is necessary.
Experience with NoSQL database technologies, ideally DynamoDB, is preferred.
Experience with pipeline automation scripting and tooling, such as Jenkins and Terraform, is required.
Knowledge and experience utilizing coding languages (e.g., C++, Java, PHP) and frameworks/systems (e.g., AWS) is necessary.
The ability to learn new languages and technologies is strongly preferred.
A broad understanding of the lending industry, with the ability to become a subject matter expert on the job, is required.
Benefits:
The position offers the opportunity to work in a dynamic environment focused on cloud-based solutions for financial institutions.
Employees will have the chance to take ownership of projects and contribute to the operational stability of a critical platform.
The role encourages collaboration with both technical and business partners, fostering a team-oriented atmosphere.
There is potential for professional growth and development in a rapidly evolving field.
The company values strong communication and interpersonal skills, providing a supportive environment for team players.