Please, let Spreedly know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
As a Senior Site Reliability Engineer (SRE) at Spreedly, you will focus on ensuring the reliability, observability, and scalability of our globally distributed payments platform.
You will lead efforts to stabilize and optimize our infrastructure, build platform services, and champion best practices that enhance system performance and resilience.
The role requires leveraging expertise in software development, infrastructure, and operations to ensure applications and systems are reliable, scalable, and efficient.
Responsibilities include ensuring system reliability and performance, collaborating with development teams, implementing observability solutions, leading incident management, developing automation tools, tuning database performance, providing thought leadership, and mentoring team members.
Requirements:
Hands-on experience with observability tools such as Datadog, OpenTelemetry, Sentry, and Sumo Logic, focusing on actionable metrics and alerts is required.
Strong proficiency in a modern programming language, with a proven ability to write clean, maintainable, and efficient code; Ruby, Rails, and Elixir experience are preferred.
Extensive experience with AWS services, including EC2 (Ubuntu Linux), S3, and RDS is necessary.
In-depth knowledge of relational databases (e.g., CockroachDB, PostgreSQL, Riak) with experience in performance optimization and query tuning is essential; experience with Kafka is a plus.
Excellent problem-solving skills with experience diagnosing complex system issues in production environments are required.
Proven ability to work cross-functionally with product and application, infrastructure, and security engineering teams is necessary.
A strong understanding of DevOps practices, including CI/CD pipelines, configuration management, and infrastructure-as-code is required.
Advanced knowledge of Docker and container orchestration best practices is a plus.
Strong written and verbal communication skills, with the ability to explain complex technical concepts to non-technical stakeholders are essential.
Benefits:
Competitive salary plus equity is offered to US-based employees.
Outstanding medical and dental benefits, including 100% employer-paid options, are provided.
Company-paid life and disability insurance is included.
Optional vision and supplemental insurance options, along with various Flexible Spending Accounts (FSA), are available.
An open paid time off policy plus 12 weeks of paid leave for new parents is offered.
A matching 401(k) plan (5% up to $5,000 yearly) is available.
Monthly home working/digital lifestyle stipend, a new MacBook, and one-time accessory reimbursement are provided.
A LinkedIn Learning subscription is included.
Access to a company-paid professional coaching service is available.
Remote employees have the opportunity to visit the HQ in Durham, North Carolina.
Apply now
Please, let Spreedly know you found this job
on RemoteYeah
.
This helps us grow π±.