Remote Site Reliability Engineer / Observability Engineer
Posted
Apply now
Please, let Rackspace know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
Rackspace is building its Professional Services Center of Excellence on Application Performance Monitoring Suites.
The role involves solving complex business problems and contributing to the development of next-generation modern applications for customers.
The position focuses on helping customers understand the connections between application performance, user experience, and business outcomes.
Responsibilities include working with customers to implement Observability solutions and building scalable systems with robust automation.
The engineer will develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
Proactive analysis of metric and log data from systems and applications is required for anomaly detection, performance tuning, capacity planning, and fault isolation.
Collaboration with development teams is essential to implement and deploy new features and enhancements while ensuring reliability, security, and performance standards.
The engineer will document and share solutions and maintain a deep understanding of the customer’s business and technical environment.
Identifying performance bottlenecks and resolving root causes of service issues is a key responsibility.
Requirements:
A Bachelor’s degree in engineering/computer science or equivalent is required.
Candidates must have senior-level experience with Site Reliability Engineering, DevOps, and code-level application support and troubleshooting.
Experience in AWS Infrastructure design, implementation, and optimization is necessary.
Proficiency with observability solutions tools like Splunk, Datadog, and SignalFx is required.
Experience in deploying, maintaining, and supporting software applications/services in the AWS ecosystem is essential.
A proactive approach to identifying problems and solutions is expected.
Candidates should have experience writing code in one or more interpreted languages such as Python, PHP, Perl, Ruby, or Linux Shell.
Familiarity with Terraform or Cloud Formation scripting is required.
Experience with configuration management tools like Ansible, Chef, or Puppet is necessary.
Knowledge of standard software development best practices and tools, such as code repositories (Git preferred), is required.
Experience executing in an agile software development environment is essential.
A good understanding of pricing/cost models across AWS services, especially compute, storage, and database offerings, is necessary.
Candidates should have a clear understanding of network and system management solutions.
Excellent organizational, project management, communication, critical thinking, and analytical skills are required.
Benefits:
Rackspace Technology is recognized as a best place to work by Fortune, Forbes, and Glassdoor.
The company offers a commitment to equal employment opportunity without regard to various legally protected characteristics.
Rackspace fosters a culture that embraces unique perspectives to fuel innovation and better serve customers and communities.
Employees are encouraged to bring their whole selves to work and thrive through connection to a central goal.
The company is committed to accommodating individuals with disabilities or special needs.
Apply now
Please, let Rackspace know you found this job
on RemoteYeah
.
This helps us grow 🌱.