Rackspace is seeking a Site Reliability Engineer / Observability Engineer for their Professional Services Center of Excellence focused on Application Performance Monitoring Suites.
The role involves solving complex business problems and building modern applications that enhance customer experiences by understanding the connections between application performance, user experience, and business outcomes.
Responsibilities include working with customers to implement Observability solutions, building and maintaining scalable systems, developing monitoring tools, and collaborating with development teams to ensure reliability and performance standards.
The engineer will proactively analyze metric and log data for anomaly detection, performance tuning, and fault isolation, while maintaining a deep understanding of the customer's business and technical environment.
The position is full-time and remote, with a focus on using tools like Datadog, New Relic, AppDynamics, or Dynatrace.
Requirements:
A Bachelor’s degree in engineering/computer science or equivalent is required.
Candidates must have senior-level experience in Site Reliability Engineering, DevOps, and code-level application support and troubleshooting.
Experience with AWS infrastructure design, implementation, and optimization is necessary, along with automation for deployment, scaling, and reliability.
Proficiency in observability tools such as Splunk, Datadog, or SignalFx is required.
Candidates should have experience deploying and maintaining software applications/services in the AWS ecosystem.
A proactive approach to problem identification and solution development is essential.
Experience in coding with interpreted languages like Python, PHP, Perl, Ruby, or Linux Shell is required.
Familiarity with Terraform or Cloud Formation scripting and configuration management tools like Ansible, Chef, or Puppet is necessary.
Knowledge of software development best practices and tools, particularly Git, is required.
Experience in an agile software development environment is preferred.
A good understanding of AWS pricing/cost models and network & system management solutions is necessary.
Excellent organizational, project management, communication, critical thinking, and analytical skills are essential.
Benefits:
Rackspace offers a collaborative work environment that values unique perspectives and innovation.
The company is recognized as a best place to work by Fortune, Forbes, and Glassdoor, attracting world-class talent.
Rackspace is committed to equal employment opportunities and provides accommodations for individuals with disabilities or special needs.
Employees are encouraged to bring their whole selves to work and contribute to a mission-driven team.