Remote Site Reliability Engineer / Observability Engineer

Posted

Apply now
Please, let Rackspace know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • Rackspace is building its Professional Services Center of Excellence on Application Performance Monitoring Suites.
  • The role involves solving complex business problems and contributing to the development of next-generation modern applications for customers.
  • The position focuses on helping customers understand the connections between application performance, user experience, and business outcomes.
  • Responsibilities include working with customers to implement Observability solutions and building scalable systems with robust automation.
  • The engineer will develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  • Proactive analysis of metric and log data from systems and applications is required for anomaly detection, performance tuning, capacity planning, and fault isolation.
  • Collaboration with development teams is essential to implement and deploy new features and enhancements while ensuring reliability, security, and performance standards.
  • The engineer will document and share solutions and maintain a deep understanding of the customer’s business and technical environment.
  • Identifying performance bottlenecks and resolving root causes of service issues is a key responsibility.

Requirements:

  • A Bachelor’s degree in engineering/computer science or equivalent is required.
  • Candidates must have senior-level experience with Site Reliability Engineering, DevOps, and code-level application support and troubleshooting.
  • Experience in AWS Infrastructure design, implementation, and optimization is necessary.
  • Proficiency with observability solutions tools like Splunk, Datadog, and SignalFx is required.
  • Experience in deploying, maintaining, and supporting software applications/services in the AWS ecosystem is essential.
  • A proactive approach to identifying problems and solutions is expected.
  • Candidates should have experience writing code in one or more interpreted languages such as Python, PHP, Perl, Ruby, or Linux Shell.
  • Familiarity with Terraform or Cloud Formation scripting is required.
  • Experience with configuration management tools like Ansible, Chef, or Puppet is necessary.
  • Knowledge of standard software development best practices and tools, such as code repositories (Git preferred), is required.
  • Experience executing in an agile software development environment is essential.
  • A good understanding of pricing/cost models across AWS services, especially compute, storage, and database offerings, is necessary.
  • Candidates should have a clear understanding of network and system management solutions.
  • Excellent organizational, project management, communication, critical thinking, and analytical skills are required.

Benefits:

  • Rackspace Technology is recognized as a best place to work by Fortune, Forbes, and Glassdoor.
  • The company offers a commitment to equal employment opportunity without regard to various legally protected characteristics.
  • Rackspace fosters a culture that embraces unique perspectives to fuel innovation and better serve customers and communities.
  • Employees are encouraged to bring their whole selves to work and thrive through connection to a central goal.
  • The company is committed to accommodating individuals with disabilities or special needs.
Apply now
Please, let Rackspace know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Experience level
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback