Please, let Tech Holding know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
The System Reliability Engineer will be responsible for managing Linux and Windows environments, automating processes, and implementing robust monitoring and security practices to ensure high availability and performance across client systems.
Responsibilities include system updates, patches, security configurations, monitoring system performance, analyzing metrics, creating dashboards, developing automation scripts, enforcing security best practices, conducting vulnerability assessments, collaborating with teams, providing technical support, and identifying process improvements.
The role involves working with stakeholders, maintaining documentation, and enhancing system reliability and performance through continuous improvements.
Requirements:
Proficiency in managing and troubleshooting Linux (e.g., Amazon Linux, CentOS) and Windows Server operating systems.
Experience with system configuration, management, maintenance, and automation tools like Ansible, Puppet, or Chef.
Familiarity with monitoring solutions such as AWS CloudWatch, Dynatrace, Datadog, or similar tools.
Ability to analyze system performance metrics, implement optimizations, and conduct patch management and vulnerability assessments.
Proficiency in scripting languages (Bash, Python, PowerShell), version control systems (Git), AWS management (EC2 instances, lambdas, containers), incident response, and infrastructure as code tools (Terraform, AWS CloudFormation).
Benefits:
Remote work opportunities are available.
Flexible work hours are offered for a better work-life balance.
Apply now
Please, let Tech Holding know you found this job
on RemoteYeah
.
This helps us grow 🌱.