Remote LB - Cloud Infrastructure Engineer - 0040

at Thaloz

Posted 1 day ago 3 applied

Description:

  • We are seeking a highly skilled and experienced Senior Cloud Infrastructure Engineer to join our dynamic team.
  • This role is critical to designing, implementing, and managing scalable, reliable, and secure cloud infrastructure on Amazon Web Services (AWS).
  • The ideal candidate will play a pivotal role in enabling our organization to leverage cloud technologies effectively, ensuring robust infrastructure that supports our business goals.
  • This position requires a deep understanding of cloud architecture, infrastructure as code, monitoring, security, and continuous integration/continuous deployment (CI/CD) pipelines.
  • The successful candidate will collaborate closely with cross-functional teams to deliver innovative solutions that drive operational excellence and business growth.
  • Responsibilities include designing, deploying, and maintaining scalable and secure cloud infrastructure on AWS, developing and managing infrastructure as code (IaC) using Terraform and AWS CloudFormation, and implementing comprehensive monitoring and alerting solutions using AWS CloudWatch, Prometheus, and Grafana.
  • The role also involves managing logging and auditing systems with AWS CloudTrail and the ELK stack, collaborating with various teams to define infrastructure requirements, and building and maintaining CI/CD pipelines.
  • Ensuring adherence to security best practices and compliance standards is essential, along with providing technical leadership and mentorship to junior engineers.
  • Participation in on-call rotations to respond to infrastructure incidents and continuous research and adoption of new cloud technologies are also key aspects of the role.

Requirements:

  • Extensive experience designing, deploying, and managing cloud infrastructure on AWS, including core services such as EC2, S3, VPC, IAM, RDS, and Lambda.
  • Proficient in writing, testing, and maintaining infrastructure as code using Terraform to automate cloud resource provisioning and management.
  • Skilled in using AWS CloudFormation templates for infrastructure automation and orchestration.
  • Expertise in setting up monitoring, logging, and alerting using AWS CloudWatch to track system metrics and respond to operational issues.
  • Experience implementing Prometheus for monitoring containerized and microservices environments, including custom metrics collection.
  • Ability to create and maintain Grafana dashboards for visualizing metrics and logs to support operational decision-making.
  • Familiarity with Datadog for cloud infrastructure monitoring, log management, and alerting.
  • Knowledge of AWS CloudTrail for auditing and tracking API activity to ensure security and compliance.
  • Experience managing centralized logging solutions using the ELK stack to aggregate, analyze, and visualize logs.
  • Hands-on experience designing and maintaining CI/CD pipelines using GitHub Actions, Jenkins, or GitLab CI/CD.
  • Strong understanding of cloud networking concepts including VPC design, subnets, routing, VPN, security groups, and load balancing.
  • Deep knowledge of cloud security principles such as least privilege access, encryption, key management, vulnerability management, and incident response.
  • Familiarity with compliance frameworks and standards relevant to cloud infrastructure.
  • Excellent analytical and troubleshooting skills to diagnose and resolve complex infrastructure issues.
  • Strong verbal and written communication skills to effectively collaborate with technical and non-technical stakeholders.
  • Proven ability to work cross-functionally with engineering, security, and operations teams to deliver integrated solutions.
  • Willingness to participate in on-call rotations to provide timely response to infrastructure incidents.

Benefits:

  • Opportunity to work in a dynamic and innovative environment that leverages cutting-edge cloud technologies.
  • The role offers the chance to collaborate with cross-functional teams and contribute to significant business growth.
  • You will have the opportunity to provide technical leadership and mentorship to junior engineers.
  • Participation in on-call rotations allows for hands-on experience in incident response and infrastructure management.
  • Continuous research and adoption of new cloud technologies will enhance your professional development and expertise.
  • The position may offer competitive compensation and benefits packages, including potential for professional certifications and training.