Please let Mistral AI know you found this job on RemoteYeah. This helps us get more companies to post jobs here for you.
Description:
Mistral AI is seeking highly experienced Site Reliability Engineers (SRE) to enhance the reliability, scalability, and performance of their platform and customer-facing applications.
The role involves balancing day-to-day operations on production systems with long-term software engineering improvements to reduce operational toil.
Responsibilities include designing, building, and maintaining scalable infrastructures, ensuring high availability of platforms, operating systems, troubleshooting production issues, and implementing monitoring and incident response systems.
The position also requires driving continuous improvement in infrastructure automation, collaborating with AI/ML researchers, and documenting processes for knowledge sharing.
The role is based in Paris or London and reports to the Head of Engineering.
Requirements:
A Masterโs degree in Computer Science, Engineering, or a related field is required.
Candidates must have 7+ years of experience in a DevOps/SRE role.
Strong experience with cloud computing and highly available distributed systems is essential.
Exposure to site reliability issues in critical environments, including issue root cause analysis and on-call rotations, is necessary.
Experience working against reliability KPIs such as observability, alerting, and SLAs is required.
Hands-on experience with CI/CD, containerization, and orchestration tools like Docker and Kubernetes is needed.
Knowledge of monitoring, logging, alerting, and observability tools such as Prometheus and Grafana is important.
Familiarity with infrastructure-as-code tools like Terraform or CloudFormation is required.
Proficiency in scripting languages (Python, Go, Bash) and knowledge of software development best practices is necessary.
A strong understanding of networking, security, and system administration concepts is essential.
Excellent problem-solving and communication skills are required, along with the ability to work well in a fast-paced startup environment.
Additional experience in an AI/ML environment and high-performance computing systems is a plus.
Benefits:
The position offers a competitive salary and equity.
Health insurance is provided to all employees.
A transportation allowance and sport allowance are included.
Meal vouchers are offered as part of the benefits package.
Employees have access to a private pension plan.
A generous parental leave policy is in place for new parents.
Visa sponsorship is available for eligible candidates.