Remote Infrastructure Operations Engineer (GPU Computing) - Enterprise AI
Posted
Apply now
Please, let Aethir know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
Aethir, a pioneering technology company, is looking for an Infrastructure Operations Engineer specializing in GPU-based compute infrastructure for diverse industries like AI and high-performance computing.
The role involves managing and optimizing GPU-based compute infrastructure across multiple locations and partners to ensure maximum performance, scalability, and reliability.
Responsibilities include deploying, configuring, and maintaining infrastructure, monitoring and optimizing performance, developing automation scripts, ensuring security and compliance, providing incident response, capacity planning, and knowledge sharing.
Requirements:
Experience in infrastructure operations, preferably in a DevOps, SRE, Sales Engineering, or Solution Architect role focused on GPU compute.
Proficiency in managing GPU-based compute infrastructure, including NVIDIA GPUs and CUDA programming.
Strong expertise in Linux system administration, shell scripting, configuration management tools, and version control systems.
Familiarity with containerization, orchestration technologies, networking concepts, and troubleshooting techniques.
Excellent analytical and problem-solving skills, effective communication, collaboration abilities, and knowledge of cloud computing platforms.
Understanding of HPC frameworks, GPU-accelerated libraries, cybersecurity principles, and bonus points for knowledge of Web3.
Benefits:
Competitive compensation structure with flexibility on fiat/token mix.
Flexible benefits, salary, work hours, and remote work options based on location and setup.
Apply now
Please, let Aethir know you found this job
on RemoteYeah
.
This helps us grow π±.