Welcome to RemoteYeah 2.0! Find out more about the new version here.

Remote Senior Staff Engineer, Generative AI

at Nagarro

Posted 1 day ago 0 applied

Description:

  • Nagarro is a Digital Product Engineering company that is rapidly scaling and focuses on building products, services, and experiences that inspire and delight.
  • The company has over 17,500 experts across 39 countries and promotes a dynamic, non-hierarchical work culture.
  • The Senior Staff Engineer, Generative AI will have a total experience of 10+ years.
  • The role requires strong experience with LLMs (such as LLaMA and DeepSeek) and an understanding of RAG pipelines.
  • Candidates must have hands-on experience in Python, Linux, and Shell scripting.
  • Experience with frameworks like OpenCV, PyTorch, YOLO, or TensorFlow is essential.
  • Familiarity with LLM inference engines like Ollama, vLLM, and llama.cpp is required.
  • A solid knowledge of model conversion and deployment is necessary.
  • The position involves working on AI Agents, LangChain, and retrieval-augmented generation (RAG).
  • Hands-on experience with Docker, Docker Compose, and integration into DevOps pipelines is expected.
  • Understanding of embedded platforms (such as Jetson, NXP, Qualcomm) and Yocto builds is important.
  • Experience in model optimization techniques (like quantization and pruning) is required.
  • A good grasp of CUDA kernels and GPU computing for acceleration is necessary.
  • Excellent communication skills and the ability to collaborate effectively with cross-functional teams are essential.

Requirements:

  • Candidates must have a Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
  • A total experience of 10+ years in relevant fields is required.
  • Strong experience with LLMs and understanding of RAG pipelines is necessary.
  • Hands-on experience in Python, Linux, and Shell scripting is essential.
  • Experience with OpenCV, PyTorch, YOLO, or TensorFlow frameworks is required.
  • Familiarity with LLM inference engines like Ollama, vLLM, and llama.cpp is expected.
  • Solid knowledge of model conversion and deployment is necessary.
  • Experience working on AI Agents, LangChain, and retrieval-augmented generation (RAG) is required.
  • Hands-on experience with Docker and Docker Compose, along with integration into DevOps pipelines, is essential.
  • Understanding of embedded platforms and Yocto builds is important.
  • Experience in model optimization techniques is required.
  • A good grasp of CUDA kernels and GPU computing for acceleration is necessary.
  • Excellent communication skills and the ability to collaborate effectively with cross-functional teams are essential.

Benefits:

  • Nagarro offers a dynamic and non-hierarchical work culture that encourages collaboration and innovation.
  • Employees have the opportunity to work with a diverse team of experts from around the world.
  • The company provides a platform for personal and professional growth in the field of digital product engineering.
  • Employees can expect to be involved in exciting projects that inspire and delight clients.
  • Nagarro promotes a mindset of continuous improvement and supports its employees in addressing challenges effectively.