Nagarro is a Digital Product Engineering company that is rapidly scaling and focuses on building products, services, and experiences that inspire and delight.
The company has over 17,500 experts across 39 countries and promotes a dynamic, non-hierarchical work culture.
The Senior Staff Engineer, Generative AI will have a total experience of 10+ years.
The role requires strong experience with LLMs (such as LLaMA and DeepSeek) and an understanding of RAG pipelines.
Candidates must have hands-on experience in Python, Linux, and Shell scripting.
Experience with frameworks like OpenCV, PyTorch, YOLO, or TensorFlow is essential.
Familiarity with LLM inference engines like Ollama, vLLM, and llama.cpp is required.
A solid knowledge of model conversion and deployment is necessary.
The position involves working on AI Agents, LangChain, and retrieval-augmented generation (RAG).
Hands-on experience with Docker, Docker Compose, and integration into DevOps pipelines is expected.
Understanding of embedded platforms (such as Jetson, NXP, Qualcomm) and Yocto builds is important.
Experience in model optimization techniques (like quantization and pruning) is required.
A good grasp of CUDA kernels and GPU computing for acceleration is necessary.
Excellent communication skills and the ability to collaborate effectively with cross-functional teams are essential.
Requirements:
Candidates must have a Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
A total experience of 10+ years in relevant fields is required.
Strong experience with LLMs and understanding of RAG pipelines is necessary.
Hands-on experience in Python, Linux, and Shell scripting is essential.
Experience with OpenCV, PyTorch, YOLO, or TensorFlow frameworks is required.
Familiarity with LLM inference engines like Ollama, vLLM, and llama.cpp is expected.
Solid knowledge of model conversion and deployment is necessary.
Experience working on AI Agents, LangChain, and retrieval-augmented generation (RAG) is required.
Hands-on experience with Docker and Docker Compose, along with integration into DevOps pipelines, is essential.
Understanding of embedded platforms and Yocto builds is important.
Experience in model optimization techniques is required.
A good grasp of CUDA kernels and GPU computing for acceleration is necessary.
Excellent communication skills and the ability to collaborate effectively with cross-functional teams are essential.
Benefits:
Nagarro offers a dynamic and non-hierarchical work culture that encourages collaboration and innovation.
Employees have the opportunity to work with a diverse team of experts from around the world.
The company provides a platform for personal and professional growth in the field of digital product engineering.
Employees can expect to be involved in exciting projects that inspire and delight clients.
Nagarro promotes a mindset of continuous improvement and supports its employees in addressing challenges effectively.