As an AI Vision Engineer at EverAI, you will help drive the development of visual intelligence capabilities that power conversational AI experiences.
You will work hands-on with cutting-edge generative models and computer vision techniques to evolve the image and video stack, improving quality, consistency, and user engagement.
This role is highly applied, focused on building and shipping production-grade features that enhance the creative potential of the platform.
Key responsibilities include collaborating closely with AI engineers, product managers, and content creators to define and execute visual AI features, fine-tuning diffusion-based models, improving existing image and video generation pipelines, and contributing to shared tools and internal libraries for dataset preparation and quality control.
Requirements:
A Master’s degree or higher in Computer Vision, Computer Science, Applied Mathematics, or a related technical field is required.
You must have 2+ years of experience performing full fine-tuning of diffusion-based models for image generation.
A proven track record of trained models being deployed in traffic-intensive production environments is necessary.
Strong software engineering skills are required, including writing clean, modular, maintainable code and familiarity with version control, testing, and team workflows.
Proficiency in Python and experience with tools and libraries such as PyTorch, Hugging Face Diffusers, Pillow, OpenCV, ComfyUI, or Automatic1111 is essential.
Experience working with cloud-based environments (e.g., GCP, AWS) for training, experimentation, and data processing is needed.
Strong communication and collaborative skills are required, with fluency in English.
A goal-oriented mindset, ownership, and commitment are essential.
A doer mindset is necessary, as the company moves fast and requires a balance between executing, planning, and strategy.
You should be humble, willing to learn, and open to feedback.
Comfort with building products based on uncensored models and content is required.
Bonus points for a background in designing intuitive tooling for content creators or technical teams and a portfolio of visual AI projects or contributions to open-source models or tools.
Benefits:
The position offers a preference for a B2B contract, with flexibility for long-term commitment.
It is a fully remote position, allowing you to work from the place that suits you best.
You will receive 4 weeks of paid time off (PTO).
An annual gathering is provided to foster team connections.
A wellbeing budget of up to $200 is available.
A learning budget is offered to support your professional development.
A company laptop will be provided for your work.
Access to GPT-4, Mistral, and Hugging Face Pro plans is included.