Please, let G2i Inc. know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
The Software Engineer, AI position focuses on training large-language models (LLMs) to write production-grade code across various programming languages.
Responsibilities include comparing and ranking multiple code snippets, explaining which is best and why.
The role involves repairing and refactoring AI-generated code for correctness, efficiency, and style.
The engineer will inject feedback (ratings, edits, test results) into the RLHF pipeline to ensure it runs smoothly.
The end result is that the model learns to propose, critique, and improve code in a manner similar to the engineer's approach.
The RLHF process consists of generating code, having expert engineers rank and edit it, justifying their choices, and converting that feedback into reward signals for reinforcement learning.
Requirements:
Candidates must have 4+ years of professional software engineering experience in one or more of the following languages: Python, Java, JavaScript, TypeScript, Go, C++, PHP, COBOL, C, Ruby, or Rust. Constraint programming experience is a bonus but not required for all languages.
Strong code-review instincts are essential; candidates should be able to quickly spot logic errors, performance traps, and security issues.
Extreme attention to detail and excellent written communication skills are required, as much of the role involves explaining why one approach is better than another.
Candidates should enjoy reading documentation and language specifications and thrive in an asynchronous, low-oversight environment.
No prior RLHF or AI training experience is needed, nor is deep machine learning knowledge; the ability to review and critique code clearly is sufficient.
Benefits:
The position is fully remote, allowing candidates to work from anywhere.
Compensation is up to $30 per hour.
The role offers flexible hours, with a minimum of 15 hours per week and up to 40 hours per week available.
Engagement is structured as a 1099 contract, providing straightforward impact without unnecessary complexity.
Apply now
Please, let G2i Inc. know you found this job
on RemoteYeah
.
This helps us grow π±.