Stability logoStability is hiring a

Machine Learning Ops Engineer, RLHF

Full-Time
Worldwide
Login to Apply →See all Jobs on Stability eye icon

Please let Stability know you found this job on Remote3. It helps us get more jobs on our site. Thanks & All the best!

Important: For your security, please only use well-known video meeting platforms like Google Meet or Zoom. Never download unfamiliar software or share sensitive information like wallet addresses or ENS names with recruiters. Doing so might compromise your crypto wallet. If you encounter anything suspicious, please report it immediately to us on Twitter.

Posted on: May 2, 2023

About Stability: 

Stability AI is a community and mission driven, open-source artificial intelligence company that cares deeply about real-world implications and applications. Our most considerable advances grow from our diversity in working across multiple teams and disciplines. We are unafraid to go against established norms and explore creativity. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.

About the role: 

We are seeking an experienced Machine Learning Operations Engineer to join our team and help drive our Reinforcement Learning from Human Feedback (RLHF) efforts. The successful candidate will report directly to the Research Director and will be responsible for developing and iterating large-scale data collection proposals, coordinating with business development personnel, managing the implementation of RLHF projects, and documenting data quality standards.

Responsibilities:  

  • Develop and iterate large-scale data collection proposals to support RLHF projects.
  • Coordinate with business development personnel to draft proposals for partnerships and collaborations.
  • Manage the implementation of RLHF projects, including data collection, model training, and evaluation.
  • Monitor and optimize the performance of RLHF models.
  • Collaborate with researchers, engineers, and other stakeholders to ensure project success.
  • Document all kinds of data received and data quality standards.
  • Stay current with developments in the field of machine learning operations and apply best practices to our RLHF efforts. Familiar with recent literature in RLHF and RLAIF.

Qualifications: 

  • Bachelor's or Master's degree in Computer Science, Mathematics, Statistics, or a related field.
  • 5+ years of experience in machine learning operations.
  • 1+ year of experience in RLHF, or relevant experience.
  • Experience managing large-scale data collection efforts.
  • Excellent communication skills and ability to collaborate effectively with researchers, engineers, and other stakeholders.

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.
Login to Apply →See all Jobs on Stability eye icon

Please let Stability know you found this job on Remote3. It helps us get more jobs on our site. Thanks & All the best!

Important: For your security, please only use well-known video meeting platforms like Google Meet or Zoom. Never download unfamiliar software or share sensitive information like wallet addresses or ENS names with recruiters. Doing so might compromise your crypto wallet. If you encounter anything suspicious, please report it immediately to us on Twitter.

Posted on: May 2, 2023