Scientist Jobs in Web3

421 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Scientist Jobs in Web3
Job Position Company Posted Location Salary Tags

OpenAI

San Francisco, CA, United States

$90k - $90k

Binance

Dubai, United Arab Emirates

Immutable

Sydney, Australia

$36k - $70k

Immutable

Sydney, Australia

$90k - $180k

Aptos

San Francisco, CA, United States

$45k - $75k

Shakepay

Montreal, Canada

$98k - $110k

Ripple

London, United Kingdom

$98k - $165k

Coinmarketcap

Taipei, Taiwan

$95k - $110k

SwissBorg

Remote

$72k - $80k

Gemini

Remote

$120k - $168k

TRM Labs

Remote

$157k - $192k

Openmesh

Sydney, Australia

$75k - $100k

Nethermind

Argentina

$84k - $115k

BitGo

Bangalore, India

$95k - $105k

FalconX

Remote

$106k - $165k

Research Scientist PostTraining Core Algorithms

OpenAI
$90k - $90k estimated

This job is closed

About the Team

The Post-Training - Core Algorithms team is responsible for researching and developing the next generation of algorithms to power our RLHF stack (reinforcement learning from human feedback). The algorithms we develop are used in ChatGPT consumer product and the OpenAI API.

About the Role

As a Member of Technical Staff on our team, you will research and develop improvements to all components of our RLHF stack, including data collection, supervised finetuning, reward modeling, off- and on-policy learning, active learning, and evaluations. The ultimate test for our algorithms is how useful they are to our users, and we often deploy our algorithms into new ChatGPT models.

We’re looking for people who have extensive background in reinforcement learning research, are able to iterate quickly, and are proficient at coding.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Come up with improvements to RLHF
  • Prototype and evaluate these ideas
  • Scale up your innovations to ChatGPT scale

You might thrive in this role if you:

  • Love being on the cutting edge of RL and language model research
  • Can iterate fast on lots of ideas
  • Like doing research that has real-world impact