Často hledané: prodavačka, řidič, sklad
P
pracenadosah.cz
PřihlásitVložit inzerát zdarma
DomůPráceRL Environments Engineer (Remote) @ Verita HR
VH
Verita HR
Zveřejněno minulý měsíc
✓ Ověřená firma

RL Environments Engineer (Remote) @ Verita HR

HPP

O pozici / o projektu Původní popisek. Client: US based start-up ️ Recruitment: phone screen with our recruiter + 2 on-line meetings with hiring managers ️ Remote work Verita HR is an international company providing recruitment support within #Fintech, #Finance and #Banking…

📋

Popis pozice

O pozici / o projektu

Původní popisek. Client: US based start-up

️ Recruitment: phone screen with our recruiter + 2 on-line meetings with hiring managers

️ Remote work

Verita HR is an international company providing recruitment support within #Fintech, #Finance and #Banking market in EMEA. We connect the most innovative organizations with the best people in the market. We conduct systematic market research, which allows our Digital Teams to be a step ahead of the competition.

About the company: US-based AI startup focused on building the next generation of training data for LLMs. The team partners with top AI labs to create realistic RL environments where models encounter research and engineering challenges, iterate, and learn from feedback, pushing AI closer to its full potential.

Project: Design and build reinforcement learning environments to teach LLMs advanced reasoning and modern ML concepts. Candidates will work on realistic feedback loops where models encounter research and engineering problems and iterate on solutions.

What's in it for you

  • Fully remote, flexible work schedule with some overlap to US time zone
  • Direct impact on how LLMs learn
  • Collaboration with top AI researchers and labs
  • Exposure to cutting-edge RL and ML projects

Odpovědnosti

Původní popisek. • Build and maintain RL/ML environments for LLM training

  • Implement robust, production-quality Python code (not just notebooks)
  • Deploy and run environments in Docker with focus on reliability and iteration speed
  • Analyze model performance and respond to feedback efficiently
  • Collaborate with research teams to translate papers and ideas into RL problems

Detail pracovní nabídky

  • Online nábor
  • Start on 2026-04-01
  • Smlouva na 6 měsíců
  • Práce plně na dálku
  • Pevná pracovní doba

Metodika práce

  • Agile management Scrum, Agile

Výhody

  • Plochá struktura
  • Malé týmy
  • Mezinárodní projekty
🏢

O firmě

Verita HR je zaměstnavatel s aktivní inzercí v Pracenadosah.cz.