Často hledané: prodavačka, řidič, sklad
P
pracenadosah.cz
PřihlásitVložit inzerát zdarma
DomůPráceSenior Machine Learning /AI Engineer (RL) @ Acaisoft
A
Acaisoft
Zveřejněno před 23 dny
✓ Ověřená firma

Senior Machine Learning /AI Engineer (RL) @ Acaisoft

HPP

O pozici / o projektu Původní popisek. You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.

📋

Popis pozice

O pozici / o projektu

Původní popisek. You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.

In this role, you will work on generating tasks in Reinforcement Learning environments. We create environments for producing training data that can be used to train models.

The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.

Due to the client’s time zone, we would appreciate a candidate who can work 2 p.m. - 10 p.m.

Join us and make a real impact!

If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.

Odpovědnosti

Původní popisek. • Design and implement RL environments that support large-scale agent evaluation and reinforcement learning experiments.

  • Build task generation pipelines, dynamic datasets, and scripted environments with controlled complexity and stochasticity.
  • Develop verifiers and reward models to automatically score trajectories and evaluate model reasoning.
  • Collaborate with infrastructure and systems engineers to ensure environments are scalable, reproducible, and instrumented for detailed telemetry.
  • Design APIs and orchestration frameworks for running, resetting, and evaluating agents across environments.
  • Optimize environment performance, logging, and reward reproducibility across distributed setups.

Detail pracovní nabídky

  • Online nábor
  • Ihned
  • Práce plně na dálku
  • Pevná pracovní doba
  • Především tvorba nových funkcí

Výhody

  • Sportovní balíček
  • Soukromá zdravotní péče
  • Plochá struktura
  • Malé týmy
  • Mezinárodní projekty
🏢

O firmě

Acaisoft je zaměstnavatel s aktivní inzercí v Pracenadosah.cz.