JetBrains Logo

JetBrains

Research Engineer (Agentic Models)

Posted 13 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Germany
Mid level
In-Office or Remote
Hiring Remotely in Germany
Mid level
Develop and maintain SFT and RL post-training pipelines for multi-step coding agents, train and adapt LLMs for agent workflows, build evaluation and simulation environments, design metrics and evaluation frameworks, analyze results to improve models and datasets, and collaborate with research, product, and infra teams to ship models into JetBrains IDEs.
The summary above was generated by AI

At JetBrains, code is our passion. Ever since we started, back in 2000, we’ve been striving to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in our IDEs.

We’re building multi-step coding agents that can understand large codebases, plan changes, call tools, and iterate with the user. As a Research Engineer in the Agentic Models team, you’ll be responsible for the models, training loops, and evaluation pipelines that power these agents.

You’ll work at the intersection of SFT and RL-style post-training, and product-driven evaluation, using our distributed GPU and MapReduce clusters to ship models into JetBrains products.

As part of our team, you will:
  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
  • Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.
  • Design evaluation frameworks and metrics for agent behavior, analyze traces and logs, and close the loop from evaluation back into training, data, and reward design.
  • Analyze training and evaluation results to propose and implement improvements to model architectures, training recipes, and datasets.
  • Work with large-scale infrastructure, including distributed training on GPU clusters and large MapReduce-style data processing for pre-training and fine-tuning datasets.
  • Collaborate closely with research, product, and infrastructure teams to turn high-level product visions into concrete models, experiments, and shipped features. 
We’ll be happy to bring you on board if you have:
  • Extensive hands-on experience training LLMs (pre-training, fine-tuning, or post-training) in a research or production setting.
  • Deep expertise in modern deep learning frameworks such as PyTorch, and specialized LLM training stacks (e.g. Megatron, NeMo, verl, or similar).
  • Strong theoretical and practical understanding of LLM fundamentals: architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.
  • The ability to own projects end to end, starting from a high-level problem or product pain point and overseeing it through the design, experimentation, implementation, and iteration phases.
  • A product-aware mindset – you care about how developers actually use agents and can translate product needs and failure modes into modeling and evaluation work.
  • At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.
Our ideal candidate would have experience with:
  • ML orchestrators and workflow tools such as Kubeflow, Dagster, Airflow, ZenML, and/or job schedulers like Kubernetes or SLURM.
  • Large-scale data and training pipelines, e.g. MapReduce-style clusters, multi-node GPU training, or workloads on the order of 1M+ CPU/GPU hours.
  • Designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks.
  • AI agent development, such as tool-using agents, planners, or multi-step coding workflows, and familiarity with agentic frameworks or patterns.
  • Experiment tracking and observability using tools like Weights & Biases, MLflow, Langfuse, or similar.
  • Inference optimization and serving optimized models in production.

#LI-KP1

We are an equal opportunity employer
We know great ideas can come from anyone, anywhere. That’s why we do our best to create an open and inclusive workplace – one that welcomes everyone regardless of their background, identity, religion, age, accessibility needs, or orientation.

We process the data provided in your job application in accordance with the Recruitment Privacy Policy.

Similar Jobs

13 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality
This role involves managing VIP corporate travel, providing elite service, handling disruptions, and maintaining performance metrics in a contact center environment.
Top Skills: AmadeusGalileoGdsSabre
13 Hours Ago
Remote or Hybrid
Edinburgh, City of Edinburgh, Scotland, GBR
Senior level
Senior level
Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality
Manage high-touch Executive and VIP travel with a focus on premium service, complex itinerary management, and fostering long-term client relationships while adhering to contact centre performance metrics.
Top Skills: AmadeusContact Centre OperationsGalileoGdsSabre
13 Hours Ago
Remote or Hybrid
United Kingdom
Mid level
Mid level
Healthtech • Machine Learning • Software • Biotech • Pharmaceutical
Manage Schrodinger Platform deployment projects, ensuring customer success and aiding product development. Requires experience in drug discovery and project management.
Top Skills: CheminformaticsComputational ChemistryComputational MethodsSchrodinger Platform

What you need to know about the Edinburgh Tech Scene

From traditional pubs and centuries-old universities to sleek shopping malls and glass-paneled office buildings, Edinburgh's architecture reflects its unique blend of history and modernity. But the fusion of past and future isn't just visible in its buildings; it's also shaping the city's economy. Named the United Kingdom's leading technology ecosystem outside of London, Edinburgh plays host to major global companies like Apple and Adobe, as well as a growing number of innovative startups in fields like cybersecurity, finance and healthcare.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account