Synthesia Logo

Synthesia

Senior Machine Learning Engineer - AI & GPU Performance

Reposted 18 Days Ago
Be an Early Applicant
In-Office
6 Locations
Mid level
In-Office
6 Locations
Mid level
As a ML Performance Engineer, optimize model training and inference pipelines, develop efficient solutions, and promote best practices in computing resources.
The summary above was generated by AI
Who are we?

From your everyday PowerPoint presentations to Hollywood movies, AI will transform the way we create and consume content.

Today, people want to watch and listen, not read — both at home and at work. If you’re reading this and nodding, check out our brand video.

Despite the clear preference for video, communication and knowledge sharing in the business environment are still dominated by text, largely because high-quality video production remains complex and challenging to scale—until now….

Meet Synthesia

We're on a mission to make video easy for everyone. Born in an AI lab, our AI video communications platform simplifies the entire video production process, making it easy for everyone, regardless of skill level, to create, collaborate, and share high-quality videos. Whether it's for delivering essential training to employees and customers or marketing products and services, Synthesia enables large organizations to communicate and share knowledge through video quickly and efficiently. We’re trusted by leading brands such as Heineken, Zoom, Xerox, McDonald’s and more. Read stories from happy customers and what 1,200+ people say on G2.

In February 2024, G2 named us as the fastest growing company in the world. Today, we're at a $2.1bn valuation and we recently raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook.

About the role

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will join a team of 40+ Researchers and Engineers within the R&D Department working on cutting edge challenges in the Generative AI space, with a focus on creating highly realistic, emotional and life-like Synthetic humans through text-to-video. Within the team you’ll have the opportunity to work on the applied side of our research efforts and directly impact our solutions that are used worldwide by over 60,000 businesses.
This is an opportunity to work for a company that is impacting businesses at a rapid pace across the globe.

What will you be doing?

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will own one or more projects for computationally optimizing large-scale model training and inference pipelines. By partnering with researchers and research teams you’ll identify high-impact initiatives and push the boundaries of model performance. You will work on re-implementing models in an efficient manner by using PyTorch and underlying technologies like CUDA/Triton, Torch compilation, etc.

This would include:
  • Evaluating, profiling and optimising compute resource usage (e.g., Hopper & Blackwell GPUs) for cost and time efficiency at training and inference times
  • Developing customized efficient solutions for inference pipelines (CUDA/Triton kernels) as well as Introducing or enhancing tooling for achieving optimal computational performance (e.g. DL compilers, ONNX, TensorRT)
  • Driving the adoption of best practices for large-model training, including checkpointing, gradient accumulation, and memory optimisation among others
  • Introducing or enhancing tooling for distributed training, performance monitoring, and logging (e.g., DeepSpeed, PyTorch Distributed)
  • Designing and implement techniques for model parallelism, data parallelism, and mixed-precision training
  • Keeping updated on the latest research in model compression (e.g., quantization, pruning) and advanced optimisation methods
Who are you?
  • You are a ML engineer passionate about high performance computing
  • You have a background in Computer Science / Engineering  and 3+ years of industry experience. (PhD preferred)
  • You have worked on optimising large models for over 2 years
  • You have experience developing CUDA/Triton kernels and optimizing models with DL compilers (torch.compile)
  • You have great coding skills in Python and C++ and you care about writing clean, and efficient code
  • You have experience with optimising distributed systems and distributed tools like DDP, Deepspeed, Accelerate or similar
  • You have some experience in the video space (Diffusion models / GAN’s)
  • You are interested in doing research, trying new things and pushing the boundaries, going beyond what's already known
The good stuff...
  • Attractive compensation (salary + stock options + bonus)
  • Private Health Insurance in London
  • Hybrid work setting with an office in London
  • 25 days of annual leave + public holidays
  • Work in a great company culture with the option to join regular planning and socials at our hubs.
  • A generous referral scheme when you know people that are amazing for us
  • Strong opportunities for your career growth

You can see more about Who we are and How we work here: https://www.synthesia.io/careers
#LI-MD1


Top Skills

AI
C++
Cuda
Gpu
Onnx
Python
PyTorch
Tensorrt
Triton

Similar Jobs

10 Hours Ago
In-Office
2 Locations
Senior level
Senior level
eCommerce • Mobile
As a Marketing Manager, you will drive user acquisition and retention in the DACH market through campaigns, influencer partnerships, and events, while measuring performance and adapting strategies.
Top Skills: SQL
10 Hours Ago
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Software
The International Renewals Manager will lead a team to optimize customer retention, manage renewal strategies, and drive revenue forecasts. Collaborating cross-functionally and using product data to improve renewal processes are key responsibilities.
Top Skills: GainsightSalesforce
10 Hours Ago
Hybrid
5 Locations
Senior level
Senior level
Cloud • Enterprise Web • Software • Analytics • Design
The Solutions Architect will implement Altium's Enterprise solutions, build strong customer relationships, provide technical guidance, and create customized solutions to meet customer needs.
Top Skills: Ecad SoftwarePlm Integrations

What you need to know about the Edinburgh Tech Scene

From traditional pubs and centuries-old universities to sleek shopping malls and glass-paneled office buildings, Edinburgh's architecture reflects its unique blend of history and modernity. But the fusion of past and future isn't just visible in its buildings; it's also shaping the city's economy. Named the United Kingdom's leading technology ecosystem outside of London, Edinburgh plays host to major global companies like Apple and Adobe, as well as a growing number of innovative startups in fields like cybersecurity, finance and healthcare.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account