Staff LLM Systems Engineer

New York, NY

Location: United States (West Coast preferred, remote considered)

About the Company

We are a rapidly growing AI company delivering large language models at scale. Our mission is to ensure models not only perform well in research but also serve real-world applications reliably and efficiently. We are looking for engineers who enjoy solving high-scale inference and systems challenges.

Role Overview

We are seeking a Senior / Staff LLM Systems Engineer to lead the development, optimization, and deployment of large language model inference pipelines. This role focuses on high-throughput, low-latency serving and production reliability, bridging ML research and platform engineering.

This is not a training-focused role – the emphasis is on serving models at scale, optimizing systems, and enabling production ML reliability .

Responsibilities

  • Design, implement, and optimize inference pipelines for large language models
  • Improve throughput and latency of model serving in production environments
  • Collaborate closely with infrastructure, platform, and ML research teams to ensure smooth deployment
  • Build monitoring, observability, and alerting systems for inference performance and reliability
  • Identify and solve scaling challenges across GPUs, TPUs, or distributed environments
  • Evaluate and adopt new technologies, frameworks, and architectures to improve inference efficiency
  • Mentor other engineers and contribute to technical strategy for production ML systems

Qualifications

  • 5+ years of software engineering experience, including hands-on ML systems experience
  • Strong background in distributed systems, performance tuning, and low-latency architectures
  • Experience with model serving frameworks (e.g., Triton, vLLM, Ray, TorchServe)
  • Familiarity with GPU/TPU infrastructure, multi-node deployment, and system-level optimization
  • Understanding of ML workloads and trade-offs between accuracy, latency, and cost
  • Proven ability to deliver production-grade ML systems at scale
  • Excellent collaboration and problem-solving skills

Why You’ll Enjoy This Role

  • Work on cutting-edge LLM inference systems at scale
  • Solve technically challenging, high-impact engineering problems
  • Collaborate with top ML researchers and platform engineers
  • Competitive compensation and flexible work arrangements

Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.

Reece Waldon

Posted 2026-01-02

Recommended Jobs

Regional Airline Sales GM — Growth & Strategy Leader

Etihad Airways
New York, NY

A leading international airline is seeking a Sales Manager to implement sales strategies and lead a team across multiple markets. This role requires a Bachelor's degree or seven years of relevant exp…

View Details
Posted 2026-01-15

Licensed Practical Nurse (LPN)

WellNow Urgent Care
Herkimer, NY

WellNow Urgent Care takes pride in creating an environment filled with meaningful work and opportunities by investing in our colleagues. We offer competitive salaries and a comprehensive benefits pac…

View Details
Posted 2026-01-12

Accounts Payable Manager

Abilities First
Poughkeepsie, NY

Full-time Description   Who We Are   For over 60 years, Abilities First, Inc. has been empowering individuals with developmental disabilities to live their most vibrant, independent liv…

View Details
Posted 2026-01-16

Product Marketing Associate

talentpluto
New York, NY

Location: New York, NY Work Model: Onsite (5 days per week in office, with limited flexibility) Industry: B2B SaaS / Education Technology Compensation: Base salary range of $85,000–$130,0…

View Details
Posted 2026-01-15

Academic Scholar - Clinical Pediatrician, Ambulatory, Division of General Pediatrics

State University of New York at Buffalo
New York, NY

Academic Scholar – Clinical Pediatrician, Ambulatory, Division of General Pediatrics Position Title Academic Scholar – Clinical Pediatrician, Ambulatory, Division of General Pediatrics The Dep…

View Details
Posted 2026-01-15

CREW MEMBER

Dunkin' Cafua Management Company
New York, NY

We are looking for a Crew Member to help us deliver our mission statement – “turning moments into memories for our guests, while providing opportunities to our employees, and giving back to the com…

View Details
Posted 2026-01-09

Manager, Technical Accounting - Insurance

Uber
New York, NY

About the Role As the Accounting Manager - Insurance, you will have the opportunity to lead an accounting team that is on the forefront of the insurance industry. You will be involved in managing …

View Details
Posted 2025-12-06

Account Manager - Fine Fragrance Creation

Givaudan SA
New York, NY

Join us and celebrate the beauty of human experience. Create for happier, healthier lives, with love for nature. Together, with our customers, we deliver food innovations, craft inspired fragrances a…

View Details
Posted 2026-01-12

Center Director (Liaison) Lead Coach

RECAP
Middletown, NY

RECAP is a leader in early childhood education, dedicated to providing high-quality Head Start programs that support the growth and development of children and families in our community. As a Center …

View Details
Posted 2025-12-25

Inventory Coordinator

Julie Vos
New York, NY

About the Job Julie Vos is a fashion jewelry brand created by CEO and designer Julie Vos. Since 2006, the brand has been guided by the belief that with inspiration and discipline, we create the be…

View Details
Posted 2026-01-15