Lead Data Science Engineer

Dassault Systèmes
New York, NY

Location: New York, Hybrid

Medidata follows a hybrid office policy in which employees who are hired for an in-person position are expected to work on site a certain number of days per week in accordance with Company policy.

About our Company:

Medidata is powering smarter treatments and healthier people through digital solutions to support clinical trials. Celebrating 25 years of ground-breaking technological innovation across more than 36,000 trials and 11 million patients, Medidata offers industry-leading expertise, analytics-powered insights, and one of the largest clinical trial data sets in the industry. More than 1 million users trust Medidata's seamless, end-to-end platform to improve patient experiences, accelerate clinical breakthroughs, and bring therapies to market faster. Discover more at

Our Team:

Medidata is looking for individuals who will help us tackle some of the most complex questions facing the industry today using our proprietary platform and advanced analytics. At Medidata, we never work alone. This role will partner heavily with all of the key stakeholder functions including product, delivery, data science, engineering, partnerships, and biostatistics. Successful Medidata AI candidates will be skilled in analytical/quantitative thinking, structured communication, and excited about building the next horizon of Medidata's mission to power smarter treatments and healthier people. You will be reporting to Director, Data Engineering.

Responsibilities:

  • Apply advanced skills in data architecture, data science engineering, data modeling, and data quality using modern cloud-native technologies.
  • Develop ETL pipelines, working with vector databases, automation, and CI/CD using tools such as Python, SQL, and Git.
  • Develop LLM applications using Retrieval-Augmented Generation (RAG) and support fine-tuning for domain-specific tasks.
  • Analyze and manipulate both structured and unstructured data sources, ensuring high data quality and readiness for downstream consumers.
  • Document and communicate technical work clearly to stakeholders at all levels, both technical and non-technical.
  • Collaborate effectively in Agile environments and cross-functional teams, building secure, scalable data pipelines into Snowflake from both on-premise and cloud-based sources.

Qualifications:

  • Bachelor's degree in a technical or scientific field, such as Statistics, Data Science, Computer Science, or similar
  • 7+ years of experience in roles such as Data Scientist or Data Engineer with a strong foundation in Enterprise Data Architecture and Engineering
  • Hands-on experience with tools and concepts such as Airflow, CDC, batch processing, and job scheduling.
  • Hands-on experience data curation, cleansing, and annotation to support model fine-tuning and evaluation workflows.
  • Experienced in building scalable, cloud-native data pipelines using tools and services like Streamlit, Snowflake and containerization platforms like Docker/Kubernetes.
  • Proficient in Git/GitHub, GitHub Actions for CI/CD, and managing infrastructure as code using Terraform
  • Experience with clinical trial data is not required, but interest to learn and understand how these data improve medical research is paramount
  • Hands-on experience building high-throughput data pipelines across cloud platforms and MCP server environments. Proficient in implementing RAG architectures, vector databases, and low-latency retrieval layers. Skilled at integrating AI/ML pipelines into production-grade data infrastructure.

The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region.

The salary range range for this position physically based in NYC/ NJ Metro Area is $135,000-$180,000.

Base pay is one part of the Total Rewards that Medidata provides to compensate and recognize employees for their work. Most sales positions are eligible for a commission on the terms of applicable plan documents, and many of Medidata's non-sales positions are eligible for annual bonuses. Medidata believes that benefits should connect you to the support you need when it matters most and provides best-in-class benefits, including medical, dental, life and disability insurance; 401(k) matching; flexible paid time off; and 10 paid holidays per year.

Equal Employment Opportunity:

In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Medidata are based on merit, qualifications and abilities. Medidata is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age, disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. Medidata will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.

Applications will be accepted on an ongoing basis until the position is filled.

#LI-Hybrid

#LI-MM1

Posted 2026-01-30

Recommended Jobs

Offer: Front Server (Poughkeepsie, Shadows on the Hudson)

Poughkeepsie, NY

Where the Best Go. Where the Best Grow. Front Server (Poughkeepsie, Shadows on the Hudson) Shadows on the Hudson is an award winning waterfront restaurant located in Poughkeepsie, NY. Our build…

View Details
Posted 2026-01-31

Warehouse Operations Supervisor

NFI Industries
Newburgh, NY

Overview: The Warehouse Operations Supervisor is responsible for the efficient day-to-day operation of the warehouse with particular emphasis on supervising warehouse personnel and achieving daily …

View Details
Posted 2026-02-03

Part-Time Registered Dietitian (80% remote)

RD Nutrition Consultants
Buffalo, NY

Clinical Registered Dietitian Position - Part-time/Hybrid (80% Remote)    Company: RD Nutrition Consultants LLC   Overview:  RD Nutrition Consultants LLC is excited to offer an opportunity fo…

View Details
Posted 2026-01-21

FIG Solutions Structuring - Director

New York, NY

Position Overview Job Title: FIG Solutions Structuring Corporate Title: Director Location: New York, NY Overview As a Director in FIG Solutions Structuring, you will partner with …

View Details
Posted 2025-12-12

Legal Support Assistant

Seyfarth Shaw LLP
New York, NY

Why Seyfarth: At Seyfarth, we understand that great people are the key to our success, and we provide the opportunities to match. If you join us, you’ll work with state-of-the-art technology in a fri…

View Details
Posted 2026-01-15

Enterprise Systems Administrator

Tommy John
New York, NY

COMPANY OVERVIEW From the first-ever patented undershirt to distraction-free underwear and essentials, Tommy John has been redefining confidence through comfort since 2008. Driven by innovation an…

View Details
Posted 2026-01-12

Optometrist | $700 per diem - Bronx, NYC

Eyetastic Services
New York, NY

Are you an enthusiastic Optometrist looking to work in a thriving, multi-doctor private practice? Were seeking a passionate, patient-focused professional to join this spectacular team! This establish…

View Details
Posted 2025-12-31

Manager - Crossix Analytics Services

Veeva Systems
New York, NY

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…

View Details
Posted 2025-07-31

Market Manager

Morgan Stanley
New York, NY

POSITION SUMMARY:   The primary focus of the Market Manager role is to drive revenue, increase profit before taxes within their branch and other assigned branches in the market, and manage risk wi…

View Details
Posted 2026-01-30

Global Capital Markets - Leveraged Finance AI Solutions Architect

Morgan Stanley
New York, NY

The Leveraged Finance (‘Lev Fin’) group sits within Global Capital Markets (‘GCM’) and focuses on providing comprehensive and innovative financing advice to financial sponsors and non-investment grad…

View Details
Posted 2026-01-30