Lead Data Science Engineer
Location: New York, Hybrid
Medidata follows a hybrid office policy in which employees who are hired for an in-person position are expected to work on site a certain number of days per week in accordance with Company policy.
About our Company:
Medidata is powering smarter treatments and healthier people through digital solutions to support clinical trials. Celebrating 25 years of ground-breaking technological innovation across more than 36,000 trials and 11 million patients, Medidata offers industry-leading expertise, analytics-powered insights, and one of the largest clinical trial data sets in the industry. More than 1 million users trust Medidata's seamless, end-to-end platform to improve patient experiences, accelerate clinical breakthroughs, and bring therapies to market faster. Discover more at
Our Team:
Medidata is looking for individuals who will help us tackle some of the most complex questions facing the industry today using our proprietary platform and advanced analytics. At Medidata, we never work alone. This role will partner heavily with all of the key stakeholder functions including product, delivery, data science, engineering, partnerships, and biostatistics. Successful Medidata AI candidates will be skilled in analytical/quantitative thinking, structured communication, and excited about building the next horizon of Medidata's mission to power smarter treatments and healthier people. You will be reporting to Director, Data Engineering.
Responsibilities:
- Apply advanced skills in data architecture, data science engineering, data modeling, and data quality using modern cloud-native technologies.
- Develop ETL pipelines, working with vector databases, automation, and CI/CD using tools such as Python, SQL, and Git.
- Develop LLM applications using Retrieval-Augmented Generation (RAG) and support fine-tuning for domain-specific tasks.
- Analyze and manipulate both structured and unstructured data sources, ensuring high data quality and readiness for downstream consumers.
- Document and communicate technical work clearly to stakeholders at all levels, both technical and non-technical.
- Collaborate effectively in Agile environments and cross-functional teams, building secure, scalable data pipelines into Snowflake from both on-premise and cloud-based sources.
Qualifications:
- Bachelor's degree in a technical or scientific field, such as Statistics, Data Science, Computer Science, or similar
- 7+ years of experience in roles such as Data Scientist or Data Engineer with a strong foundation in Enterprise Data Architecture and Engineering
- Hands-on experience with tools and concepts such as Airflow, CDC, batch processing, and job scheduling.
- Hands-on experience data curation, cleansing, and annotation to support model fine-tuning and evaluation workflows.
- Experienced in building scalable, cloud-native data pipelines using tools and services like Streamlit, Snowflake and containerization platforms like Docker/Kubernetes.
- Proficient in Git/GitHub, GitHub Actions for CI/CD, and managing infrastructure as code using Terraform
- Experience with clinical trial data is not required, but interest to learn and understand how these data improve medical research is paramount
- Hands-on experience building high-throughput data pipelines across cloud platforms and MCP server environments. Proficient in implementing RAG architectures, vector databases, and low-latency retrieval layers. Skilled at integrating AI/ML pipelines into production-grade data infrastructure.
The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region.
The salary range range for this position physically based in NYC/ NJ Metro Area is $135,000-$180,000.
Base pay is one part of the Total Rewards that Medidata provides to compensate and recognize employees for their work. Most sales positions are eligible for a commission on the terms of applicable plan documents, and many of Medidata's non-sales positions are eligible for annual bonuses. Medidata believes that benefits should connect you to the support you need when it matters most and provides best-in-class benefits, including medical, dental, life and disability insurance; 401(k) matching; flexible paid time off; and 10 paid holidays per year.
Equal Employment Opportunity:
In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Medidata are based on merit, qualifications and abilities. Medidata is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age, disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. Medidata will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.
Applications will be accepted on an ongoing basis until the position is filled.
#LI-Hybrid
#LI-MM1
Recommended Jobs
Au Pair
We look forward to welcoming a hardworking, energetic, passionate, soft spoken Au pair to our family of 5. My husband and I have a newborn named a 2 year old and a pretty self sufficient 11 year We ar…
Offer: Cheese Associate - Adams Wappinger
Do you love cheese!? Cheese Associate - Adams Wappinger At Adams Fairacre Farms, we see everyday as an opportunity to share our unique shopping practice from the backyard to the kitchen table. …
Operations Lead - New York City
&##127757; Redefining how people live. At Blueground, we believe that when your base is reliable, the world opens up. That’s why we’re building the world’s leading platform for living. Every y…
Client Advisor
Job Description About Acrisure Acrisure is a global Fintech leader that combines the best of humans and high tech to offer multiple financial products and services to millions of businesses and…
Urologist
Description : We are looking for a full time Urologist to join our Urology practice. Â Our community is thriving and we have a need for a full time Urologist to work in our growing communities in C…
Senior Research Scientist - Composites - Aerospace Research
Job Description Summary GE Aerospace is pioneering the use of Ceramic Matrix Composites (CMCs) in commercial and military aerospace applications. As a Senior Research Scientist in the Composites o…
IT GRC Analyst (Cyber Contract Management)
NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to…
Rails and Transit Project Director
Job Description Overview We are seeking a Project Director with experience in Transit Facilities, ADA Improvements, and State-of-Good-Repair to join our Rails and Transit team in New York,…
Senior Director, Product
Who is Nexxen? Flexible advertising, unified by data. Nexxen empowers advertisers, agencies, publishers and broadcasters around the world to utilize data and advanced TV in the ways that are most m…
Automation Engineer
About the Role We are seeking an experienced Controls/Automation Engineer to design, develop, and implement automation control systems for industrial processes and warehouse distribution equipment.…