AI Data Engineer

C the Signs
New York, NY

Position Summary

The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.

You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement

Key Responsibilities

  • Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning.
  • Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.
  • Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
  • Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.
  • Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
  • Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).
  • Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.
  • Document data engineering processes, data models, and data dictionaries.
  • Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning.

Requirements

Required

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Proven experience as a Data Engineer, with a focus on big data technologies.
  • Strong proficiency in programming languages such as Python, Scala, or Java.
  • Extensive experience with data warehousing, ETL processes, and data modeling.
  • Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
  • Hands-on experience with big data frameworks like Apache Spark for distributed processing.
  • Excellent problem-solving skills and the ability to work independently and as part of a team.
  • Strong communication and interpersonal skills.

Preferred

  • Master's degree in a related field.
  • Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).
  • Familiarity with machine learning concepts and LLM fine-tuning processes.
  • Experience with data orchestration tools (e.g., Apache Airflow).

Work Authorization:

  • Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa

Benefits

Why Join Us?

Joining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.

Benefits:

  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Posted 2025-12-18

Recommended Jobs

Director, Media Relations (Corporate & Executive Communications)

Praytell
New York, NY

Praytell is a creative communications agency built by strategic minds and spirited hearts who believe original storytelling, unexpected ideas and an instinctive recoil from anything remotely boring c…

View Details
Posted 2026-01-15

Sous Chef - Monticello, NY

Hyatt
White Lake, NY

Summary At Hyatt, we believe in the power of belonging- of making people feel at home no matter where they are in the world. We return trips into journeys, encounters into experiences, and jobs in…

View Details
Posted 2026-02-03

Server

Cookshop
New York, NY

We are looking for an experienced server that thrives in fast paced environments along with great energy! Cookshop is a seasonal restaurant located in the heart of Chelsea, next to the High Line Ho…

View Details
Posted 2026-02-27

Front of House Staff

MÁLÀ PROJECT
Brooklyn, NY

MáLà Project isn't for everyone. We're not your average Chinese restaurant, and we're not looking for average people. We need a Front of House team that can handle the pace, the energy, and high stan…

View Details
Posted 2026-02-24

General Laborer

Metalico
Buffalo, NY

JOB TITLE: General Laborer REPORTS TO : Yard Manager and Yard Supervisor. JOB TYPE: Full-time, Non-Exempt LOCATION : U Pull It 49 Hopkins (49 Hopkins Street Buffalo NY, 14220). HOURS …

View Details
Posted 2026-02-18

Vice President of Investor Relations; Agency

Finn Partners
New York, NY

Vice President, Investor Relations (Agency) Join the agency consistently recognized as a “Best Place to Work,” known for its values-based culture and commitment to employee growth. This role offers…

View Details
Posted 2026-01-09

Staff Engineer - PKI Systems

Fastly
New York, NY

Fastly helps people stay better connected with the things they love. Fastly’s edge cloud platform enables customers to create great digital experiences quickly, securely, and reliably by processing, …

View Details
Posted 2026-02-12

Senior Site Reliability Engineer

Stellar Development Foundation
New York, NY

Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SD…

View Details
Posted 2026-02-10

Growth Media Specialist

Create Wellness, Inc.
New York, NY

The Role We’re hiring a Paid Media Specialist to be a hands-on, execution-first operator and a true right-hand partner to the Director of Growth Marketing. This role is for someone who: Love…

View Details
Posted 2026-02-27

Mobile Commercial Maintenance Technician

MalaceHR
Buffalo, NY

Job Title:  Mobile Commercial Maintenance Technician Shift: 1st Shift | Monday–Friday | 8:00 AM – 4:00 PM Compensation: $27–$33 per hour Location: Buffalo, NY 14201 Build. Repai…

View Details
Posted 2026-02-21