Lead Spark Data Engineer

Fusemachines
New York, NY


About Fusemachines

Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, the United States, Canada, and the Dominican Republic) and more than 450 full-time employees, Fusemachines brings global AI expertise to transform companies worldwide. Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise Transformation, regardless of where they are in their Digital AI journeys. With offices in North America, Asia, and Latin America, Fusemachines provides a suite of enterprise AI offerings and specialty services that allow organizations of any size to implement and scale AI. Fusemachines serves companies in industries such as retail, manufacturing, and government.

Fusemachines continues to actively pursue the mission of democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI.

Job Description:

We are looking for an experienced Lead Data Engineer to join our team to build the "Brain" of an IoT platform, a library that allows definition and metrics, validates it against a Virtual Schema, and generates optimized execution plans for both Spark (Batch) and Flink (Stream).

Qualification / Skill Set Requirement:

  • 5+ years of hands-on data engineering experience with deep expertise in the Azure ecosystem.
  • Expert-level Java, Python and SQL.
  • Deep understanding of Apache Spark Internals (Catalyst Optimizer, Logical Plans).
  • Experience with ANTLR v4 or writing custom DSLs/Parsers.
  • Experience with Databricks and Delta Lake optimization.
  • Experience constructing Abstract Syntax Trees (ASTs).
  • Strong understanding of SDLC and Agile methodologies with hands-on experience in Azure DevOps, GitHub, CI/CD, and artifact management.
  • Skilled in data modeling, data design, and data warehousing solutions on Azure Databricks.
  • Knowledge of data quality, governance, and security best practices within Azure (AD, NSG, encryption, compliance).
  • Certifications preferred: Azure Fundamentals, Azure Data Engineer Associate, Databricks Certified Data Engineer Professional and Azure Solutions Architect Expert (nice to have).

Responsibilities

  • Architect, design, and implement scalable and efficient data solutions on Spark and Flink.
  • Implement the grammar for the IoT Query Language.
  • Build the Query Validator to enforce semantic constraints before a query is executed.
  • Develop a Spark Adapter: A translation layer that converts definition on metrics into Spark code.
  • Implement relationships logic (traversing a Graph/Ontology) within the core to avoid database bottlenecks.
  • Ensure 100% logic parity between Spark (Batch) and Flink (Stream) implementations.
  • Manage and optimize Azure and Databricks resources, for performance, reliability, and cost-efficiency.
  • Transform, clean, and prepare data using SQL, Python and Java.
  • Monitor and fine-tune workloads and pipelines for optimal performance and reliability.
  • Maintain clear documentation of solutions, configurations, and workflows.
  • Actively participate in Agile team activities and continuous improvement initiatives.
  • Promote and enforce data engineering best practices, including data governance, security, and data quality.

Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

Posted 2026-02-22

Recommended Jobs

Executive Assistant

TD Bank
New York, NY

Work Location : New York, New York, États-Unis d'Amérique Hours 40 Line Of Business Administration Pay Detail $95,000 - $110,000 USD TD is committed to providing fair and equita…

View Details
Posted 2026-02-12

Software Engineer I (Full Stack)

Grit PPO
Croton On Hudson, NY

About Us At Grit - Pest Process Outsourcing, we are an enthusiastic BPO company located in the United States, focused on transforming the customer experience in the pest control sector. Our skille…

View Details
Posted 2026-02-10

Regional Business Director

Marmon Link
Albany, NY

Acumed LLC As a part of the global industrial organization Marmon Holdings—which is backed by Berkshire Hathaway—you’ll be doing things that matter, leading at every level, and winning a better wa…

View Details
Posted 2026-02-21

Customer Service Product Manager

Uphold
New York, NY

About Uphold Uphold is a leading fintech platform enabling users to transact in multiple asset classes—from traditional currencies to cryptocurrencies and commodities—on a single, unified interfac…

View Details
Posted 2026-02-22

Scanning Operator Coordinator, Bureau of Vital Statistics

DEPT OF HEALTHMENTAL HYGIENE
New York, NY

~ Open to those permanent in the Clerical Associate title only. The Bureau of Vital Statistics is responsible for registering and certifying all birth, deaths, spontaneous and induced terminat…

View Details
Posted 2026-02-21

Associate Director, Fellow Engagement and Communications

Braven
New York, NY

Job Title : Associate Director, Fellow Engagement and Communications Team : Product Location : In-Person in Atlanta (GA), Chicago (IL), Newark (NJ), New York (NY) Employment Type : Full…

View Details
Posted 2026-02-13

Job Offer: Clinician/Social Worker, Families Work

Schenectady, NY

Clinician/Social Worker, Families Work POSITION OVERVIEW Masters Level Clinician, Families The Clinician position will maintain a caseload of ten families and meet with each on a week…

View Details
Posted 2026-01-30

Field Sales (Outside Sales) Representative

HireLive
Yonkers, NY

UniFirst is HIRING - ~ SALES REP / TERRITORY SALES Territories Available: NEW HAVEN, CT & STRATFORD, CT Interviews ONE DAY ONLY - TUESDAY, FEBRUARY 24 - (8 am - 4:30 pm) APPLY NOW to S…

View Details
Posted 2026-02-06

Equipment Repair Tech

Sanico Inc.
Syracuse, NY

Are you handy?  Like to fix stuff?  Looking for a company you can grow with?  Sanico is a growing family-owned wholesale distributor of janitorial supplies and equipment across Upstate NY. We ar…

View Details
Posted 2026-02-07

Corporate Banking - Sr. Associate/Associate - Global Capital Management

Citi
New York, NY

The Corporate Banking Associate is an intermediate level professional responsible for relationship management and developing solutions for Corporate Banking clients in coordination with partners acro…

View Details
Posted 2026-02-15