Software Engineer, Data Infrastructure
Who we are
EvolutionaryScale’s mission is to develop artificial intelligence to understand biology for the benefit of human health and society, through open, safe, and responsible research, and in partnership with the scientific community. Over the next ten years AI will transform biological design, making molecules and entire cells programmable. We will develop the foundation models for biology that enable this.
The EvolutionaryScale team is based in San Francisco and New York. We believe in flexibility around work schedules and locations, but expect that our team members will work half of the days or more of most weeks from one of our offices.
What you’ll do
As a Data Infrastructure Engineer, you will work closely with bioinformatics and research teams to ensure our data jobs are reliable, efficient, and scalable. You'll implement best practices for handling large-scale data processing, select and integrate the right technologies, and drive continuous improvements in performance and quality of our data sets.
The role
- Design, develop, and maintain large-scale batch processing pipelines using tools like Spark and Ray, for acquiring biology datasets.
- Manage data infrastructure components to ensure robust and fault-tolerant operations.
- Optimize data ingestion, storage, and retrieval processes for acquiring large and growing biology datasets, and for efficient pre and post training data ingestion.
- Create systems for easy and reproducible data evaluation and experiments.
- Integrate modern ML based data curation technologies with data processing pipelines.
- Work with researchers and other engineering teams to understand data needs, create solutions that meet modeling requirements.
Preferred qualifications
Apply even if you don’t meet all of these!
- Staff level engineers with 5+ years experience highly preferred
- Proven experience with large-scale data processing systems using technologies such as Hadoop, Spark, or Ray.
- Knowledge of streaming data frameworks like Kafka Streams, Spark Streaming, or Flink.
- Understanding of data processing principles and best practices.
- Strong problem-solving skills, including the ability to research, debug, and resolve complex technical problems.
- Experience with major cloud providers (AWS, GCP, or Azure), including familiarity with data warehousing tools is a plus.
- Knowledge of biology and biology datasets is a big plus but not required.
- Experience with large scale distributed systems or machine learning is also not required but a plus.
Recommended Jobs
Network Engineer
Job Description Job Description Our client, a global Japanese trading company is looking for a Network Engineer in the Houston TX or New York NY area to join their team Title: Network Engineer…
STEM Instructor
POSITION SUMMARY: The STEM Instructor leads engaging, hands-on science, technology, engineering, and math activities for children in our after school program. This role is responsible for sparking c…
Senior Product Manager, Ad Experience
About Us: DailyPay is transforming the way people get paid. As a worktech company and the industry’s leading on demand pay solution, DailyPay uses an award-winning technology platform to help Amer…
Financial Institution-Relationship Manager
Job Description Job Description About Heartland, A Global Payments Company Every day, Heartland, a Global Payments Company, makes it possible for millions of people to move money between buyer…
Temporary Architectural Designer
Tiffany & Co. seeks a Temporary Architectural Designer for a 6-month role in New York, with potential for extension. Responsibilities include developing architectural designs, coordinating with consul…
Lead Host
Red Hook Lobster Pound is looking for an exceptional host to lead our busy New England Style restaurant! We are located in Red Hook Brooklyn and looking for someone who is available Friday nights, al…
VP, Corp FP&A Tech Product Manager
Company Profile Morgan Stanley is a leading global financial services firm providing a wide range of investment banking, securities, investment management and wealth management services. The Fir…
In House Counsel Jobs | JDHuntr 39015 Associate General Counsel, Global Compliance, New York, NY
In House Counsel Jobs | JDHuntr 39015 Associate General Counsel, Global Compliance, New York, NY To apply go to JDHuntr.com experienced, full time Global Compliance Associate General Counsel to …
IT Professionals/Consultants for a NYS Entity
IT Professionals/Consultants for a NYS Entity Management Applications, Inc., a leading provider of Managed IT Services and Network Design and Implementation is seeking IT Professionals and C…
Sales and Service Leader - Full Time
Job ID: 270717 Store Name/Number: NY-Poughkeepsie (0744) Address: 2001 South Road, New York, NY 12601, United States (US) Hourly/Salaried: Hourly (Non-Exempt) Full Time/Part Time: Full Ti…