Applied AI Engineer - Agent
We’re hiring an Applied AI Engineer to push the boundaries of our Cofounder agent. You’ll own core backend systems and applied LLM work: advancing agent reliability and autonomy, building evaluation pipelines, and shipping techniques that measurably improve agent performance. This is a hands-on role with high ownership across research-to-production: prototyping, instrumenting, evaluating, and deploying improvements that show up directly in user outcomes.
What You’ll Do
Design and implement agent improvements end-to-end: prompting strategies, tool selection, action planning, memory usage, safety/guardrails, and recovery paths
Build robust evaluation pipelines for the agent: offline evals (golden tasks, regression suites, behavior tests), online metrics (latency, success rate, fallout modes, cost efficiency), and experimentation frameworks (A/B, canaries, guardrail thresholds)
Productionize applied LLM techniques: function/tool-calling orchestration, self-reflection, retrieval/RAG, multi-agent handoffs, caching/embedding strategies, and hallucination reduction
Improve core backend systems: reliable job orchestration, retries/backoff, idempotency, and auditability; scalable memory and context routing; data pipelines across Gmail, Slack, Notion, Linear, Google Workspace, etc.; observability and tracing for agent actions/outcomes
Partner with product and infra to define success metrics and ship fast, safe iterations
Write clean, well-tested code; document design decisions and runbooks
What You’ll Bring
4+ years backend engineering experience, preferably Python (we care about impact over years)
Hands-on LLM experience: prompt engineering, function-calling, retrieval, embeddings, evaluation design; you’ve shipped LLM features to production
Track record building evaluation harnesses and using them to drive improvements (regression suites, task success metrics, cost/runtime tradeoffs)
Solid distributed systems fundamentals: concurrency, reliability, performance, data modeling, lifecycle management
Pragmatic experimentation: hypothesis → prototype → measured improvement → rollout
Excellent debugging and instrumentation skills; you enjoy finding and fixing edge cases in the wild
Nice To Have
Experience with agent frameworks, tool orchestration, and memory architectures
RAG systems in production (chunking, retrieval quality, freshness strategies)
Redis, Postgres/Supabase, queues (e.g., Celery/Arq/SQS), and event-driven designs
Observability stacks (Datadog, OpenTelemetry), and cost/latency optimization
Why Join Us
Mission: build autonomous agents that run entire businesses
Impact: ship core agent improvements that users feel immediately
Velocity: small, senior team; fast decision cycles; high ownership
Stack: modern tooling across AI orchestration, integrations, and memory systems
Compensation
Competitive salary and meaningful equity
Comprehensive benefits and flexible work setup
Recommended Jobs
Busser
Double Knot NYC brings the Schulson Collective’s award-winning concept to the heart of Manhattan. This bi-level Japanese restaurant and cocktail lounge offers a dynamic balance: a vibrant upstairs ba…
Sales and Marketing Associate
Location: In Office Status & Salary: Full-Time; Exempt: $60,000 - $62,000/year plus performance-based bonus opportunities tied to number of clients signed up for services; based on applicable skills…
Community Manager
Overview: As a Community Manager, you will be a pivotal member of our social media team, responsible for fostering meaningful engagement and maintaining a healthy relationship with our brand's soc…
Remote Market Research Assistant Earn Up to $50/Task
Compensation: Up to $50 per task About the Role We are seeking individuals to complete small, remote market research tasks from home. Your insights help shape future products, services, and publi…
Travel Nurse RN - Infusion - $2,464 per week in Rochester, NY
Registered Nurse (RN) | Infusion Location: Rochester, NY Agency: Cynet Health Pay: $2,464 per week Shift Information: Days - 5 days x 8 hours Contract Duration: 13 Weeks S…
Cashroom Manager
Position Title: Cash Room Manager Department: Cash Room Supervisor: Assistant Branch Manager FLSA: Non-exempt Position Summary: Works closely with Assistant Branch Manager and Branch…
Travel Nurse RN - Labor & Delivery in New York
Registered Nurse (RN) | Labor & Delivery Location: New York Agency: job.com Pay: Competitive weekly pay (inquire for details) Shift Information: 3 days x 12 hours Start Date: …
Full-Cycle Sales Representative - $1,000 - $2,000/Week
Full-Cycle Sales Representative - $1,000 - $2,000/Week About Vyynl Vyynl is a fast-growing design and installation company specializing in custom business signage and vinyl wall graphics . We…
Custodian
Under direct supervision, the Custodian will provide specialized janitorial services and light maintenance work for the program’s main location. The position ensures that the facility is cleaned/sani…
Assistant Budget and Policy Analyst
Assistant Analyst, Education Team The New York City Independent Budget Office (IBO), a nonpartisan city government agency, is seeking a self-motivated Assistant Budget and Policy Analyst to join…