Skip to main content
NEUN
Back to Careers

Bullish

Lead Engineer, AI Platform

NEW
LondonFull-timeGlobal
šŸ“Š SenioršŸ  On-site
ActivePosted within the last 30 days

Job Description

[AI-summarized by JobStash]

You will design and implement production AI systems with an emphasis on reliability, observability, and continuous evaluation. You will lead development of natural-language interfaces to business data and architect multi-agent systems that coordinate across data sources. You will build evaluation harnesses and testing frameworks to measure AI quality before production deployment. You will translate complex requirements into scalable AI solutions, mentor engineers, establish coding standards, and partner with data engineering to define and enforce semantic models.

Requirements

  • ā—5+ years building production AI/ML systems with experience deploying LLM-based applications beyond proof-of-concept
  • ā—Hands-on experience with agent frameworks, tool-use patterns, and multi-step reasoning systems
  • ā—Experience with at least three of: LangChain, LangGraph, LlamaIndex, multi-agent frameworks, Model Context Protocol, DSPy, vector databases, structured output libraries, LLM inference infrastructure, cloud AI platforms, or evaluation and observability tools
  • ā—Strong background in data engineering, semantic modeling, or analytics infrastructure
  • ā—Proficiency in Python for AI/ML and cloud infrastructure (GCP preferred)
  • ā—Track record with CI/CD for ML, experiment tracking, and model governance
  • ā—Strong communication and ability to present to senior stakeholders

Responsibilities

  • ā—Design and implement production AI systems with emphasis on reliability, observability, and continuous evaluation
  • ā—Lead development of natural-language interfaces to business data
  • ā—Architect multi-agent systems and agent orchestration
  • ā—Build evaluation harnesses and testing frameworks measuring groundedness and factual consistency
  • ā—Translate complex requirements into scalable AI solutions with clear success metrics
  • ā—Mentor engineers and establish coding standards
  • ā—Partner with data engineering to define and enforce semantic models

Tech Stack

Semantic modelingevaluationAIVector databaseMLPythondata engineeringLLMmodel governanceagent orchestrationproject:CoinDesk
Expired
Search