BIO Protocol
AI Engineer
NEWRemoteFull-timeGlobal
š Remote
RemoteRemote work position availableActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will design, build, and scale agent systems for planning, tool use, memory, and context management. You will integrate agents with internal and external tools and data sources, implement safety guardrails and sandboxed execution, and develop evaluation, telemetry, and automated scoring systems. You will instrument data pipelines for fine-tuning and reinforcement learning, profile and optimize performance and reliability, and contribute to platform services, APIs, orchestration, CI/CD, and observability.
Requirements
- āExperience building production software in Python or TypeScript
- āStrong systems and API design skills (FastAPI, gRPC, GraphQL or similar)
- āProven experience shipping LLM applications or agentic systems including tool use, function calling, retrieval/RAG, structured outputs, evaluation, or observability
- āFamiliarity with agent and orchestration frameworks (LangChain, LangGraph, AutoGen, CrewAI, MCP) and vector databases (FAISS, Weaviate, Pinecone)
- āExperience with cloud infrastructure and containers (AWS, GCP, or Azure), Docker, Kubernetes, Terraform, CI/CD, and production telemetry
- āAbility to translate research prototypes into robust, scalable systems
- āExperience with fine-tuning and reinforcement learning (RL, RLAIF, RLHF) ā nice to have
- āFamiliarity with benchmarks and evaluations and with schema, ontology, and provenance design ā nice to have
Responsibilities
- āBuild agent capabilities for planning, tool use, memory, and context management and ship them into production
- āIntegrate agents with internal and external tools and data sources using robust schemas and safeguards
- āDevelop quality and evaluation systems including unit tests, regression tests, scenario benchmarks, telemetry, and automated scoring
- āCollaborate with scientists to analyze failure modes and improve performance
- āEnsure outputs are source-traceable and compliant with provenance standards
- āImplement safety measures, guardrails, and sandboxed execution for risky operations
- āOptimize performance and reliability via profiling, idempotency, retries, rate limiting, and uptime management
- āInstrument data pipelines for supervised fine-tuning and reinforcement learning
- āContribute to agent platform services, APIs, orchestration, CI/CD, and observability
Tech Stack
AutoGPTreinforcement learningontologyCrewAIGraphQLLangGraphPythonTerraformMCPfine-tuning