TRM Labs

Agent Engineer

United StatesFull-timeGlobal

💰 USD 200,000 - 275,000/yr

🏠 Remote

Job Description

[AI-summarized by JobStash]

You will design and implement agentic frameworks that support tool use, context retrieval, memory, and planning. You will build modular agents to automate investigative tasks and augment analyst decision-making. You will extend and scale LLM infrastructure, work on prompt engineering, RAG, and evaluation loops, and optimize pipelines for latency and reliability. You will design safe, observable, and auditable agent behaviors, measure performance across metrics like reasoning and hallucination, and iterate based on telemetry and user feedback.

Requirements

●Strong engineering background with deep experience in backend or systems work (Python preferred)
●Hands-on experience building with LLMs, agents, and tooling frameworks such as LangChain, semantic caches, and vector DBs
●Comfort working with agentic pipelines and optimizing information flow into AI systems
●Thoughtful system design approach with attention to safety, scalability, and explainability
●High product empathy and concern for agent impact on end users
●Bias toward experimentation and rapid iteration
●Previous experience with knowledge graphs, task orchestration, or AI safety is a plus

Responsibilities

●Architect and implement a robust agentic framework supporting tool use, context retrieval, memory, and planning
●Build intelligent, modular agents that automate investigative tasks and augment analyst decision-making
●Extend and scale LLM infrastructure including prompt engineering, RAG, and evaluation loops
●Design safe, observable, and auditable agent behaviors to ensure reliability in high-sensitivity environments
●Evaluate performance across metrics like reasoning, latency, success rate, and hallucination and iterate based on feedback and telemetry
●Drive rapid experimentation and deliver production-ready AI systems

Benefits & Perks

●Eligibility to participate in TRM's equity plan

Tech Stack

agent orchestrationVector databaseorchestrationPythonobservabilitysystem designLLMagentTask orchestrationSemantic cache