TRM Labs
Agent Engineer
United StatesFull-timeGlobal
š° USD 200,000 - 275,000/yr
š Remote
Job Description
[AI-summarized by JobStash]
You will design and implement agentic frameworks that support tool use, context retrieval, memory, and planning. You will build modular agents to automate investigative tasks and augment analyst decision-making. You will extend and scale LLM infrastructure, work on prompt engineering, RAG, and evaluation loops, and optimize pipelines for latency and reliability. You will design safe, observable, and auditable agent behaviors, measure performance across metrics like reasoning and hallucination, and iterate based on telemetry and user feedback.
Requirements
- āStrong engineering background with deep experience in backend or systems work (Python preferred)
- āHands-on experience building with LLMs, agents, and tooling frameworks such as LangChain, semantic caches, and vector DBs
- āComfort working with agentic pipelines and optimizing information flow into AI systems
- āThoughtful system design approach with attention to safety, scalability, and explainability
- āHigh product empathy and concern for agent impact on end users
- āBias toward experimentation and rapid iteration
- āPrevious experience with knowledge graphs, task orchestration, or AI safety is a plus
Responsibilities
- āArchitect and implement a robust agentic framework supporting tool use, context retrieval, memory, and planning
- āBuild intelligent, modular agents that automate investigative tasks and augment analyst decision-making
- āExtend and scale LLM infrastructure including prompt engineering, RAG, and evaluation loops
- āDesign safe, observable, and auditable agent behaviors to ensure reliability in high-sensitivity environments
- āEvaluate performance across metrics like reasoning, latency, success rate, and hallucination and iterate based on feedback and telemetry
- āDrive rapid experimentation and deliver production-ready AI systems
Benefits & Perks
- āEligibility to participate in TRM's equity plan
Tech Stack
agent orchestrationVector databaseorchestrationPythonobservabilitysystem designLLMagentTask orchestrationSemantic cache