TRM Labs
Senior Full Stack Data Scientist, NLP
North AmericaFull-timeGlobal
š Midš Remote
ActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will design, build, and productionize machine learning models focused on knowledge extraction from unstructured data (NER, entity linking), graph-based learning and inference, and entity resolution and relationship discovery. You will evaluate and leverage existing ML models to solve real world problems, and you will integrate ML models into production services and APIs. You will help design and evolve knowledge graphs and ontologies, perform exploratory data analysis to inform modeling decisions, own ML components end-to-end from experimentation through deployment, and help define best practices for applied ML.
Requirements
- ā5+ years of experience in data science, machine learning engineering, or applied ML.
- āStrong programming experience in Python.
- āHands-on experience building, training, or deploying machine learning models in production.
- āFamiliarity with NLP or information extraction techniques, such as Named Entity Recognition (NER), text classification, or embedding-based approaches.
- āExperience or strong interest in knowledge graphs, graph data, or graph-based ML.
- āSolid software engineering fundamentals, including building and maintaining APIs or services.
- āAbility to translate ambiguous problem spaces into practical ML solutions.
- āStrong communication skills and comfort collaborating with engineers across disciplines.
Responsibilities
- āDesign, build, and productionize machine learning models focused on knowledge extraction from unstructured data (NER, entity linking), graph-based learning and inference, and entity resolution and relationship discovery.
- āEvaluate and leverage existing ML models and frameworks to solve real-world problems efficiently.
- āPartner with backend and graph engineers to integrate ML models into production services and APIs.
- āContribute to the design and evolution of knowledge graphs and ontologies.
- āPerform exploratory data analysis to inform modeling decisions and system design.
- āOwn ML components end-to-end, including experimentation, evaluation, deployment, and iteration.
- āHelp shape best practices for applied ML.
Tech Stack
APIsPythonentity_resolutiondata_sciencemachine_learninggraph_mlknowledge_extractionentity_linkingrelationship_discoveryknowledge_graphs