TRM Labs
Senior Software Engineer Data Platform
North AmericaFull-timeGlobal
📊 Mid🏠 Remote
ActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will build highly reliable data services that analyze blockchain transaction activity at petabyte scale and work to enable a safer financial system for billions of people. You will collaborate with data scientists, backend engineers, and product managers to create scalable data models and robust data pipelines.
Requirements
- ●A Bachelor's degree (or equivalent) in Computer Science or a related field.
- ●5+ years of hands on experience in architecting distributed system architecture, guiding projects from initial ideation through to successful production deployment.
- ●Exceptional programming skills in Python, as well as adeptness in SQL or SparkSQL.
- ●In-depth experience with data stores such as Iceberg, Trino, BigQuery, and StarRocks, and Citus.
- ●Proficiency in data pipeline and workflow orchestration tools like Airflow, DBT, etc.
- ●Expertise in data processing technologies and streaming workflows including Spark, Kafka, and Flink.
- ●Competence in deploying and monitoring infrastructure within public cloud platforms, utilizing tools such as Docker, Terraform, Kubernetes, and Datadog.
- ●Proven ability in loading, querying, and transforming extensive datasets.
Responsibilities
- ●Build highly reliable data services to integrate with dozens of blockchains.
- ●Develop complex ETL pipelines that transform and process petabytes of structured and unstructured data in real time.
- ●Design and architect intricate data models for optimal storage and retrieval to support sub second latency for querying blockchain data.
- ●Oversee the deployment and monitoring of large database clusters with an unwavering focus on performance and high availability.
- ●Collaborate across departments, partnering with data scientists, backend engineers, and product managers to design and implement novel data models that enhance TRM’s products.
Tech Stack
dbtPythonSQLETLKafkaTerraformBigQueryAirflowFlinkSparkSQL