Babylon Labs
Senior DevOps Engineer
NEWJob Description
[AI-summarized by JobStash]
You will wear many hats across DevOps, Site Reliability and Platform engineering. You will own onboarding services to production infrastructure, produce proofs of concept for blockchain and dApp deployments, identify hosting needs and rightsize infrastructure, plan and run performance benchmarking and load tests, configure CDN, DNS and security for low latency, set up fine-grained monitoring and alerting, enforce disaster recovery procedures, lead incident response and join the on-call rotation, operate scalable Kubernetes clusters and platform offerings (Bitcoin, Cosmos SDK, Ethereum, ZK infrastructure), manage databases and queuing systems, maintain observability stacks, and collaborate with other teams to find and fix production bugs and promote production-ready practices.
Requirements
- āBachelor's degree in Computer Science Computer Engineering or related field
- ā2+ years production-grade experience on Ethereum or Bitcoin or Zero Knowledge systems
- ā2+ years operating highly available containerized web application systems on Kubernetes
- āExperience with cloud management (AWS preferred)
- āExperience in Kubernetes application packaging (Helm) and deployment (FluxCD GitHub Actions)
- āProficiency in Linux management and scripting
- āExperience in Terraform scripting
- āExperience operating observability tooling (Prometheus Grafana Loki)
- āExperience with databases Redis MongoDB MariaDB and queuing systems RabbitMQ
- āAbility to manage competing priorities and respond swiftly to incidents
- āFluent oral and written communication in English
- āCosmos SDK experience (nice to have)
Responsibilities
- āOwn the onboarding of services to production infrastructure
- āProduce proofs of concept for deployment of blockchain networks and dApps
- āIdentify hosting needs and rightsize infrastructure for high availability
- āPlan and execute performance benchmarking and load-testing
- āOwn CDN caching DNS and security configuration to achieve low latency
- āSet up fine-grained monitoring and alerting rules
- āEnforce thoroughly tested disaster recovery procedures
- āLead incident response and participate in the DevOps on-call rotation
- āOperate scalable Kubernetes clusters and in-house platform offerings
- āManage blockchain infrastructure databases and queuing systems
- āMaintain observability stacks and tooling
- āCollaborate with other teams to drive bug identification and resolution and advocate production-ready practices
Benefits & Perks
- āRemote-first work