Auki Labs
Platform Engineer
NEWRemoteFull-timeGlobal
š Mid
RemoteRemote work position availableActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will design, operate, and improve the infrastructure and deployment systems that run production services. You will automate processes to improve reliability and recovery, own CI/CD pipelines and infrastructure-as-code, build monitoring and alerting, manage incidents and on-call processes, maintain release artifacts and runbooks for self-hosted deployments, and contribute backend reliability and deployment work as needed.
Requirements
- āBachelor in computer science or related subject or equivalent experience
- āAt least three years of relevant working experience in DevOps, SRE, infrastructure, or cloud
- āHands-on experience operating production systems on AWS
- āHands-on experience with containerization and orchestration such as Docker and Kubernetes
- āExperience building and maintaining CI/CD pipelines and deployment automation
- āExperience with infrastructure-as-code such as Terraform
- āProficiency in one or more of Go, Python, or shell scripting
- āHands-on experience with debugging, optimizing code, and automation
- āExperience with monitoring and alerting tools such as Prometheus, Grafana, and PagerDuty or equivalents
- āStrong written and verbal communication skills and ability to write operational and deployment documentation
- āAbility to manage multiple projects in a deadline-driven environment and collaborate with a distributed team
Responsibilities
- āDevelop and integrate solutions with a bias for automation to improve and maintain reliability across the production estate and make recovery easier
- āOwn and evolve CI/CD pipelines and infrastructure-as-code to ensure consistent, high-quality deployments
- āDesign and track metrics for uptime and performance and maintain visibility through dashboards and alerting
- āOwn the incident management process and continuously improve monitoring, on-call hygiene, and post-incident follow-ups
- āOwn release engineering for customer self-hosted deployments including packaged deployment artifacts, versioning, upgrade guides, and troubleshooting runbooks
- āEnsure consistent deployments and reliable connectivity of devices by working closely with other teams
- āDrive secure-by-default infrastructure practices including IAM least privilege, secrets management, patching, and vulnerability scanning integration
- āMaintain lightweight security and compliance evidence such as architecture diagrams, control descriptions, and runbooks and support customer security questionnaires
- āContribute to backend work focused on reliability, deployments, and infrastructure-adjacent improvements (up to ~30%)
Benefits & Perks
- āHacker residency at the lab in Hong Kong
- āGrants of up to 100000 USD worth of AUKI tokens for successful applicants
- āAccess to hardware (robots, smart glasses, and other equipment) at the hacker house in Hong Kong
Tech Stack
monitoringInfrastructure-as-CodeARRedisalertingRelease engineeringGoHelmIAMmetrics