Luxor Technology
Platform Engineer Storage
NEWJob Description
[AI-summarized by JobStash]
You will build and operate the core infrastructure that powers a distributed compute platform. You will design, deploy, tune, and run production Ceph clusters from hardware selection through ongoing operations. You will build efficient, generalizable APIs and kernel-leveraging tools to enable near-zero-downtime live migrations of stateful workloads. You will design and implement storage-orchestration and control-plane services using Go, gRPC, ScyllaDB, and Temporal, and connect storage primitives to higher-level platform abstractions. You will write clear engineering requirements documents, scope work, implement changes, roll them out, and perform post-deployment evaluation. You will develop foundational storage primitives used by customer applications and internal services to enable features like streaming image pulls and movable build caches. You should be comfortable working remotely while reliably aligning with core time zones in LATAM (GMT-3) or the Philippines (GMT+8) and communicating in English.
Requirements
- āDeep experience building distributed systems
- āProduction experience with distributed block storage systems such as Ceph or equivalent storage-cluster design knowledge
- āKnowledge of modern filesystems (Ext4, ZFS, Btrfs) with bonus experience in next-gen filesystems (EROFS, bcachefs)
- āStrong intuition for system longevity and lifecycle management at large scale
- āExperience implementing reliable instrumentation, monitoring, and documentation
- āAbility to operate in ambiguous, early-stage environments and prioritize work
- āOwnership mentality and willingness to dive deep into problems
Responsibilities
- āDesign production Ceph clusters from hardware selection to configuration and tuning
- āDeploy and operate distributed storage clusters and perform ongoing operations
- āBuild APIs that leverage system and kernel capabilities for live migrations
- āDesign and implement storage-orchestration and control-plane services using Go, gRPC, ScyllaDB, and Temporal
- āWrite engineering requirements documents and take projects from conception to rollout and evaluation
- āDevelop foundational storage primitives enabling streaming image pulls and movable build caches
- āInstrument, monitor, and document systems and boundaries