Skip to main content
NEUN
Back to Careers

Luxor Technology

Platform Engineer Storage

NEW
WorldwideFull-timeGlobal
šŸ“Š MidšŸ  Remote
ActivePosted within the last 30 days

Job Description

[AI-summarized by JobStash]

You will build and operate the core infrastructure that powers a distributed compute platform. You will design, deploy, tune, and run production Ceph clusters from hardware selection through ongoing operations. You will build efficient, generalizable APIs and kernel-leveraging tools to enable near-zero-downtime live migrations of stateful workloads. You will design and implement storage-orchestration and control-plane services using Go, gRPC, ScyllaDB, and Temporal, and connect storage primitives to higher-level platform abstractions. You will write clear engineering requirements documents, scope work, implement changes, roll them out, and perform post-deployment evaluation. You will develop foundational storage primitives used by customer applications and internal services to enable features like streaming image pulls and movable build caches. You should be comfortable working remotely while reliably aligning with core time zones in LATAM (GMT-3) or the Philippines (GMT+8) and communicating in English.

Requirements

  • ā—Deep experience building distributed systems
  • ā—Production experience with distributed block storage systems such as Ceph or equivalent storage-cluster design knowledge
  • ā—Knowledge of modern filesystems (Ext4, ZFS, Btrfs) with bonus experience in next-gen filesystems (EROFS, bcachefs)
  • ā—Strong intuition for system longevity and lifecycle management at large scale
  • ā—Experience implementing reliable instrumentation, monitoring, and documentation
  • ā—Ability to operate in ambiguous, early-stage environments and prioritize work
  • ā—Ownership mentality and willingness to dive deep into problems

Responsibilities

  • ā—Design production Ceph clusters from hardware selection to configuration and tuning
  • ā—Deploy and operate distributed storage clusters and perform ongoing operations
  • ā—Build APIs that leverage system and kernel capabilities for live migrations
  • ā—Design and implement storage-orchestration and control-plane services using Go, gRPC, ScyllaDB, and Temporal
  • ā—Write engineering requirements documents and take projects from conception to rollout and evaluation
  • ā—Develop foundational storage primitives enabling streaming image pulls and movable build caches
  • ā—Instrument, monitor, and document systems and boundaries

Tech Stack

monitoringgRPCCephTemporalcontrol planedistributed systemsNoSQL databaseinstrumentationGoAPI
Expired
Search