Skip to main content
NEUN
Back to Careers

Explore

DevOps/SRE Engineer

NEW
49, Jalan Dungun, Bukit Damansara, ...Full-timeGlobal
šŸ“Š MidšŸ  On-site
ActivePosted within the last 30 days

Job Description

[AI-summarized by JobStash]

You will design, implement, and manage CI/CD pipelines using Jenkins and/or GitLab Runner. You will work with development teams to ensure infrastructure supports services, manage pre-production and production environments for high availability, and perform capacity planning. You will proactively monitor system performance and reliability, troubleshoot and resolve issues, create and maintain configuration and operational documentation, participate in on-call rotations and provide after-hours support, implement automation and infrastructure as code, and ensure security best practices in deployments.

Requirements

  • ā—Strong experience with CI/CD tools such as Jenkins or GitLab CI/CD
  • ā—Proficient in scripting languages such as Bash or Python
  • ā—Experience with cloud services (Alicloud preferred) and infrastructure as code (Terraform, OpenTofu, CloudFormation)
  • ā—Solid understanding of containerization and orchestration technologies such as Docker and Kubernetes
  • ā—Knowledge of system monitoring tools such as Prometheus and Grafana and log management like ELK
  • ā—Excellent troubleshooting and problem-solving skills
  • ā—Strong understanding of network fundamentals and security practices
  • ā—Effective communication skills and ability to work collaboratively in cross-functional teams
  • ā—Bachelor's degree in Computer Science, Engineering, or a related field
  • ā—3+ years of experience in a DevOps/SRE role preferred

Responsibilities

  • ā—Design and implement CI/CD pipelines using Jenkins or GitLab Runner
  • ā—Collaborate with development teams to ensure infrastructure supports service requirements
  • ā—Manage pre-production and production environments to ensure high availability and performance
  • ā—Perform infrastructure capacity planning and management
  • ā—Monitor system performance and reliability and address issues proactively
  • ā—Develop and maintain configuration, operations, and troubleshooting documentation
  • ā—Participate in on-call rotations and provide after-hours support
  • ā—Implement automation tools to reduce manual work and optimize resource utilization
  • ā—Recommend and apply improvements to systems and deployments
  • ā—Ensure compliance with security best practices in infrastructure deployment and management

Tech Stack

monitoringJenkinsInfrastructure-as-Codecapacity planningdocumentationlog managementCloudFormationDockerKubernetesGrafanaproject:Hata
Expired
Search