Explore
DevOps/SRE Engineer
NEW49, Jalan Dungun, Bukit Damansara, ...Full-timeGlobal
š Midš On-site
ActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will design, implement, and manage CI/CD pipelines using Jenkins and/or GitLab Runner. You will work with development teams to ensure infrastructure supports services, manage pre-production and production environments for high availability, and perform capacity planning. You will proactively monitor system performance and reliability, troubleshoot and resolve issues, create and maintain configuration and operational documentation, participate in on-call rotations and provide after-hours support, implement automation and infrastructure as code, and ensure security best practices in deployments.
Requirements
- āStrong experience with CI/CD tools such as Jenkins or GitLab CI/CD
- āProficient in scripting languages such as Bash or Python
- āExperience with cloud services (Alicloud preferred) and infrastructure as code (Terraform, OpenTofu, CloudFormation)
- āSolid understanding of containerization and orchestration technologies such as Docker and Kubernetes
- āKnowledge of system monitoring tools such as Prometheus and Grafana and log management like ELK
- āExcellent troubleshooting and problem-solving skills
- āStrong understanding of network fundamentals and security practices
- āEffective communication skills and ability to work collaboratively in cross-functional teams
- āBachelor's degree in Computer Science, Engineering, or a related field
- ā3+ years of experience in a DevOps/SRE role preferred
Responsibilities
- āDesign and implement CI/CD pipelines using Jenkins or GitLab Runner
- āCollaborate with development teams to ensure infrastructure supports service requirements
- āManage pre-production and production environments to ensure high availability and performance
- āPerform infrastructure capacity planning and management
- āMonitor system performance and reliability and address issues proactively
- āDevelop and maintain configuration, operations, and troubleshooting documentation
- āParticipate in on-call rotations and provide after-hours support
- āImplement automation tools to reduce manual work and optimize resource utilization
- āRecommend and apply improvements to systems and deployments
- āEnsure compliance with security best practices in infrastructure deployment and management
Tech Stack
monitoringJenkinsInfrastructure-as-Codecapacity planningdocumentationlog managementCloudFormationDockerKubernetesGrafanaproject:Hata