Home -

DevOps Group Manager

StarkWare

Description

Our ideal candidate is someone with extensive experience designing, building and maintaining highly reliable and scalable infrastructure in production environments. Someone who strongly advocates for DevOps best practices and believes in automation. Someone who can envision a long-term strategic road map and has the passion for executing it and leading others through it by inspiration and mentorship.

What You Will Be Doing

Lead a group of DevOps teams.
Design, build and maintain cloud infrastructure that will provide a reliable and scalable platform for other teams to build on.
Implement and manage CI/CD pipelines to ensure seamless testing, deployment, and monitoring of services.
Develop and maintain monitoring and alerting systems to ensure the early detection of issues, enabling proactive problem resolution.
Monitor and maintain production and testing systems and support customers in onboarding and integrating with them.
Learn and apply industry best practices and share this knowledge with other teams through guidance, lectures and workshops.
Practice sustainable incident response and blameless postmortems.
Continuously evolve and learn new technologies that can improve our team’s workflow, accelerate the development process and make it more reliable.

Requirements

3+ years of experience leading DevOps/engineering teams.
5+ years of proven experience building and maintaining scalable and highly available systems in the cloud, preferably GCP or AWS.
Experience in building and managing microservice systems in a containerized or serverless environment.
Experience with CI/CD solutions (Github Actions, Argo CD, Circle CI, etc.)
Experience with Declarative Infrastructure (Infrastructure as Code i.e. Terraform)
Familiarity with various DB engines – relational and non-relational, and good understanding of when and how to utilize each.
Hands-on experience working with Kubernetes.
Understanding of SRE principals, including monitoring, alerting, fault analysis, and other common reliability engineering concepts.

Preferred Requirements

B.Sc. or higher degree in Computer Science or similar field.
3+ years of experience developing production backend systems in one or more of the following programming languages: Python, Typescript, C++, Rust.
Practicing GitOps methodology.
Deep understanding of networking protocols and network-security concepts.
Good familiarity with UNIX-like operating-systems and experience writing shell scripts.