Manager, SRE

Nintex

Full Time

Johannesburg, South Africa

10 months ago

Apply now

About Nintex:

At Nintex, we are transforming the way people work, everywhere.

As the global standard for process intelligence and automation, we're trusted by over 10,000 public and private sector organizations across 90 countries. Our customers, from industry giants like Amazon, Coca-Cola, and Microsoft, rely on the Nintex Platform to accelerate their digital transformation journeys by managing, automating, and optimizing business processes quickly and efficiently. We improve their lives through the technology we build.

We are committed to fostering a workplace that supports amazing people in doing their very best work every day. Collaboration is constant, our workplace is fun, the environment is fast-paced, and we value our people’s curiosity, ideas, and enthusiasm. Driven by passion and accountability, we take initiative, measure progress, and deliver results. Our culture fosters innovation and problem-solving, fueled by curiosity and a commitment to thinking big. Together, we move with agility, prioritize customer needs, and build unity through empathy, leaving a positive impact wherever we go.

Working in engineering:

Working at Nintex as an engineer means building more than just software; it’s about making a tangible impact with every line of code. Our engineers are process experts, developing the industry’s most complete process and automation platform to transform the way people work. If you’re interested, curious and want to learn and do more, the sky is the limit here. We take a solutions-oriented and collaborative approach, constantly innovating our business and products.

About the role:

The SRE Manager will be leading a team of globally distributed SRE engineers, responsible for ensuring the performance, reliability, and scalability of a fleet of Kubernetes clusters and other distributed systems. This role involves overseeing the availability and health of critical infrastructure while supporting multiple delivery teams to enhance operational efficiency. The SRE Manager drives continuous improvements in monitoring, automation, and incident management processes, fostering collaboration between development and operations to uphold service level objectives (SLOs) across the platform.

Your contribution will be: 

You lead and mentor a team of Site Reliability Engineers to enhance system reliability and performance.
You are highly skilled and sufficiently experienced in Nintex DevOps tools and processes to own a long-term program or technology such as Kubernetes etc.
You develop and implement strategies for incident response, capacity planning, and system monitoring.
You are continually on the lookout for opportunities to reduce errors through automating and standardizing processes. You bring infrastructure components into managed implementations like Infrastructure as Code (IaC), configuration management, and container usage
You collaborate with engineering, product, and operations teams to align on reliability goals and drive improvements.
You manage and prioritize the backlog of reliability-related tasks and projects.
You review and advise on appropriate design patterns to solve automation and infrastructure problems without creating technical debt.
You design and build complex infrastructure components for distributed systems as Kubernetes.
You lead and contribute to post-mortems for incidents, including root cause analysis and identification of preventative and remedial actions.
You build, promote and support infrastructure patterns and SRE practices within Nintex.
You identify optimization opportunities anywhere in the development or operations functions and contribute to the implementation of proposed solutions.
You suggest and contribute improvements to Nintex platform and observability tools and practices.
You act as a reliability champion
You conduct regular performance and reliability metrics reports for stakeholders.

To be successful, we think you need: 

2+ years experience leading a team
6+ years experience in a SRE/DevOps position
Experience with Kubernetes, Helm Charts etc
Experience with Prometheus, Grafana, Alertmanager, PagerDuty, Loki and Mimir.
You have extensive knowledge of system architecture, cloud platforms (e.g., AWS, Azure, Google Cloud), and modern DevOps and SRE practices.
You run Nintex infrastructure with IaC tools (as Terraform) and GitHub Actions for automation, containerize our environments (Kubernetes) and leverage cloud technologies to meet our goals

What’s in it for you?

Our people work in the way that best suits them and their teams - whether at home, in an office, or another place that sparks creativity, focus, and collaboration. Our work environment is such that our people can successfully deliver their work while adequately supporting their lifestyle and preferences.

While our offerings differ from country to country, we offer our entire global workforce an array of exciting perks and benefits, including

Global Gratitude and Recharge Days
Flexible, paid time off policy
Employee wellness programs and counseling resources
Meaningful peer recognition and awards
Paid parental leave
Invention/patenting assistance
Community impact, paid volunteer time, and opportunities
Intercultural learning and celebration
Multiple tools through which to learn and grow, and an incredible global community

View more about our benefits here: https://www.nintex.com/wp-content/uploads/2023/01/Global-Perks-and-Benefits.pdf.

Equity Statement: Preference will be given to People Living with Disability who are members of the designated groups in line with the Employment Equity Plan and Targets of the Company.

#LI-REMOTE