Senior Engineering Manager - SRE

Full Time
Gdańsk, Pomeranian Voivodeship, Poland
11 months ago
Senior Engineering Manager - SRE

Want to lead a global team responsible for the most important product features – availability, reliability & security?  Sumo’s SRE program focuses on continual data-driven evolution and improvement of the reliability, security, and efficiency of our global scale technological presence. We are looking for a great leader with a passion for site reliability, continuous technology improvement, and reducing the operational workload of our own engineers - as well as our customers who leverage our products for their own monitoring and reliability.

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Sumo’s services have reliability, uptime appropriate to users' needs as well as the ability to quickly and continuously deliver value to our customers.

As a Senior SRE Engineering Manager you will be responsible for: 

  • Cost Efficiency Program:
    • Carry out projects that actively reduce our AWS spend.
    • Manage AWS resource reservations for our whole infrastructure.
    • Observe our current spend on cloud resources and improve our cost monitoring ecosystem.
  • Reliability Program: 
    • Drive the program that maintains excellent uptime numbers for our services.
    • Manage error budgets and associated policies for key product SLOs. 
    • Promote blameless post-mortem culture combined with developer operational accountability.
    • Continuously reduce operational workload for engineers by means of infrastructure improvements and automation.  
  • Application Security Program: 
    • Help product teams develop secure applications for the Sumo Logic platform.
    • Integrate and implement solutions improving Sumo Logic’s security posture.
    • Lead security reviews and penetration tests at design and implementation stages.
  • Partner with the Security Operations Center (SOC) on our security program, vulnerability management, and threat modeling of our tech stack.
    • Educate other teams about application security
  • Team Leadership: 
    • Lead and grow a global team of SREs adept at building extremely high-volume, fault-tolerant, efficient, and scalable backend systems.
  • Technical Vision: 
    • Partner with our technical leadership team to review choices on an ongoing basis, in anticipation of increased scale and ever-evolving technology to meet the demands of growing business. Leverage technical skills to successfully analyze and improve the efficiency, scalability, and reliability of our backend systems.

What you have: 

  • B.S. in Computer Sciences or related discipline (M.S., or Ph.D. is a plus).
  • Minimum 8+ years of industry experience with a proven track record of ownership, delivery, and operational excellence.
  • Minimum 3+ years in a management role.
  • Experience being responsible for key SLOs of a cloud-based SaaS: availability, uptime, performance, and security.
  • Experience in multi-threaded programming and distributed systems.
  • Object-oriented programming experience, for example in Java, Scala, Ruby, or C++.
  • Experience with high volumes of data using the latest technologies such as Kafka, Kubernetes and Docker.
  • Agile software development experience (test-driven development, iterative and incremental development). Experience in big data and/or 24x7 commercial service is highly desirable.
  • Hands-on experience with public cloud Infrastructure-as-a-service and Platform-as-a-service offerings - Amazon Web Services, Google Cloud Platform, etc.

Why it’s worth applying:

  • Competitive salary - employment contract w/65% authorship costs.
  • You will work with great engineers on a complex product. 
  • 4 extra days off / year (Sumo Wellness Days).
  • Private healthcare for you and your family.
  • Medical and life insurance. 
  • Sports card.
  • WFH budget. 
  • Lunch budget when you work from the office. 
  • Individual English lessons with a native speaker.
  • You can work from the office, 100% remotely or in a hybrid model. 

About Us:

Sumo Logic, Inc., empowers the people who power modern, digital business.  Through its SaaS analytics platform, Sumo Logic enables customers to deliver reliable and secure cloud-native applications. The Sumo Logic Continuous Intelligence Platform™ helps practitioners and developers ensure application reliability, secure and protect against modern security threats, and gain insights into their cloud infrastructures. Customers around the world rely on Sumo Logic to get powerful real-time analytics and insights across observability and security solutions for their cloud-native applications. For more information, visit www.sumologic.com.

More at: https://www.sumologic.com/

Technology Video & Demo: https://youtu.be/-WoseyIma8g

Youtube Channel: https://www.youtube.com/user/sumologic

LinkedIn: https://www.linkedin.com/company/sumo-logic/

How it works: https://www.sumologic.com/platform/

 

#LI-Remote

#LI-AO1