Senior Site Reliability Engineering Manager, Observability

Full Time
Lisbon, Portugal
3 months ago
Who We Are

The name ThousandEyes was born from two big ideas: the power to see things not ordinarily possible and the ability to collect insights from a multitude of vantage points. As the world continues its digital transformation and relies more on cloud services and the Internet, the “network,” which is now both public and private, has become a black box our customers cannot see or understand.  

Our Internet and cloud intelligence platform delivers the only collectively powered real-time view of the Internet and private networks, cloud, and SaaS platforms, helping enterprises and service providers identify problems before they impact revenue, damage brand reputation, or halt employee productivity. 

In August 2020, Cisco Systems completed the acquisition of ThousandEyes, which now forms the ThousandEyes Business Unit within the Cisco Networking Business Group and is the Network Assurance solution for Cisco across the Cisco Networking Cloud and Cisco Security Cloud. ThousandEyes is also a foundational component of Cisco’s growing Full-Stack Observability (“FSO”) business. 

About The Role

This role is the Senior Site Reliability Engineering Manager for the Observability SRE team at ThousandEyes. The Observability team is responsible for providing a world class developer experience when they need to understand and observe platform behavior. In addition to visibility, this team drives visibility into action, relentlessly pursuing the goal of a platform that is resilient, fault tolerant, and self-healing.

What You'll Do

As a senior engineering manager leading the Observability team, you will be responsible for the design, development and operations of our internal observability platform. Working with a team of strong and mission focused engineers, you’ll bring a user-focused perspective to delivering observability as a platform for a team running the best observability platform in the industry.

Qualifications
  • Proven site reliability engineering management experience or experience delivering an internal developer platform focused on production operations, ideally managing teams of 4+ Engineers

  • Can provide strong technical vision for your team and ensure consistent delivery on objectives

  • Have experience formulating a team's technical strategy and roadmap; you've collaborated and partnered effectively with several other teams to execute on shared goals

  • Extensive experience building and supporting missing critical services with focus on automation, observability, availability and performance

  • Experience building infrastructure and operating services in production environments which are required to have high availability and reliability

  • You have worked on large-scale distributed systems including multi-tiered architecture

  • Understand how to balance tactical needs with strategic growth and quality-based initiatives that can span multiple quarters

Preferred Qualifications
  • Cloud Native Observability via Kubernetes, Prometheus, Open Telemetry, and other industry standard or CNCF technologies

  • Operated a cloud service at significant scale

  • Delivered an engineering-wide platform for service visibility

  • Owned incident response process, post-mortem practices, or service best practice standards

Cisco values the perspectives and skills that emerge from employees with diverse backgrounds. That's why Cisco is expanding the boundaries of discovering top talent by not only focusing on candidates with educational degrees and experience but also placing more emphasis on unlocking potential. We believe that everyone has something to offer and that diverse teams are better equipped to solve problems, innovate, and create a positive impact.We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification. Research shows that people from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy. We urge you not to prematurely exclude yourself and to apply if you're interested in this work.

Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis. Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.