Site Reliability Engineer II - Observability

Full Time
Toronto, ON, Canada
8 months ago
Our Organization

HashiCorp helps solve development, operations, and security challenges in infrastructure so organizations can focus on business-critical tasks. We build products to give organizations a consistent way to manage their move to cloud-based IT infrastructures for running their applications.

We use the Tao of HashiCorp as our guiding principles for product development and operate according to a strong set of company principles for how we interact with each other. We value top-notch collaboration and communication skills, both among internal teams and in how we interact with our users. 

Our Team

The HashiCorp Observability team is responsible for providing HashiCorp engineers with observability tooling and capabilities using a software engineering-based approach. Our focus is on making it easy for HashiCorp engineers to understand the state of production in HashiCorp Cloud Platform, while at the same time providing context and control over things like cloud and SaaS costs. This team will involve a mixture of both infrastructure engineering/product engineering practices and more SRE-like engagements with engineering teams who are using observability tooling. There will also be some emphasis on addressing vendor costs as it relates to supporting developers in their observability needs. The team will also consult and collaborate with our other Infrastructure teams who are focused on developer tooling and platforms when there are opportunities to achieve objectives around better observability or on tooling value optimization.

About this Role

This engineering role is on a nascent, growing engineering team. The team is responsible for products that touch many areas of engineering organizations at HashiCorp, so applicants will need to excel at collaboration, have product-focused mindsets, and be comfortable iterating in an agile manner towards solutions.

In this role, you can expect to:

  • Be responsible for and drive operational excellence through observability tooling and best practices
  • Build technical skills and relationships within a team of engineers and SREs
  • Make understanding our operational posture and resolving incidents easier for multiple engineering teams and product systems
  • Participate in crucial decision-making related to various observability tools and services, including build vs. buy
  • Deliver elegant, user-focused solutions that address the observability and cloud cost challenges we face in our cloud product

You may be a good fit for our team if:

  • Professional backend software development experience in cloud environments
  • Interested or experienced in observability practices and how that can enable teams
  • Enjoy working on a variety of scopes spanning software engineering, operations, and SRE
  • Worked to operationalize complex software at scale
  • Worked on infrastructure teams in customer-centric and agile organizations with empathy and compassion
  • Worked with SaaS or another type of managed software offering
  • Expertise in one or more of the major public cloud

 

#LI-Remote