Engineering Manager (SRE - Scale & Performance)
HashiCorp solves development, operations, and security challenges in infrastructure so organizations can focus on business-critical tasks. We build products to give organizations a consistent way to manage their move to cloud-based IT infrastructures for running their applications. Our products enable companies large and small to mix and match AWS, Microsoft Azure, Google Cloud, and other clouds as well as on-premises environments, easing their ability to deliver new applications.
At HashiCorp, we have used the Tao of HashiCorp as our guiding principles for product development and operate according to a strong set of company principles for how we interact with each other. We value top-notch collaboration and communication skills, both among internal teams and in how we interact with our users.
Our TeamThe HashiCorp Scale and Performance Engineering team plays a vital role in ensuring the scalability, performance, and reliability of HashiCorp's cloud products. Our team is central to the reliability journey, focusing on enhancing system resilience and delivering high availability across enterprise and cloud-based systems. By leveraging advanced testing strategies, innovative tools, and a commitment to quality, we ensure HashiCorp’s products meet the highest standards of scale and performance excellence.
What you’ll do (responsibilities)As an Engineering Manager for this team, you will lead a group of talented engineers responsible for improving the scalability, performance, and reliability of HashiCorp's offerings. This role involves strategic planning, hands-on management, and collaboration across teams to ensure our infrastructure not only meets current needs but also supports future growth.
You will drive efforts to identify performance bottlenecks and proactively address operational challenges before they impact customers. Your expertise in load testing, performance analysis, and system hardening will be critical to achieving optimal system resilience and reliability.
With a holistic understanding of enterprise and cloud systems, you will define and execute system-wide strategies for load and performance testing. This role provides the opportunity to work with diverse tools and explore innovative approaches to ensure HashiCorp’s products meet essential scale and performance criteria.
Additionally, you will leverage advanced troubleshooting techniques, such as system simulations with Chaos Engineering, to identify, prioritize, and advocate for creative solutions that mitigate customer impact across complex, interconnected systems. Your leadership will be pivotal in delivering exceptional reliability and performance to HashiCorp's customers while positioning our infrastructure for long-term success.
- Define and implement the vision and performance testing strategies, aligning with the technology stack, delivery approach, and strategic goals.
- Lead performance testing initiatives from kickoff to closure, ensuring consistency, efficiency, and adherence to standards and processes.
- Develop and execute testing plans by capturing non-functional requirements (NFRs), setting SLAs, defining metrics, and establishing success criteria.
- Enhance test automation levels and establish best practices to drive efficiency, standardization, and continuous improvement
- Identify and resolve performance issues while delivering actionable insights and recommendations.
- Collaborate with cross-functional teams—Business, Architects, Developers, and Support—to identify dependencies, understand requirements, and deliver effective testing solutions.
- Monitor production systems, analyze trends, and provide sign-offs for production releases to ensure system reliability and performance.
- Mentor and guide engineers, promoting professional growth and implementing best practices for testing, monitoring, and system optimization.
- Foster knowledge sharing, manage stakeholder relationships, and oversee competency development to ensure seamless project delivery.
- Continuously refine testing strategies and processes, leveraging analytical insights and lessons learned for ongoing improvement.
- You have 8+ years of experience in performance testing, including hands-on expertise with manual, automated testing, and analyzing application performance.
- You possess 1+ years of leadership experience in managing and delivering large-scale performance testing projects.
- You are experienced with performance testing and monitoring tools such as LoadRunner, JMeter, BlazeMeter, K6, Artillery, Vegeta, Locust, and Application Performance Management tools like SPLUNK, AppDynamics, or Dynatrace.
- You have a solid understanding of micro service architecture, infrastructure, public clouds (preferable aws) and technology stacks to conduct comprehensive performance testing.
- You can accurately analyze performance test results, troubleshoot issues, and communicate findings effectively in both technical and business contexts.
- You are knowledgeable in Agile software development lifecycle methodologies
- You have expertise in analyzing cross-application dependencies, identifying bottlenecks, and driving solutions for performance challenges.
- You thrive in collaborative, fast-paced environments and have a proven track record of delivering quality outcomes through strategic planning and hands-on leadership.
- You are passionate about continuous improvement, optimizing processes, and driving innovation to ensure scalability, reliability, and high performance of critical systems. #LI-Hybrid