Sr. Engineer II - Infrastructure (Hybrid)
As a Senior Site Reliability Engineer(II) focusing on Infrastructure at HashiCorp, you will be central to our mission of adopting infrastructure changes, as well as maintaining and enhancing the backbone of our cloud infrastructure. With over 8 years of experience in site reliability engineering, infrastructural engineering or a related field, you will leverage your deep understanding of core infrastructure components and technologies such as: base images, and our HashiStack to ensure our infrastructure is not only robust and scalable but also optimized for performance and cost.
What you’ll do (responsibilities)
- Collaboration and Planning: Work closely with engineering and product teams to adopt infrastructure changes. Contribute to infrastructure planning, capacity management, and architectural improvements.
- Infrastructure Optimization: Develop and implement strategies to enhance the consumption interface, performance, scalability, and reliability of HashiCorp's infrastructure.
- Software Development: Develop software solutionsDesign and refine processes for the usability of infrastructure resources, increasing efficiency and reducing manual overhead.
- Monitoring and Incident Response: Implement comprehensive monitoring solutions to proactively identify and address issues. Lead the response to infrastructure incidents, minimizing impact on service availability and performance.
- Knowledge Sharing: Serve as a subject matter expert in infrastructure technologies and practices.
What you’ll need (basic qualifications)
- Minimum 8+ years of experience in site reliability engineering, infrastructure engineering, or a closely related field, with a proven track record of managing complex, cloud-based infrastructure at scale and system administration
- Advanced technical expertise in designing and implementing large-scale systems and infrastructure solutions, with a deep understanding of cloud platforms (AWS, Azure, GCP), container orchestration (Nomad, Kubernetes), and infrastructure as code (Terraform, Ansible).
- Hands-on experience with HashiCorp tools (Terraform, Vault, Consul, Nomad) and other key technologies, including cloud services, DevOps tooling, and automation platforms.
- Proficiency in one or more programming languages (e.g. Python, Go), with experience writing production-level code and integrating it into complex systems
- Proven ability to lead high-level technical projects, drive innovation, and mentor junior engineers in the design and implementation of complex systems and infrastructure solutions.
- A strong understanding of software development principles, including agile methodologies and CI/CD pipelines
- Experience working with cloud-based services, including managed services, IaaS, and PaaS
- Excellent problem-solving skills, with a strong track record of leading incident response efforts, driving root cause analysis and resolution, and collaborating with cross-functional teams to resolve technical issues
- Strong technical leadership skills, with experience managing teams and influencing technical decisions at an executive level.
- A commitment to continuous learning and improvement, staying abreast of the latest industry trends and technologies #LI-Hybrid
“HashiCorp is an IBM subsidiary which has been acquired by IBM and will be integrated into the IBM organization. HashiCorp will be the hiring entity. By proceeding with this application you understand that HashiCorp will share your personal information with other IBM subsidiaries involved in your recruitment process, wherever these are located. More information on how IBM protects your personal information, including the safeguards in case of cross-border data transfer, are available here: link to IBM privacy statement.”