Sr. Site Reliability Engineer - Infrastructure (Hybrid)
As a Senior Site Reliability Engineer focusing on Infrastructure at HashiCorp, you will be central to our mission of maintaining and enhancing the backbone of our cloud infrastructure. With over 6 years of experience in site reliability engineering or a related field, you will leverage your deep understanding of DNS, CDN, Artifactory, base images, and HashiStack to ensure our infrastructure is not only robust and scalable but also optimized for performance and cost.
Key Responsibilities- Infrastructure Optimization: Develop and implement strategies to enhance the performance, scalability, and reliability of HashiCorp's infrastructure, focusing on DNS, CDN, Artifactory, base images, and the HashiStack.
- Automation and Tooling: Design and refine tools and processes for automating infrastructure deployment and management, increasing efficiency and reducing manual overhead.
- Monitoring and Incident Response: Implement comprehensive monitoring solutions to proactively identify and address issues. Lead the response to infrastructure incidents, minimizing impact on service availability and performance.
- Collaboration and Planning: Work closely with engineering and product teams to align infrastructure development with company objectives. Contribute to infrastructure planning, capacity management, and architectural improvements.
- Knowledge Sharing: Serve as a subject matter expert in infrastructure technologies and practices. Mentor junior team members and share insights with the broader engineering team to foster a culture of learning and improvement.
- 6+ years of experience in site reliability engineering, infrastructure engineering, or a closely related field, with a proven track record of managing large-scale, cloud-based infrastructure.
- Deep technical expertise in managing and optimizing DNS, CDN, Artifactory, and cloud infrastructure services, with hands-on experience with HashiCorp tools (Terraform, Vault, Consul, Nomad) preferred.
- Strong background in infrastructure as code, automation tooling, and cloud platforms (AWS, Azure, GCP).
- Excellent problem-solving abilities, with the capacity to lead incident response efforts and drive root cause analysis and resolution.
- Effective communication skills, capable of collaborating with cross-functional teams and articulating technical concepts to a non-technical audience.
- A commitment to continuous learning and improvement, staying abreast of the latest industry trends and technologies.
#LI-hybrid
ALERT: HashiCorp has received reports of scams where individuals purporting to represent HashiCorp conduct bogus “employment interviews” via email or text, and then request payment as a condition for receiving an offer of employment. HashiCorp and its subsidiaries do not conduct interviews by email or text, and will never request payment as a condition for applying for a position or receiving an offer of employment. These scam operators may also ask for your personal information (name, address, birthdate, social security number, etc.), which you should not provide to them. If you have been the target of such a scam, you should report it to the U.S. Federal Trade Commission (see this FTC posting for further details: https://www.consumer.ftc.gov/articles/job-scams) the office of your state Attorney General, or the government agency responsible for investigating matters such as this where you reside.