Our Team
The HashiCorp Core SRE team is responsible for enabling a great experience for our customers by providing HashiCorp engineers with the tools they need to make our products and services more reliable. Our focus is on making it easy for HashiCorp engineers to respond to incidents, learn from them, and understand the current state of reliability in HashiCorp Cloud Platform, while at the same time collaborating with them to measure reliability metrics and establish benchmarks such as SLOs. This team involves a mixture of product engineering practices and SRE engagements with engineering teams who are operating services. We also focus on telling the story of reliability within HashiCorp on a broad scale; collaborating with engineering teams to build common goals as well as presenting to larger audiences within the organization. As an enablement team at our core, we consult and collaborate with nearly every engineering team in HashiCorp to achieve our objective of improving reliability.
About this Role
This is a high level role on a team that is responsible for helping engineers across the organization improve reliability. We are looking for a technical leader who is highly collaborative, strategic, and motivated to advocate for & help enable data-driven decisions to a broad and varied audience. They will have the opportunity to empower HashiCorp Cloud Platform with health insights tooling to drive and support ongoing platform improvements.
In this role, you can expect to:
- Drive operational excellence through reliability tooling and best practices
- Build trust and frequently collaborate within a team of SREs and with engineers across HashiCorp Cloud Platform products
- Make understanding our operational posture and resolving incidents easier for multiple engineering teams and product systems
- Talk to engineers about their operational concerns and help them analyze data and build solutions that address their needs
- Participate in crucial decision-making related to various reliability programs
- Deliver elegant, user-focused solutions that address the reliability and uptime challenges we face in our cloud products
You may be a good fit for our team if:
- Worked on a team of SREs or engineers to improve reliability
- Love data, analyzing data, helping teams draw conclusions from data
- Effective communicator and collaborator, comfortable influencing others through data driven advocacy
- Able to identify pragmatic and ideal solutions by focusing on customer feedback and incremental improvement
- Technical leader capable of crafting long term strategy and vision
- Expertise in one or more of the major public clouds
- Professional backend software development experience in cloud environments
#LI-Remote