bigshyft
HHashiCorp
HashiCorp
Sr. Site Reliability Engineer II
Series E
Start-up
1001-5000 employees
8y - 12y
₹50 - ₹100 LPA
Bengaluru/ Bangalore
CI/CD, Load Balancer, Kubernetes, Terraform, Ansible

Role

Company

Job Description

What makes you a great fit:


  • 8+ years of experience in site reliability engineering, systems administration, or software engineering, with a significant focus on incident response and operational reliability.
  • 3+ years managing, coordinating, and ensuring resolution of major incidents.
  • Professional experience with incident management in cloud environments.
  • Enjoy working on a variety of scopes spanning software engineering, cloud infrastructure, and SRE.
  • Worked with SaaS or another type of managed software offering.
  • Proven track record of managing and resolving incidents in cloud-based environments, with expertise in major public cloud platforms (AWS, GCP, Azure).
  • Understanding of fundamental network technologies like DNS, Load Balancing, SSL, TCP/IP, HTTP
  • Strong understanding of monitoring and alerting systems, with the ability to develop metrics and alarms that accurately reflect system health and operational risks.
  • Experience with incident management tools and practices, including post-mortem analysis and root cause investigation.
  • Passion for consistently responding to and leading complex incidents in a 24x7x365 environment utilizing a globalized follow-the-sun model.
  • Customer-centric attitude with a focus on providing best-in-class incident response for customers and stakeholders
  • Demonstrate strong leadership skills during periods of significant business impact, remaining calm and professional during high-pressure situations
  • A strong desire to drive customer success with partner teams and management on high-profile issues critical to the long-term success of the business
  • Outstanding verbal and written communication skills with the ability to convey information in a meaningful way to both engineers and executive-level management, during and outside of incidents
  • Adaptable to a wide variety of technologies and capable of incident response and troubleshooting activities in complex interconnected environments


All about us
HashiCorp

At HashiCorp, we believe infrastructure enables innovation, and we are helping organizations to operate that infrastructure in the cloud. Our suite of multi-cloud infrastructure automation products — all with open source projects at their core — underpin the most important applications for the largest enterprises in the world. As part of the once-in-a-generation shift to the cloud, organizations of all sizes, from well-known brands to ambitious start-ups, rely on our solutions to provision, secure, connect, and run their business-critical applications so they can deliver essential services, communications tools, and entertainment platforms worldwide.

Employee count
1001-5000 employees
Employment Type
Full Time Job
Company Type
Start-up
Headquarters
San Francisco, California, United States

Apply to Similar Jobs

  • HHashiCorp
    HashiCorp
    Sr. Site Reliability Engineer
    Series E
    Start-up
    1001-5000 employees
    6y - 9y
    ₹30 - ₹60 LPA
    Bengaluru/ Bangalore
    CI/CD, Kubernetes, Terraform, Ansible, Python
  • Nnference
    nference
    Staff Site Reliability Engineer
    Series C
    Start-up
    201-500 employees
    7y - 10y
    ₹30 - ₹50 LPA
    Bengaluru/ Bangalore
    Java, Python, Linux, AWS, CI/CD