portal resources jobs companies b braze site reliability engineering manager, kubernetes platform

Site Reliability Engineering Manager, Kubernetes Platform 🔥


WHO WE ARE

Braze delivers customer experiences across email, mobile, SMS, and web. Customers, including Burger King, Delivery Hero, HBO Max, Mercari, and Venmo, use the Braze platform to facilitate real-time experiences between brands and consumers in a more authentic and human way. And we do it at scale – each month, hundreds of billions of messages are sent to a network of over 3 billion active users through Braze.

Need more proof? Braze was named a Leader in the Forrester Wave™: Cross-Channel Campaign Management (Independent Platforms), Q3 2021, and was named to the Forbes Cloud 100 list for the fourth consecutive year. The company has also been selected as one of Fortune’s Best Workplace for Millennials in 2021, and was ranked #20 on Fortune’s Best Medium Sized Workplaces in 2021. Braze is certified as a Great Place to Work in the UK and the U.S. and is recognized as one of the UK's Best Workplaces for Women.

Site Reliability at Braze is a blend of sensible system administrators and software engineers that apply sound engineering principles, operational discipline, and mature automation to the environments and infrastructure products we provide to the business. We specialize in systems–whether it be networking, the Linux kernel, or some more specific interest in scaling–algorithms, or distributed systems. Braze is looking for an experienced Site Reliability Manager to grow our newly formed Kubernetes Platform team. The Kubernetes Platform Team at Braze is be responsible for improving the Kubernetes based products that keep Braze’s applications running smoothly.

WHAT YOU'LL DO:

As a Site Reliability Engineering Manager at Braze, you will be responsible for managing and building the team and products that continuously improve the performance, availability, infrastructure, and tooling that are critical to Braze’s success.  You will help solve major engineering challenges by:

  • Guiding key architectural decisions to ensure we effectively utilize infrastructure in a scalable, reliable manner
  • Reducing operational pain and improving the day-to-day workflow of Braze’s engineering teams via increased visibility, monitoring, scalable processes, and automation
  • Perform deep retrospectives on everything that happens to turn lessons into system improvements/changes, automation, etc

As the team’s engineering leader, you and the team you build will ensure the reliability of both our existing and future infrastructure products. You will recruit, coach, and develop the team’s engineers - providing an environment where they fully own projects while helping to remove blockers including being hands-on and writing code when required.  Finally, you will have broad responsibility for the team’s vision, goals, and processes.

WHO YOU ARE:

  • 2+ years of experience managing reliability-focused infrastructure development teams
  • 3+ years of experience running Kubernetes or another compute orchestration platform at scale
  • 2+ years of experience as a Software, DevOps, or Site Reliability Engineer
  • Successful track record of recruiting, building, and growing great teams
  • Proven ability in strategic planning, change management, cost optimization, and project management
  • Effective resource management of multiple concurrent projects, some of which with competing priorities
  • Thrive on a high level of autonomy and responsibility while having the ability to dig into technical details to inform decisions
  • High level of ownership and accountability for high-quality work, both personally and at a team level

WHAT WE OFFER

  • Competitive compensation that includes equity
  • Generous time off policy to balance your work and life, including paid parental leave
  • Competitive medical, dental, and vision coverage for you and your dependents
  • Collaborative, transparent, and fun loving office culture

If you are a California resident subject to the California Consumer Privacy Act, click here [1] to understand how Braze processes your personal information and how you can exercise your rights.

If you are located in the EU or UK visit our privacy policy [2] to understand how Braze processes your personal information and how you can exercise your rights.


  1. http://info.braze.com/rs/367-GUY-242/images/CCPA%20Notice%20%28Candidates%29.pdf
  2. https://www.braze.com/privacy

Other jobs at Braze

2 jobs in the last 60 days · 2 jobs in total · avg 1 - 3 jobs/mo · 2434 job visits

Braze

Let us send you new openings similar to Site Reliability Engineering Manager, Kubernetes Platform straight to your Inbox. Weekly or Daily. 7-day free trial đź’Ś

The ability to work remotely increases employee happiness by 20 percent.