portal resources jobs companies s segment.io, inc. site reliability engineer

Site Reliability Engineer


At Segment, we believe companies should be able to send their data wherever they want, whenever they want, with no fuss. Unfortunately, most product managers, analysts, and marketers spend too much time searching for the data they need, while engineers are stuck integrating the tools they want to use. Segment standardizes and streamlines data infrastructure with a single platform that collects, unifies, and sends data to hundreds of business tools with the flip of a switch. That way, our customers can focus on building amazing products and personalized messages for their customers, letting us take care of the complexities of processing their customer data reliably at scale. We’re in the running to power the entire customer data ecosystem, and we need the best people to take the market.  The Infrastructure Engineering group is central to Segment’s Platform strategy. The ecosystem of tools that your team creates and supports are the foundation for the services built by Product teams. In order to maintain our leadership position in the customer engagement space we must continue to build innovative services that support our developers in seamlessly delivering value to customers.  You will partner with some of the brightest minds in the industry to push the boundaries of web-scale service delivery. As a member of the Site Reliability Engineering (SRE) team, you’ll help to empower our entire R&D organization. Alongside a diverse distributed Infrastructure group you’ll participate in building the next iteration of our service platform; focusing on the reliability, operability, observability, flexibility, and cost-effectiveness of our production infrastructure.

What you’ll do

  • Write software to build, maintain, automate, and introspect our production systems
  • Mentor teams to reliably and cost effectively operate and maintain their services
  • Build the next version of Segment’s Service Platform (focused on deployment and observability) to support teams in deploying hundreds of services across a multi-region cloud environment
  • Take proactive steps to improve our availability, reliability, and efficiency
  • Participate in driving Segment as a market leader in the development of Open Source Software like kafka-go [1], chamber [2], kubeapply [3], etc.
  • Participate in an on-call rotation to support our business-critical infrastructure

What you’ll bring

  • Minimum of 5 years experience as a Software Engineer, Systems Administrator, Operations Engineer, Site Reliability Engineer, or another similar role
  • A systematic problem-solving approach, coupled with good communication skills, sense of ownership, and drive
  • Experience operating large-scale, distributed systems on top of cloud infrastructure such as Amazon Web Services (AWS) or Google Compute Platform (GCP)
  • Experience programming in one or more of the following: Go, Python, Node.js, Bash, or similar languages
  • A proven grasp of Linux systems administration and programming concepts

We’re especially excited about candidates who:

  • Have hands-on experience with container orchestration frameworks (e.g. Kubernetes, EKS, ECS)
  • Have hands-on experience in operating event-based systems (e.g. Kafka) capable of processing millions of events per second and petabytes of data each month
  • Possess a broad understanding of the Linux kernel internals and networking protocols
  • Are proficient in metrics tooling such as Datadog and Prometheus
  • Have lead teams, large projects, or been the owner of an important system
 We encourage you to apply if this role excites you - even if you think you may not meet all of the qualifications. At Segment we live by four values: karma, drive, tribe and focus. We are always looking for outstanding individuals with diverse backgrounds and perspectives who embody these values. To learn more about life at Segment and our commitment to diversity, equity, and inclusion, visit our LinkedIn [4] page. We’re excited to meet you! Segment is an equal opportunity employer. We believe that everyone should receive equal consideration and treatment in all terms and conditions of employment regardless of sex, gender (including pregnancy, childbirth, breastfeeding or related medical conditions), sexual orientation, gender identity, gender expression, race, color, religion, creed, national origin, ancestry, age (over 40), physical disability, mental disability, medical condition, genetic information, marital status, domestic partner status, military or veteran status, height, weight, AIDS/HIV status, and any other protected category under federal, state or local law. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. #LI-Remote

  1. https://github.com/segmentio/kafka-go
  2. https://github.com/segmentio/chamber
  3. https://github.com/segmentio/kubeapply
  4. https://www.linkedin.com/company/segment-io/life/158aab60-769e-4b14-8505-f8063f6162f6/

Other openings you might be interested in

Site Reliability Engineer (Remote - India)

Site Reliability Engineer (Remote - India)

Sysdig is the secure DevOps company, and we’re at the forefront of the container and Kubernetes revolution. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure and operate cloud-native appl

today
Site Reliability Engineer

Site Reliability Engineer

Hearsay is looking to hire a knowledgeable, ambitious Site Reliability Engineer to join our global Site Reliability team. The Hearsay Platform is the pre-eminent omni-channel digital engagement center for the financial services advisor. It tethers th

this week
Staff Site Reliability Engineer, Service Mesh

Staff Site Reliability Engineer, Service Mesh

Staff Site Reliability Engineer, Service Mesh  Location: 100% Remote Wayfair believes everyone should live in a home they love. Through technology and innovation, Wayfair makes it possible for shoppers to quickly and easily find exactly what they w

this week
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Paubox provides secure communication for modern healthcare. Right out of the box. We are a fast growing B2B startup based in San Francisco, ranking #320 on the Inc. 5000 list of fastest growing privately owned companies. Our core solutions are HIPAA

this week
Senior Site Reliability / DevOps Engineer (Agent)

Senior Site Reliability / DevOps Engineer (Agent)

Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engine

last week
Site Reliability Engineer

Site Reliability Engineer

Doximity is transforming the healthcare industry. Our mission is to help doctors be more productive, informed, and connected. As a software engineer, you'll work within cross-functional delivery teams alongside other engineers, designers, and product

last week
Senior Site Reliability Engineer - Kubernetes & Terraform

Senior Site Reliability Engineer - Kubernetes & Terraform

Remote - Hiring in North America  At Urbint, our mission is to create safer and more resilient communities using AI. We are passionate about taking data about our changing world – from climate, to urbanization, to infrastructure risk – and harnessing

last week
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Who we are and what we do SurveyMonkey (Nasdaq: SVMK), is a leader in agile software solutions for customer experience, market research, and survey feedback. Our mission is to power the curious and enable organizations, including 98% of the Fortune

last week
More remote jobs

Other jobs at Segment.io, Inc.

4 jobs in the last 60 days · 4 in total · avg 21.59 jobs/mo · 92 job visits

Director of Infrastructure Engineering

Director of Infrastructure Engineering

At Segment, we believe companies should be able to send their data wherever they want, whenever they want, with no fuss. Unfortunately, most product managers, analysts, and marketers spend too much time searching for the data they need, while enginee

yesterday
Corporate Infrastructure Security Engineer

Corporate Infrastructure Security Engineer

Overview   Thousands of companies send their most sensitive data through Segment daily: personal data, user actions, and sensitive revenue metrics. Those companies have thousands (even millions) of customers each. Segment, as the platform that connec

this week
Site Reliability Engineer

Site Reliability Engineer

At Segment, we believe companies should be able to send their data wherever they want, whenever they want, with no fuss. Unfortunately, most product managers, analysts, and marketers spend too much time searching for the data they need, while enginee

this week
Corporate Infrastructure Security Engineer

Corporate Infrastructure Security Engineer

Overview   Thousands of companies send their most sensitive data through Segment daily: personal data, user actions, and sensitive revenue metrics. Those companies have thousands (even millions) of customers each. Segment, as the platform that connec

this week
Segment.io, Inc.