portal resources jobs companies m mercy associate principal site reliability engineer - remote

Associate Principal Site Reliability Engineer - Remote 🔥


We're a Little Different

Our mission is clear. We bring to life a healing ministry through our compassionate care and exceptional service.

At Mercy, we believe in careers that match the unique gifts of unique individuals - careers that not only make the most of your skills and talents, but also your heart. Join us and discover why Modern Healthcare Magazine named us in its "Top 100 Places to Work."

Overview: Associate Principal Site Reliability Engineer

Hybrid: Mostly Remote (Work from home) with ability to be onsite for weekly meetings as needed.

As we execute on our vision of transforming health in the communities we serve, we are looking for an experienced SRE to join our rapidly growing Digital Software Engineering team. As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operational problems. Much of your support and development focuses on optimizing new and existing systems, building infrastructure, and reducing work through automation. You'll interact with a diverse range of system types - public facing web & API developer services, pipeline-based transformation systems, and internal business systems. As a SRE, you will be laser focused on design, reliability, and scalability of the Azure Cloud platform giving our patients an always-available experience and doing so with an eye towards cost, efficiency, and security. If you are a natural leader and passionate about cloud technology and automation this could be the career you've been looking for!

Job Description Responsibilities:
  • Design, build, and implement improvements to optimize performance, quality, and security of cloud and on-prem hosted solutions.
  • Help support and mature agile workflows and strengthen security posture by optimizing DevSecOps processes including designing, building, implementing, and optimizing CI/CD pipelines.
  • Design and code software solutions that help automate manual processes.
  • Build reporting and dashboards to provide transparency, anticipate and identify platform key performance metrics, implement monitoring and alerting when thresholds exceeded.
  • Perform Management Group creation, manage subscriptions, resource groups, resources (Creation, RBAC Design, Security).
  • Provide architectural and practical guidance to software development team to improve application resiliency, efficiency, performance, and costs.
  • Capacity planning and management - anticipate, create, use, and maintain a capacity model for on-prem and cloud hosting.
  • Work with production operations team to resolve trouble tickets, develop and run scripts, and facilitate blameless post mortems.
  • Provide technical insight on development projects.
  • Develop, document, communicate, and enforce a cloud technology standards policy.
  • Conduct research on emerging technologies in support of development efforts and recommend and implement solutions that will increase cost effectiveness and infrastructure flexibility.

Qualifications:
  • Experience: 9+ years of experience as a Site Reliability Engineer or comparable role working with cloud (Azure preferred) and on-prem hosted solutions.
  • Required Education: Bachelor's degree in related field, specialized training, or equivalent work experience.
  • Other: Proven experience optimizing cloud & on-prem hosted applications and services based on key performance metrics. Ability to debug and optimize code and automate routine processes.
  • Experience configuring Azure API Manager and App Services policies preferred.
  • Experience with scale testing, disaster recovery, and capacity planning.
  • Familiarity with microservices architecture and container orchestration with Kubernetes.
  • Strong understanding and experience configuring cloud & on-prem technologies, building & optimizing CI/CD, and Infrastructure-as-Code. Preferably Azure Pipelines, Azure DevOps, Terraform, PowerShell, Python, Git, Jenkins, Maven, Ansible.
  • Expertise with full Microsoft stack including AD, DHCP, DNS, DFS Namespace, Windows Servers, and Linux environments.
  • Good understanding of networking solutions including Load Balancer, V-Net, Peering, etc. desired.
  • Experience growing talent and leading less senior coworkers in developing skills in this competency.
  • We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.

We Offer Great Benefits:

Day-one comprehensive health, vision and dental coverage, PTO, tuition reimbursement and employer-matched retirement funds are just a few of the great benefits offered to eligible co-workers, including those working 48 hours or more per pay period!

We're bringing to life a healing ministry through compassionate care.

At Mercy, our supportive community will be behind you every step of your day, especially the tough ones. You will have opportunities to pioneer new models of care and transform the health care experience through advanced technology and innovative procedures. We're expanding to help our communities grow. Join us and be a part of it all.

What Makes You a Good Match for Mercy?

Compassion and professionalism go hand-in-hand with us. Having a positive outlook and a strong sense of advocacy is in perfect step with our mission and vision. We're also collaborative and unafraid to do a little extra to deliver excellent care - that's just part of our commitment. If that sounds like a good fit for you, we encourage you to apply.

Mercy has determined this is a safety-sensitive position. The ability to work in a constant state of alertness and in a safe manner is an essential function of this job.

Other jobs at Mercy

7 jobs in the last 60 days · 10 jobs in total · avg 4 - 6 jobs/mo · 7408 job visits

Mercy

Let us send you new openings similar to Associate Principal Site Reliability Engineer - Remote straight to your Inbox. Weekly or Daily. 7-day free trial 💌

The ability to work remotely increases employee happiness by 20 percent.