Senior Site Reliability Engineer (Remote, Pacific Time)

Job post summary
Location Americas
Team Engineering & Development

About the role

At Shopify, we ship on quality instead of time. Our teams deploy new code many times a day, and our production scale is massive. Shopify has many critical components, and sometimes they fail. That's where the Resiliency team comes in, ensuring we can get back to green as fast as possible when that happens. Resiliency set the foundation for building and running resilient systems at Shopify.  This is a team of engineers with in-depth operational knowledge of the entire Shopify stack, and who act as first responders and leaders during an incident.  

Our job is to get to a resolution as quickly as possible, and guide teams to build a more resilient Shopify. We build whatever is necessary to quickly resolve incidents, and seek out ways to automate away the manual toil.

Commerce happens 24/7, so we are building out a globally distributed team that can respond whenever necessary. Our team hires across 4 different regions (APAC, North America West, North America East, and EMEA) in a follow-the-sun support model that provides 24/7 coverage for incident management.

We welcome remote candidates based the Americas in the Pacific time zone. Working hours skew toward Hawaii Standard Time  (UTC -10:00). Relocation to Hawaii is an option for the right candidate🏄🏾‍♀️

 

What’s in it for you:

  • Help Shopify run its planet scale systems by enabling our engineering teams to create resilient systems.
  • Work on uniquely interesting and challenging technical problems that aren't easily found elsewhere.
  • Help define what Resiliency and Site Reliability Engineering means for Shopify.
  • Have a direct impact on building continual stability for our millions of merchants to generate revenue for their livelihood, their families, and their employees, through the business they’ve built from the ground up on our platform.
  • Possibility of relocation to a region the team operates within.

What you'll do:

  • Respond to automated alerts and execute playbooks
  • Manage ongoing incidents, using your understanding of Shopify to involve the right teams and resolve as quickly as possible.
  • Clean up the noise in our signals, ensuring we can get an understanding of the system and easily debug a problem.
  • Set the standards with engineering teams across the company for building resilient, debuggable systems.
  • Ensure we never fail for the same reason twice.
  • Follow up on each incident to ensure the appropriate action items are in place and prioritized.
  • Help teams build tools to automate the toil of on call duties.

About you:

  • Based in the Americas, based in the Pacific time zone, and willing to work Hawaii Standard Time (UTC -10:00) There is also the possibility of relocation to Hawaii for the right candidate. 🏄🏾‍♀️
  • You have experience handling on call shifts for mission-critical systems.
  • You know what good observability looks like, but more importantly, how to get there.
  • You have been responsible for the tools and processes used to debug and correct failures in those systems.
  • You have strong software engineering skills, primarily in backend software development. You have experience diving into Java, Go, Python, and/or Ruby code. 
  • You are a developer comfortable navigating through multiple programming languages and digging deep in the stack..
  • You reject the idea that on call has to be a terrible, disruptive experience.
  • You’re comfortable with hands-on development using cloud infrastructure (AWS, GCE, Azure, Kubernetes, Docker).
  • You understand how to improve difficult situations through short and iterative projects.

Nice to have but not necessary:

  • You have handled multiple on call shifts, and navigated more than one incident through to the retrospective process.
  • You have experience working with a variety of open-source software including nginx, redis, Memcached, and MySQL.
  • You have familiarity with network and web protocols, from IP to HTTP.

Qualifications

We know that looking for a new role can be both exciting and time-consuming, and we truly appreciate your effort. Brad is an actual real live person (👋🏻) and is looking forward to learning more about you through your application. And remember, we want to know what you're really interested in building and why you want to build it at Shopify, so please give us as much detail on this as you'd like in the answers on the next page. 👍 📖

As there are multiple positions, this posting will remain live until all positions have been filled. Successful candidates can expect to hear back from us within 1-3weeks of application.


Our belief is that a strong commitment to diversity & inclusion enables us to truly make commerce better for everyone. We encourage applications from Indigenous peoples, racialized people, people with disabilities, people from gender and sexually diverse communities, and/or people with intersectional identities. Please take a look at our Sustainability Reports to learn more about Shopify’s commitments to our communities, and our planet.

At Shopify, we understand that experience comes in many forms. We’re dedicated to adding new perspectives to the team - so if your experience is this close to what we’re looking for, please consider applying.

Our belief is that a strong commitment to diversity & inclusion enables us to truly make commerce better for everyone. We encourage applications from Indigenous peoples, racialized people, people with disabilities, people from gender and sexually diverse communities, and/or people with intersectional identities. Please take a look at our Sustainability Reports to learn more about Shopify’s commitments to our communities, and our planet.

At Shopify, we understand that experience comes in many forms. We’re dedicated to adding new perspectives to the team - so if your experience is this close to what we’re looking for, please consider applying.

How we hire

At Shopify, we put a lot of care and time into who we hire. We believe that in order to build the best products, we need to build high impact teams. Our recruitment process centres around what we call the Life Story interview, a conversational-style interview where we get to learn more about you.
Learn more about our hiring process 

Not what you’re looking for?Check out these similar roles.

Job postings for similar
Position Team Location
Senior Backend Software Engineer (Remote, Americas) Engineering & Development Americas
Senior Software Engineer, Shop – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Senior Mobile Software Engineer (Remote, Americas) Engineering & Development Americas
Tech Lead Software Engineer, Shop – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Tech Lead Software Engineer – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Software Engineering Manager – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Senior Tech Lead Software Engineer, Shop – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Senior Tech Lead Software Engineer – Backend, Mobile, or Frontend Development (Remote, Americas) Engineering & Development Americas
Senior Frontend Software Engineer, Marketing (Remote, Americas) Engineering & Development Americas
Senior Frontend Software Engineer, Product (Remote, Americas) Engineering & Development Americas
Senior Infrastructure Software Engineer (Remote, Americas) Engineering & Development Americas
Senior Software Engineering Manager (Remote, Americas) Engineering & Development Americas
Lead/Staff Production Engineer (Remote, Americas) Engineering & Development Americas
Production Engineering Manager (Remote, Americas) Engineering & Development Americas
Development Manager - Technical Infrastructure Engineering & Development Americas
Lead/Staff Frontend Software Engineer, Marketing (Remote, Americas) Engineering & Development Americas
Senior Site Reliability Engineer, Resiliency (Remote, Hawaii) Engineering & Development Americas
Lead(Staff) Site Reliability Engineer, Resiliency (Remote, Pacific Time) Engineering & Development Americas
Lead(Staff) Site Reliability Engineer, Resiliency (Remote, Eastern Time) Engineering & Development Americas
Engineering Program Manager, Data (Remote, Americas) Engineering & Development Americas
Pottery store wall with products

Don’t see the right role?

Join our Engineering community, or sign up be alerted when relevant roles are available.

Join our community