Staff Site Reliability Engineer

San Francisco, California

Share on Twitter.
Share on Facebook.
Share on LinkedIn.
Apply For This RoleBrowse open rolesLearn About Sentry

About Sentry

Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.

With more than $217 million in funding and 85,000 customers that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools.

About the role

The Engineering Operations team is responsible for the deployment, configuration, maintenance and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers. Sentry receives over a billion events a day, and process terabytes of data to return complex aggregations with sub-second latency.

As a Staff SRE, you will lead efforts to enable our systems to scale and be fault-tolerant, handling 100x our current volume while maintaining our SLOs. You’ll do this by working across engineering to implement processes, engrain principles, and develop solutions that improve the performance and resilience of their services. You’ll contribute to our vision of Engineering Operations in a world of cloud providers and most of all, help shape the vision of SRE best practices here at Sentry.

If you're looking for a high-impact role where you move a company from processing "big data" to "really big data", this could be the job for you.

In this role you will

  • Work across Sentry to ensure the uptime and reliability of Sentry's hosted platform
  • Architect and automate services and systems to meet the demand of scale
  • Analyze and tune systems to operate at maximum efficiency
  • Collaborate with other engineering teams to deploy and scale new and existing services
  • Lead design and discussions around deliverables the team is working towards
  • Be a member of the Engineering Operations team's on-call rotation, and be available to respond and resolve critical issues

You'll love this job if you

  • Enjoy working with others to improve scalability and performance
  • Aren't afraid to dig into Linux internals during the troubleshooting process
  • Are experienced in leading the way to a solution when faced with system limitations or frailty
  • Seen networks make and break hosted solutions; and have direct experience with growing and maintaining distributed systems
  • Are familiar with the various SaaS ecosystems and have taken ownership of a service you once knew nothing about
  • Have a story (or two) of royally goofing it and can tell us why it would never happen again under your watch

Qualifications

  • 10+ years relevant experience
  • Experience with implementing good processes and solutions
  • Strong knowledge of replicated and distributed data storage systems
  • You have experience with some or all of the following tools we leverage:
    • System Administration: Debian, Docker, Kubernetes,
    • Databases: PostgreSQL, ClickHouse, Redis
    • Environment Management: Saltstack, TerraformGoogle Cloud Environment
    • TCP/HTTP Routing: HAProxy, NGINX, Envoy
    • Data Platforms: Kafka, RabbitMQ, Memcached
  • Excellent written and oral communication skills and ability to articulate technical concepts clearly and succinctly
  • Experience in Python
  • In the San Francisco Bay Area
  • Current, valid work permit for the United States

Benefits

  • Competitive salary and meaningful equity
  • 100% medical, dental, and vision coverage for employees, 75% company-paid for dependents
  • Monthly commuter subsidy
  • 401k program
  • Learning & Development stipend
  • Charitable matching program
  • Generous parental leave policy
  • Flexible working schedule and vacation policy, work from home policy, and real work/life balance
  • Catered lunches
  • Company events (Hack Weeks, All Hands, quarterly social events) and friends and family events
  • Relocation assistance - you are living in, or willing to relocate to the San Francisco Bay Area

COVID Vaccine Required - Reasonable Accommodations for Medical or Religious Reasons Considered

Sentry values diversity and inclusivity in our company and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Apply For This Role
© 2023 • Sentry is a registered Trademark
of Functional Software, Inc.