Principal Site Reliability Engineer

Posted:
10/4/2024, 10:54:17 AM

Location(s):
Austin, Texas, United States ⋅ Texas, United States

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

About Care.com 

Care.com is a consumer tech company with heart. We’re on a mission to solve a human challenge we all face: finding great care for the ones we love. We’re moms and dads and pet parents. We have parents and grandparents, so we understand that everyone, at some point in their lives, could use a helping hand. Our culture and our products reflect that. 

Here, entrepreneurs, self-starters, team players, and big thinkers unite behind a common cause. Here, we’re applying data analytics, AI, and the latest technologies to solve universal problems and connect people in new ways. If you like having autonomy, if you thrive on collaboration and building new things, and if you’re all about using your talent for good, Care.com is the place for you. 

Work Environment: Hybrid (In Office - Monday, Wednesday & Friday) 
Locations: Salt Lake City | Austin | Dallas

What You’ll Be Working On
As a Principal Site Reliability Engineer (SRE), you will be responsible for ensuring the reliability, scalability, and performance of our critical systems. You’ll lead incident response, manage releases, improve observability, and collaborate across development and operations teams to drive continuous improvements.

Key Responsibilities

  • Release Management: Coordinate releases for applications, ensuring efficient deployment and smooth rollbacks.
  • Incident Response: Lead incident management, facilitate root cause analysis, and continuously update response processes.
  • Monitoring & Alerting: Implement proactive monitoring, create dashboards, and set up real-time alerts for critical services.
  • Hypercare: Ensure system stability during critical post-release periods, monitoring performance and preventing incidents.
  • Collaboration with Dev & QA: Work closely with developers and QA teams to ensure performance benchmarks and observability goals are met.
  • SLI/SLA/SLO Management: Define and measure service levels for key workflows and APIs, ensuring alignment with business expectations.
  • Observability Maturity: Continuously assess and improve observability practices across teams, driving data-driven insights.

What You’ll Need to Succeed

  • 6+ years of experience in SRE or DevOps roles with a focus on monoliths and distributed microservices in cloud environments (AWS, GCP).
  • Proficiency in CI/CD tools (Jenkins, Terraform, Ansible).
  • Strong experience with Kubernetes, Docker, and JVM-based monoliths.
  • Expertise in monitoring tools (SignalFX, Splunk, Amplitude) and production incident management.
  • Scripting skills (Python, Bash, or Groovy).
  • Strong understanding of cloud-based systems and containerization.
  • Excellent communication skills and a collaborative approach to working cross-functionally.
  • Experience optimizing large-scale, customer-facing platforms in fast-paced environments.

For a list of our Perks + Benefits, click here!

Care.com supports diverse families and communities and seeks employees who are just as diverse. As an equal opportunity employer, Care.com recognizes the power of a diverse and inclusive workforce and encourages applications from individuals with varied experiences, perspectives, and backgrounds. Care.com is committed to providing reasonable accommodations for qualified individuals with disabilities. If you need assistance or accommodation, please reach out to [email protected].

Company Overview:

Available in 21 countries, Care.com is one of the largest providers of online services for finding family care and care jobs, spanning in-home and in-center care solutions. Since 2007, families have relied on Care.com for an array of care for children, seniors, pets, and the home.  Designed to meet the evolving needs of today’s families and caregivers, the Company also offers customized corporate benefits packages to support working families, household tax and payroll services, and innovations for caregivers to find and book jobs. Care.com is an IAC company (NASDAQ: IAC).

Salary Range: $180,000 to $200,000. 

The base salary range above represents the anticipated low and high end of the national salary range for this position. Actual salaries may vary and may be above or below the range based on various factors including but not limited to work location, experience, and performance. The range listed is just one component of Care.com’s total compensation package for employees. Other rewards may include annual bonuses and short- and long-term incentives. In addition, Care.com provides a variety of benefits to employees, including health insurance coverage, life, and disability insurance, a generous 401K employer matching program, paid holidays, and paid time off (PTO).