Infrastructure Site Reliability Engineer

Posted:
9/4/2024, 4:17:07 AM

Location(s):
Connecticut, United States

Experience Level(s):
Junior ⋅ Mid Level ⋅ Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric health care for a rapidly changing world. Anchored in our brand — with heart at its center — our purpose sends a personal message that how we deliver our services is just as important as what we deliver.
 
Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make health care more personal, convenient and affordable.

Position Summary

As an Infrastructure Site Reliability Engineer, you will be responsible for designing, implementing, and managing the infrastructure systems and tools that enable reliability and performance of our technology platforms supporting various business initiatives within CVS Health. This role requires a strong background in infrastructure engineering and a commitment to proactive monitoring, troubleshooting, and optimizing systems for maximum uptime and performance. Collaborating with diverse teams, you will prioritize high availability, scalability, and resilience to ensure our platforms and services consistently meet and exceed customer expectations.
 

Primary Responsibilities:

1. Operations: Manage and maintain various systems and infrastructure, such as servers, storage, mainframe, iSeries, backup, archive, and recovery, ensuring the platforms have high availability, scalability, and reliability to meet the business requirements. Participate in on-call rotation to ensure availability and uptime of critical systems and provide timely response and resolution to incidents. Develop and maintain best practices documentation, including system architecture diagrams, standard operating procedures, and runbooks. Perform system and application performance analysis, utilizing monitoring tools, logging systems, and other relevant metrics, to identify and resolve issues and enhance overall system performance.

2.    Process Improvement: Streamline and optimize operational processes, procedures, and documentation by implementing industry best practices. Develop, modify, and implement incident and problem management processes to increase efficiency and reduce downtime. Establish a comprehensive SRE process that encompasses the entire software team, ensuring seamless operations and prompt resolution of any escalated issues.


3.    System Support: Collaborate with development teams to participate in code reviews, performance optimization, and application deployment processes. Drive reliability engineering practices, including monitoring, alerting, incident management, capacity planning, and disaster recovery. Automate infrastructure deployments, upgrades, and maintenance tasks, utilizing configuration management tools like Ansible and infrastructure-as-code frameworks such as Terraform. Stay abreast of industry trends, emerging technologies, and best practices in infrastructure site reliability engineering and apply knowledge to continually improve CVS Health's systems and processes. Provide customer support with meticulously documented procedures, enabling them to proficiently address customer complaints and deliver optimal service.


4.    Capacity Management: Analyze historical usage patterns and growth projections to forecast future capacity requirements. Collaborate with stakeholders such as developers, product managers, and operations teams to understand the demand for resources and estimate the necessary infrastructure capacity. Establish and maintain monitoring systems to track the performance and utilization of critical resources. Identify potential bottlenecks, anomalies, or areas of improvement. Perform regular performance reviews help ensure systems meet defined service-level objectives (SLOs) and key performance indicators (KPIs).


Required Qualifications

  • 7+ years of experience in Infrastructure Engineering, System Administration, or related roles.
  • 3+ years of experience with cloud platforms (e.g., Amazon Web Services, Microsoft Azure) and infrastructure-as-code tools (e.g., Terraform, CloudFormation).
  • 2+ years of experience in at least one configuration management tool such as Ansible, Puppet, or Chef.
  • 2+ years of experience with containerization technologies such as Docker and container orchestration platforms like Kubernetes.
  • 2+ years of experience in networking principles and protocols, including TCP/IP, DNS, load balancing, and firewalls.
  • 1+ years of experience with incident management, performance monitoring, and capacity planning tools.
     

Preferred Qualifications

  • Excellent troubleshooting and problem-solving skills, with the ability to identify, communicate, and resolve technical issues swiftly.

Education

  • Bachelor’s degree or equivalent experience (High School Diploma and 4 years relevant experience)

Pay Range

The typical pay range for this role is:

$118,450.00 - $260,590.00


This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls.  The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors.  This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above.  This position also includes an award target in the company’s equity award program. 
 
In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities.  The Company offers a full range of medical, dental, and vision benefits.  Eligible employees may enroll in the Company’s 401(k) retirement savings plan, and an Employee Stock Purchase Plan is also available for eligible employees.  The Company provides a fully-paid term life insurance plan to eligible employees, and short-term and long term disability benefits. CVS Health also offers numerous well-being programs, education assistance, free development courses, a CVS store discount, and discount programs with participating partners.  As for time off, Company employees enjoy Paid Time Off (“PTO”) or vacation pay, as well as paid holidays throughout the calendar year. Number of paid holidays, sick time and other time off are provided consistent with relevant state law and Company policies. 
 
For more detailed information on available benefits, please visit Benefits | CVS Health

We anticipate the application window for this opening will close on: 09/23/2024

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.