Posted:
3/3/2026, 7:22:17 AM
Location(s):
England, United Kingdom ⋅ Swindon, England, United Kingdom
Experience Level(s):
Mid Level ⋅ Senior
Field(s):
DevOps & Infrastructure ⋅ Software Engineering
Workplace Type:
Hybrid
Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and our communities, we work hard to create a welcoming and inclusive environment, and our associates dedicate thousands of hours to volunteering for causes that matter most to them.
Chart your own path and grow your career while helping more customers achieve financial freedom. Empower Yourself.
***Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of an employment visa at this time, including CPT/OPT.***
We are seeking a Site Reliability Engineer (SRE) to own the reliability, availability, and operational excellence of our AWS-based data platform. This role is focused on applying core SRE principles — production engineering, incident management, root cause elimination, observability, automation, and capacity planning — to large-scale data infrastructure supporting EMR, EMR Serverless, Redshift, DynamoDB, and S3.
You will treat data pipelines and analytics platforms as production systems, designing and enforcing SLAs/SLOs for uptime, performance, scalability, and data freshness. You will lead incident response, perform deep root cause analysis, implement durable fixes, and eliminate toil through automation and infrastructure-as-code.
What you will do:
Own and improve the reliability, stability, scalability, and performance of our core data platforms and services
Provide operational support for large-scale, distributed data systems, ensuring high availability and strong SLAs
Partner closely with full-stack, data, and platform engineering teams to deliver continuous improvements
Operate and support EMR and EMR Serverless (Python/Spark) workloads and data pipelines
Support and optimize Amazon Redshift and DynamoDB in high-throughput, production environments
Design, build, and evolve monitoring, alerting, and observability frameworks with a focus on symptoms, not just outages
Lead incident response, troubleshooting production issues across the full stack and coordinating with internal and external stakeholders
Perform root cause analysis (RCA) and readiness reviews; turn findings into durable fixes and automation
Create and maintain runbooks, SOPs, and operational documentation
Collaborate with engineering teams to optimize performance, reliability, and cost
Participate in an on-call rotation to respond to incidents impacting customer-facing systems
Recommend and influence the use of AWS managed services and architectural patterns
Continuously evaluate system performance, capacity, and cost to scale efficiently
What you will bring:
4–6 years of experience building or operating systems across multiple architecture domains: application, data, integration, infrastructure, and security
4+ years of hands-on AWS experience, with strong production exposure to several of the following:
Redshift, DynamoDB, EMR, EMR Serverless, EC2, S3
Lambda, Step Functions, EventBridge, RDS, IAM
Proven experience operating data platforms such as data lakes and data warehouses in production
Strong SQL skills and experience working with modern databases (e.g., Redshift, DynamoDB, Postgres, MySQL, Oracle)
4+ years of Python experience, including scripting, automation, or data workloads
Experience with CloudWatch, infrastructure monitoring, and alerting
Hands-on experience with incident management, uptime SLAs, and customer-impacting systems
Strong understanding of Git-based workflows (GitHub, Git Flow, or similar)
Experience working in Agile environments (Scrum / Kanban) using tools such as Jira and Confluence
Bachelor’s in Computer Science, Information Systems, Data/Analytics, or related; equivalent practical experience welcomed.
What will set you apart:
Experience with Terraform or other Infrastructure-as-Code tools
Exposure to Snowflake or experience supporting analytics platforms beyond Redshift
Experience in financial services or other highly regulated environments
Knowledge of DevOps and CI/CD best practices
Familiarity with observability tools such as Splunk, AppDynamics, or advanced CloudWatch usage
Comfortable working across Linux/Unix environments
Strong communication skills during incident response with both technical and non-technical stakeholders
Security-minded approach to building secure, reliable, and durable systems
Willingness to support occasional off-hours or weekend incidents as part of on-call responsibilities
Streaming/event pipelines (Kafka/Kinesis), CDC patterns, and backfill strategies.
Experience with OpenLineage/Marquez and catalog integrations (Collibra/Alation/Purview).
Prior FinOps or capacity-planning ownership for data platforms.
Familiarity with BI semantic layers and contract enforcement at consumption (Looker/Power BI/Tableau).
Work conditions
Participate in an on-call rotation; occasional change windows outside business hours to support safe releases and resiliency drills.
This job description is not intended to be an exhaustive list of all duties, responsibilities and qualifications of the job. The employer has the right to revise this job description at any time. You will be evaluated in part based on your performance of the responsibilities and/or tasks listed in this job description. You may be required perform other duties that are not included on this job description. The job description is not a contract for employment, and either you or the employer may terminate employment at any time, for any reason.
What we offer you
We offer an array of diverse and inclusive benefits regardless of where you are in your career. We believe that providing our employees with the means to lead healthy balanced lives results in the best possible work performance.
Base Salary Range
$87,400.00 - $123,400.00The salary range above shows the typical minimum to maximum base salary range for this position in the location listed. Non-sales positions have the opportunity to participate in a bonus program. Sales positions are eligible for sales incentives, and in some instances a bonus plan, whereby total compensation may far exceed base salary depending on individual performance. Actual compensation offered may vary from posted hiring range based upon geographic location, work experience, education, licensure requirements and/or skill level and will be finalized at the time of offer.
Equal opportunity employer • Drug-free workplace
We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age (40 and over), race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.
***For remote and hybrid positions you will be required to provide reliable high-speed internet with a wired connection as well as a place in your home to work with limited disruption. You must have reliable connectivity from an internet service provider that is fiber, cable or DSL internet. Other necessary computer equipment, will be provided. You may be required to work in the office if you do not have an adequate home work environment and the required internet connection.***
Job Posting End Date at 12:01 am on:
03-05-2026Want the latest money news and views shaping how we live, work and play? Sign up for Empower’s free newsletter and check out The Currency.
Website: https://www.empower.com/
Headquarter Location: Greenwood Village, Colorado, United States
Employee Count: 5001-10000
Year Founded: 2014
IPO Status: Private
Industries: Employee Benefits ⋅ Non Profit ⋅ Social