Posted:
6/15/2026, 2:24:18 AM
Location(s):
Tamil Nadu, India ⋅ Chennai, Tamil Nadu, India
Experience Level(s):
Senior
Field(s):
Software Engineering
At U.S. Bancorp India, we’re on a journey to do our best. We believe it takes all of us to bring our shared ambition to life, and each person is unique in their potential. A career with U.S. Bancorp India gives you a wide, ever-growing range of opportunities to discover what makes you thrive at every stage of your career. Try new things, learn new skills and discover what you excel at—all from Day One.
Key Responsibilities
Reliability Engineering & Service Operations
Own reliability outcomes for distributed systems, APIs, microservices, data pipelines, and critical production platforms, with accountability for availability, latency, throughput, and saturation
Define and operationalize SLOs, SLIs, error budgets, alert thresholds, and service health indicators to improve customer experience and engineering accountability
Lead production readiness reviews, capacity planning, performance testing, failover validation, chaos/resiliency testing, and disaster recovery preparedness
Drive standardization of monitoring, telemetry, distributed tracing, logging, synthetic checks, and runbook practices across services and platforms
Partner with engineering teams to reduce operational toil through automation, self-healing workflows, auto-remediation, and reliability-focused platform improvements
Incident Management & Operational Excellence
Lead major incident response, escalation management, and technical triage for high-severity production events, ensuring rapid mitigation, stakeholder communication, and service recovery in high-pressure, time-critical environments
Establish strong practices for root cause analysis, problem management, failure mode analysis, and durable corrective/preventive actions
Drive operational governance using metrics such as MTTR, MTTD, change failure rate, incident recurrence, alert noise, and service error budget consumption
Partner with infrastructure, application, database, and network teams to proactively identify scaling risks, dependency bottlenecks, and single points of failure
People & Team Management
Directly manage multiple SRE leads and senior reliability engineers, providing technical coaching, operational guidance, and performance leadership across teams
Build and scale high-performing SRE teams in the GCC environment with strong focus on production ownership, operational engineering, and systems thinking
Drive team capability in on-call operations, debugging, incident command, automation development, and platform diagnostics
Foster a culture of blameless incident analysis, engineering accountability, and continuous reliability improvement
Manage staffing, on-call coverage, skill distribution, and hiring aligned to platform complexity, production demand, and business criticality
Cross-Functional Collaboration
Partner with software engineering, infrastructure, database, network, security, and platform teams to improve system stability, deployment safety, and operational readiness
Translate business criticality and customer impact into technical reliability priorities, architecture guardrails, recovery objectives, and measurable engineering outcomes
Work effectively in a distributed/global operating model, ensuring seamless coordination with onshore engineers, command centers, platform owners, and leadership teams during both steady-state and incident scenarios
Automation & Platform Enablement
Promote and govern infrastructure as code, configuration management, CI/CD reliability, release guardrails, policy enforcement, and automated rollback/recovery patterns
Enable engineering teams with standardized tooling for observability, deployment validation, incident response, debugging, performance diagnostics, and service dependency analysis
Drive platform modernization through reusable automation, reliability frameworks, production diagnostics, and engineering patterns that improve resiliency and reduce mean time to recovery
Basic Qualifications
Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience
12+ years of experience in software engineering, site reliability engineering, production operations, platform engineering, or infrastructure engineering
5+ years of experience leading reliability or production engineering teams, including managing leads or senior engineers in technically complex environments
Preferred Skills & Experience
Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup
Strong experience working in offshore/onshore operating models, preferably within Banking and Financial Services
Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation
Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance
Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows
Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks
Strong stakeholder management and communication skills across global engineering teams and senior leadership
Leadership Competencies
Ability to balance technical depth, operational discipline, architecture awareness, and people leadership
Strong decision-making skills with a focus on business continuity, failure risk, service reliability, and engineering trade-offs, with the ability to remain calm and effective in high-pressure operational situations
Proven ability to influence engineering design and operational practices without direct authority across platform and application teams
High ownership mindset with focus on resilience engineering, operational excellence, and predictable service behavior in production
If there’s anything we can do to accommodate a disability during any portion of the application or hiring process, please refer to our disability accommodations for applicants.
Posting may be closed earlier due to high volume of applicants.
This is an U.S. Bancorp India posting. U.S. Bancorp India is a part of the U.S. Bank family.
Website: https://www.usbank.com/
Headquarter Location: Minneapolis, Minnesota, United States
Employee Count: 10001+
Year Founded: 1863
Industries: Financial Services