Software Engineering Manager

Posted:
6/15/2026, 2:24:18 AM

Location(s):
Tamil Nadu, India ⋅ Chennai, Tamil Nadu, India

Experience Level(s):
Senior

Field(s):
Software Engineering

At U.S. Bancorp India, we’re on a journey to do our best. We believe it takes all of us to bring our shared ambition to life, and each person is unique in their potential. A career with U.S. Bancorp India gives you a wide, ever-growing range of opportunities to discover what makes you thrive at every stage of your career. Try new things, learn new skills and discover what you excel at—all from Day One.

Job Description

Key Responsibilities 

Reliability Engineering & Service Operations 

  • Own reliability outcomes for distributed systems, APIs, microservices, data pipelines, and critical production platforms, with accountability for availability, latency, throughput, and saturation 

  • Define and operationalize SLOs, SLIs, error budgets, alert thresholds, and service health indicators to improve customer experience and engineering accountability 

  • Lead production readiness reviews, capacity planning, performance testing, failover validation, chaos/resiliency testing, and disaster recovery preparedness 

  • Drive standardization of monitoring, telemetry, distributed tracing, logging, synthetic checks, and runbook practices across services and platforms 

  • Partner with engineering teams to reduce operational toil through automation, self-healing workflows, auto-remediation, and reliability-focused platform improvements 

Incident Management & Operational Excellence 

  • Lead major incident response, escalation management, and technical triage for high-severity production events, ensuring rapid mitigation, stakeholder communication, and service recovery in high-pressure, time-critical environments 

  • Establish strong practices for root cause analysis, problem management, failure mode analysis, and durable corrective/preventive actions 

  • Drive operational governance using metrics such as MTTR, MTTD, change failure rate, incident recurrence, alert noise, and service error budget consumption 

  • Partner with infrastructure, application, database, and network teams to proactively identify scaling risks, dependency bottlenecks, and single points of failure 

People & Team Management 

  • Directly manage multiple SRE leads and senior reliability engineers, providing technical coaching, operational guidance, and performance leadership across teams 

  • Build and scale high-performing SRE teams in the GCC environment with strong focus on production ownership, operational engineering, and systems thinking 

  • Drive team capability in on-call operations, debugging, incident command, automation development, and platform diagnostics 

  • Foster a culture of blameless incident analysis, engineering accountability, and continuous reliability improvement 

  • Manage staffing, on-call coverage, skill distribution, and hiring aligned to platform complexity, production demand, and business criticality 

Cross-Functional Collaboration 

  • Partner with software engineering, infrastructure, database, network, security, and platform teams to improve system stability, deployment safety, and operational readiness 

  • Translate business criticality and customer impact into technical reliability priorities, architecture guardrails, recovery objectives, and measurable engineering outcomes 

  • Work effectively in a distributed/global operating model, ensuring seamless coordination with onshore engineers, command centers, platform owners, and leadership teams during both steady-state and incident scenarios 

Automation & Platform Enablement 

  • Promote and govern infrastructure as code, configuration management, CI/CD reliability, release guardrails, policy enforcement, and automated rollback/recovery patterns 

  • Enable engineering teams with standardized tooling for observability, deployment validation, incident response, debugging, performance diagnostics, and service dependency analysis 

  • Drive platform modernization through reusable automation, reliability frameworks, production diagnostics, and engineering patterns that improve resiliency and reduce mean time to recovery 

 

Basic Qualifications 

  • Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience 

  • 12+ years of experience in software engineering, site reliability engineering, production operations, platform engineering, or infrastructure engineering 

  • 5+ years of experience leading reliability or production engineering teams, including managing leads or senior engineers in technically complex environments 

 

Preferred Skills & Experience 

  • Proven experience managing multiple SRE, production engineering, or platform operations teams in a matrix/global setup 

  • Strong experience working in offshore/onshore operating models, preferably within Banking and Financial Services 

  • Hands-on knowledge of cloud platforms, Kubernetes, Linux systems, networking fundamentals, distributed systems, and infrastructure automation 

  • Experience with observability platforms, telemetry pipelines, incident tooling, configuration management, and service governance 

  • Strong understanding of CI/CD pipelines, scripting languages, infrastructure as code, release engineering, and automated operational workflows 

  • Ability to review architecture and operational designs for scalability, fault tolerance, recovery, and performance bottlenecks 

  • Strong stakeholder management and communication skills across global engineering teams and senior leadership 

 

Leadership Competencies 

  • Ability to balance technical depth, operational discipline, architecture awareness, and people leadership 

  • Strong decision-making skills with a focus on business continuity, failure risk, service reliability, and engineering trade-offs, with the ability to remain calm and effective in high-pressure operational situations 

  • Proven ability to influence engineering design and operational practices without direct authority across platform and application teams 

  • High ownership mindset with focus on resilience engineering, operational excellence, and predictable service behavior in production 

If there’s anything we can do to accommodate a disability during any portion of the application or hiring process, please refer to our disability accommodations for applicants.

Posting may be closed earlier due to high volume of applicants.

This is an U.S. Bancorp India posting. U.S. Bancorp India is a part of the U.S. Bank family.