Staff Software Engineer

Posted:
4/29/2026, 9:10:57 PM

Location(s):
Karnataka, India ⋅ Bengaluru, Karnataka, India

Experience Level(s):
Expert or higher ⋅ Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Job Description Summary

Lead Cloud Operations Engineer / Cloud Operations Lead
We are seeking an experienced Lead Cloud Operations Engineer to drive cloud operations excellence across our AWS-based environments. This role requires a strong technical leader with deep expertise in cloud infrastructure, automation, monitoring, security, production operations, and team mentorship. The ideal candidate will lead cloud operations initiatives, guide engineers on best practices, oversee mission-critical production environments, and collaborate with cross-functional stakeholders across multiple time zones. This position is ideal for someone who combines hands-on cloud engineering capability with leadership, architectural thinking, and operational ownership.




Infrastructure as Code (IaC):
============================
Manage and enhance IaC templates using Terraform or CloudFormation.
Ensure infrastructure deployments are automated, repeatable, and compliant with best practices.



Automation and Scripting:
============================
Automate system processes using Python or Golang.
Identify opportunities for automation to improve efficiency and reduce manual effort.



FinOps and Cost Management:
============================
Implement a FinOps mindset to identify cost leakages and optimize cloud spending.
Review and balance cost vs. performance tradeoffs.



CI/CD Pipelines:
============================
Design, implement, and maintain CI/CD pipelines using Jenkins and GitOps practices.
Ensure seamless integration and deployment processes.



Security Operations (SecOps):
============================
Identify and analyze cybersecurity vulnerabilities.
Actively resolve security issues and implement best practices for secure development.



Containerization and Orchestration:
============================
Manage Kubernetes clusters, including EKS and ECR.
Ensure containerized applications are deployed, scaled, and monitored effectively.



Monitoring and Logging:
============================
Utilize external monitoring and logging tools such as Splunk, DataDog, DynaTrace, Grafana, and Prometheus.
Ensure system health and performance are continuously monitored and optimized.



Advanced Technologies:
============================
Explore and implement AI/Agentic AI automation where applicable.
Design systems using modern tools like ArgoCD and CrossPlane.



Cross-Team Collaboration:
======================
Work effectively in a cross-team, multi-time-zone environment.
Foster a culture of collaboration, knowledge sharing, and continuous improvement.

Technical Skills:
======================
In-depth knowledge of AWS services including EC2, RDS, ElastiCache, Lambda, S3, Amazon MQ, EKS, ECR, ALBs, CloudWatch, CloudTrail, VPCs, Subnets, IAM Roles and Users, Cost Explorer, Compute Optimizer, Trusted Advisor, and AWS Config.
Strong working knowledge of IaC tools like Terraform or CloudFormation.
Demonstrated ability to automate systems using Python or Golang.
Experience with Jenkins CI/CD pipelines, GitOps, Kubernetes, Docker, and containerization technologies.
Familiarity with external security and monitoring tools.



Soft Skills:
======================
Excellent communication skills.
Ability to work independently and lead a team.
Strong problem-solving and analytical skills."

Job Description

Key Responsibilities:

Must Have:

  • Lead and mentor the cloud operations team by providing technical guidance, support, and direction.
  • Manage and oversee AWS cloud operations including patching, migrations, upgrades, backups, releases, and production support.
  • Work with key AWS services such as EC2, RDS, Elastic cache, Lambda, S3, Amazon MQ, EKS, ECR, ALB, CloudWatch, VPC, IAM, AWS Config, and others.
  • Drive automation initiatives using Python or Golang to improve efficiency and reduce manual effort.
  • Build and maintain Jenkins CI/CD pipelines and support GitOps practices.
  • Oversee security operations including vulnerability identification, analysis, remediation, and compliance support.
  • Contribute to SecOps initiatives by identifying, analyzing, and resolving cybersecurity vulnerabilities.
  • Perform cloud operational activities including VM patching, migrations, cluster/database upgrades, AMI patching and releases, backups, and live production support.
  • Use monitoring and logging tools such as Splunk, DataDog, Dynatrace, Grafana, and Prometheus.
  • Collaborate with product teams, engineering teams, and stakeholders across multiple geographies and time zones. Encourage a culture of collaboration, operational excellence, knowledge sharing, and continuous improvement.

Good to have:

  • Familiarity with advanced deployment and platform engineering tools such as ArgoCD or similar tools.
  • Exposure to AI / Agentic AI-based automation for cloud operations and process optimization.
  • Ability to drive architectural improvements and standardization across cloud environments.
  • Experience working with external security tools such as: Wiz.io, CrowdStrike Falcon and Qualys.
  • Knowledge of AWS cost and governance tools such as: Cost Explorer, Compute Optimizer and Trusted Advisor.

Required Skills & Qualifications:

  • Minimum 7+ years of experience in cloud environments.
  • Strong practical experience in AWSAzure knowledge is a plus.
  • Hands-on experience with JenkinsGitOps, and Docker.
  • Good understanding of cloud cost optimizationsecurity operations, and production support activities.
  • Strong communication and collaboration skills.

Education Qualification

  • Bachelor’s Degree in Computer ScienceSoftware Engineering, or a related field.
  • Minimum 7+ years of relevant professional experience in cloud engineering, cloud operations, or DevOps environments.

Additional Information

Relocation Assistance Provided: Yes