Cloud Operations Engineer Infrastructure is responsible for leading shift and supporting implementation of core cloud infrastructure components. Utilizes advanced technical skills to coordinate design, enhancement and deployment efforts and provide insight and recommendations for operating enterprise cloud infrastructure solutions. Works closely with cloud application and infrastructure support teams, project managers, network and system engineers, and other technology support teams. Documents critical design and configuration details required to support the delivery of enterprise cloud services.
ESSENTIAL DUTIES AND RESPONSIBILITIES
- Responsible for reliability and support of Cloud Platform including Public Cloud (Azure /AWS /Google) services.
- Migration Hands on from on prem to cloud
- Handon experience of DFS , File servers and File server Migrations
- Sound knowledge and experience of Windows and Linux OS administration
- Monitor and troubleshoot Azure/AWS /Google environment performance issues, connectivity issues, security issues, etc.
- Perform deep dives into systemic and latent reliability issues, incident management, problem management
- Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
- Perform RCA, partner with engineering and operation teams across the organization to roll out fixes.
- Identify and drive opportunities to improve automation for the cloud services; scope and create automation for deployment, management, and visibility of our services.
- Evaluating and automating the scaling and capacity requirements within Azure environments
- Engage with engineering teams throughout the full lifecycle from design, engineering, deployment, & operations.
- Partner with risk and compliance teams to bring visibility and implement right controls and policies in the Cloud Platform
- Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams
- Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams
- Participate in 24x7 on-call coverage follow the sun model
- Identify the cloud optimization opportunities, design solutions and implement
- Support deployment templates or patterns as requested by Customer.
- Automating the deployment of templates into the environment to continually reduce provisioning and deployment times.
- Manage cloud brokerage and orchestration software to monitor and modify infrastructure solutions to address planned and ad hoc demands for cloud Services.
- Manage Virtual Networks (VPCs) and Subnets
- Patch management (System updates assessment and updates)
- Endpoint Management, Native Load-Balancer, NSG, IP address Management, management of virtual networks in cloud
- Support DR set up & restore environment after disaster recovery
- 3rd party vendor coordination for troubleshooting
QUALIFICATIONS
EDUCATION: Bachelor’s degree in computer science or Higher in similar field preferred.
REQUIRED EXPERIENCE
- Minimum 3+ years of hands-on experience maintaining cloud platforms on a major cloud service provider.
- Experience working on Azure/Google/AWS/OCI operations and Administration.
- Handon experience of DFS , File servers and File server Migrations
- Azure /Terraform /AWS /Google certifications are a plus
- Strong experience in implementing, monitoring, and maintaining Microsoft Azure solutions, including major services related to Compute, Storage, Network and Security
- Experience with monitoring tools such as cloud native tools like Azure Monitor and Log Analytics
- Understanding of cost management, inventory management, FinOps model
- Strong understanding and background of working with a complex IAM infrastructure, including Active Directory, Azure AD and other SSO solutions.
- Advanced knowledge of DNS, DHCP, Kerberos and Windows Authentication
- Experience with IaC with Terraform
- Python, Ansible and shell scripting
- Experience with CI/CD tools such as git andJenkins, familiarity with using a GitOps model
- Excellent understanding of Linux /Windows operating systems administration
- Systematic problem-solving approach, sense of ownership and drive
- Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
PREFERRED EXPERIENCE
- 6 or more years with Virtualization including Virtualization Server, Storage, Desktop, Network
- 6 or more years with Infrastructure-Based Processes such as Monitoring, Capacity Planning, Performance Tuning, Asset Management, Disaster Recovery
- 2 or more years with Hyper-V, Virtual Infrastructure, Platform Sizing
- Experience in Terraform, Ansible
- Experience working in a highly available multi-datacenter environment
- Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities.
- Ability to juggle competing priorities and adapt to changes in project scope.