Technical Lead, CloudOps

Posted:
10/16/2024, 8:27:30 PM

Location(s):
Bengaluru, Karnataka, India ⋅ Karnataka, India

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Workplace Type:
Hybrid

Technical Lead, CloudOps
Lead and mentor Platform Operations team that is responsible for deploying, managing and maintaining Saviynt’s SAAS version of Solution platform across AWS and Azure cloud environment. Duty Details:Architect the public Cloud Infrastructure for day-to-day operations applications with emphasis on performance, security, and scalability.Drive and collaborate on the strategic roadmap of infrastructure decisions for large-scale complex projects across multiple customers and create enterprise-grade platform capabilities considering resiliency, scalability, and performance infrastructure.Collaborate with Cloud Product Management by providing Operations’ perspective to drive improvement to the product and services with a focus on availability, reliability, scalability etc.Conducting workshops for scoping, questionnaire and solution discussions and collaboratively work with all internal and external stakeholders in designing a solution. Detail and articulation of solutions developed as part of the discussions.Build world-class team for monitoring and incident response, including automated alert handling and integration with tracking and ticketing tools.Define and document system architectures, systems configurations, and technical operational processes and policies.Participate in customer meetings and bring proactive solutions to customer problems.Automate the deployment of software on cloud environment in coordination with DevOps engineers.Build and maintain the platform operations related Procedures and policies, define Key Performance Indicators (KPIs), and contribute to and enhance security policies and procedures for Cloud Services.Manage the relationship with the Cloud Hosting providers/3rd party solution providers used to deploy and manage the application e.g., AWS, Azure, Qualys, Datadog to clearly define the business needs and expectations, roles and responsibility model, operational support process and SLAs, Closely work with information Security team, cloud providers to implement appropriate data security and governance infrastructure and policies while deploying the solution to ensure compliance with customer data sovereignty requirements.Create and implement Disaster Recovery strategy that consists of the steps and precautions to minimize the effects of cloud Services outage so the Saviynt’s solution can continue to operate or quickly resume normal functions with minimal effort.

Infrastructure Maintenance, Monitoring, Performance Management and Troubleshooting
Duty Details: Design and automate Routine Infrastructure maintenance including patching and upgrade of application servers. Any new patches released for any vulnerability or kernel/software version upgrade needs to be pushed to all the workloads.Troubleshoot and resolve infra issues across public cloud platforms like AWS, Azure. These can be internal connectivity issues within private network or connectivity issues with Customer systems, Issue with cloud services and follow up with cloud providers on resolution status, Issues with any provisioned workloads (memory, CPU, storage, etc..) and troubleshoot the cause for the same along with resolving any issue with it. Ensures management and monitoring tools (Datadog, CloudWatch Alarms, lambda Functions, WAF, Shield) are integrated with application stack and have rules / alerts for routine and exceptional operations conditions.Work with project, QA team to troubleshoot system level performance and connectivity issues.Develop and Build lambda functions, system scripts to perform routine tasks, patching, backup and cleanup of unused resources.Monitor system stability and performance and ensure system availability, reliability, and usability using cloud provider monitoring tools like cloud watch alarms or third-party tools like DataDog.Interact with customer’s security team for risk assessment exercises, architecture review of the application integration and respond to customers questions related to the controls in place to ensure high availability of application as well as controls and policies in place to protect customer data.Develop and implement business continuity protocols to minimize disruption to business operations (Disaster Recovery). Ensure to achieve the best possible RTO & RPO.Create standards, cost-optimized delivery models, and proactively designs infrastructure solutions to support change and drive innovation. This will involve regular upgrading of AWS/Azure services to cost effective solutions recommended by AWS/Azure cloud platforms and analyzing Cost Explorer, Billing Reports, Budgets provided by AWS/Azure Native services.Collaborate in containerization, orchestration, and cloud scale solutions for the application deployment.Maintain a highly secure system through proper security practices to meet security and regulatory compliance requirements.Define, design, and implement comprehensive monitoring strategy to detect and isolate issues before impacting system availability.

AWS and Azure infrastructure management (IAM Security, infrastructure security, patching resource utilization and Cost optimization etc.)

Duty Details:
Coordinate technical architecture and design discussions with Customers’ Network and Security team to finalize Infrastructure requirements, Saviynt’s product setup requirements, SLAs, Network security and Firewall rules.Provide mentorship and guidance to internal operations staff regarding security, identity, compliance, and risk management best practices on public cloud to build a secure infrastructure.Interact with customer’s security team for risk assessment exercises, architecture review of the application integration and respond to customers questions related to the controls in place to ensure high availability of application as well as controls and policies in place to protect customer data.Create a modernization roadmap and architect infra set up solutions to meet customer’s optimization requirement to build a robust system.Create and share Operational requirements, provide guidance and deployment options to customer on infrastructure creation and network connectivity integration.Determine the system requirements and perform capacity planning for Server/Database set up that will strengthen a robust and highly available implementation of Saviynt’s product. Build infrastructure design document, data flow diagrams, and network architecture for establishing connectivity between customer and Saviynt’s Network.Finalize the business requirements document and send the document to clients for validation and agreement of the requirements and get a sign off from each stakeholder involved in client meeting.Work with cloud team to optimize AWS cloud spending, deliver usage and expenditure analysis to leadership for monthly and quarterly reviews, assist in quarterly and annual financial planning as well as execute cost optimization drives as needed.Create standards, cost-optimized delivery models, and proactively designs infrastructure solutions to support change and drive innovation. This will involve regular upgrading of AWS/Azure services to cost effective solutions recommended by AWS/Azure cloud platforms and analyzing Cost Explorer, Billing Reports, Budgets provided by AWS/Azure Native services.Provide technical recommendations to internal teams and customers on various approaches and options of cloud system administration, and installation processes with cost-optimized suggestions of Saviynt’s Tool implementation as a SaaS model on AWS and AzureWork directly with executive leadership to review the deployment architecture, get their insights on controls and policies put in place to ensure disaster recovery, data protection, data security, application high availability to enhance overall operational efficiencies and ensure operational process and procedure comply with company’s business SLAs, security and audit requirements.Routine Infrastructure maintenance including patching and upgrade of application servers. Any new patches released for any vulnerability or kernel/software version upgrade needs to be pushed to all the workloads.Writing shell scripts, code snippets and use of APIs to connect to the Cloud systems, execute enhanced monitoring and build out a resilient and real-time alerting Cloud Network architecture. This will involve POC’s on latest security tools, alerting and monitoring tools and integrating with our application set ups.

  
Architecting Infrastructure, planning and research, and Proof of Concepts to build a highly reliable, secure and robust application platform.

Duty Details: 
Requirements gathering as per Project’s System Requirement Specifications. The infra-architecture design will be done for SaaS set up on Azure and AWS Cloud.Writing Optimized DB Queries for integration with application platform and skilled with database tuning for different customer requirements to build a resilient and a highly available set-up.Explore resources supported by Cloud (AWS, Azure etc.) that would satisfy the requirements and come up with a design for our product integration as a SaaS model. This includes regular analysis and proof of concepts on all AWS, Azure services, and other third-party tools (SendGrid, Qualys, Datadog, TrendMicro, Sophos UTM) needed for any specific functionality not yet available on AWS or Azure cloud.Design and Configuration of infrastructure and Cloud Private Networks for Customers and setting up product. This also involves setting up of IPsec VPN, SSL VPN, VPC peering and VNET Peering to establish private connectivity with customer network and SaaS set up on Saviynt Network.Writing shell scripts, code snippets and use of APIs to connect to the Cloud systems, execute enhanced monitoring and build out a resilient and real-time alerting Cloud Network architecture. This will involve POC’s on latest security tools, alerting and monitoring tools and integrating with our application set ups.Continuous enhancements and design for Continuous Integration and Continuous Deployment process on cloud will be required including creation of Machine images using Packer, Source Code versioning of the automation scripts and templates and automated deployments on instances using Jenkins pipeline.Research and evaluate new cloud technologies or emerging cloud service (e.g., Google Cloud, Bluemix, etc.) to increase the reach of the Saviynt’s product installation as SaaS model on different cloud technologies.

Saviynt

Website: http://www.saviynt.com/

Headquarter Location: El Segundo, California, United States

Employee Count: 501-1000

Year Founded: 2010

IPO Status: Private

Last Funding Type: Debt Financing

Industries: Cloud Data Services ⋅ Cyber Security ⋅ Enterprise Software ⋅ Information Technology