Creating Peace of Mind by Pioneering Safety and Security
At Allegion, we help keep the people you know and love safe and secure where they live, work and visit. With more than 30 brands, 12,000+ employees globally and products sold in 130 countries, we specialize in security around the doorway and beyond. Additionally, in 2024 we were awarded the Gallup Exceptional Workplace Award, which recognizes the most engaged workplace cultures in the world.
Job Description:
- Design, implement, and maintain highly available and scalable infrastructure systems, ensuring maximum uptime and performance.
- Collaborate with software engineering teams to build and deploy applications using best practices in reliability, scalability, and security.
- Develop and implement automation tools and frameworks to streamline operational processes, reduce manual intervention, and improve efficiency.
- Monitor and analyse system performance, identifying bottlenecks, and implementing solutions to optimize performance and scalability.
- Implement and maintain effective monitoring, alerting, and logging systems to proactively identify and resolve issues before they impact users.
- Lead incident response and root cause analysis efforts, driving continuous improvement and preventing future incidents.
- Collaborate with cross-functional teams to define and enforce best practices, standards, and guidelines for system reliability and performance.
- Participate in on-call rotations and respond to incidents, ensuring timely resolution and minimal impact to users and thereby meeting SLAs.
- Plan and devise Disaster Recovery (DR) strategies and implement DR Plans.
- Mentor and provide guidance to junior team members, fostering a culture of learning and growth.
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Build software and systems to manage platform infrastructure and applications.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement.
- Provide primary operational support and engineering for multiple large-scale distributed software applications.
Required Knowledge, Skills and Abilities:
- Proven experience as a Site Reliability Engineer or similar role, with a focus on designing and maintaining highly available and scalable systems.
- Strong programming and scripting skills (Python, Bash, etc.) to automate operational tasks and develop tooling.
- Experience with cloud platforms (AWS) and containerization technologies (Docker, EKS).
- Proficient in configuration management tools like Ansible and infrastructure-as-code frameworks such as Terraform and CloudFormation.
- Experience with monitoring and logging tools (Prometheus, Grafana, Loki, Sentry.io, CloudWatch, etc.) for proactive system monitoring and troubleshooting.
- Ability to program (Structured and OOP) using one or more high-level languages, such as Java and JavaScript
- Solid understanding of networking principles, protocols, and security best practices.
- Strong problem-solving skills and the ability to work effectively in a fast-paced, dynamic environment.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
- Experience with distributed storage technologies such as NFS, Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
- Experience in Agile methodologies
- Strong skills in software design, design patterns
- Experience in different architecture patterns like client-server/server less computing.
- Effective written, verbal and presentation skills with the ability to clearly articulate ideas and concepts.
- Self-directed and able to direct others.
Desired Skills & Abilities
- Experience with setting up performance/load test environments.
- Familiarity with SOC2 audit processes
Required Education and/or Experience:
- BE/B Tech/M Tech/MCA/MSc in Computer Science Engineering
- 7 to 11 Years of experience in Software Application Development/CloudOps/SRE
Allegion is a diverse and inclusive environment. We are an equal opportunity employer and are dedicated to hiring qualified protected veterans and individuals with disabilities. If for any reason you cannot apply through the job center, please contact HR, Allegion India for special accommodation.
We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.
We Celebrate Who We Are!
Allegion is committed to building and maintaining a diverse and inclusive workplace. Together, we embrace all differences and similarities among colleagues, as well as the differences and similarities within the relationships that we foster with customers, suppliers and the communities where we live and work. Whatever your background, experience, race, color, national origin, religion, age, gender, gender identity, disability status, sexual orientation, protected veteran status, or any other characteristic protected by law, we will make sure that you have every opportunity to impress us in your application and the opportunity to give your best at work, not because we’re required to, but because it’s the right thing to do. We are also committed to providing accommodations for persons with disabilities. If for any reason you cannot apply through our career site and require an accommodation or assistance, please contact our Talent Acquisition Team.
© Allegion plc, 2023 | Block D, Iveagh Court, Harcourt Road, Dublin 2, Co. Dublin, Ireland
REGISTERED IN IRELAND WITH LIMITED LIABILITY REGISTERED NUMBER 527370
Allegion is an equal opportunity and affirmative action employer
Privacy Policy