Senior/Systems Engineer

Posted:
11/6/2024, 4:00:00 PM

Location(s):
Singapore, Singapore

Experience Level(s):
Senior

Field(s):
IT & Security

Join High-Performance Computing Centre (HPCC) as a Senior/Systems Engineer and be part of our team that will help bridge development and operations. Your role will be pivotal in ensuring the efficient and reliable delivery of software and services to support NTU’s staff and students.

Key Responsibilities:

  • Design, deploy and manage virtual infrastructure platforms such as VMware, Hyper-V and Nutanix.

  • Monitor and optimise the virtual environment for performance, availability and resource allocation, addressing performance bottlenecks and capacity challenges.

  • Automate repetitive tasks through the use of scripts and automation tools.

  • Secure the infrastructure environment and data integrity by Implementing access controls, patches and updates in alignment with industry best practices and regulatory requirements.

  • Develop and implement disaster recovery and backup management strategies to ensure data protection and business continuity.

  • Configure and optimize storage systems for performance and scalability.

  • Monitor and manage data storage to ensure performance and availability.

  • Collaborate with security team to manage vulnerabilities.

  • Monitor network performance, troubleshoot issues, and ensure high availability and reliability of network infrastructure.

  • Maintain detailed network diagrams, configurations, and operational procedures, providing regular updates on network performance.

  • Implement and maintain network security measures, including firewalls and encryption, to safeguard data and prevent unauthorized access.

  • Implement and manage infrastructure using IaC tools such as Terraform, Ansible or CloudFormation.

  • Set up and maintain monitoring tools to track system performance and ensure high availability.

  • Implement alerting systems for quick detection and resolution of infrastructure issues.

  • Manage code repositories and workflows using version control systems such as Git, GitLab, or GitHub.

  • Continuously monitor systems and proactively troubleshoot to improve system uptime and minimize downtime.

  • Identify and resolve performance bottlenecks promptly, ensuring smooth application performance even during peak usage.

Requirements:

  • A degree in Computer Engineering or a related field.

  • At least 6 years of relevant experience, preferably in IT, higher education or enterprise environments, with a minimum of 3 years of hands-on experience managing Linux/UNIX systems and 5 years in managing large scale, high- performance storage environments.

  • Candidate with at least 2 years of hands-on experience with virtualization platforms such as VMware, Hyper-V, or KVM, certification in VMware Administration or Hyper-V is preferred.

  • Proven ability to optimize storage performance, troubleshoot and ensure data integrity, with in-depth knowledge of backup software, snapshot technologies, and data replication methods

  • Strong teamwork and communication skills and is able to collaborate effectively with developers, operations personnel, and other stakeholders.

  • Possess good analytical skill to efficiently identify and resolve issues and address the root causes of problems.

  • Is passionate to learn and stay updated on emerging technologies and industry trends.

Hiring Institution: NTU