LoBoS System Administrator

Posted:
10/15/2024, 8:22:48 AM

Location(s):
Maryland, United States ⋅ Rockville, Maryland, United States

Experience Level(s):
Mid Level ⋅ Senior

Field(s):
IT & Security

Workplace Type:
On-site

Job Family:

Systems Engineering (Digital)


Travel Required:

Up to 10%


Clearance Required:

Ability to Obtain Public Trust

What You Will Do:
We are currently searching for Linux-based LoBoS System Administrator to provide the day-to-day management of the LoBoS HPC compute nodes, storage systems, and desktops. The position involves working as part of a small team (at least two people) whose primary responsibilities are to keep the cluster running in good order and ensuring the cluster follows security best-practices as determined by the NIH and Department of Health and Human Services. It also involves maintaining the usability of the LoBoS cluster via yearly purchase and installation of hardware to replace

aging components.

Researchers within the National Heart, Lung, and Blood Institute’s (NHLBI) Laboratory of Computational Biophysics (LCB) employ computational simulation methods to investigate problems in biophysics and chemistry using the Linux-based LoBoS high-performance computing (HPC) cluster. LoBoS consists of several hundred CPU/GPU computational nodes, three tiers of storage (home directories, scratch space, and archive), associated network infrastructure (both Infiniband and Ethernet), and Linux desktops for users.

This is a full-time, on-site opportunity in Rockville, MD.  

  • Oversee various components of the LoBoS cluster remain in good working order such as network configuration, firewall management (Palo Alto), file system management (ZFS, VAST), security, batch queuing systems (SLURM), database administration, distributed computing, file transfer services, web servers, and electronic mailing lists.
  • May occasionally require work outside normal 9-5 hours to address emergency situations with the cluster (e.g. significant numbers of down nodes, storage outages, etc.) or cybersecurity incidents (FISMA).
  • Ensure that the LoBoS cluster has sufficient capabilities to run the scientific software needed by the LCB scientists.
  • Evaluate the existing system to determine when updates/upgrades to hardware and/or software are necessary.
  • Responsible for managing the budget used to procure new hardware/software for LoBoS.
  • Oversee configuration and installation of virtual and physical servers and manage upgrades to existing hardware.
  • Ensure that patches, security updates, and configuration changes to software systems are applied to enhance reliability and to meet security needs.
  • Collaborate with Office of the Chief Information Officer (OCIO), Center for Information Technology (CIT), and NHLBI security teams to ensure adherence to compliance policies.
  • Assist in maintaining the LoBoS Assessment & Authorization package based on National Institute of Standards and Technology (NIST) SP 800-53 security controls under guidance from NHLBI's Information System Security Officers (ISSO).
  • Serve as a technical resource for HPC, LCB, NHLBI, and other NIH personnel in areas such as the Linux operating system, networking, database system administration, distributed computing. May serve on technical evaluation panels for institute-wide initiatives.
  • Stay informed regarding new developments in hardware/software and evaluate their potential usability for LoBoS/LCB.
  • Participate in conferences and meetings of professional groups concerned with the application of HPC, AI/machine learning, and other emerging computer technologies. Some travel to professional meetings (e.g. Super Computing Conference) may occasionally be required.
  • Prepare software documentation and technical reports related to assigned projects.


What You Will Need:

  • Bachelor’s Degree or equivalent experience in lieu of a degree.
  • At least FIVE (5) years of experience in Linux HPC systems administration, less experienced candidates with outstanding qualifications will also be considered.
  • Comprehensive knowledge of shell scripting.
  • Broad knowledge of systems administration tools (e.g. Puppet, Ansible, etc.) along with a detailed knowledge of tools used in a particular area such as file system management, usage accounting, mail configuration, database system administration, file transfer, or security.
  • Knowledgeable in high level computer languages such as C, C++, FORTRAN, Ruby, Perl, or Python.
  • Experience implementing and managing SLURM batch queueing software preferred.


What Would Be Nice To Have:

  • Experience with government computer security rules and standards is desirable.
  • Extensive knowledge of at least two high level computer languages such as C, C++, FORTRAN, Ruby, Perl, or Python is desirable.
  • Solid interpersonal, leadership, and critical thinking skills.
  • Excellent written and oral communication skills.

The annual salary range for this position is $100,200.00-$150,200.00. Compensation decisions depend on a wide range of factors, including but not limited to skill sets, experience and training, security clearances, licensure and certifications, and other business and organizational needs.


What We Offer:

Guidehouse offers a comprehensive, total rewards package that includes competitive compensation and a flexible benefits package that reflects our commitment to creating a diverse and supportive workplace.

Benefits include:

  • Medical, Rx, Dental & Vision Insurance

  • Personal and Family Sick Time & Company Paid Holidays

  • Parental Leave

  • 401(k) Retirement Plan

  • Group Term Life and Travel Assistance

  • Voluntary Life and AD&D Insurance

  • Health Savings Account, Health Care & Dependent Care Flexible Spending Accounts

  • Transit and Parking Commuter Benefits

  • Short-Term & Long-Term Disability

  • Tuition Reimbursement, Personal Development, Certifications & Learning Opportunities

  • Employee Referral Program

  • Corporate Sponsored Events & Community Outreach

  • Care.com annual membership

  • Employee Assistance Program

  • Supplemental Benefits via Corestream (Critical Care, Hospital Indemnity, Accident Insurance, Legal Assistance and ID theft protection, etc.)

  • Position may be eligible for a discretionary variable incentive bonus

About Guidehouse
Guidehouse is an Equal Employment Opportunity / Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, national origin, ancestry, citizenship status, military status, protected veteran status, religion, creed, physical or mental disability, medical condition, marital status, sex, sexual orientation, gender, gender identity or expression, age, genetic information, or any other basis protected by law, ordinance, or regulation.


Guidehouse will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable law or ordinance including the Fair Chance Ordinance of Los Angeles and San Francisco.


If you have visited our website for information about employment opportunities, or to apply for a position, and you require an accommodation, please contact Guidehouse Recruiting at 1-571-633-1711 or via email at [email protected]. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodation.


Guidehouse does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Guidehouse and Guidehouse will not be obligated to pay a placement fee.