Principal, High Performance Compute Engineering

Posted:
8/19/2024, 5:00:00 PM

Location(s):
Massachusetts, United States ⋅ Boston, Massachusetts, United States

Experience Level(s):
Mid Level ⋅ Senior

Field(s):
Software Engineering

Job Overview

The Principal High Performance Compute Engineer will be a thought leader on architecture and development within the development team responsible for the compute platform for the Research group. As a systematic asset manager, Arrowstreet must identify investable trading strategies and implement them quickly and with the highest quality. Having a robust, scalable, and performant general compute platform is thus of critical importance. 

The main responsibility of the role is developing the compute platform for HPC workloads in the cloud. A current focus is improving observability, so the ideal candidate has experience in designing and implementing large scale observability capabilities. The expectation of the role is to be hands-on, working across multiple teams to define requirements, create the design, develop, test, build, deploy and support the functionality. The work includes developing the automated build and deploy pipelines with unit and integrated tests to ensure high quality and efficient operations.


 

Responsibilities

  • Work closely with members of the Research group to review and define requirements for the compute platform and observability systems.

  • Provide expert level design that ensures the solution to be scalable, cost effective and to have low maintenance.

  • Lead technical design discussions within the team to gather feedback, discuss the merits and risks of different approaches, and reach consensus on the target architecture.

  • Develop high quality solutions for the compute platform and observability systems in both on premise environments and AWS using Python.

  • Develop the automated build and deploy pipelines with unit and integrated tests to ensure high quality and efficient operations.

  • Provide guidance to other team members on development tasks.

  • Promote high quality code via code reviews.

  • Provide production support for the platform to prevent disruptions to investment processes.

Qualifications

  • Bachelor’s degree in Computer Science, Computer Engineering or a related discipline

  • 8+ years of professional software development experience using Python or another object-oriented language, financial services exposure is a plus.

  • 4+ years of experience leading design or architecture of large-scale production systems

  • 3+ years of experience developing large, high-performance, distributed systems.

  • 3+ years of experience building high-performance cloud native solutions on public cloud (AWS preferred)

  • 3+ years of experience in container technologies like Kubernetes and Docker.

  • 3+ years of experience in large scale observability systems like Elasticsearch and Prometheus

  • 2+ years of experience in building resilient CI/CD pipelines, strong knowledge of Git, and familiarity with a DevOps platform like GitLab.

  • 1+ years of experience with Helm and Infrastructure as Code tools (Terraform preferred)

  • Strong in computer science fundamentals like data structures, algorithm design and complexity analysis

  • Ability to write elegant code, and comfortable with picking up new technologies independently.

  • Self-motivated and self-directed, ability to translate technical direction into functional solutions.

We maintain a friendly, team-oriented environment and place a high value on professionalism, attitude and initiative.