Production Reliability Engineer, Trade Desk

Posted:
6/26/2024, 10:47:03 AM

Location(s):
Sydney, New South Wales, Australia ⋅ New South Wales, Australia

Experience Level(s):
Junior ⋅ Mid Level ⋅ Senior

Field(s):
Operations & Logistics

Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking unique individual talent by incenting collaboration and mutual respect. At Jump, research outcomes drive more than superior risk adjusted returns. We design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading global research organizations and universities to solve problems.

Core Development is a global team of technologists who architect, build and maintain our world-class trading platform. From optimizing our core trading engine to building custom hardware, we leverage software & hardware engineering, data science and research, to deliver the infrastructure and tools that drive our trading and business needs.

Trade Desk is part of the larger Core Development team and plays a critical role in the management and oversight of the real-time production trading environment. The team runs a global operation to monitor and troubleshoot, reliably deploy changes to our production environment, and build the orchestration, configuration management, and monitoring automation for the production trading system. This role will require deep technical and operational knowledge across all areas of the trading platform in order to proactively monitor and troubleshoot our trading system, deploy changes to our production environment while minimizing operational risk, and implement tools and processes to drive continuous improvement. This team works with traders, operations, exchanges, and developers to optimize the trading environment and investigate and solve system issues.

What You'll Do:

  • Own the production environment, driving performance, reliability, and operability through continuous improvement
  • Proactively monitor and troubleshoot large-scale trading systems and exchange connectivity
  • Build and maintain devops toolkit for the production trading system including configuration management, process management, deployment, monitoring, data collection, and analysis
  • Leverage firm-wide metrics to improve scalability and system performance
  • Collaborate across the technology organization to analyze and troubleshoot complex system problems
  • Work closely with Risk Management and Operational Trading Support teams to coordinate changes and manage incidents
  • Interact directly with traders to communicate and drive technology changes, manage incidents, and troubleshoot problems
  • Work with Clearing team to reconcile trades and position breaks
  • Assess and manage operational risk of changes into the production environment
  • Define and document process and procedure
  • Provide mentorship and cross training to other technical operations SREs
  • Other duties as assigned or needed

Skills You'll Need:

  • Degree in Computer Science, a related field, or equivalent professional experience
  • At least 5+ years of relevant work experience in an IT ops role, such as DevOps, SRE, Linux Systems Engineering, or Network Engineering
  • At least 3+ years of experience in python and shell scripting
  • Familiarity with C++  helpful but not required
  • A rigorous, detail-oriented approach to operations
  • Strong understanding of the linux operating system, including network and system configuration, kernel internals, scheduling, performance tuning
  • Strong understanding of networking concepts such as routing, multicast, LLDP, VLAN tagging, ethernet
  • A deep sense of ownership and urgency
  • Ability to handle shared operational and periodic on-call duties
  • Reliable and predictable availability

If you are currently a student or recent graduate, please see our Campus postings which offer both Summer and Full-Time opportunities.