Site Reliability Engineer

Posted:
8/21/2024, 5:00:00 PM

Location(s):
Atlanta, Georgia, United States ⋅ Georgia, United States

Experience Level(s):
Junior ⋅ Mid Level ⋅ Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

You know the moment. It’s the first notes of that song you love, the intro to your favorite movie, or simply the sound of someone you love saying “hello.” It’s in these moments that sound matters most. 

At Bose, we believe sound is the most powerful force on earth. We’ve dedicated ourselves to improving it for nearly 60 years. And we’re passionate down to our bones about making whatever you’re listening to a little more magical. 

The Information Technology team at Bose exists to deliver valuable and reliable business and technology solutions with an innovative, engaged, and collaborative team focused on contributing to our corporate vision.

Job Description

Specific Responsibilities: 

  • Design, implement, and manage systems to ensure high availability and performance of production services. 

  • Develop and maintain monitoring, alerting, and logging systems to proactively identify and address issues. 

  • Create and enforce Service Level Objectives (SLOs), Service Level Agreements (SLAs), and Key Performance Indicators (KPIs). 

  • Lead the response to production incidents, including troubleshooting, resolution, and post-incident analysis. 

  • Develop and maintain incident response procedures and runbooks. 

  • Conduct root cause analysis and implement corrective actions to prevent recurrence. 

  • Automate repetitive tasks and processes to improve efficiency and reduce human error. 

  • Develop and maintain tools for deployment, configuration management, and system monitoring. 

  • Collaborate with development teams to integrate automation into the software delivery pipeline. 

  • Perform capacity planning to ensure systems can handle current and future workloads. 

  • Design and implement scaling strategies to accommodate changes in demand. 

  • Monitor resource utilization and optimize infrastructure to achieve cost efficiency. 

  • Collaborate with development teams to design scalable and reliable system architectures. 

  • Participate in architectural reviews and provide guidance on reliability and performance considerations. 

  • Evaluate and recommend new technologies and approaches to enhance system reliability and performance. 

  • Document system configurations, processes, and procedures. 

  • Create and maintain operational runbooks and knowledge base articles. 

  • Provide training and mentorship to team members and other stakeholders on reliability best practices. 

  • Work closely with software engineers, operations teams, R&D, automotive, and other stakeholders to ensure smooth deployment and operation of services. 

  • Communicate effectively about system status, incident responses, and reliability improvements. 

  • Participate in on-call rotations and be available to respond to incidents as needed. 

 

Required Competencies: 

  • Proficiency in scripting and programming languages (e.g., Python, Go, JSON, Java) 

  • Experience with monitoring and observability tools (e.g., Logic Monitor, Prometheus, New Relic, Grafana, Datadog) preferred. 

  • Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes). 

  • Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud). 

  • Experience with configuration management and infrastructure-as-code tools (e.g., Terraform, Ansible) preferred. 

  • Excellent problem-solving and analytical skills. 

  • Strong communication and collaboration abilities. 

 

Experience Requirements: 

  • Experience: 3+ years of experience in a similar role, with a strong background in systems engineering, software development, or operations. 

 

Education/Certification Requirements: 

  • Education: Bachelor’s degree in Computer Science, Information Technology, or a related field. Advanced degree or relevant certifications (e.g., AWS Certified DevOps Engineer, Google Professional DevOps Engineer) preferred. 

Bose is an equal opportunity employer that is committed to inclusion and diversity. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, veteran status, or any other legally protected characteristics. For additional information, please review: (1) the EEO is the Law Poster (http://www.dol.gov/ofccp/regs/compliance/posters/pdf/OFCCP_EEO_Supplement_Final_JRF_QA_508c.pdf); and (2) its Supplements (http://www.dol.gov/ofccp/regs/compliance/posters/ofccpost.htm). Please note, the company's pay transparency is available at http://www.dol.gov/ofccp/pdf/EO13665_PrescribedNondiscriminationPostingLanguage_JRFQA508c.pdf. Bose is committed to working with and providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation because of a disability for any part of the application or employment process, please send an e-mail to [email protected] and let us know the nature of your request and your contact information.

Our goal is to create an atmosphere where every candidate feels supported and empowered in the interviewing process. Diversity and inclusion are integral to our success, and we believe that providing reasonable accommodation is not only a legal obligation but also a fundamental aspect of our commitment to being an employer of choice. We recognize that individuals may have different needs and requirements based on their abilities, and we provide reasonable accommodations to ensure ideal conditions are met during the application process.

If you believe you need a reasonable accommodation, please send a note to [email protected]