Senior Site Reliability Engineer (Kubernetes)

Posted:
12/19/2024, 5:20:06 AM

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Workplace Type:
Remote

We're looking for a Senior Site Reliability Engineer (Kubernetes) to join our Infrastructure team in Supermetrics.

Location: Canada (fully remote)

Role: Permanent, full-time. This role consists of on-call rotation.

Onboarding: As part of your onboarding, we expect the candidate to spend 2-3 weeks at our HQ in Helsinki (we organize the travel arrangements).

In this role, you'll:

Raise the team's bar in Kubernetes expertise, getting to mentor, guide and support your direct colleagues as well as other members of our Engineering organization in working with managed Kubernetes clusters across providers
Operate the platform that enables our SaaS products to be used by thousands of businesses from around the world, defining SLAs and SLOs and driving the automation that will ensure we meet them
In a nutshell, you will use your expertise in containers, Kubernetes, databases, and automation to streamline our operations and improve our infrastructure.

Your day-to-day work and responsibilities will include:

Write Terraform configuration and modules that bootstrap a Kubernetes cluster, or review PRs with contributions from other members, making sure that our modules are truly reusable and well-defined, improving how we test and release them.
Write (using Golang, for example) and maintain or improve our tooling, ensuring it facilitates platform utilization by engineering teams.
Develop and maintain Helm charts for internal deployments and third-party software.
Respond to an incident with our production environment.
Support our pre-sales team and help them answer potential customers' questions on our architecture and how we guarantee data security or consistency or ensure uptime.
Review an architecture change involving a new database and take part in the meetings discussing the pros and cons of such an approach.
Rewrite a Github Action to improve how we deploy to Kubernetes using GitOps.
Troubleshoot and resolve technical issues as they arise.
Participate in our on-call rotations to provide support, respond to incidents, or handle internal users' questions.

Technologies you'll be working with:

Kubernetes
ArgoCD, Helmfile, Helm, External Secrets, Cert-manager, Nginx, Contour
Terraform
Cloud providers: AWS/GCP (Queues, Compute, Object Storage, Networking, IAM, etc.)
Other providers: Cloudflare (CDN, DNS), Aiven, Redis Co.
Github Cloud and Github Enterprise
OpenSearch, Redis, PostgreSQL, ClickHouse, MySQL
PHP, Golang

Requirements:

4+ years of experience in Site Reliability Engineering, Platform Engineering, or related roles
Strong understanding of containers and experience operating Kubernetes clusters at scale.
Experience operating databases in production
Proficient in database concepts with hands-on experience in both relational and NoSQL databases.
In-depth knowledge of Linux systems and Terraform.
In-depth experience and understanding of AWS and/or GCP
Solid understanding of modern observability practices and tools
Automation mindset with the ability to automate repetitive tasks using scripting languages such as Python or Bash.
Team player spirit
Willing to take on-call rotations during non-business hours
Good communication skills, in particular in writing (documentation, but able to write good PRs too)
Strong problem-solving skills with a passion for the tools, technologies and problems in this space
Automation mindset with the ability to reduce toil by codifying repetitive tasks using scripting languages such as Python or Bash

Nice to have:

A developer background and the ability to write CLIs and other tools in Go, Python or Rust is highly desirable.
Security mindset with experience implementing security best practices in platform and operational contexts.
Experience in creating and managing Helm charts.
Expert knowledge of continuous integration and continuous deployment (CI/CD) systems and processes and experience developing and maintaining GitHub Actions.

Recruitment Process:

Screening call with the recruiter
Hiring Manager Interview
Tech Assignment + Presentation
Team Interview
Final chat with CIO

Benefits we offer:

Competitive compensation package, including equity
Excellent work equipment and home office allowance for those working in our fully remote locations
Health care benefits and leisure time insurance
Annual 1000 euros of personal learning budget
Sports and wellbeing allowance

Does this sound like your next adventure? Apply now! We'll fill the role as soon as we find the right person.

Hear why our team likes it here at supermetrics.com/careers/life-at-supermetrics.

Get to know our Engineering team at supermetrics.com/careers/engineering.

#LI-Remote #LI-FullTime #LI-MiddleToSeniorLevel

Join us on our mission to make data a marketing superpower

Supermetrics is a frontrunner in data integration technology, with 15% of global advertising spend reported through our products.

Our technology streamlines marketing data for over 200,000 businesses through a network of agencies and customers like Shopify, HubSpot, and Nestlé. We help marketers master their data and turn it into insights that improve business results and predict the best next step. Since our founding in 2013, we've grown profitably to reach 750K+ users and over 50M€ in annual recurring revenue.

We're a team of 360+ growth-minded people from diverse backgrounds. Together, we make a multicultural, resourceful, and collaborative team.

Supermetrics operates on trust, transparency, and a keen customer focus. Forward-looking and action-oriented, we work hard to be the leader in our industry. As team players, we help each other and win together.

We're hiring for a diverse, competent, and collaborative team and building an inclusive workplace where everyone is treated fairly and respectfully.

It all started with a Google t-shirt... Read the rest of our growth story at supermetrics.com/about.

Supermetrics

Website: https://supermetrics.com/

Headquarter Location: Helsinki, Southern Finland, Finland

Employee Count: 101-250

Year Founded: 2013

IPO Status: Private

Last Funding Type: Series B

Industries: Analytics ⋅ B2B ⋅ Enterprise Software ⋅ Marketing ⋅ SaaS ⋅ Software

Senior Software Engineer (Multiple Openings) in Irving, Texas

US Bank • 8/18/2024 ⋅ United States

Staff Software Engineer, Build & Release (R3007)

Shield AI • 11/13/2024 ⋅ United States

Sr. Software Engineer

OpenGov • 12/5/2024 ⋅ Argentina

Automated Software Testing Developer

Leidos • 6/5/2024 ⋅ United States

Senior Product Security Engineer

Toast • 12/10/2024 ⋅ India

Notify

postings

pricing

login