DevOps Lead

Posted:
11/18/2024, 5:35:37 PM

Location(s):
Old Toronto, Ontario, Canada ⋅ Ontario, Canada

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Ideogram’s mission is to help people become more creative. Our thesis is that everyone has an innate desire to create. We are developing state-of-the-art AI tools that will make creative expression more accessible and efficient. We are pushing the limits of what’s possible with AI, with a focus on creativity and a high standard for trust and safety. Our headquarters is in downtown Toronto, and we have a small presence in NYC. Read Ideogram 2.0 blog post and check out our text to image product at ideogram.ai to get a glimpse of what we're building.

About The Role

The DevOps Lead will manage Ideogram's cloud infrastructure for product, ML training, and ML inference. This includes a tight orchestration of resources on GCP and Cloudflare, in particular the design of Ideogram's networking infrastructure to maximize security.

What We’re Looking For

  • Expertise with Terraform, Kubernetes, cloud infrastructure, and management of ML accelerator fleets

  • Expertise in cloud security and organization security, and identity and access management

  • Expertise with Google Cloud Platform

  • Python, shell scripting

  • Strong infrastructure and security fundamentals

  • Experience in maintaining scalable and reliable systems (e.g., experience with PubSub, service accounts, identity management & RBAC and CI/CD).

  • Experience with setting up and maintaining observability for deployed applications

  • Strong analytical and problem-solving skills

  • Self-starter - able to come up to speed on complex, difficult concepts with minimal assistance

  • Experience communicating complex technical issues to technical and non technical team members

  • Excitement about generative AI technology and its impact on the creative economy.

  • Bachelor's degree in Computer Science, Engineering or related field

Nice to Have

  • Managing web service infrastructure (e.g. load balancers, reverse proxies, CDNs)

  • Monitoring and improving cloud resource usage and costs

  • Management of training and inference workloads

  • Management of CI infrastructure for internal development

  • Have prior experience working closely with technical teams in creating and deploying infrastructure.

  • Passion for building robust and secure systems

  • Possess a track record of helping companies properly manage their infrastructure through IaC

  • Experience with Helm and Kustomize

Founding team

Our founding team consists of world-renowned AI experts including Mohammad Norouzi, Jonathan Ho, William Chan, and Chitwan Saharia. This team has previously led transformative AI projects at Google Brain, UC Berkeley, CMU, and the University of Toronto. Our fundamental work in AI includes: Imagen: Google’s text-to-image system, Imagen Video for video synthesis, Denoising Diffusion Models, which is the foundation of the recent generative media transformation.

Company Culture

We're a single flat team that transcends engineering, research, product, and operation roles. Everyone is willing to do whatever is necessary to make our company and customers successful. We believe that a small, dedicated team with a collaborative culture can move faster and build better and more coherent products than large hierarchical organizations. At Ideogram, we provide mentorship and support to help our employees grow with the company and achieve their ambitious career goals.

Ideogram is committed to welcoming everyone, regardless of gender identity, orientation, or expression. Our mission is to remove exclusivity and barriers and encourage new thinking and perceptions, in a space of belonging. It is not about race, gender, or age, it is about the people.