Posted:
7/29/2024, 5:00:00 PM
Location(s):
Irving, Texas, United States ⋅ The Woodlands, Texas, United States ⋅ Georgia, United States ⋅ Florida, United States ⋅ Ohio, United States ⋅ Texas, United States ⋅ Jacksonville, Florida, United States ⋅ Richmond, Virginia, United States ⋅ Virginia, United States ⋅ Columbus, Ohio, United States ⋅ Alpharetta, Georgia, United States
Experience Level(s):
Senior
Field(s):
AI & Machine Learning
Workplace Type:
Remote
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve – we care.
What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patients, our communities, and our people. If you want to be part of tomorrow’s health today, we want to hear from you.
Job title: Sr. Machine Learning Ops Engineer
Job Description:
As a ML OPS Engineer , you'll be part of a lean software team dedicated to productionizing machine learning applications and systems at scale. You’ll participate in the detailed technical design, development, and implementation of machine learning applications using existing and emerging technology platforms. You’ll focus on machine learning architectural design, develop and review model and application code, and ensure high availability and performance of our machine learning applications. You'll have the opportunity to continuously learn and apply the latest innovations and best practices in machine learning engineering.
As a Senior ML OPS Engineer, you will be joining a team of experienced Machine Learning Engineers that support, build, and enable Machine Learning capabilities across the organization. You will work closely with internal customers and infrastructure teams to build our next generation data science workbench and ML platform and products. You will be able to further expand your knowledge and develop your expertise in modern Machine Learning frameworks, libraries and technologies while working closely with internal stakeholders to understand the evolving business needs. If you have a penchant for creative solutions and enjoy working in a hands-on, collaborative environment, then this role is for you.
What you’ll do in the role:
Implement scalable and reliable systems leveraging cloud-based architectures, technologies and platforms to handle model inference at scale.
Deploy and manage machine learning & data pipelines in production environments.
Work on containerization and orchestration solutions for model deployment.
Participate in fast iteration cycles, adapting to evolving project requirements.
Collaborate as part of a cross-functional Agile team to create and enhance software that enables state-of-the-art big data and ML applications.
Leverage CICD best practices, including test automation and monitoring, to ensure successful deployment of ML models and application code.
Ensure all code is well-managed to reduce vulnerabilities, models are well-governed from a risk perspective, and the ML follows best practices in Responsible and Explainable AI.
Collaborate with Data scientists, software engineers, data engineers, and other stakeholders to develop and implement best practices for MLOps, including CI/CD pipelines, version control, model versioning, monitoring, alerting and automated model deployment.
Manage and monitor machine learning infrastructure, ensuring high availability and performance.
Implement robust monitoring and logging solutions for tracking model performance and system health.
Monitor real-time performance of deployed models, analyze performance data, and proactively identify and address performance issues to ensure optimal model performance.
Troubleshoot and resolve production issues related to ML model deployment, performance, and scalability in a timely and efficient manner.
Implement security best practices for machine learning systems and ensure compliance with data protection and privacy regulations.
Collaborate with platform engineers to effectively manage cloud compute resources for ML model deployment, monitoring, and performance optimization.
Develop and maintain documentation, standard operating procedures, and guidelines related to MLOps processes, tools, and best practices.
Basic Qualifications:
Master’s or doctoral degree in computer science, electrical engineering, mathematics, or a similar field.
Typically requires 7+ years of hands-on work experience developing and applying advanced analytics solutions in a corporate environment with at least 4 years of experience programming with Python.
At least 3 years of experience designing and building data-intensive solutions using distributed computing.
At least 3 years of experience productionizing, monitoring, and maintaining models
Must have skills:
Understanding of Azure stack like Azure Machine Learning, Azure Data Factory, Azure Databricks, Azure Kubernetes Service, Azure Monitor, etc.
Demonstrated expertise in building and deploying AI/Machine Learning solutions at scale leveraging cloud such as AWS, Azure, or Google Cloud Platform.
Experience in developing and maintaining APIs (e.g.: REST).
Experience specifying infrastructure and Infrastructure as a code (e.g.: Ansible, Terraform).
Experience in designing, developing & scaling complex data & feature pipelines feeding ML models and evaluating their performance.
Ability to work across the full stack and move fluidly between programming languages and MLOps technologies (e.g.: Python, Spark, DataBricks, Github, MLFlow, Airflow).
Expertise in Unix Shell scripting and dependency-driven job schedulers.
Understanding of security and compliance requirements in ML infrastructure.
Experience with visualization technologies (e.g.: RShiny, Streamlit, Python DASH, Tableau, PowerBI).
Familiarity with data privacy standards, methodologies, and best practices.
Physical Requirements: General Office Demands
Candidate must be authorized to work in the U.S, now or in the future, without the support from McKesson.
Relocation is NOT budgeted for this position.
#LI-TV
We are proud to offer a competitive compensation package at McKesson as part of our Total Rewards. This is determined by several factors, including performance, experience and skills, equity, regular job market evaluations, and geographical markets. In addition to base pay, other compensation, such as an annual bonus or long-term incentive opportunities may be offered. For more information regarding benefits at McKesson, please click here.
Our Base Pay Range for this position
$130,400 - $217,400McKesson is an Equal Opportunity Employer
McKesson provides equal employment opportunities to applicants and employees and is committed to a diverse and inclusive environment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, age or genetic information. For additional information on McKesson’s full Equal Employment Opportunity policies, visit our Equal Employment Opportunity page.
Join us at McKesson!
Website: https://macrohelix.com/
Headquarter Location: Atlanta, Georgia, United States
Employee Count: 101-250
Year Founded: 2009
IPO Status: Private
Industries: Information Technology ⋅ Software