Lead Data Engineer - ML/GCP

Posted:
7/28/2024, 5:00:00 PM

Location(s):
Irving, Texas, United States ⋅ Texas, United States

Experience Level(s):
Senior

Field(s):
AI & Machine Learning ⋅ Data & Analytics

Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric health care for a rapidly changing world. Anchored in our brand — with heart at its center — our purpose sends a personal message that how we deliver our services is just as important as what we deliver.
 
Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make health care more personal, convenient and affordable.

Position Summary

  • As a technical leader, provide guidance to a team of data engineers and collaborate closely with data scientists and analysts to support data-driven decision-making.
  • Analyzes complex data structures from disparate data sources and design large scale data engineering pipeline.
  • Develops large scale data structures and pipelines to organize, collect and standardize data that helps generate insights and addresses reporting needs.
  • Implement data ingestion pipeline using APIs, third party tools, or create custom codes to ingest high volume data into Cloud environment.
  • Writes processes, designs database systems and develops tools for real-time and offline analytic processing.
  • Collaborates with product business and data science team to collect user stories, translate into technical specifications, and implement data transformation, algorithms and models into automated processes.
  • Uses strong programming skills in PySpark, Python/Java or any of the major languages to build robust data pipelines and dynamic systems.
  • Builds highly scalable and extensible data marts and data models to support Data Science and other internal customers on Cloud. Integrates data from a variety of sources, assuring that they adhere to data quality and accessibility standards.
  • Analyzes current information technology environments to identify and assess critical capabilities and recommend solutions.
  • Build and facilitate machine learning across large scale systems and campaigns and ensure deployment and updates in partnership with data engineering.
  • Develop and participate in presentations and consultations to existing and prospective constituents on analytics results and solutions.
  • Interact with internal and external peers and managers to exchange complex information related to areas of specialization.
  • The ideal candidate should be detailed oriented with the ability to quickly understand complex situations, manage multiple urgent tasks at the same time and have a proven track record of communicating in an open and honest way that quickly builds trust and respect.
  • The ideal candidate should work with data related to a wide range of customer interactions and analytics. This role will be responsible for designing and deploying large scale ML models with support from data engineering and Product Team.
  • Experiments with available tools and advice on new tools to determine optimal solution given the requirements dictated by the model/use cases.


Required Qualifications

  • 7+ years of progressively complex related experience in cloud data engineering and data analysis.
  • 7+ Years of experience in Data Engineering, Analytics and Machine Learning Systems.
  • 3+ years of building cloud native analytical products in GCP, Azure or AWS
  • Sound knowledge in any of cloud Technology is must preferably Google cloud Platform (GCP).
  • Deep Knowledge of Large Scale distributed Data Architecture and Performance Optimization Techniques.
  • Proficiency in developing Complex Data Pipelines, ETLs and Workflows on Cloud Platform optimized for High Volume of Health Care data.
  • Proficiency in using Cloud Platforms such as GCP/Azure/AWS. Knowledge of tools like Composer, Kafka, PySpark and SQL.
  • Knowledge of Programming Language Java/Python.
  • Proficiency with CI/CD tooling like Jenkins, GitHub to enable robust development pipelines for data and ML.
  • Strong knowledge of large-scale search applications and building high volume data pipelines, preferably using PySpark on GCP and It’s native tools such as BigQuery, Airflow, Composer, DataProc, PUB/SUB, DataFlow and Vertex AI.
  • Strong Foundational Knowledge in Agile Methodologies.


Preferred Qualifications

  • Experience with Healthcare domain is highly desirable.
  • Deep understanding of data warehousing, data architecture, and data modeling methods & best practices.
  • Understanding of AI/ML technology Stack.
  • Comfortable working experience with large scale LLMs.
  • Exposure in implementing Gen AI and/or NLP based solutions using LLMs.
  • Sound experience in Google Cloud Data Services: Big Query, Data Proc, PubSub, Cloud Functions, Cloud Storage, Dataflow, Composer.


Education

  • Bachelor's degree or equivalent work experience in Mathematics, Statistics, Computer Science, Business Analytics, Data Science, Engineering, or related discipline
  • Master’s degree Preferred in Computer Science/ML

Pay Range

The typical pay range for this role is:

$118,450.00 - $236,900.00


This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls.  The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors.  This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above.  This position also includes an award target in the company’s equity award program. 
 
In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities.  The Company offers a full range of medical, dental, and vision benefits.  Eligible employees may enroll in the Company’s 401(k) retirement savings plan, and an Employee Stock Purchase Plan is also available for eligible employees.  The Company provides a fully-paid term life insurance plan to eligible employees, and short-term and long term disability benefits. CVS Health also offers numerous well-being programs, education assistance, free development courses, a CVS store discount, and discount programs with participating partners.  As for time off, Company employees enjoy Paid Time Off (“PTO”) or vacation pay, as well as paid holidays throughout the calendar year. Number of paid holidays, sick time and other time off are provided consistent with relevant state law and Company policies. 
 
For more detailed information on available benefits, please visit jobs.CVSHealth.com/benefits

We anticipate the application window for this opening will close on: 10/28/2024

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.