Distinguished, Data Scientist

Posted:
12/6/2024, 10:45:02 PM

Location(s):
Indiana, United States ⋅ Karnataka, India ⋅ Indianapolis, Indiana, United States ⋅ Bengaluru, Karnataka, India

Experience Level(s):
Senior

Field(s):
AI & Machine Learning ⋅ Data & Analytics

Position Summary...

Provides overall direction by analyzing business objectives and customer needs; developing, communicating, building support for, and implementing business strategies, plans, and practices; analyzing costs and forecasts and incorporating them into business plans; determining and supporting resource requirements; evaluating operational processes; measuring outcomes to ensure desired results; identifying and capitalizing on improvement opportunities; promoting a customer environment; and demonstrating adaptability and sponsoring continuous learning. Develops and implements strategies to attract and maintain a highly skilled and engaged workforce by diagnosing capability gaps; recruiting, selecting, and developing talent; supporting mentorship, workforce development, and succession planning; and leveraging the capabilities of new and existing talent. Cultivates an environment where associates respect and adhere to company standards of integrity and ethics by integrating these values into all programs and practices; developing consequences for violations or non-compliance; and supporting the Open Door Policy. Develops and leverages internal and external partnerships and networks to maximize the achievement of business goals by sponsoring and leading key community outreach and involvement initiatives; engaging key stakeholders in the development, execution, and evaluation of appropriate business plans and initiatives; and supporting associate efforts in these areas.

What you'll do...

Your Opportunity 

As a Distinguished Data Scientist in the Catalog-data sciences team, you'll have the opportunity to  

  • Drive innovative strategic solutions of improving catalog content utilizing advanced SOTA AI and ML solutions at large scale and at accelerated pace. 

  • Drive data-derived insights by developing advanced statistical models, machine learning algorithms and computational algorithms based on business initiatives  

  • Work closely with Directors, Sr. Managers of Data Science, and leaders of Architecture, Engineering, Product & business teams to drive the Organizational strategy around AI driven Catalog and Content management. 

  • Direct the gathering of data, assess data validity and synthesize data into large analytics datasets to support project goals  

  • Lead a team of Data Scientists, Data Analysts, Data Engineers to improve the Catalog Content Quality using advanced Machine Learning and Gen AI. 

  • Utilize big data analytics and advanced data science techniques to identify trends, patterns, and discrepancies in data. Determine additional data needed to support insights  

  • Build and train AI/ML models for replication for future projects  

  • Deploy and maintain the data science solutions  

  • Communicate recommendations to business partners and influence future plans based on insights  

 

 

Your Responsibility 

 

  • Consult with product and business stakeholders regarding algorithm-based recommendations and be a thought-leader to develop these into business actions. 

  • Guides data scientists, senior data scientists and staff data scientists across the domain DS team to ensure on-time delivery of ML products. 

  • Proactive collaboration with cross-functional teams on driving the end-to-end content improvement. 

  • Research oriented development to find innovative solutions on content improvement, with a strong grasp of multi-lingual and multi-modal foundational models. 

  • Guides the data science team members to ensure on-time delivery of ML products 

  • Proactive identification of complex business problems that can be solved using advanced ML and Computer Vision, finding opportunities and gaps in the current business domain 

  • Evaluates proposed business cases for projects and initiatives 

  • Translates business requirements into strategies, initiatives, and projects and aligns them to business strategy and objectives, and drives the execution of deliverables 

  • Sets relevant deliverables based on the established success criteria and define key metrics to measure progress and effectiveness of the solution 

  • Quantifies business impact and ensures regular impact measurement of all ML products in the domain. 

  • Identifies and reviews model evaluation metrics based on analytical requirements 

  • Ensures testing information is documented and maintained by the team 

  • Guide teams to perform model deployment following MLOps guidelines. 

  • Play a key role to solve complex problems, pivotal to Walmart's business and drive actionable insights 

  • Utilize product mindset to build, scale and deploy holistic data science products after successful prototyping 

  • Demonstrate incremental solution approach with agile and flexible ability to overcome practical problems 

  • Articulate and present recommendations to business partners and influence plans based on insights 

  • Partner and engage with associates in other regions for delivering the best services to customers around the globe 

  • Work with the customer-centric mindset to deliver high-quality business-driven analytic solutions. 

  • Drive innovation in approach, method, practices, process, outcome, delivery, or any component of end-to-end problem solving 

  • Proactively engages in the external community to build Walmart's brand and learn more about industry practices 

  • Promote and support company policies, procedures, mission, values, and standards of ethics and integrity 

 

Our Ideal Candidate 

You are a technically strong and high-performing individual with excellent communication skills, a proven analytical skillset, and customer focus. A big plus for this role is to have proven experience in leading data science initiatives in areas like Catalog content improvement, attribute and image enhancements, item classifications and grouping, matching, content scoring and cleaning. You stay updated with the latest research and technology ideas and have a passion for utilizing innovative ways to solve problems. You stay updated with the latest research and technology ideas and have a passion for utilizing innovative ways to solve problems. You are a good storyteller and able to articulate the intricacies of your models and explain your results clearly to executive stakeholders. In addition, you have industry knowledge of the retail space, with a keen interest in keeping up to date on the latest happenings in this space. 

 

Your Qualifications 

  • Bachelor's with > 20 years, Masters > 17 years OR Ph.D. in Comp Science/Statistics/Mathematics with > 12 years of relevant experience. Educational qualifications should be Computer Science/Statistics/Mathematics or a related area. 

  • Experience of acting as a tech lead for > 10 years. 

  • Ability to lead data science projects end to end. 

  • Deep experience in building data science solutions with a focus on catalog content such as attribute and image enhancements, item classifications and grouping, matching, content scoring and cleaning at the scale of billions of items across many countries. 

  • Strong experience in machine learning, supervised and unsupervised: NLP, Classification, Data/Text Mining, Multi-modal supervised and unsupervised models, Neural Networks, Deep Learning Algorithms, Generative AI models. 

  • Experience in analyzing complex problems and translating them into analytical solutions. 

  • Deep experience in analyzing complex problems and translating them into analytical solutions. 

  • Strong experience in Computer Vision, image processing, object detection, Deep learning. Exposure to image features, edge detection, blur, sharpness, corner detection, face detection, knowledge of how image data is represented. 

  • Knowledge of Deep learning, CNNs, and components of CNNs, loss functions for classification, object detection, segmentation. 

  • Experience in machine learning: Classification models, regression models, NLP, Forecasting, Unsupervised models, Optimization, Recommendation models, deep-learning, Graph ML, Causal inference, Causal ML, Statistical Learning, experimentation 

  • Ability to utilize data science solutions outcomes to drive predictive and prescriptive analytics. 

  • Experience with big data analytics - identifying trends, patterns, and outliers in large volumes of data 

  • Ability to scale and deploy data science solutions. 

  • Experience in LLMs, VLMs, embedding generation from multimodal data, storage and retrieval from Vector Databases, set-up and provisioning of managed LLM gateways, development of Retrieval augmented generation based LLM agents, model selection, iterative prompt engineering and finetuning based on accuracy and user-feedback, monitoring and governance. 

  • Strong Experience in Python, PySpark, OpenCV, Python, C++,  Keras, tensorflow, pytorch, big data platforms like Hadoop 

  • Google Cloud platform, Vertex AI, Kubeflow, model deployment, Kafka streaming, API development & deployment, CI/CD, MLOPs 

 

Additional Preferred Qualifications: 

  • Domain Knowledge of Catalog and Content Management. 

  • Published papers or given talks in leading academic and research journals 

  • Published papers or given talks in Data Science Forums 

  • Holds data science related patents 

  • Knowledge and experience of working on Spanish language. 

  • Experience with GPU/CUDA for computational efficiency 

Minimum Qualifications...

Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.

Minimum Qualifications:Option 1: Bachelor's degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 6 years' experience in an analytics related field. Option 2: Master's degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 4 years' experience in an analytics related field. Option 3: 8 years' experience in an analytics or related field.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Primary Location...

Pardhanani Wilshire Ii, Cessna Business Park, Kadubeesanahalli Village, Varthur Hobli , India

Walmart

Website: http://www.walmart.com/

Headquarter Location: Bentonville, Arkansas, United States

Employee Count: 10001+

Year Founded: 1962

IPO Status: Public

Last Funding Type: Post-IPO Debt

Industries: E-Commerce ⋅ Grocery ⋅ Retail ⋅ Retail Technology ⋅ Shopping