Data Science Internship

Posted:
1/7/2025, 12:02:11 AM

Location(s):
Amsterdam, Noord-Holland, Netherlands ⋅ Noord-Holland, Netherlands

Experience Level(s):
Internship

Field(s):
Data & Analytics

Data Science Internship


Background

Academic researchers frequently utilize Elsevier's products like ScienceDirect and Scopus to find scientific research that will help them understand the background of a field, learn how to conduct an experiment in an appropriate way, or plan and execute their own project. Oftentimes, researchers encounter a common challenge where they have an idea or research question but lack clarity on how to initiate or perform the study effectively. They struggle to identify the required materials, resources, techniques, and steps essential for producing high quality research outcomes. As a leader in providing scientific, technical, and medical information and solutions, Elsevier is in a great position to help solve this problem and optimize the research process.

Research protocols are detailed plans that outline the procedures, methods, and steps to be followed in a research study. They serve as a roadmap or guide for conducting scientific investigations and experiments in a systematic and structured manner. Research protocols typically include information on the research question or hypothesis, study design, data collection methods, analysis techniques, ethical considerations, and other aspects relevant to the study. Many protocols are published in journals such as Elsevier’s own STAR Protocols, an open access journal from Cell Press. However, an individual researcher may need to adapt a protocol or combine information from many sources to be able to execute their own project.

Responsibilities

This project aims to leverage LLM agents to develop an innovative solution for generating research protocols for academic researchers using Elsevier's products. We are looking for an intern that will design and implement GenAI based tooling that can analyze a researcher's specific research question, retrieve relevant information from various sources, and generate a detailed protocol outlining the necessary steps to conduct the research study effectively.

Steps include:

· Literature review and competitor analysis

· Characterization of data sources and protocol structure

· Implementation of GenAI tooling

· Creation of gold dataset and evaluation

· Prototype development

· Presentation to stakeholders

Requirements

  • Knowledge of NLP, experience with GenAI (prompt engineering, evaluation);
  • Proficient in writing Python to collect data, run experiments, create simple prototypes;
  • Ability to work with various data sources via API calls;
  • Good communication and presentation skills;
  • The duration for this internship is 4 months.

This role is based in the Amsterdam office (1x per week). A valid Visa or EU citizenship is mandatory. We do not sponsor Visas for this role. 

Non EU candidates need to receive credits from their university for this internship. You need to be registered as a student in the Netherlands during the whole internship period.

-----------------------------------------------------------------------

Elsevier is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form: https://forms.office.com/r/eVgFxjLmAK , or please contact 1-855-833-5120.

Please read our Candidate Privacy Policy.

Elsevier

Website: https://elsevier.com/

Headquarter Location: Amsterdam, Noord-Holland, The Netherlands

Employee Count: 5001-10000

Year Founded: 1880

IPO Status: Private

Last Funding Type: Private Equity

Industries: Content ⋅ Content Discovery ⋅ Delivery ⋅ Health Care ⋅ Information Services ⋅ Information Technology ⋅ Publishing