Posted:
10/9/2024, 12:03:10 AM
Location(s):
Catalonia, Spain ⋅ Barcelona, Catalonia, Spain
Experience Level(s):
Senior
Field(s):
AI & Machine Learning ⋅ Data & Analytics
About Our Organization:
Dow Jones is a global provider of news and business information, delivering content to
consumers and organizations around the world across multiple formats, including print,
digital, mobile and live events. Dow Jones has produced unrivaled quality content for
more than 130 years and today has one of the world’s largest news-gathering
operations globally. It is home to leading publications and products including the
flagship Wall Street Journal, America’s largest newspaper by paid circulation; Barron’s,
MarketWatch, Mansion Global, Financial News, Investor’s Business Daily, Factiva, Dow
Jones Risk & Compliance, Dow Jones Newswires, OPIS and Chemical Market
Analytics. Dow Jones is a division of News Corp (Nasdaq: NWS, NWSA; ASX: NWS,
NWSLV).
About the Team:
Our Technology team drives the evolution of our Technology, Engineering, Data,
Product and User Experience functions. With a keen focus on delivering cutting-edge
solutions, we shape the digital landscape for our customers, readers, and users. From
revolutionizing visuals to optimizing tools and harnessing the power of data, mobile,
video, and social platforms, our team is committed to providing a seamless and
immersive experience across all touchpoints. Collaborating closely with our newsrooms
and strategic partners, we spearhead the development of groundbreaking products and
technologies.
About the Role:
Dow Jones is seeking an experienced Data Engineer to join our AI Engineering Team. You will be responsible for designing, developing, and maintaining robust data pipelines for data scraping, processing, extraction, transformation, loading, and storage. You will collaborate within our team to ensure the efficient and reliable retrieval of data, enabling seamless integration with downstream systems for analysis and decision-making.
As a key team member, you will play a crucial role in operationalizing data solutions to meet our organization's needs and deliver tangible value. You will leverage your strong data engineering skills to develop robust, secure, and scalable data pipelines, utilizing your expertise in data retrieval and processing techniques. This position will report to the Associate Director, Data Science.
You Will:
Collaborate with data scientists and ML engineers to design, develop, and maintain end-to-end data pipelines for extraction, transformation, loading (ETL), and storage.
Clean, transform, and structure data using industry-standard techniques, ensuring quality and consistency.
Work with APIs to retrieve data from external sources or integrate with third-party services, adhering to best practices.
Manage and optimize SQL and NoSQL database systems for data storage, ensuring integrity and performance.
Automate data fetching, processing, and storage by implementing data pipelines for ML/AI use cases, leveraging ETL principles.
Identify and troubleshoot issues related to data quality and pipeline performance, applying problem-solving skills.
Communicate effectively with stakeholders and data providers to gather requirements and ensure project alignment.
Stay updated with industry trends, emerging technologies, and best practices in data engineering for Machine Learning
You Have:
At least 3 years of industrial experience in a data engineering role
Experience with cloud-based infrastructure and services (AWS, GCP preferred)
Experience in designing and implementing end-to-end data pipelines for ML/AI use cases. Preferably in Airflow or GCP Cloud Composer.
Ability to work with APIs to retrieve data from external sources or integrate with third-party services.
Familiarity with NLP and Machine Learning frameworks and libraries (e.g., PyTorch, HuggingFace, LangChain, spaCy, NLTK, scikit-learn, etc.)
Experience with working on Jenkins and/or Docker.
Familiarity with database systems, including SQL and NoSQL databases.
Bachelor's Degree or higher in Computer Science, Computer Engineering, Data Science or related STEM field preferred
Strong communication and collaboration skills to work effectively with team members and stakeholders.
Continuous learning mindset with a willingness to stay updated with industry trends and best practices.
Our Benefits
Comprehensive Insurance Plans
Paid Time Off
Family Care Benefits
Access to Dow Jones Products
Subscription Discounts
Employee Referral Program
Employee Well-being Support & Fitness Programs
Business Area:
Dow Jones - TechnologyJob Category:
Data Analytics/Warehousing & Business IntelligenceUnion Status:
Non-Union roleWebsite: https://www.dowjones.com/
Headquarter Location: New York, New York, United States
Employee Count: 5001-10000
Year Founded: 1882
IPO Status: Private
Industries: Digital Media ⋅ News ⋅ Publishing ⋅ Social Media