Senior Data Scientist, Ingest

Posted:
9/11/2024, 4:29:58 AM

Location(s):
Austin, Texas, United States ⋅ Texas, United States ⋅ New York, United States ⋅ North Carolina, United States ⋅ England, United Kingdom ⋅ New York, New York, United States ⋅ Florida, United States ⋅ Tarrytown, New York, United States ⋅ Saint Petersburg, Saint Petersburg, Russia ⋅ Saint Petersburg, Russia ⋅ California, United States ⋅ Rhode Island, United States ⋅ London, England, United Kingdom ⋅ Los Angeles, California, United States ⋅ Minneapolis, North Carolina, United States

Experience Level(s):
Senior

Field(s):
Data & Analytics

Workplace Type:
Remote

Exactera has offices in New York City, Tarrytown NY, St. Petersburg FL, London, and Argentina. 

Compensation: $180,000- $200,000 depending on experience

We are a remote-friendly company, with offices in New York City, Tarrytown NY, St. Petersburg FL, London, and Argentina. For this role, we are happy to consider US-based candidates in the following states: AZ, CO, CT, DC, FL, GA, HI, IL, IN, LA, MA, MD, MI, MN, NC, NJ, NY, OH, PA, RI, SC, TN, TX, UT, VA, WA and WI, with preference to Eastern time zone residency.

Role Overview:

We are seeking an exceptional Senior Data Scientist with a focus on data ingestion to join our innovative team. The ideal candidate will possess a powerful combination of mathematical and statistical expertise, natural curiosity, and creative problem-solving skills. In this role, you will be the driving force behind uncovering hidden opportunities and realizing the full potential of our data.

Collaborating closely with our engineering and product teams, you will apply scientific methodologies to tackle complex business challenges and derive actionable insights. Your ability to synthesize information across diverse datasets and ask incisive questions will be fundamental in shaping our data-driven strategies. Importantly, your emphasis on leveraging advanced data collection methods, sophisticated scraping tools, and accessing third-party data will play a key role in not just replacing but significantly expanding our data ingestion capabilities.

 

Key Responsibilities:

  • Design strategies for gathering data from internal and external sources (e.g., scraping business descriptions, pulling financial data). Ensure data quality, accuracy, and reliability.
  • Research and implement methods to collect and enrich data without relying on third-party sources, ensuring that the internal database provides comprehensive coverage.
  • Develop complex algorithms to process and enrich raw data, transforming qualitative text (e.g., business descriptions) into quantitative features.
  • Set up and execute A/B testing comparing the model’s performance with and without enrichments.
  • Evaluate and track the impact of data enrichments using key performance metrics. Optimize models for better performance based on test results.
  • Continuously explore new methods or sources to enrich company data, such as alternative datasets or external APIs.
  • Create compelling data visualizations and narratives to effectively communicate findings to both technical and non-technical stakeholders.
  • Collaborate with cross-functional teams, including business leaders and engineering, to translate data insights into actionable strategies.
  • Design and conduct experiments to evaluate and improve model performance.
  • Develop and maintain documentation for data processes, models, and methodologies. 

 

Qualifications:

  • 5+ years of experience in data science, with a focus on data ingestion.
  • Strong programming skills in Python, R, or similar languages.
  • Strong expertise in building data models for large datasets, including natural language processing (NLP).
  • 3+ years of experience with query languages and database technologies.
  • 2+ years of experience working with cloud computing platforms such as AWS.
  • Proficiency in data visualization tools (e.g., Tableau, PowerBI, matplotlib).
  • Hands-on experience with A/B testing and optimizing machine learning models.
  • Strong knowledge of data engineering principles and big data technologies.
  • Excellent communication skills, with the ability to explain complex technical concepts to non-technical audiences.
  • Experience in presenting data-driven insights to senior management and influencing decision-making.
  • Demonstrated ability to work collaboratively with engineering teams to implement data science solutions.


Nice To Haves:

  • Proven track record of delivering measurable value to stakeholders and driving positive business outcomes through data-driven initiatives.
  • Experience applying design of experiments (DOE) in a professional setting.
  • Master's degree or Ph.D. in Computer Science, Data Science, Statistics, or a related field.
  • Experience in a fast-paced, startup environment.
  • Fintech background is a bonus

What We Offer:

Great team culture, career development, great compensation package and benefits, and an unparalleled experience as part of one of the most advanced teams in the world. With a company culture that provides ample opportunities to be recognized, build valuable skills, and grow your career.

About Us:

Exactera, a pre-IPO SaaS company, is the global leader in technology driven tax solutions. Founded in 2016, we stand at the intersection of human and machine intelligence. We have offices in New York, Florida, London and remote employees across the United States and Argentina.  Through our AI and cloud-based technologies, we deliver superior solutions to our corporate tax customers. As a result, Savant Venture Fund and Insight Partners have invested over $100 million in funding to support our growth.

We are committed to promoting the values of diversity and inclusion throughout the business. Whether it is through recruitment, retention, career progression or training and development, we are committed to improving opportunities for our people regardless of their background or circumstances.