Data Scientist

Posted:
5/1/2026, 10:35:14 AM

Location(s):
San Francisco, California, United States ⋅ California, United States

Experience Level(s):
Mid Level

Field(s):
AI & Machine Learning ⋅ Data & Analytics

About Middesk

Middesk makes it easier for businesses to work together. Since 2018, we’ve been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at every stage of the customer lifecycle.

Middesk came out of Y Combinator, is backed by Sequoia Capital and Accel Partners, and was recently named to Forbes Fintech 50 List.

The Role

We’re building AI-driven applications that simplify customer workflows, starting with business onboarding. With our proprietary identity data and deep domain expertise, we’re in a strong position to expand into a broader set of intelligent, risk-aware products.

We’re looking for a hands-on engineer to help build the foundation for these systems. This role is less about inventing new ML algorithms and more about applying the right techniques to messy, real-world problems. You’ve worked in fraud, risk, or trust domains, and you understand how bad actors behave, how data breaks, and how to still ship reliable systems anyway.

This is a highly technical, hands-on role with broad influence over how we design, build, and scale data-driven systems at Middesk.

What You’ll Do

Build fraud & risk systems
Design and ship production systems that detect and prevent fraud across KYB, trust & safety, and compliance workflows.
Work with messy, real-world data
Tackle problems with extreme class imbalance, sparse signals, evolving adversarial behavior, and limited ground truth.
Leverage relationships in data
Apply graph-based approaches and entity resolution techniques to uncover hidden connections and improve risk detection.
Improve signal & labeling
Use a mix of heuristics, weak supervision, and modern AI tools (including LLMs where appropriate) to generate better features and labels.
Help scale our infrastructure
Partner with engineering to build and evolve systems for feature generation, model training, and production deployment across multiple use cases.

What We’re Looking For

4+ years of experience in fraud, risk, or trust & safety
You’ve worked on real-world fraud or abuse problems and understand the domain deeply.
Experience building and shipping production systems
You’ve deployed models or data-driven systems that power external-facing products.
Strong foundation in applied ML or data systems
Comfortable working on classification problems with real-world constraints like imbalanced data, sparse signals, and changing patterns.
Experience with graph or relational data approaches
Familiarity with knowledge graphs, network analysis, or entity linking is strongly preferred.
Hands-on and pragmatic
You focus on impact over perfection and know how to balance speed, accuracy, and maintainability.

Middesk

Website: https://www.middesk.com/

Headquarter Location: San Francisco, California, United States

Employee Count: 51-100

Year Founded: 2018

IPO Status: Private

Last Funding Type: Series B

Industries: Business Development ⋅ Enterprise Resource Planning (ERP) ⋅ Enterprise Software ⋅ FinTech ⋅ Information Technology ⋅ Risk Management

Data Engineer

Booz Allen • 5/13/2026 ⋅ United States

AI Chip Design Engineer - New College Grad 2026

NVIDIA • 3/30/2026 ⋅ United States

Manager II, Machine Learning-Search

Pinterest • 1/6/2026 ⋅ United States

Data Engineer, Smart Factory Solutions

Magna • 2/2/2026 ⋅ United States

AI Technical Training Content Developer

Nasdaq • 6/1/2026 ⋅ Lithuania

Notify

postings

pricing

login