Data Scientist

Posted:
5/1/2026, 10:35:14 AM

Location(s):
San Francisco, California, United States ⋅ California, United States

Experience Level(s):
Mid Level

Field(s):
AI & Machine Learning ⋅ Data & Analytics

About Middesk

Middesk makes it easier for businesses to work together. Since 2018, we’ve been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at every stage of the customer lifecycle.

Middesk came out of Y Combinator, is backed by Sequoia Capital and Accel Partners, and was recently named to Forbes Fintech 50 List.

The Role

We’re building AI-driven applications that simplify customer workflows, starting with business onboarding. With our proprietary identity data and deep domain expertise, we’re in a strong position to expand into a broader set of intelligent, risk-aware products.

We’re looking for a hands-on engineer to help build the foundation for these systems. This role is less about inventing new ML algorithms and more about applying the right techniques to messy, real-world problems. You’ve worked in fraud, risk, or trust domains, and you understand how bad actors behave, how data breaks, and how to still ship reliable systems anyway.

This is a highly technical, hands-on role with broad influence over how we design, build, and scale data-driven systems at Middesk.

What You’ll Do

  • Build fraud & risk systems
    Design and ship production systems that detect and prevent fraud across KYB, trust & safety, and compliance workflows.

  • Work with messy, real-world data
    Tackle problems with extreme class imbalance, sparse signals, evolving adversarial behavior, and limited ground truth.

  • Leverage relationships in data
    Apply graph-based approaches and entity resolution techniques to uncover hidden connections and improve risk detection.

  • Improve signal & labeling
    Use a mix of heuristics, weak supervision, and modern AI tools (including LLMs where appropriate) to generate better features and labels.

  • Help scale our infrastructure
    Partner with engineering to build and evolve systems for feature generation, model training, and production deployment across multiple use cases.

What We’re Looking For

  • 4+ years of experience in fraud, risk, or trust & safety
    You’ve worked on real-world fraud or abuse problems and understand the domain deeply.

  • Experience building and shipping production systems
    You’ve deployed models or data-driven systems that power external-facing products.

  • Strong foundation in applied ML or data systems
    Comfortable working on classification problems with real-world constraints like imbalanced data, sparse signals, and changing patterns.

  • Experience with graph or relational data approaches
    Familiarity with knowledge graphs, network analysis, or entity linking is strongly preferred.

  • Hands-on and pragmatic
    You focus on impact over perfection and know how to balance speed, accuracy, and maintainability.

Middesk

Website: https://www.middesk.com/

Headquarter Location: San Francisco, California, United States

Employee Count: 51-100

Year Founded: 2018

IPO Status: Private

Last Funding Type: Series B

Industries: Business Development ⋅ Enterprise Resource Planning (ERP) ⋅ Enterprise Software ⋅ FinTech ⋅ Information Technology ⋅ Risk Management