Data Engineer (Spark/Python)

Posted:
9/26/2024, 1:38:37 PM

Location(s):
San Francisco, California, United States ⋅ California, United States

Experience Level(s):
Junior ⋅ Mid Level ⋅ Senior

Field(s):
Data & Analytics

Workplace Type:
On-site

About

Fintool is an AI Equity Research Copilot for institutional investors. It’s a LLM on top of financial documents, starting with SEC filings. Fintool is engineered to discover financial insights beyond the reach of timely human analysis. We are building the AI Warren Buffett.

We are on the fastest growing LLM vertical applications . Thousands of investors signed up for Fintool. Fintool is backed by Y Combinator and entrepreneurs such as the co-founders of Datadog, Vercel, HuggingFace, or domain experts from OpenAI to Deepmind.

Team

Nicolas Bustamante: spent 7 years building one of the largest AI-driven legal search engines (Bloomberg for lawyers). Nicolas hired nearly 200 people, secured millions of dollars in debt and equity funding, and the profitable business was successfully acquired by Summit Partners, a $43B billion growth equity fund, for $x00M+

Edouard Godfrey: worked for 9 years at Apple, leading teams of data scientists and engineers. He worked on Apple Search (Spotlight) and Apple Pay, maintaining big data pipelines and deploying cutting-edge AI models. He received the 2019 Apple Pay Innovation Award for outstanding contributions and fresh insights.

Our philosophy

Small team: small in-person teams outperform large and well-funded companies. When people visit our office, they should be surprised by how few people we are.

Ship code: we avoid meetings, PM jargon to release early, release often, and listen to customers.

In-person: we believe high-performing teams do their best work, build long-term relationships, and have the most fun in person.

Company Values

Clone and improve the best: we're not about reinventing the wheel but about enhancing proven success. We are shameless cloners who stand on the shoulders of giants. We draw inspiration and then create differentiation because distinctiveness drives dominance.

Release early, release often, and listen to your customers: speed matters in business, so we push better-than-perfect updates for customers asap. Mastery comes from repeated experiments and learning from mistakes rather than putting in a set number of hours. It’s 10,000 iterations, not 10,000 hours.

Warren Buffett: We model our personal and professional ethos on the principles he exemplifies. Upholding integrity, valuing honesty, practicing frugality, championing lifelong learning, embracing humility, extending generosity, applying rationality, and demonstrating patience. Every day, we strive to mirror these Buffett-inspired virtues.

Job Desc

We are building real-time data pipelines for millions of unstructured financial documents to feed our financial LLM. You will build real-time data pipelines to process millions of financial docs, and build ML-based parsers to chunk and tag intelligently to index into our Elastic that will then feed our LLM. It’s cutting-edge data engineering at the AI frontier.

Requirements: Spark/Databricks, Python, Postgres and LLM. Knowing Elastic, Next.js, and TypeScript, web crawling is a plus.

Experience: 3+ years of deploying production code at a company with a large infrastructure.

Location: San Francisco (no remote)

Contract: Full-time

Apply via the form below 👇

Fintool

Website: https://www.fintool.com/

Headquarter Location: San Francisco, California, United States

Employee Count: 1-10

Year Founded: 2023

IPO Status: Private

Last Funding Type: Pre-Seed

Industries: Analytics ⋅ Artificial Intelligence (AI) ⋅ Finance ⋅ SaaS ⋅ Search Engine