About Pachyderm:
Pachyderm is the leader in data versioning and pipelines for MLOps. We’re building the data foundation that allows data science teams to automate and scale their machine learning lifecycle while guaranteeing reproducibility. With over $40 million in three rounds of funding from leading investors like Benchmark, Microsoft M12, and Y Combinator, Pachyderm is committed to building industrial strength capabilities for Data Centric AI. Pachyderm offers a commercial Enterprise Edition and an open source Community Edition. Pachyderm helps customers get their ML and AI projects to market faster, lower data processing and storage costs, and supports strict data governance requirements.
Pachyderm is growing fast and still small, so joining means you are getting in right at the ground floor and that you will have an enormous impact on the success and direction of the company and product. Pachyderm has always and will always embrace a “Remote-first” approach to growing our team. This allows us to hire a diverse group of individuals across the country (and world!) while giving our team members the flexibility to work from anywhere. Being a member of The Pach means joining a supportive team that cares about you, values kindness and works hard to create an open and transparent workplace.
The Role
On the Integrations team, we focus on connecting Pachyderm to other tools in the ML/Data Science/Data Engineering ecosystem. We've built:
Connectors to other pipeline systems, so that you can put your data in Pachyderm and re-use your existing pipelines, but still get the benefits of reproducibility, provenance, and incrementality
Tools for automatically deploying ML models, as well as tools like explainability (why did my model produce this inference) and drift detection. Pachyderm's versioning can ensure that all production services derived from a single training data set are running together; everything is based on the same version
Tools for mounting Pachyderm data into Jupyterlab, exploring it, and then pushing the resulting code back into Pachyderm as a pipeline—all to dramatically reduce data scientists' idea-to-production latency and boost their productivity
On our team, you'll be building extensions and plugins for tools like Jupyter and VSCode, for which you'll get to explore the wide variety of frameworks and libraries used by these partner products. You'll also be building UIs for new ML tools, sometimes with little precedent in the industry.
While your primary focus will be building the product, you’ll also have direct exposure to users and enterprise customers via our open source support channels. At Pachyderm, open source user and customer feedback is a major driver of our product roadmap and we believe that everyone within the company should experience that first-hand.
The long and short of it is, if you're looking to make a big impact on a small team that works on open source software and delivers an enterprise-grade product, then this role is for you. You can check out our product on github.
We offer significant equity, full benefits, and all the usual startup perks.
Qualifications
Benefits:
We can’t wait to meet you and hope you’ll join our PACH!