Dedupe.io was shut down Jan 31, 2023.
The Dedupe.io team has decided to dedicate our focus to our consulting practice at DataMade and work on projects more aligned with our mission to support our clients in working toward democracy, justice, and equity.
We are continuing our consulting practice around the open source dedupe library and would be happy to consult with you on setting up a solution based on it. Contact us to get started >
To get familiar with Dedupe.io's features and advanced capabilities, our tutorials and documentation.
Dedupe.io is a software as a service platform for quickly and accurately identifying clusters of similar records across one or more files or databases. In this tutorial, we will go over how to de-duplicate your first dataset using Dedupe.io.
In this tutorial, we will go over how to merge or find matches across multiple datasets using Dedupe.io.
Deep dives into how Dedupe.io works and advanced settings.
Using advanced machine learning and statistics, Dedupe.io learns the best way to identify similar records in any dataset. Learn the specifics of our research-driven approach to record matching and entity resolution.
Instructions and tips on formatting and processing files for upload.
Dedupe.io can compare your fields in different ways depending on the makeup of the data.
While you can use either Dedupe.io or the dedupe library to de-duplicate or link your data, there are some important differences to note when choosing which one to use.
Frequently asked questions (and answers) from Dedupe.io users.
Guides for how to download and use your results from Dedupe.io.