Tutorials

To get familiar with Dedupe.io's features and advanced capabilities, our tutorials and documentation.


Intro to Dedupe.io

Intro to Dedupe.io

30 minutes

Dedupe.io is a a software as a service platform for quickly and accurately identifying clusters of similar records across one or more files or databases. In this tutorial, we will go over how to de-duplicate your first dataset using Dedupe.io.

Merging and matching multiple datasets

Merging and matching multiple datasets

20 minutes

In this tutorial, we will go over how to merge or find matches across multiple datasets using Dedupe.io.


Documentation

Deep dives into how Dedupe.io works and advanced settings.

How it works

How it works

Using advanced machine learning and statistics, Dedupe.io learns the best way to identify similar records in any dataset. Learn the specifics of our research-driven approach to record matching and entity resolution.

Formatting files for upload

Formatting files for upload

Instructions and tips on formatting and processing files for upload.

Field comparators

Field comparators

Dedupe.io can compare your fields in different ways depending on the makeup of the data.