Working with results


Guides for how to download and use your results from Dedupe.io.

Downloaded results from Dedupe.io
Downloaded results from Dedupe.io

Dedupe.io provides data downloads in the form of a ZIP archive containing CSV files for each dataset in your project. Each CSV represents your original dataset with an additional cluster_id column that can be used to identify clusters.

The following are a few common operations that users perform after downloading their results.

Create a pivot table

Users of spreadsheet apps like Microsoft Excel and Google Sheets can create a pivot table on the cluster_id column to select the columns they’d like to view in each sheet.

Read more about pivot tables in Microsoft Excel »

Read more about pivot tables in Google Sheets »

Merge tables in a spreadsheet app

Users of spreadsheet apps can merge multiple tables based on the common cluster_id column. Microsoft Excel users can use merge queries to join multiple tables, while Google Sheets users can install the third-party addon Merge Sheets.

Read more about merge queries in Excel »

Read more about the Merge Sheets addon in Google Sheets »

Join tables in a SQL database

Users of SQL databases like PostgreSQL and MySQL can import the files into tables and join them with a SQL JOIN operation on the cluster_id column.

Read more about SQL joins »

Still not sure how to work with your data? Contact us for help.