Dedupe.io was shut down Jan 31, 2023.
The Dedupe.io team has decided to dedicate our focus to our consulting practice at DataMade and work on projects more aligned with our mission to support our clients in working toward democracy, justice, and equity.
We are continuing our consulting practice around the open source dedupe library and would be happy to consult with you on setting up a solution based on it. Contact us to get started >
This supplementary release in the summer of 2019 produces the new results from linking UMETRICS employee transaction records to ProQuest dissertation data with a focus on dissertation subjects.
Using the Python package dedupe, the 244,023 unique publications were condensed into one author per row with combined thesis title and subject information, with a final n of 242,316.