On Jan 31, 2023, Dedupe.io will be shut down. After this date, users will not be able to login or use our service and all uploaded user data will be deleted. If you have active projects or project data on the Dedupe.io platform, please download it ahead of this date.
The Dedupe.io team has decided to dedicate our focus to our consulting practice at DataMade and work on projects more aligned with our mission to support our clients in working toward democracy, justice, and equity.
We are continuing our consulting practice around the open source dedupe library and would be happy to consult with you on setting up a solution based on it. Contact us to get started >
This supplementary release in the summer of 2019 produces the new results from linking UMETRICS employee transaction records to ProQuest dissertation data with a focus on dissertation subjects.
Using the Python package dedupe, the 244,023 unique publications were condensed into one author per row with combined thesis title and subject information, with a final n of 242,316.