« Dedupe in action

Project
A Scaleable Approach To Emissions-Innovation Record Linkage
By
Brugel Working Paper
Author(s)
Mark Huberty, Amma Serwaah, Georg Zachmann
Link
http://bruegel.org/wp-content/uploads/imported/publications/WP_2014_10ii.pdf
Published
June 2013
Tool used
dedupe python library
A Scaleable Approach To Emissions-Innovation Record Linkage

A paper that reports an approach to linking data on European emitters to data on their innovation practices. They illustrate a straightforward approach to record linkage between the European Union Community Integrated Transaction Log (CITL) and the PATSTAT international patent database. We show how that record linkage can be maintained with relatively minimal human input.