Results from projects looking to enhance use of integrated data, including data (if any), programs developed, linked out to publications where possible. This could include programming for some data sets that are deposited elsewhere for secondary use (e.g., IPUMS, NAPP).
Data for experimentation and testing, particularly data with personal identifying information is the other core of the repository. Having real datasets available for comparisons of linkage methodology will be very valuable for refining methodology and assessing the tradeoffs between quality, usefulness and privacy.
Bibliography of data linkage articles encourages researchers to include related publications and will support the addition of related publications from third party users. These entries will be included in ICPSR’s Bibliography of Related Literature.
Engagement will improve methodology by bringing together disciplines and categories of data and linkage strategies so that researchers and students can learn from each other. The functionality for this will be built by ICPSR’s technology team, leveraging the Archonnex platform that supports OpenICPSR. Participants will be able to add comments and ask questions, allowing for engagement between data custodians, providers, producers, holders and data users, as well as between various groups of data users. In addition, data users will be able to upload and share code snippets related to the data, allowing for knowledge sharing and better reproducibility. Data users will also be able to link related publications and citations via DOI import or by manually entering citation information, providing feedback to data providers and other data users regarding how these data were used. In order to participate in the repository community, researchers will register their ICPSR user id (known as a MyData account) as a verified participant in this linkage repository. This will allow them to contribute their own study materials to LinkageLibrary, if they wish, as well as contribute to a conversation (commenting, sharing documents, citations, data) on other linkage studies, share their code for other studies, etc. This will encourage trust and responsibility in the management of functionality such as crowdsourcing comments and code improvements.
Funding Source: NSF
Social Science Domains