Using the tool Observable to ensure data quality in institutional repository​
Conference poster, 2023

We have managed to speed up data cleaning 400 % by developing a tool in the platform Observable that compares data via API and finds deviations between databases. We’ve looked at one metadata field at a time, mapping what types of deviations that occurred and how they could be handled most efficiently. Most corrections can be automated, some need human verification. The tool gives us a better overview of how many deviations we have of each type; they are so much easier to correct, and it is also designed based on gamification so that we constantly can see the progress, it very motivating to see the numbers decrease and to know that we are delivering higher quality data to our users and the national Swedish library. Its transparent and open for all.

CRIS

processes

Observable

data quality

Open Repositories

Author

Jessica Byström

Chalmers, Communication and Learning in Science, Information Resources and Scientific Publishing

Cecilia Granell

Chalmers, Communication and Learning in Science, Research support, bibliometrics and ranking

18th International Conference on Open Repositories 2023
Stellenbosch, South Africa,

Subject Categories

Other Computer and Information Science

More information

Latest update

1/29/2024