Curation module in action - preliminary findings on VLO metadata quality
Numerous problems and suggestions have been reported on the issues of metadata aggregation for VLO (Virtual Language Observatory), one of the core services of CLARIN, over the last years. In response to them, we have developed a metadata curation module which is capable of assembling and reporting a wide range of statistics about CMD (Component Metadata) records, collections, and profiles in the aim of monitoring the issues of metadata quality in VLO. In this paper, we present its on-going development and preliminary findings. With an easy-to-use interactive interface and scoring system, the module has successfully demonstrated to visualise the current state of the VLO. Our first set of analysis outlines unprecedented views on the quality of CMD metadata. We have also identified future works including the user interface, usability, input methods, and the calibration of scoring algorithm. We strongly believe that the curation module has a potential to openly and collectively check and improve the metadata, fostering the comprehensive analysis and assessment of metadata quality to support CMDI and VLO in the long run.