RDF Ontology (Re-)Engineering through Large-scale Data Mining
en-de
en-es
en-fr
en-pt
en-sl
en
en-zh
0.25
0.5
0.75
1.25
1.5
1.75
2
As Linked Open Data originates from various sources, lever- aging well-defined ontologies aids integration. However, oftentimes the utilization of RDF vocabularies by data publishers differs from the in- tended application envisioned by ontology engineers. Especially in large- scale datasets as presented in the Billion Triple Challenge a significant divergence between vocabulary specification and usage patterns can be observed. This may impede the goals of the Web of Data in terms of dis- covering domain-specific information in the Semantic Web. In this work, we identify common misusage patterns by employing frequency analysis and rule mining and propose re-engineering suggestions.