Visualization of the ontology, following the notation of the Chowlk Visual Notation [Chávez-Feria, et al., 2021].
The classes and properties that are from external ontologies, are highlighted with different colors.
This part introduces the anatomy of knowledge cleaning, outlining and connecting key aspects relevant to cleaning knowledge graphs. It addresses the first research question (see Section 1.1).
These aspects include cleaning techniques, which define the specific strategies employed by different approaches. The techniques depend on various forms of background knowledge for effective processing, and this section introduces the different types involved.
In addition, four sets of dimensions are discussed—each set characterizes particular properties of cleaning approaches and techniques.
This section introduces various dimensions, i.e., characteristics, of knowledge graph cleaning approaches. Each dimension has one or more variations, and individual approaches are not limited to a single variation.
For instance, hybrid approaches often combine different techniques and dimensions to achieve a more robust and adaptable cleaning workflow. Such approaches may, for example, incorporate both internal and external methods.
Error detection and correction rely on background knowledge, which provides essential context for identifying or resolving errors. This knowledge can come from different parts or features of a knowledge graph itself (i.e., internal background knowledge) or from external supplementary sources (i.e., external background knowledge).
We distinguish seven types of background knowledge:
Approaches use different cleaning techniques to detect and correct errors. Each technique is characterized by a methodology focusing on specific insights and requirements. It leverages background knowledge to address the cleaning task. Each technique and related approaches are described, using an example knowledge graph, figures, and tables to visualize key insights.
We distinguish between eleven cleaning techniques:
akco | <https://purl.archive.org/akco> |
cube | <http://purl.org/linked-data/cube#> |
dc | <http://purl.org/dc/elements/1.1/> |
dqv | <http://www.w3.org/ns/dqv#> |
owl | <http://www.w3.org/2002/07/owl#> |
rdf | <http://www.w3.org/1999/02/22-rdf-syntax-ns#> |
rdfs | <http://www.w3.org/2000/01/rdf-schema#> |
skos | <http://www.w3.org/2004/02/skos/core#> |
terms | <http://purl.org/dc/terms/> |
vann | <http://purl.org/vocab/vann/> |
xml | <http://www.w3.org/XML/1998/namespace> |
xsd | <http://www.w3.org/2001/XMLSchema#> |
Visualization of the ontology, following the notation of the Chowlk Visual Notation [Chávez-Feria, et al., 2021].
The classes and properties that are from external ontologies, are highlighted with different colors.
This section describes how the defined concepts and relations are used to (1) represent relevant aspects, and to (2) answer relevant questions to knowledge cleaning.
Select an aspect to represent, using the AKCO
Select an example from the left.
IRI: https://purl.archive.org/akco#Approach
IRI: https://purl.archive.org/akco#BackgroundKnowledge
IRI: http://www.w3.org/2004/02/skos/core#Concept
IRI: http://purl.org/linked-data/cube#DataSet
IRI: https://purl.archive.org/akco#Dimension
IRI: https://purl.archive.org/akco#Error
IRI: https://purl.archive.org/akco#ExpertKnowledge
IRI: https://purl.archive.org/akco#ExternalSource
IRI: https://purl.archive.org/akco#KnowledgeGraph
IRI: http://www.w3.org/ns/dqv#QualityMeasurement
IRI: https://purl.archive.org/akco#Technique
IRI: http://www.w3.org/2004/02/skos/core#broader
IRI: http://www.w3.org/2004/02/skos/core#narrower
IRI: http://www.w3.org/ns/dqv#hasQualityMeasurement
IRI: https://purl.archive.org/akco#hasDimension
IRI: https://purl.archive.org/akco#hasError
IRI: https://purl.archive.org/akco#isPublished
IRI: https://purl.archive.org/akco#targetsError
IRI: https://purl.archive.org/akco#usesBackgroundKnowledge
IRI: https://purl.archive.org/akco#usesTechnique
IRI: https://purl.archive.org/akco#errorNature
IRI: https://purl.archive.org/akco#errorSource
IRI: https://purl.archive.org/akco#errorType
IRI: http://purl.org/dc/terms/bibliographicCitation
IRI: http://purl.org/dc/elements/1.1/creator
IRI: http://purl.org/dc/elements/1.1/date
IRI: http://purl.org/dc/terms/issued
IRI: http://www.w3.org/2004/02/skos/core#definition
IRI: http://purl.org/dc/terms/description
IRI: http://purl.org/dc/elements/1.1/language
IRI: http://purl.org/dc/terms/language
IRI: http://purl.org/dc/terms/license
IRI: http://purl.org/dc/terms/modified
IRI: http://www.w3.org/2004/02/skos/core#prefLabel
IRI: http://purl.org/vocab/vann/preferredNamespacePrefix
IRI: http://purl.org/vocab/vann/preferredNamespaceUri
IRI: http://www.w3.org/2004/02/skos/core#scopeNote
IRI: http://purl.org/dc/elements/1.1/source
IRI: http://purl.org/dc/elements/1.1/title
IRI: http://purl.org/dc/terms/title
IRI: http://purl.org/dc/elements/1.1/type
IRI: http://purl.org/dc/terms/type
IRI: https://purl.archive.org/akco#appADianReFrfoDLKB
IRI: https://purl.archive.org/akco#appAPE
IRI: https://purl.archive.org/akco#appAPrInToImLiDaQuUsDiOuDe
IRI: https://purl.archive.org/akco#appAttributE
IRI: https://purl.archive.org/akco#appBioGRER
IRI: https://purl.archive.org/akco#appCEDAL
IRI: https://purl.archive.org/akco#appChanHaInofDB
IRI: https://purl.archive.org/akco#appCoCKG
IRI: https://purl.archive.org/akco#appCOLIBRI
IRI: https://purl.archive.org/akco#appCOPAAL
IRI: https://purl.archive.org/akco#appCorhist
IRI: https://purl.archive.org/akco#appDBOnEnfoInDe
IRI: https://purl.archive.org/akco#appDeanCoTyErinOpKnGrUsSeReofEn
IRI: https://purl.archive.org/akco#appDecefitogr
IRI: https://purl.archive.org/akco#appDECIDE
IRI: https://purl.archive.org/akco#appDeerinnulidauscroude
IRI: https://purl.archive.org/akco#appDeFacto
IRI: https://purl.archive.org/akco#appDeInNuDainDB
IRI: https://purl.archive.org/akco#appDojuanenbyitnaEntyuslamo
IRI: https://purl.archive.org/akco#appExfakt
IRI: https://purl.archive.org/akco#appFachvievpa
IRI: https://purl.archive.org/akco#appFactCheck
IRI: https://purl.archive.org/akco#appFacTify
IRI: https://purl.archive.org/akco#appFACTY
IRI: https://purl.archive.org/akco#appFaVawiKnGrEm
IRI: https://purl.archive.org/akco#appFEA
IRI: https://purl.archive.org/akco#appGAT
IRI: https://purl.archive.org/akco#appGeneralCorrectionFramework
IRI: https://purl.archive.org/akco#appGltyfoliinrdtrwiaptoensu
IRI: https://purl.archive.org/akco#appGRR
IRI: https://purl.archive.org/akco#appHMGCN
IRI: https://purl.archive.org/akco#appHybridFC
IRI: https://purl.archive.org/akco#appIDLabTurtleValidator
IRI: https://purl.archive.org/akco#appIdwrlibedabymuoude
IRI: https://purl.archive.org/akco#appIncoinOW
IRI: https://purl.archive.org/akco#appIO
IRI: https://purl.archive.org/akco#appIOTW
IRI: https://purl.archive.org/akco#appJSAE
IRI: https://purl.archive.org/akco#appKD2R
IRI: https://purl.archive.org/akco#appKefogr
IRI: https://purl.archive.org/akco#appKGClean
IRI: https://purl.archive.org/akco#appKGTtm
IRI: https://purl.archive.org/akco#appKS
IRI: https://purl.archive.org/akco#appKV-rule
IRI: https://purl.archive.org/akco#appLediaxwiasrumianitaptoindeoflida
IRI: https://purl.archive.org/akco#appLeRiXt
IRI: https://purl.archive.org/akco#appLOD-Community-Detection
IRI: https://purl.archive.org/akco#appLodeofinSastinRDda
IRI: https://purl.archive.org/akco#appLODLaundromat
IRI: https://purl.archive.org/akco#appMTM
IRI: https://purl.archive.org/akco#appNotquitethesame
IRI: https://purl.archive.org/akco#appOGFC
IRI: https://purl.archive.org/akco#appOptimalABoxRepairw.r.t.StaticELTBoxes
IRI: https://purl.archive.org/akco#appPaTyBRED
IRI: https://purl.archive.org/akco#appPredPath
IRI: https://purl.archive.org/akco#appPrerdemofokngrre
IRI: https://purl.archive.org/akco#appRDD-Checker
IRI: https://purl.archive.org/akco#appRDFDoctor
IRI: https://purl.archive.org/akco#appRDFUnit
IRI: https://purl.archive.org/akco#apprdfvalidator
IRI: https://purl.archive.org/akco#appRDvase
IRI: https://purl.archive.org/akco#appReRaViinDB
IRI: https://purl.archive.org/akco#appRUGE
IRI: https://purl.archive.org/akco#appS3K
IRI: https://purl.archive.org/akco#appSDType
IRI: https://purl.archive.org/akco#appSDValidate
IRI: https://purl.archive.org/akco#appServDBpediaDOLCE
IRI: https://purl.archive.org/akco#appSiApCofoRDMo
IRI: https://purl.archive.org/akco#appSLCN
IRI: https://purl.archive.org/akco#appTISCO
IRI: https://purl.archive.org/akco#appToDeSePrfoRDusSPasInLa
IRI: https://purl.archive.org/akco#appTripleNet
IRI: https://purl.archive.org/akco#appTypingErrorsinFactualKnowledgeGraphs
IRI: https://purl.archive.org/akco#appTyPrfoEfCoReinHeSeGr
IRI: https://purl.archive.org/akco#appUsReofInKnBa
IRI: https://purl.archive.org/akco#appValidata
IRI: https://purl.archive.org/akco#appValidatrr
IRI: https://purl.archive.org/akco#appVeriGraph
IRI: https://purl.archive.org/akco#appVGFD
IRI: https://purl.archive.org/akco#appVRP
IRI: https://purl.archive.org/akco#appWhenowl
IRI: https://purl.archive.org/akco#appWhEvlidahewiaquthclupDB
IRI: https://purl.archive.org/akco#dimCorrection
IRI: https://purl.archive.org/akco#dimData-driven
IRI: https://purl.archive.org/akco#dimDetection
IRI: https://purl.archive.org/akco#dimExternal
IRI: https://purl.archive.org/akco#dimInternal
IRI: https://purl.archive.org/akco#dimKnowledge-driven
IRI: https://purl.archive.org/akco#errESemSemWrong
IRI: https://purl.archive.org/akco#errESynNotPIdentID
IRI: https://purl.archive.org/akco#errESynNotPIdentIR
IRI: https://purl.archive.org/akco#errISemSemWrong
IRI: https://purl.archive.org/akco#errISemSynNotEType
IRI: https://purl.archive.org/akco#errISynNotPIdent
IRI: https://purl.archive.org/akco#errPVSemNotADomain
IRI: https://purl.archive.org/akco#errPVSemNotARange
IRI: https://purl.archive.org/akco#errPVSemSemWrong
IRI: https://purl.archive.org/akco#errPVSemSynNotEProperty
IRI: https://purl.archive.org/akco#errPVSynNotPIdentID
IRI: https://purl.archive.org/akco#errPVSynNotPIdentIR
IRI: https://purl.archive.org/akco#errSystematic
IRI: https://purl.archive.org/akco#errTailVertical
IRI: https://purl.archive.org/akco#extSrcAny
IRI: https://purl.archive.org/akco#extSrcDBpedia
IRI: https://purl.archive.org/akco#extSrcDomaInExpertise
IRI: https://purl.archive.org/akco#extSrcExternalSources
IRI: https://purl.archive.org/akco#extSrcFormalAndSyntaxSpecifications
IRI: https://purl.archive.org/akco#extSrcHornClauses
IRI: https://purl.archive.org/akco#extSrcKnowledgeGraph
IRI: https://purl.archive.org/akco#extSrcQueryLogs
IRI: https://purl.archive.org/akco#extSrcTextCorpus
IRI: https://purl.archive.org/akco#extSrcWeb
IRI: https://purl.archive.org/akco#extSrcWikipedia
IRI: https://purl.archive.org/akco#kGContextualGraph
IRI: https://purl.archive.org/akco#kGGraphStructure
IRI: https://purl.archive.org/akco#kGValuesOfPV
IRI: https://purl.archive.org/akco#publADianReFrfoDLKB
IRI: https://purl.archive.org/akco#publAPE
IRI: https://purl.archive.org/akco#publAPrInToImLiDaQuUsDiOuDe
IRI: https://purl.archive.org/akco#publAttributE
IRI: https://purl.archive.org/akco#publBioGRER
IRI: https://purl.archive.org/akco#publCEDAL
IRI: https://purl.archive.org/akco#publChanHaInofDB
IRI: https://purl.archive.org/akco#publCoCKG
IRI: https://purl.archive.org/akco#publCOLIBRI
IRI: https://purl.archive.org/akco#publCOPAAL
IRI: https://purl.archive.org/akco#publCorhist
IRI: https://purl.archive.org/akco#publDBOnEnfoInDe
IRI: https://purl.archive.org/akco#publDeanCoTyErinOpKnGrUsSeReofEn
IRI: https://purl.archive.org/akco#publDecefitogr
IRI: https://purl.archive.org/akco#publDECIDE
IRI: https://purl.archive.org/akco#publDeerinnulidauscroude
IRI: https://purl.archive.org/akco#publDeFacto
IRI: https://purl.archive.org/akco#publDeInNuDainDB
IRI: https://purl.archive.org/akco#publDojuanenbyitnaEntyuslamo
IRI: https://purl.archive.org/akco#publExfakt
IRI: https://purl.archive.org/akco#publFachvievpa
IRI: https://purl.archive.org/akco#publFactCheck
IRI: https://purl.archive.org/akco#publFacTify
IRI: https://purl.archive.org/akco#publFACTY
IRI: https://purl.archive.org/akco#publFaVawiKnGrEm
IRI: https://purl.archive.org/akco#publFEA
IRI: https://purl.archive.org/akco#publGAT
IRI: https://purl.archive.org/akco#publGeneralCorrectionFramework
IRI: https://purl.archive.org/akco#publGltyfoliinrdtrwiaptoensu
IRI: https://purl.archive.org/akco#publGRR
IRI: https://purl.archive.org/akco#publHMGCN
IRI: https://purl.archive.org/akco#publHybridFC
IRI: https://purl.archive.org/akco#publIDLabTurtleValidator
IRI: https://purl.archive.org/akco#publIdwrlibedabymuoude
IRI: https://purl.archive.org/akco#publIncoinOW
IRI: https://purl.archive.org/akco#publIO
IRI: https://purl.archive.org/akco#publIOTW
IRI: https://purl.archive.org/akco#publJSAE
IRI: https://purl.archive.org/akco#publKD2R
IRI: https://purl.archive.org/akco#publKefogr
IRI: https://purl.archive.org/akco#publKGClean
IRI: https://purl.archive.org/akco#publKGTtm
IRI: https://purl.archive.org/akco#publKS
IRI: https://purl.archive.org/akco#publKV-rule
IRI: https://purl.archive.org/akco#publLediaxwiasrumianitaptoindeoflida
IRI: https://purl.archive.org/akco#publLeRiXt
IRI: https://purl.archive.org/akco#publLOD-Community-Detection
IRI: https://purl.archive.org/akco#publLodeofinSastinRDda
IRI: https://purl.archive.org/akco#publLODLaundromat
IRI: https://purl.archive.org/akco#publMTM
IRI: https://purl.archive.org/akco#publNotquitethesame
IRI: https://purl.archive.org/akco#publOGFC
IRI: https://purl.archive.org/akco#publOptimalABoxRepairw.r.t.StaticELTBoxes
IRI: https://purl.archive.org/akco#publPredPath
IRI: https://purl.archive.org/akco#publPrerdemofokngrre
IRI: https://purl.archive.org/akco#publRDD-Checker
IRI: https://purl.archive.org/akco#publRDFDoctor
IRI: https://purl.archive.org/akco#publRDFUnit
IRI: https://purl.archive.org/akco#publrdfvalidator
IRI: https://purl.archive.org/akco#publRDvase
IRI: https://purl.archive.org/akco#publReRaViinDB
IRI: https://purl.archive.org/akco#publRUGE
IRI: https://purl.archive.org/akco#publS3K
IRI: https://purl.archive.org/akco#publSDType
IRI: https://purl.archive.org/akco#publSDValidate
IRI: https://purl.archive.org/akco#publServDBpediaDOLCE
IRI: https://purl.archive.org/akco#publSiApCofoRDMo
IRI: https://purl.archive.org/akco#publSLCN
IRI: https://purl.archive.org/akco#publTISCO
IRI: https://purl.archive.org/akco#publToDeSePrfoRDusSPasInLa
IRI: https://purl.archive.org/akco#publTripleNet
IRI: https://purl.archive.org/akco#publTypingErrorsinFactualKnowledgeGraphs
IRI: https://purl.archive.org/akco#publTyPrfoEfCoReinHeSeGr
IRI: https://purl.archive.org/akco#publUsReofInKnBa
IRI: https://purl.archive.org/akco#publValidata
IRI: https://purl.archive.org/akco#publValidatrr
IRI: https://purl.archive.org/akco#publVeriGraph
IRI: https://purl.archive.org/akco#publVGFD
IRI: https://purl.archive.org/akco#publVRP
IRI: https://purl.archive.org/akco#publWhenowl
IRI: https://purl.archive.org/akco#publWhEvlidahewiaquthclupDB
IRI: https://purl.archive.org/akco#tecCrowdsourcing-based
IRI: https://purl.archive.org/akco#tecEmbeddingAndNeuralNetwork-based
IRI: https://purl.archive.org/akco#tecHybrid
IRI: https://purl.archive.org/akco#tecIntegrityConstraint-based
IRI: https://purl.archive.org/akco#tecOntology-based
IRI: https://purl.archive.org/akco#tecPath-based
IRI: https://purl.archive.org/akco#tecRuleMining-based
IRI: https://purl.archive.org/akco#tecStatistical
IRI: https://purl.archive.org/akco#tecSyntactic
IRI: https://purl.archive.org/akco#tecVerbalization-based
IRI: http://purl.org/dc/terms/
[Chávez-Feria, et al., 2021] Chávez-Feria, S., García-Castro, R., & Poveda-Villalón, M. (2021). Converting UML-based ontology conceptualizations to OWL with Chowlk. In European Semantic Web Conference (pp. 44–48). Springer.
[BERT] Devlin, J., et al. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.
[TransE] Bordes, A., et al. (2013). Translating Embeddings for Modeling Multi-relational Data.
[Paulheim, 2017] Heiko Paulheim. Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic web, 8(3):489–508, 2017.
[Fensel et al., 2020] Dieter Fensel, Umutcan Şimşek, Kevin Angele, Elwin Huaman, Elias Kärle, Oleksandra Panasiuk, Ioan Toma, Jürgen Umbrich, and Alexander Wahler. Knowledge Graphs: Methodology, Tools and Selected Use Cases. Springer Nature, 2020.
[Acosta, et al., 2013] Maribel Acosta, Amrapali Zaveri, Elena Simperl, Dimitris Kontokostas, Sören Auer, and Jens Lehmann. Crowdsourcing linked data quality assessment. In International semantic web conference, pages 260–276. Springer, 2013.
The authors would like to thank Silvio Peroni for developing LODE, a Live OWL Documentation Environment, which is used for representing the Cross Referencing Section of this document and Daniel Garijo for developing Widoco, the program used to create the template used in this documentation.