Romanian Journal of Information Science and Technology (ROMJIST)

An open – access publication

  |  HOME  |   GENERAL INFORMATION  |   ROMJIST ON-LINE  |  KEY INFORMATION FOR AUTHORS  |   COMMITTEES  |  

ROMJIST is a publication of Romanian Academy,
Section for Information Science and Technology

Editor – in – Chief:
Radu-Emil Precup

Honorary Co-Editors-in-Chief:
Horia-Nicolai Teodorescu
Gheorghe Stefan

Secretariate (office):
Adriana Apostol
Adress for correspondence: romjist@nano-link.net (after 1st of January, 2019)

Founding Editor-in-Chief
(until 10th of February, 2021):
Dan Dascalu

Editing of the printed version: Mihaela Marian (Publishing House of the Romanian Academy, Bucharest)

Technical editor
of the on-line version:
Lucian Milea (University POLITEHNICA of Bucharest)

Sponsor:
• National Institute for R & D
in Microtechnologies
(IMT Bucharest), www.imt.ro

ROMJIST Volume 23, No. 1, 2020, pp. 55-68
 

Rajesh MAHULE, Ranjana VYAS, Om Prakash VYAS
Towards Knowledge Discovery in interlinked heterogeneous datasets of LOD cloud

ABSTRACT: In the last years, a huge volume of data was published on the web as Linked Open Data (LOD). Consuming and using this interlinked collection of heterogeneous data in classical data mining methods is a substantial challenge as it requires input in propositional Feature Vector Table (FVT) form. To overcome this consumption hurdle, this paper proposes a framework inspired by Link Traversal Based Query Execution (LTBQE) paradigm. The framework is designed to dynamically extract relevant features and build an FVT from a set of interlinked RDF datasets in a local environment. This article introduces a Content-Based similarity measure to evaluate generated FVT. Also, two representative data mining tasks are performed to evaluate the framework empirically which shows that the generated FVT assists in learning from heterogeneous LOD datasets. The evaluation work revealed some interesting patterns and also suggests an appropriate distance measure to handle dimensionality of the set-valued data attribute.

KEYWORDS: Linked Open Data, Semantic Web, Data mining, Knowledge discovery, Feature vector generation

Read full text (pdf)






  |  HOME  |   GENERAL INFORMATION  |   ROMJIST ON-LINE  |  KEY INFORMATION FOR AUTHORS  |   COMMITTEES  |