Springer Nature
Browse
1/1
3 files

Entity Relatedness Test Data

dataset
posted on 2017-09-08, 15:31 authored by J. Herrera, M.A. Casanova, B.P. Nunes, G.R. Lopes, L.A. Leme
The entity relatedness problem refers to the question of computing the relationship paths that better describe the connectivity between a given entity pair.

This dataset supports the evaluation of approaches that address the entity relatedness problem. It covers two familiar domains, music and movies, and uses data available in IMDb and last.fm, which are popular reference datasets in these domains.

The dataset contains 20 entity pairs from each of these domains and, for each entity pair, a ranked list with 50 relationship paths. It also contains entity ratings and property relevance scores for the entities and properties used in the paths.

The data is compressed in .zip format and can be uncompressed by standard compression utilities. The data are split into three archives:

EntityRelatednessTestData to RDF.zip: contains raw (.txt) and rdf test data along with test scripts (.java) and java class (.class) files.

ontology.zip: contains the .rdf ontology for the entity relatedness test dataset

dataset.zip: contains the entity relatedness test dataset in .rdf, .ttl and .nt formats

The underlying data and code can be accessed through standard text edit software.

History

Research Data Support

Research data support provided by Springer Nature.