Expressive Scalable Querying over Integrated Linked Open Data

Description:

Linked Open Data (LOD) is rapidly developing into an open data movement to connect a large variety of data across the World Wide Web using standards adopted by the World Wide Web Consortium (W3C). Driven by researchers, government agencies and companies, the resulting Web of Data has grown to over 1000 datasets and is showing exponential growth. However, simply putting collections of data on the Web will be of very limited value. The key to unlocking the value for developing more powerful search, browsing, exploration and analysis is to richly interlink or semantically integrate components of LOD. Given the size, growth rate, heterogeneity and growing areas of coverage, manual semantic integration or interlinking is not practical. Furthermore, current techniques focus on a construct owl:sameAs that is abused due to limited expressiveness, and hence is ineffective or yields poor quality of integration. What is needed is to be able to represent and identify richer and more explicit relationships between different entities, so that the richness of the real world is not crammed inaccurately and inappropriately into very limited types of relationships. At the same time, exponential growth of the LOD in terms of size and diversity creates challenges to identify and analyze datasets for both human and application consumptions. Even though popular datasets such as DBPedia, Freebase, MusicBrainz are well known and widely used in the community, there can be other hidden gems that will be useful for specialized applications.

To address the challenges, this project developed exploratory techniques to richly interlink components of LOD, address the challenges of querying the LOD cloud and propose approaches to discover datasets compress and create entity summaries.

Funding Agency:

National Science Foundation

From:

September, 2011

To:

August, 2014

Publications

A. Krishna Joshi, Hitzler, P., and Dong, G., “Logical linked data compression”, in The Semantic Web: Semantics and Big Data.10th Extended Semantic Web Conference, ESWC 2013, Montpellier, France, May 26-30, 2013. , 2013, pp. 170–184.
S. Lalithsena, Hitzler, P., Sheth, A., and Jain, P., “Automatic Domain Identification for Linked Open Data”, in 2013 IEEE/WIC/ACM International Conferences on Web Intelligence, WI 2013, Atlanta, GA, USA, 2013, pp. 205–212.
DomainIdentLOD-WI13.pdf (1.16 MB)
A. Krishna Joshi, Jain, P., Hitzler, P., Yeh, P. Z., Verma, K., Sheth, A., and Damova, M., “Alignment-based querying of linked open data”, in On the Move to Meaningful Internet Systems: OTM 2012, Springer, 2012, pp. 807–824.
A. Krishna Joshi, Hitzler, P., and Dong, G., “Towards logical linked data compression”, in Proceedings of the Joint Workshop on Large and Heterogeneous Data and Quantitative Formalization in the Semantic Web, LHD+ SemQuant2012, at the 11th International Semantic Web Conference, ISWC2012, 2012.
P. Jain, Hitzler, P., Verma, K., Yeh, P. Z., and Sheth, A., “Moving beyond SameAs with PLATO: Partonomy detection for Linked Data”, in 23rd ACM Conference on Hypertext and Social Media, HT '12, Milwaukee, WI, USA, 2012, pp. 33–42.
plato-ht2012.pdf (314.03 KB)

Search form

Description:

Funding Agency:

From:

To:

Publications

DaSe Lab Tweets