Computer Science: Faculty Publications and Other Works

A Distributed Graph Approach for Pre-processing Linked RDF Data Using Supercomputers

Michael J. Lewis, The University of Illinois at ChicagoFollow
George K. Thiruvathukal, Loyola University ChicagoFollow
Venkatram Vishwanath, Argonne National Laboratory
Michael J. Papka, Argonne National Laboratory and Northern Illinois University
Andrew Johnson, The University of Illinois at Chicago

Document Type

Conference Proceeding

Publication Date

5-19-2017

Publication Title

International Workshop on Semantic Big Data 2017 (SBD 2017)

Publisher Name

ACM

Abstract

Efficient RDF, graph based queries are becoming more pertinent based on the increased interest in data analytics and its intersection with large, unstructured but connected data. Many commercial systems have adopted distributed RDF graph systems in order to handle increasing dataset sizes and complex queries. This paper introduces a distribute graph approach to pre-processing linked data. Instead of traversing the memory graph, our system indexes pre-processed join elements that are organized in a graph structure. We analyze the Dbpedia data-set (derived from the Wikipedia corpus) and compare our access method to the graph traversal access approach which we also devise. Results show from our experiments that the distributed, pre-processed graph approach to accessing linked data is faster than the traversal approach over a specific range of linked queries.

Identifier

978-1-4503-4987-1

Recommended Citation

Michael J. Lewis, George K. Thiruvathukal, Venkatram Vishwanath, Michael E. Papka, and Andrew Johnson, A Distributed Graph Approach for Pre-Processing Linked Data Using Supercomputers, In Proceedings of International Workshop on Semantic Big Data 2017 (SBD 2017) at ACM SIGMOD 2017.

Download

Included in

Computer Sciences Commons

COinS

Computer Science: Faculty Publications and Other Works

A Distributed Graph Approach for Pre-processing Linked RDF Data Using Supercomputers

Document Type

Publication Date

Publication Title

Publisher Name

Abstract

Identifier

Recommended Citation

Included in

Submission Tools

Explore

For Contributors

About eCommons

Computer Science: Faculty Publications and Other Works

A Distributed Graph Approach for Pre-processing Linked RDF Data Using Supercomputers

Authors

Document Type

Publication Date

Publication Title

Publisher Name

Abstract

Identifier

Recommended Citation

Included in

Share

Submission Tools

Explore

For Contributors

About eCommons