Querying Distributed RDF Data Sources with SPARQL

From semanticweb.org
Jump to: navigation, search

A paper written by Bastian Quilitz and Ulf Leser. It was presented at the ESWC2008. It is about Federated Queries, Query Processing, SPARQL, Query Distribution, and Query Optimization

See on Revyu.com.

[edit] Abstract

Integrated access to multiple distributed and autonomous RDF data sources is a key challenge for many semantic web applications. As a reaction to this challenge, SPARQL, the current W3C Proposed Recommendation for an RDF query language, supports querying of multiple RDF graphs. However, the current standard does not provide transparent query federation, which makes query formulation hard and lengthy. Furthermore, current implementations of SPARQL load all RDF graphs mentioned in a query to the local machine. This usually incurs a large overhead in network traffic, and sometimes is simply impossible for technical or legal reasons. To overcome these problems we present DARQ, an engine for federated SPARQL queries. DARQ provides transparent query access to multiple SPARQL services, i.e., it gives the user the impression to query one single RDF graph despite the real data being distributed on the web. A service description language enables the query engine to decompose a query into sub-queries, each of which can be answered by an individual service. DARQ also uses query rewriting and cost-based query optimization to speed-up query execution. Experiments show that these optimizations significantly improve query performance even when only a very limited amount of statistical information is available. DARQ is available under GPL License at http://darq.sf.net/.

This data has been imported from the ESWC2008 data

Personal tools
Namespaces

Variants
Actions
Navigation
services
Toolbox