KWTR: Outreach to Industry

From semanticweb.org
Jump to: navigation, search

Main Contributors: Alain Leger, Lyndon J B Nixon. See the list of contributors


  • 1. CURRENT TRENDS IN SEMANTIC WEB (In the following part we intend to identify the state of the art of Semantic Web based theories, methods, applications and tools in your research field.)
    • 1.1. One or more examples (case studies) in which semantic web has been used.
  Name of the institutions: WorldWideJobs GmbH
  Industry / sector: Human Resources
  Business activities improved by the SW solutions: Search and
retrieval 
  Research Needs: Ontology matching, efficiency of storage and
retrieval
  Name of the project: Wissensnetze (German funded national
project) 
  Tools and applications implemented in the project: Prototype
with Web UI - Semantic Matching Engine
  Other (specify) OWL-DL ontologies developed based on existing
industry specifications 
    • 1.2. The first 4 Semantic Web based tools used in your research fields.
Name: Protege
Website: http://protege.stanford.edu
Main characteristics: GUI based ontology editor 
Open problems: Ontology visualisation -- high relevance -- will
be solved in the medium term
                     Modularisation (for large ontologies) -- high
relevance -- will be solved in the medium term
Name: Jena
Website: http://jena.sourceforge.net
White page http://jena.sourceforge.net/documentation.html
Main characteristics: Java API for Semantic Web applications
Open problems: 
Name: Sesame
Website: http://www.openrdf.org
Main characteristics: RDF storage based on RDBMS
Open problems: Scalability -- very high relevance -- will be
solved in the medium term
                     Standardized query support  -- normal relevance
-- will be solved in the short term
Name: RACER
Website: http://www.sts.tu-harburg.de/~r.f.moeller/racer/
Main characteristics: Description Logic reasoner
Open problems: Efficiency when reasoning over large datasets --
high relevance -- will be solved in the medium term
                                                             
    • 1.3. A short summary of the first 3 best papers in the field.
Reference:  DFrom SHIQ and RDF to OWL: the making of a Web
Ontology Language.
Short abstract: The OWL Web Ontology Language is a new formal
language for representing ontologies in the Semantic Web. OWL has 
features from several families of representation languages, including
primarily Description Logics and frames. OWL also shares many 
characteristics with RDF, the W3C base of the Semantic Web. In this
paper we discuss how the philosophy and features of OWL can be traced
back to these older formalisms, with modifications driven by several
other constraints on OWL.. Several interesting problems have arisen
where these influences on OWL have clashed.
Reference:  Learning to Map between Ontologies on the Semantic
Web.
Short abstract: Ontologies play a prominent role on the Semantic 
Web. They make possible the widespread publication of machine
understandable data, opening myriad opportunities for automated
information processing. However, because of the Semantic Web's
distributed nature, data on it will inevitably come from many
different ontologies. Information processing across ontologies is not
possible without knowing the semantic mappings between their elements.
Manually finding such mappings is tedious, error-prone, and clearly
not possible at the Web scale. Hence, the development of tools to
assist in the ontology mapping process is crucial to the success of
the Semantic Web.We describe glue, a system that employs machine
learning techniques to find such mappings. Given two ontologies, for
each concept in one ontology glue finds the most similar concept in
the other ontology. We give well-founded probabilistic definitions to
several practical similarity measures, and show that glue can work
with all of them. This is in contrast to most existing approaches,
which deal with a single similarity measure. Another key feature of
glue is that it uses multiple learning strategies, each of which
exploits a different type of information either in the data instances
or in the taxonomic structure of the ontologies. To further improve
matching accuracy, we extend glue to incorporate commonsense knowledge
and domain constraints into the matching process. For this purpose, we
show that relaxation labeling, a well-known constraint optimization
technique used in computer vision and other fields, can be adapted to
work efficiently in our context. Our approach is thus distinguished in
that it works with a variety of well-defined similarity notions and
that it efficiently incorporates multiple types of knowledge. We
describe a set of experiments on several real-world domains, and show
that glue proposes highly accurate semantic mappings.
Reference: RDF Primer. 
Short abstract: The Resource Description Framework (RDF) is a
language for representing information about resources in the World
Wide Web. This Primer is designed to provide the reader with the basic
knowledge required to effectively use RDF. It introduces the basic
concepts of RDF and describes its XML syntax. It describes how to
define RDF vocabularies using the RDF Vocabulary Description Language,
and gives an overview of some deployed RDF applications. It also
describes the content and purpose of other RDF specification
documents.
    • 1.4. A short list of open problems in theories and methods.
Choice of logical model on which to build KR (DL, FOL, F-Lofic,
LP): normal relevance -- will be solved in the long term
Ontology engineering methodologies: high relevance -- will be
solved in the medium term
Ontology alignment for mapping, merging etc. : normal relevance --
will be solved in the medium term
  • 2. TRENDS ON THEORIES AND METHODS, SERVICES AND APPLICATIONS
    • 2.1. Research projects in which contributors are involved, along
with a general description. Moreover, suggest for each project the
possible future uses and applications related to the Semantic Web, the
acceptance and diffusion in each period considered, the benefits, and
the problems that will be probably occur. 
Name of the project: Knowledge Web
Type: EU Network of Excellence
Duration: 4 years (2004-7)
Partners: 
Research Institution: FU Berlin, Uni Trento, VUA, VUB, Uni Karlsruhe,
INRIA among others
Industrial Partners: France Telecom
Core activities: 
Technological transfer to industry -- very high relevance -- will be
solved in the medium/long term
Semantic Web research -- high relevance -- will be solved in the
short/medium term
Education for Semantic Web -- very high relevance -- will be solved in
the medium term
Market opportunities:
Semantic search -- normal acceptance and diffusion in the short term
Semantic social networking -- high acceptance and diffusion in the
medium term
Semantic Web Services -- normal acceptance and diffusion in the long
term
Benefits for industry and practitioners
Industry-capable tools and methods -- very high relevance
Best practices and guidelines for ontology development and use -- high
relevance
Ontology evaluation and recommendation -- normal relevance
Technological Problems (missing theories and methods)
Ontology best practices -- high relevance -- will be solved in the
medium term  
Semantic Web tool benchmarks -- high relevance -- will be solved in
the short term
Ontology evaluation and cost modelling -- high relevance -- will be
solved in the short term
Problems and missing tools: 
Ontology engineering tools -- high relevance -- will be solved in the
medium term
Ontology re-use support -- high relevance -- will be solved in the
medium term
Ontology alignment support -- normal relevance -- will be solved in
the medium term
Problems – Semantic Web culture is missing: 
Introduction of metadata generation in content creation processes --
high relevance -- will be solved in the medium term 
Industrial use of ontologies (business people are not logicians) --
very high relevance -- will be solved in the long term
Other problems
Ontology recommendation -- normal relevance -- will be solved in the
long term
On-the-fly ontology alignment -- low relevance -- will be solved in
the long term
Concept mapping -- normal relevance -- will be solved in the long term
Name of the project: TripCom
Type:     EU STREP
Duration: 3 years (April 2006-2009)
Partners: 
Research Institution: FU Berlin, TU Vienna, U Stuttgart, U Innsbruck,
NUIG Galway
Industrial Partners: Profium, Cefriel, Telefonica, Ontotext
Core activities: 
Tuplespace computing -- normal relevance -- will be solved in the
medium term
Semantic Web Services -- high relevance -- will be solved in the long
term
Market opportunities:
Communication middleware for semantic SOA applications -- normal
acceptance and diffusion in the long term
Web scale service-oriented computing -- high accceptance and diffusion
in the medium term
Support infrastructure for “intelligent” services -- high accceptance
and diffusion in the long term
Benefits for industry and practitioners
Coordination of Web Service communication -- normal relevance
Mediation between different business vocabularies -- high relevance
Technological Problems (missing theories and methods)
Semantic coordination of data -- normal relevance -- will be solved in
the short term
Representation of RDF in tuplespace -- normal relevance -- will be
solved in the short term
Problems and missing tools: 
Semantic tuplespace platform -- high relevance -- will be solved in
the medium term
Web-scale RDF storage -- very high relevance -- will be solved in the
medium term
Problems – Semantic Web culture is missing: 
Use of semantic models in business communication -- high relevance --
will be solved in the long term 
Use of semantics in Web Service development and application -- high
relevance -- will be solved in the long term
Name of the project: Wissensnetze
Type: German national project (BMBF)
Duration: 2004-7 (4 years)
Partners: 
Research Institution: FU Berlin, HU Berlin
Industrial Partners: WorldWideJobs GmbH
Core activities: 
Job portal prototype -- high relevance -- will be solved in the short
term
Implication of Sem Web technologies on e-business -- very high
relevance -- will be solved in the medium term
Market opportunities: 
Improved market transparency -- normal acceptance and diffusion in the
medium term
Strengthening job portals position as middlemen between job seekers
and business -- high acceptance and diffusion in the medium term 
Benefits for industry and practitioners
Business reaches more potential applicants -- very high relevance
Relevance of suitable applicants optimized -- high relevance
Technological Problems (missing theories and methods):
Ontology matching methodologies -- high relevance -- will be solved in
the medium term
Problems – Missing tools:
Ontology reuse support -- very high relevance -- will be solved in the
medium term
Problems – Semantic Web culture is missing: 
Real test data sets do not exist -- high relevance -- will be solved
in the medium term


    • 2.2. Some topics that will not be solved in short and medium term,
for each of them there is a short explanation of the main reasons and
(if possible) some references.
Topics: Automatic high level annotation of multimedia
Reason: requires non-trivial inference of concepts from a
combination of low level media features and associated natural
language text. While much progress is made on identifying single
concepts, a complete annotation requires modelling sets of concepts
and relationships between them as a representation of what a
multimedia content contains.
References: none
Topics: On-the-fly knowledge extraction and inference across
heterogeneous knowledge
Reason: Again, system-based extraction of semantics depends on
complex natural language understanding as well as awareness of context
to be able to formalize data correctly as knowledge. These formalisms
will likely draw on different ontologies and name their concepts
differently, so means to automatically align both schema and instances
are required.
References: none
Topics: A Web-scale Semantic Web
Reason: using the Semantic Web means making use of reasoning to
infer knowledge, and other approaches to have a common view on all
knowledge (mapping, merging). This requires a very resource heavy
logic-based approach which does not scale to the Web. 


  • 3. TRENDS ON TOOLS
    • 3.1. A list of the most relevant semantic based demos in the area.
Name: Job Recruitment Prototype
Website: Not yet public
Description: Job portal with semantic search 
References: http://wissensnetze.ag-nbi.de
Main features:
RDF based input of job offers or applicants -- high relevance 
Ontology based matching between offers and applicants  -- very high
relevance
Open problems:
Performance of matching for large data sets -- very high relevance --
will be solved in the medium term
Precision of matching -- high relevance -- will be solved in the short
term
Name: Hotel Search Prototype
Website: Not yet public
Description: Hotel search personalized to user profile 
References: http://reisewissen.ag-nbi.de 
Main features:
Mapping of RDF and non-RDF vocabularies -- high relevance 
Ontology based ranking of hotels based on inference from user profile
data  -- very high relevance
Open problems:
Performance of reasoning -- very high relevance -- will be solved in
the medium term
Datatype queries -- high relevance -- will be solved in the short term
Rules -- high relevance -- will be solved in the short term
    • 3.2. A short description of tools that are still missing. A description of business activities and problems they should solve, will be provided.
Name Ontology engineering
Description Tool based support for the creation and maintenance
of an ontology
Related business activities All ontology-based applications
require the creation of an ontology
Problems that the tool will solve
Ontology creation -- high relevance to the business -- will be solved
in the medium term
Ontology re-use -- very high relevance to the business -- will be
solved in the medium term
Ontology evolution -- high relevance to the business -- will be solved
in the medium term


  • 4.Please fell free to add any comment or suggestion.
Personal tools
Namespaces

Variants
Actions
Navigation
services
Toolbox