KWTR: Blogs

From semanticweb.org
Jump to: navigation, search

Main Contributors: Knud Möller. See the list of contributors


  • 1. CURRENT TRENDS IN SEMANTIC WEB (In the following part we intend to identify the state of the art of Semantic Web based theories, methods, applications and tools in your research field.)
    • 1.1. One or more examples (case studies) in which semantic web has been used.
  Name of the institutions: National Institute of Informatics, Japan 
  (provider of technology), Senshu University, Japan (user of technology)
  Industry / sector: Education
  Business activities improved by the SW solutions: The “semblog” and RNA 
  tools developed by the group around Hideaki Takeda were adapted and used by 
  Senshu University to provide class support. Each teacher and student had 
  their own semantic blog, which were connected using the tools. 
  Research Needs: 
  Name of the project:
  Tools and applications implemented in the project: semblog and RNA. For a description see the paper in 1.3.2
  Note: semblog is not the same as semiBlog (below), although the names are almost identical.
    • 1.2. The first 4 Semantic Web based tools used in your research fields.
Name: semiBlog
Website: http://semiblog.semanticweb.org
White paper: http://sw.deri.org/~knud/papers/semiBlogSemDesk2005/semiBlogSemDesk2005.pdf
Main characteristics: semiBlog is an authoring tool for Semantic Blogs. It is 
desktop-based and allows the user to create    semantic metadata from desktop 
objects like calendar events, address book contacts, etc.. The main focus is on 
usability and easy-of-use, which I consider important for the acceptance of SW technology in the mainstream.
Open problems: 
Platform independence -- medium relevance -- will be solved in the short term
Stability -- high relevance -- will be solved in the short term
                     
Name: HP Labs semantic blogging demonstrator
Website: http://www.semanticblogging.org/semblog/blog/default/
White paper: http://www.idealliance.org/papers/dx_xmle04/index/author/7691905a7307fb7f2731ee67e8.html
Main characteristics: The first implementation of what Semantic Blogging 
could be and how it could look like. The demonstrator is web-based and restricted to the bibliographic domain. 
The main goal was not to provide a complete solution, but to give an impression 
– a demonstrator. The “open problems” mentioned below are therefore not really problems 
of this particular tool, which is why I don’t specify a time when they will be solved.
Open problems: 
“Web-based”-ness (see below) -- high relevance 
Restricted Domain -- high relevance 

Name: semblog and RNA
Website: http://www.semblog.org
White paper: http://www-kasm.nii.ac.jp/papers/takeda/05/takeda05www.pdf
Main characteristics: Another implementation of the Semantic Blogging idea. 
This one focuses on connecting semantic blogs from various users in a FOAF-based 
network, allowing automatic recommendations from one user to another based on the network. 
Open problems: no one has been identify
                                            
Name: YARS
Website: http://sw.deri.org/2004/06/yars/
White paper: http://sw.deri.org/2005/02/dexa/yars.pdf
Main characteristics: YARS is a high-performance RDF store, and as such 
potentially an important building block in the emerging Semantic Web. For Semantic 
Blogging, it can be a solution to host, index and give access to semantic metadata 
in the blogosphere. YARS implements quads instead of triples, which allows to model 
context and provenance, an important aspect in Semantic Blogging.
Open problems: 
Complete SPARQL support -- high relevance -- will be solved in the short term
Stability -- high relevance -- will be solved in the short term 
    • 1.3. A short summary of the first 3 best papers in the field.
Reference:  S. Cayzer. Semantic Blogging: Spreading the Semantic Web 
Meme. In XML Europe 2004, Amsterdam, Netherlands, Proceedings, April 2004.
Short abstract: This paper first introduced the term “Semantic 
Blogging”. It describes the general idea and presents a web-based 
implementation of Semantic Blogging for the bibliographic domain, showing – 
in a very limited domain – how to author, host and consume a semantic blog.
Reference:  H. Takeda and I. Ohmukai. Semblog Project. 
In Activities on Semantic Web Technologies in Japan, A WWW2005 Workshop, 2005.
Short abstract: Short abstract..This paper gives a good overview 
of the “semblog” and “RNA” tools for Semantic Blogging. While these tools 
are not very powerful with respect to annotation of blog posts, this paper 
shows well how blogs can be interlinked with the FOAF network, and how the 
resulting structure can be used to facilitate automatic referral of bookmarks 
between neighbour nodes in the network. 
Reference: K. Möller, U. Bojars and J. G. Breslin. Using Semantics to 
Enhance the Blogging Experience. In 3rd European Semantic Web Conference 
(ESWC2006), Budva, Montenegro, June 2006.
Short abstract: The paper investigates two complementing aspects 
of semantic metadata in the blogosphere - structural and content-related 
metadata. The nature of these two kinds of metadata is discussed in detail, 
as well as ways of creating such metadata in a convenient and non-obstrusive 
way for the user, how to publish such metadata on the web, and how to best make 
use of such metadata from the point of view of a blog consumer. 
The paper is based on the SIOC project and the semiBlog tool.


    • 1.4. A short list of open problems in theories and methods.
* ”URI crisis” – the problem of how to coin URIs for arbitrary objects 
and how to interprete a given URI is highly relevant for Semantic Blogging 
-- high relevance -- will be solved in the medium term
* Representing RDF in blogs – As of yet, there is no standard for representing RDF 
(the semantic metadata) in web pages (the blog post) – 
by linking, inline (e.g. RDF/A), etc: high relevance -- will be solved in the short term
* Desktop- vs. Web-based – Semantic Blogging (as many other domains) can be implemented 
in both ways. Both have advantages and disadvantages. Not really a problem, 
but it will be interesting to see which way will be more prominent: 
medium relevance -- will be solved in the short term
* Other: Today users find conventional blog content using generic 
search engines like Google, or specialized ones like Technorati (and, 
of course, links within the blogosphere). As of yet, there is not search 
engine that crawls and indexes SW data and allows conceptual queries (e.g. SPARQL). 
Without such a SW-Google, the improved possibilities of Semantic Blogging in terms 
of searching and finding relevant posts cannot be implemented. This, of course, 
is a general SW problem and not restricted to Semantic Blogging.
  • 2. TRENDS ON THEORIES AND METHODS, SERVICES AND APPLICATIONS
    • 2.1. Research projects in which contributors are involved, along with a general description. Moreover, suggest for each project the possible future uses and applications related to the Semantic Web, the acceptance and diffusion in each period considered, the benefits, and the problems that will be probably occur.
Name of the project: Nepomuk
Type: FP6
Duration: 2006-2008
Partners: 
Research Institution: DFKI Kaiserslautern, AIFB Karlsruhe, NUIG Galway, 
EPFL Lausanne, L3S Hannover, ICCS Athens, KTH Stockholm, USI Lugano 
Industrial Partners: HP, IBM, SAP, PRC Group, EDGE-IT, 
Cognium Systems SA, Irion Management Consulting GmbH, Thales SA
Core activities: 
Developing an architecture standard for interlinked semantic desktops 
-- very high relevance -- will be solved in the short term
Developing a reference implementation of the standard
-- high relevance -- will be solved in the short term
Market opportunities: Developing standards compliant Semantic Desktop 
applications. Thanks to the standard, they will be interoperable. 
This will increase market chances.
Benefits for industry and practitioners: 
Industry partners can be early adopters and influence the standard -- high relevance
Bringing the SW to the desktop allows to try out many things in controlled, 
manageable environment -- medium relevance
Technological Problems (missing theories and methods):"
Open World vs. Closed World – which one will SemDesk data be? 
If closed world – how do we integrate open world SW data?
-- high relevance -- will be solved in the medium term
“URI Crisis” – just as relevant here as in Semantic Blogging (see 1.4)
-- very high relevance -- will be solved in the medium term
Problems – Missing tools:
Visualizing RDF – there is still no good solution for visualizing complex RDF graphs for ordinary users (i.e. our target)
-- high relevance -- will be solved in the medium term
SW peer2peer infrastructure – Nepomuk communication is to be based on P2P. There is no sufficient solution yet
-- high relevance -- will be solved in the short term
Problems – Semantic Web culture is missing: 
Digitize your life – in order for the SemDesk to be useful, users have to be 
willing to organize a lot of their lives in terms of the Semantc Desktop. 
-- medium relevance -- will be solved in the medium term 
Privacy – Linking my personal data to other people opens up huge privacy 
questions. Also, the web needs to stay as decentralized and out of reach of 
government institutions as possible. -- high relevance -- will be solved in the medium term 


    • 2.2. Some topics that will not be solved in short and medium term, for each of them there is a short explanation of the main reasons and (if possible) some references.

I would consider authoring in the SW in general and Semantic Blogging in particular to be more about engineering than fundamental science. As such, I think all the necessary building blocks, techniques and theories are really in place, and there are no fundamental questions that have to be solved in order to allow Semantic Blogging to work and be accepted in the mainstream, at least none that could not be solved in the short or medium term. The problems mentioned here are therefore more of a general nature.

Topics: URI crisis – I have mentioned this before, but I think this 
really is a fundamental question for all the SW to work.
Reason: In the SW, we want to make statements about things or concepts. 
To do that, we need to refer to that thing (e.g. a person) using a URI. 
Now, the question is: given e.g. some person, how do we assign a URI to it? 
There is no  central agency to do that, and such an agency is probably not 
desirable either. For specific domains like people, some sort of registry might 
work. As a workaround, users sometimes use e.g. the URL of the home page of a 
person to represent that person, but this creates new problems – not everybody 
has a home page, and it is now impossible to distinguish between statements about 
the person and the home page. Semantics like inverse functional properties can solve 
this in part (we do NOT assign URI to a person, but say that each person object with 
e.g. the same email address represents the same real life person), but they don’t 
work in all cases and are not widely implemented. Conventions such as the one 
outlined in [2] can help to cure the problem, but only if everyone adheres to them, 
and only from the point when they are introduced. There is no general solution yet, 
and even though a lot of people have been discussing this for a long time, 
nothing has come up. This might or might not be solved in the short or medium term, I’m not sure. 
References: [1] S. Pepper and S. Schwab. Curing the web’s identity crisis.   
http://www.ontopia.net/topicmaps/materials/identitycrisis.html 
[2] E. Oren, R. Delbru, K. Möller, M. Völkel, and S. Handschuh. Annotation 
and navigation in semantic wikis. In SemWiki2006 - From Wiki to Semantics, at ESWC2006, Budva, Montenegro, June 2006.
Topics: Natural language understanding
Reason: This will show up again in the WP 3.2. The power of the SW lies 
in its complex graph structures, and the concise queries they allow. 
However, these queries require the use of complex query languages (e.g. SPARQL), 
and this will keep mainstream users from using it. One solution could be interfaces 
that translate natural language into such queries. However, this holy grail of 
computational linguistics has been pursued for decades now, and I’m not aware 
that we are anywhere near a robust and reliable solution yet. 
References: none
  • 3. TRENDS ON TOOLS
    • 3.1. A list of the most relevant semantic based demos in the area.

See tools described above

    • 3.2. A short description of tools that are still missing. A description of business activities and problems they should solve, will be provided.
Name: Conceptual query interfaces for my mother
Description: The power of rich graph representations of information 
over “simple” textual representations is – among other things – 
the precision with which they can be queried. However, to make use 
of these powerful queries, users currently have to use complicated 
query languages. Not an option for my mother (or my father, for that matter). 
So, we need tools that allow to formulate powerful queries in a simple way. 
I wonder if it can be done at all. Alternative solutions like 
faceted browsing might turn out to be more useful. 
Related business activities End-user targeted application development. UI development.
  • 4.Please fell free to add any comment or suggestion.

At the moment, developments in this area seem to be hampered by the fact that people either focus on distributed solutions with very little semantics or at semantic solutions with almost no decentralization.

Personal tools
Namespaces

Variants
Actions
Navigation
services
Toolbox