| PID |
Title |
Authors |
Abstract |
| 5 |
Semantic Telecommunications Network Capability Services |
- Xiuquan Qiao*, BUPT
- Xiaofeng Li, BUPT
|
The providing of pervasive services in the B3G/4G network presents a great
challenge in light of well known problems like context-awareness, semantic
service description, accurate service query and dynamic service composition.
To eliminate the semantic gap of Telecommunications Network and Internet
in the service layer, we propose the semantic Telecommunications Network
Capability Services (TNCS). By comparing the differences between TNCS and
the plain web services, we present a semantic description approach for TNCS
by extending OWL-S. Then using this approach, we demonstrate a service profile
description case for semantic Parlay X. In this way, the ontology-based
accurate query, matching, automatic composition and invocation for telecommunication
network capability services can be supported. This will facilitate the semantic
convergence of telecommunications network and Internet in the service layer. |
| 12 |
Integrating Lightweight Reasoning into Class-Based Query Refinement for
Object Search |
- Gong Cheng*, Southeast University
- Yuzhong Qu, SouthEast University, China
|
More and more RDF data have been published online to be consumed. Ordinary
Web users also expect to experience more intelligent services promised by
the Semantic Web, such as object search based on structured data. We implemented
the Falcons search engine to meet the challenge. To enable keyword search,
for each object, we construct and index a virtual document that includes
textual descriptions of its neighboring resources. Typing information is
used to serve class-based query refinement, and lightweight reasoning is
performed to discover implicit types of objects. A method of recommending
subclasses is implemented to enable navigating class hierarchies for incremental
query refinement. We also report on lessons learned from Web-scale experiments. |
| 14 |
A Modularization-based Approach to Finding All Justifications for OWL
DL Entailments |
- Boontawee Suntisrivaraporn*, TU Dresden
- Guilin Qi, University of Karlsruhe
- Qiu Ji, University of Karlsruhe
- Peter Haase, University of Karlsruhe
|
Finding the justifications for an entailment (i.e., minimal sets of axioms
responsible for it) is a prominent reasoning service in ontology engineering,
as justifications facilitate important tasks like debugging inconsistencies
or undesired subsumption. Though several algorithms for finding all justifications
exist, issues concerning efficiency and scalability remain a challenge due
to the sheer size of real-life ontologies. In this paper, we propose a novel
method for finding all justifications in OWL DL ontologies by limiting the
search space to smaller modules. To this end, we show that so-called locality-based
modules cover all axioms in the justifications. We present empirical results
that demonstrate an improvement of several orders of magnitude in the efficiency
and scalability of finding all justifications in OWL DL ontologies. |
| 21 |
DL-Lite with role inclusions |
- Roman Kontchakov*, Birkbeck College, London
- Michael Zakharyaschev, Birkbeck College, London
|
We analyse DL-Lite logics with role inclusions and present a complete
classification of the trade-off between their expressiveness and computational
complexity. In particular, we show that in logics with role inclusions the
data complexity of instance checking becomes P-hard in the presence of functionality
constraints, and coNP-hard if arbitrary number restrictions are allowed,
even with a very primitive form of concept inclusions. Moreover, the combined
complexity of satisfiability in this case jumps to ExpTime. On the positive
side, it turns out that the combined complexity for the logics without number
restrictions depends only on the form of concept inclusions and can range
from NLogSpace and P to NP; the data complexity for such logics stays in
LogSpace. |
| 22 |
Catriple: Extracting Triples from Wikipedia Categories |
- Qiaoling Liu*, Shanghai Jiao Tong University
- Kaifeng Xu, Shanghai Jiao Tong University
- Lei Zhang, IBM China Research Lab Haofen Wang, Shanghai Jiao Tong
University
- Yong Yu, Shanghai Jiao Tong University
- Yue Pan, IBM Research Lab, China
|
As an important step towards bootstrapping the Semantic Web, many efforts
have been made to extract triples from Wikipedia because of its wide coverage,
good organization and rich knowledge. One kind of important triples is about
Wikipedia articles and their non-isa properties, e.g. (Beijing, country,
China). Previous work has tried to extract such triples from Wikipedia infoboxes,
article text and categories. The infobox-based and text-based extraction
methods depend on the infoboxes and suffer from a low article coverage.
In contrast, the category-based extraction methods exploit the widespread
categories. However, they rely on predefined relations and specific regular
expressions, which is too effort-consuming and explores only very limited
knowledge in the categories. This paper automatically extracts triples from
the less explored Wikipedia categories so as to achieve a wider article
coverage with less manual effort. We manage to realize this goal by utilizing
the syntax and semantics brought by super-sub category pairs in Wikipedia.
Our prototype implementation outputs about 10M triples with a 12-level confidence
ranging from 47.0\% to 96.4\%, which cover 78.2\% of Wikipedia articles.
Among them, 1.27M triples have confidence of 96.4\%. Applications can on
demand use the triples with suitable confidence. |
| 28 |
Semantically Conceptualizing and Annotating Tables |
- David Embley*, Brigham Young University
- Stephen Lynn, Brigham Young University
|
Enabling a system to automatically conceptualize and annotate a human-readable
table is one way to create interesting semantic-web content. But exactly
“how?” is not clear. With conceptualization and annotation in
mind, we investigate a semantic-enrichment procedure as a way to turn syntactically
observed table layout into semantically coherent ontological concepts, relationships,
and constraints. Our semantic-enrichment procedure shows how to make use
of auxiliary world knowledge to construct rich ontological structures and
to populate these ontological structures with instance data. The system
uses auxiliary knowledge (1) to recognize concepts and which data values
belong to which concepts, (2) to discover relationships among concepts and
which data-value combinations represent relationship instances, and (3)
to discover constraints over the concepts and relationships that the data
values and data-value combinations should satisfy. Experimental evaluations
indicate that the automatic conceptualization and annotation processes perform
well, yielding F-measures of 90% for concept recognition, 77% for relationship
discovery, and 90% for constraint discovery in web tables selected from
the geopolitical domain. |
| 29 |
Snippet Generation for Semantic Web Search Engines |
- Thomas PENIN*, Shanghai Jiao Tong University
- Haofen Wang, Shanghai Jiao Tong University
- Thanh Tran, Institute AIFB, University Karlsruhe
- Yong Yu, Shanghai Jiao Tong University
|
With the development of the Semantic Web, more and more ontologies are
available for exploitation by semantic search engines. However, while semantic
search engines support the retrieval of candidate ontologies, the final
selection of the most appropriate ontology is still difficult for the end
users. In this paper, we extend existing work on ontology summarization
to support the presentation of ontology snippets. The proposed solution
leverages a new semantic similarity measure to generate snippets that are
based on the given query. Experimental results have shown the potential
of our solution in this problem domain that is largely unexplored so far. |
| 32 |
The Art of Tagging: Measuring the Quality of Tags |
- Ralf Krestel*, Research Center L3S
- Ling Chen, Research Center L3S
|
Collaborative tagging, supported by many social networking websites, is
currently enjoying an increasing popularity. The usefulness of this largely
available tag data has been explored in many applications including emergent
semantics deriving, web resources categorization, and web search etc. However,
since tags are supplied by users freely, not all of them are useful and
reliable, especially when they are generated by spammers with malicious
intent. Identifying tags of high quality, therefore, is critical in improving
the performance of applications based on tags. In this paper, we propose
TRP-Rank (Tag-Resource Pair Rank), an algorithm to measure the quality of
tags by employing a quality propagating technique. The three dimensional
relationship among users, tags and web resources is firstly represented
by a graph structure. A set of seed nodes, where each node represents a
tag annotating a resource, are then selected and their quality is assessed.
The quality of the remaining nodes is calculated by propagating the known
quality of the seeds through the graph structure. We evaluate our approach
on a public data set where bad tags generated by suspicious spammers are
manually labelled. The experimental results demonstrate the effectiveness
of this approach in measuring the quality of tags. |
| 38 |
An Integrated Approach for Automatic Construction of Bilingual Chinese-English
WordNet |
- Renjie Xu*, Southeast University
- Zhiqiang Gao, Southeast University
- Yingji Pan, Southeast University
- Yuzhong Qu, SouthEast University, China
- Zhisheng Huang, Vrije Universiteit Amsterdam
|
This paper compares various approaches for constructing Chinese-English
bilingual WordNet. First, we implement three independent approaches that
translate English WordNet to Chinese WordNet automatically, including Minimum
Distance (MDA), Intersection (IA) and Words Co-occurrence (WCA). Minimum
Distance compares the gloss of synset with the explanations of words from
dictionaries. Intersection chooses the intersection part of Chinese in a
synset. Words Co-occurrence counts the results of Chinese and English words
from Google. Then, we integrate these three approaches into an integrated
one, which is named MIWA. Experimental results show that the integrated
approach MIWA has better performance: F-measure reaches 0.615, which is
higher than that of each independent one. |
| 42 |
Efficient Index Maintenance for Frequently Updated Semantic Data |
- Yan Liang*, Shanghai Jiao Tong University
- Haofen Wang, Shanghai Jiao Tong University
- Qiaoling Liu, Shanghai Jiao Tong University
- Thanh Tran, Institute AIFB, University Karlsruhe
- Thomas PENIN, Shanghai Jiao Tong University
- Yong Yu, Shanghai Jiao Tong University
|
Nowadays, the demand on querying and searching the Semantic Web is increasing.
Some systems have adopted IR (Information Retrieval) approaches to index
and search the Semantic Web data due to its capability to handle the web-scale
data and efficiency on query answering. Additionally, the huge volumes of
data on the Semantic Web are frequently updated. Thus, it further requires
effective update mechanisms for these systems to handle the data change.
However, the existing update approaches only focus on document. It still
remains a big challenge to update IR index specially designed for semantic
data in the form of finer grained structured objects rather than unstructured
documents. In this paper, we present a well-designed update mechanism on
the IR index for triples. Our approach provides flexible and effective update
mechanism by dividing the index into blocks. It reduces the number of update
operations during the insertion of triples. At the same time, it preserves
the efficiency on query processing and the capability to handle large scale
semantic data. Experimental results show that the index update time is a
fraction of that by complete reconstruction w.r.t the portion of the inserted
triples. Moreover, the query response time is not notably affected. Thus,
it is capable to make newly arrived semantic data immediately searchable
for users. |
| 47 |
Predicting Category Additions in a Topic Hierarchy |
- Dunja Mladenic*, J. Stefan Institute, SL
- Janez Brank,
- Marko Grobelnik,J. Stefan Institute, SL
|
This paper discusses the problem of predicting the structural changes
in an ontology. It addresses ontologies that contain instances in addition
to concepts. The focus is on an ontology where the instances are textual
documents, but the approach presented in this document is general enough
to also work with other kinds of instances, as long as a similarity measure
can be defined over them. We examine the changes in the Open Directory Project
ontology of Web pages over a period of several years and analyze the most
common types of structural changes that took place during that time. We
then present an approach for predicting one of the more common types of
structural changes, namely the addition of a new concept that becomes the
subconcept of an existing parent concept and adopts a few instances of this
existing parent concept. We describe how this task can be formulated as
a machine-learning problem and present an experimental evaluation of this
approach that shows promising results of the proposed approach. |
| 60 |
ROC: a method for proto-ontology construction by domain experts |
- Nicole Koenderink*, WUR -- A&F
- Mark Van Assem, Vrije Universiteit Amsterdam
- J. Lars Hulzebos, Wageningen UR -- A&F
- Jeen Broekstra, Wageningen UR -- A&F
- Jan Top, Wageningen UR -- A&F
|
Ontology construction is a labour-intensive and costly process. Even though
many formal and semi-formal vocabularies are available, creating an ontology
for a specific application is hindered in a number of ways. Firstly, the
process of elicitating concepts is a time consuming and strenuous process.
Secondly, it is difficult to keep focus. Thirdly, technical modelling constructs
are hard to understand for the uninitiated. We propose ROC as a method to
cope with these problems. ROC builds on well-known approaches for ontology
construction. However, we reuse existing sources to generate a repository
of proposed associations. ROC assists in efficiently putting forward all
relevant concepts and relations by providing a large set of potential candidate
associations. Secondly, rather than using intermediate representations of
formal constructs we confront the domain expert with `natural-language-like'
statements generated from RDF-based triples. Moreover, we strictly separate
the roles of problem owner, domain expert and knowledge engineer, each having
his own responsibilities and skills. The domain expert and problem owner
keep focus by monitoring a well-defined application purpose. We have implemented
an initial set of tools to support ROC. This paper describes the ROC method
and two application cases in which we evaluate the overall approach. |
| 62 |
Towards a component-based framework for developing Semantic Web Applications |
- Raúl García Castro, Ontology Engineering Group
- Asun Gomez-Perez, Universidad Politecnica de Madrid,
- Óscar Muñoz-García*, Ontology Engineering Group
- Lyndon Nixon, AG Netzbasierte Informationssysteme
|
For people outside of the research community, starting Semantic Web application
development is difficult. While Semantic Web tools reach industrial maturity,
there is still a lack of usable approaches for the planning of Semantic
Web solutions. We propose the Semantic Web Framework, a component-based
framework for the quick analysis of the required components, the dependencies
between them, and selection of existing solutions. This approach has been
tested with a number of industrial partners, justifying further effort
in this direction. |
| 67 |
A Pattern Based Approach for Re-engineering Non-Ontological Resources
into Ontologies |
- Boris Villazón-Terrazas*, OEG - DIA - FI - UPM
- Asun Gomez-Perez, Universidad Politecnica de Madrid,
- ES Mari Carmen Suarez-Figueroa, OEG - DIA - FI - UPM
- Andres Garcia-Silva, OEG - DIA - FI - UPM
|
With the goal of speeding up the ontology development process, ontology
engineers are starting to reuse as much as possible other ontologies as
well as knowledge-aware resources such as classification schemes, thesauri,
lexicons and folksonomies built by others that already have some degree
of consensus. The reuse of such resources involves necessarily their re-engineering
into ontologies. Non-ontological resources are highly heterogeneous; they
encode different types of knowledge, and also they can be modeled and implemented
in different ways. In this paper we present a typology for non-ontological
resources, a pattern based approach for re-engineering non-ontological resources
into ontologies, and a use case of the proposed approach. |
| 68 |
Deriving concept mappings through instance mappings |
- Shenghui Wang*, Vrije Universiteit Amsterdam
- Balthasar Schopman, Vrije Universiteit Amsterdam
- Stefan Schlobach, Vrije Universiteit Amsterdam, NL
|
Ontology matching is a promising step towards the solution to the interoperability
problem of the Semantic Web. Instance-based methods have the advantage of
focusing on the most active parts of the ontologies and reflect concept
semantics as they are actually being used. Previous instance-based mapping
techniques were only applicable to cases where a substantial set of instances
shared by both ontologies. In this paper, we propose to use a lexical search
engine to map instances from different ontologies. By exchanging concept
classification information between these mapped instances, an artificial
set of common instances is built, on which existing instance-based methods
can apply. Our experiment results demonstrate the effectiveness and applicability
of this method to a broad mapping context. |
| 72 |
Temporal Ontology Language for Representing and Reasoning Interval-based
Temporal Knowledge |
- Sang-Kyun Kim*, KIOM
- Kyu-Chul Lee,
- Mi-Young Song,
|
Recently, W3C Web Ontology working group has developed OWL as an ontology
language for the Semantic Web. However, because OWL does not have the full-fledged
semantics for temporal information, it cannot perform reasoning about temporal
knowledge. Entities in the real world are changing according to the passage
of time and new facts are occurring due to events. Thus, if knowledge in
the KBs does not have the temporal information, it becomes incomplete and
incorrect. Therefore, in this thesis we propose an ontology language TL-OWL,
which extends OWL to have the temporal semantics in order to represent and
reason the temporal information in the Semantic Web. |
| 75 |
A Formal Semantics-Preserving Translation from Fuzzy Relational Database
Schema to Fuzzy OWL DL Ontology |
- Fu Zhang*, Northeastern University
- Z.M Ma, , Northeastern University
- Hailong Wang, , Northeastern University
- Xiangfu Meng, , Northeastern University
|
How to construct Web ontologies has become a key technology to enable
the Semantic Web, especially how to construct ontologies by extracting domain
knowledge from database models such as the relational database model. But
in real-world applications, information is often imprecise and uncertain,
thus the formal approach to translation from Fuzzy Relational Database Schema
(FRDS) to fuzzy ontology is helpful for extracting domain knowledge from
database, which can profitably support fuzzy ontology development and developing
data-intensive Semantic Web applications. In this paper, we first give the
formal definition of FRDS. Then, the formal definition and Model-Theoretic
semantics of a kind of new fuzzy OWL DL ontology are given in more detail.
What’s more, we realize the formal translation from FRDS to fuzzy
OWL DL ontology by means of reverse engineering technique. Of course, the
correctness of translation is also proved. With an example, it shows that
the translation method is semantics-preserving and effective. Finally, the
reasoning problem of satisfiability, subsumption, and redundancy of FRDS
may reason automatically through reasoning mechanism of the corresponding
fuzzy description logic f-SHOIN(D) of fuzzy OWL DL ontology is also investigated,
which can further contribute to constructing fuzzy OWL DL ontologies exactly
that meet application’s needs well. |
| 76 |
STAN: Social, Trusted Annotation Network |
- Hyun Namgoong*, Seoul National University
- Kyoung-Mo Yang, SunMoon University
- Sung-Kwon Yang, Seoul National University
- Charles Borchert, Seoul National University
- Hong-Gee Kim, Seoul National University
|
Annotated data play an important role in enhancing the usability of information
resources. Single users can be easily frustrated by the task of annotating.
Collaborative approaches to annotation have been applied to web resources,
but have not yet been applied to the task of local documents, due in part
to the lack of a uniform identification method. In this paper, we use hash-based
virtual URIs for identifying documents, and introduce the concept of a STAN
(Social, Trusted Annotation Network), which enables collaborative annotation
of documents through their URIs. STAN also incorporates quantitative trust
rates between users in social networks based on their interactions with
each other. The STAN framework is described, demonstrating how these trust
networks are constructed through collaborative annotation. Finally, we evaluate
the usefulness of collaborative annotation and the feasibility of the resulting
trust rates through empirical experiment. |
| 77 |
Consolidating User-defined Concepts with StYLiD |
- Aman Shakya*, NII, Japan
- Hideaki Takeda, National Institute of Informatics, JP
- Vilas Wuwongse, Asian Institute of Technology
|
Information sharing can be effective with structured data. However, there
are several challenges for having structured data on the web. Creating structured
concept definitions is difficult and multiple conceptualizations may exist
due to different user requirements and preferences. We propose consolidating
multiple concept definitions into a unified virtual concept and formalize
our approach. We have implemented a system called StYLiD to realize this.
StYLiD is a social software for sharing a wide variety of structured data.
Users can freely define their own structured concepts. The system consolidates
multiple definitions for the same concept by different users. Attributes
of the multiple concept versions are aligned semi-automatically to provide
a unified view. It provides a flexible interface for easy concept definition
and data contribution. Popular concepts gradually emerge from the cloud
of concepts while concepts evolve incrementally. StYLiD supports linked
data by interlinking data instances including external resources like Wikipedia. |
| 78 |
Refining instance coreferencing results using belief propagation |
- Andriy Nikolov*, Open University
- Victoria Uren, KMi The Open University
- Enrico Motta, KMi, the Open University
- Anne De Roeck, Open University
|
The problem of coreference resolution (finding individuals, which describe
the same entity but have different URIs) is crucial when dealing with semantic
data coming from different sources. Specific features of Semantic Web data
(ontological constraints, data sparseness, varying quality of sources) are
all significant for coreference resolution and must be exploited. In this
paper we present a framework, which uses Dempster-Shafer belief propagation
to capture these features and refine coreference resolution results produced
by simpler string similarity techniques. |
| 80 |
Exposing heterogeneous data sources as SPARQL endpoints through an object-oriented
abstraction |
- Walter Corno, CEFRIEL
- Francesco Corcoglioniti, CEFRIEL
- Irene Celino*, Cefriel
- Emanuele Della Valle, CEFRIEL
|
The Web of Data vision raises the problem of how to expose existing data
sources on the Web without requiring heavy manual work. In this paper, we
present our approach to facilitate SPARQL queries over heterogeneous data
sources. We propose the use of an object-oriented abstraction which can
be automatically mapped and translated into an ontological one; this approach,
on the one hand, helps data managers to disclose their sources without the
need of a deep understanding of Semantic Web technologies and standards
and, on the other hand, takes advantage of object-relational mapping (ORM)
technologies and tools to deal with different types of data sources (relational
DBs, but also XML sources, object-oriented DBs, LDAP, etc.). We introduce
both the theoretical foundations of our solution, with the analysis of the
relation and mapping between SPARQL algebra and monoid comprehension calculus
(the formalism behind object queries), and the implementation we are using
to prove the feasibility and the benefits of our approach and to compare
it with alternative methods. |
| 87 |
A Segmentation-based Approach for Approximate Query over Distributed Ontologies |
- Yimin Wang*, Lilly Singapore
- Peter Haase, University of Karlsruhe
- Guilin Qi, University of Karlsruhe
|
With the popularity of semantic information systems distributed on the
Web, there is an arising challenge to provide efficient query answering
support for these systems. However, common approaches for distributed query
answering either exhibit performance disadvantages or loss of completeness
in an unbalanced way. In this paper, we introduce a novel approach for segment-based
conjunctive query answering over distributed ontologies. Our approach balances
the trade-off between performance and completeness by introducing segmentation-based
distributed ontology integration. We define the notions of segment and approximate
conjunctive query answering. Corresponding algorithms are designed, implemented
and evaluated. The evaluation results show that our approach is very promising
in processing ontologies in modern semantic information systems. |
| 89 |
Identifying key concepts in an ontology, through the integration of cognitive
principles with statistical and topological measures |
- Silvio Peroni, KMi, the Open University
- Enrico Motta, KMi, the Open University
- Mathieu D'Aquin*, KMi, the Open University
|
In this paper we address the issue of identifying the concepts in an ontology
which best summarize what the ontology is about. Our approach combines a
number of criteria, drawn from cognitive science, network topology and lexical
statistics. In the paper we show two versions of our algorithm, which have
been evaluated against the results produced by human experts. We report
that the second version of the algorithm performed well, exhibiting a good
correlation with the choices of the experts. While the generation of automatic
methods for ontology summarization is an interesting research issue in itself,
the work described here also provides a basis for novel approaches to a
variety of ontology engineering tasks, including ontology matching, automatic
classification, ontology modularization, and ontology evaluation. |
| 91 |
Deep Semantic Mapping between Functional Taxonomies for Interoperable
Semantic Search |
- Yoshinobu Kitamura*, Osaka University
- Sho Segawa, Osaka University
- Munehiko Sasajima, Osaka University
- Riichiro Mizoguchi, Osaka University, JP
|
This paper discusses ontology mapping between two taxonomies of functions
of artifacts and its use in semantic search for the engineering knowledge
management. The mapping is of two ways and has been manually established
with deep semantic analysis based on a reference ontology of function for
bridging the ontological gaps between the taxonomies. We report on the successful
results thanks to such deep analysis not at the lexical level but at the
ontological level. Using the mapping knowledge, we developed a semantic
search system which can provide engineers with interoperable access to technical
documents by searching for functional metadata based on either of functional
taxonomies. Such function-oriented engineering knowledge management is very
useful in engineering design, for example, by finding previous design cases
for the same required function, by finding related patents, or by getting
ideas how to solve a problem in a current design. The developed document
search system based on semantic annotation representing functionality can
solve the difficulty of the conventional document management systems based
on the lexical keyword search by searching for technical documents in terms
of generic types of function defined in the functional taxonomies as metadata
schemata independently of the lexical words in the documents. |
| 92 |
A Robust Ontology-Based Method for Translating Natural Language Queries
to Conceptual Graphs |
- TRU CAO*, Ho Chi Minh City Uni. Of Tech.
|
A natural language interface is always desirable for a search system.
While performance of machine translation for general texts with acceptable
computational costs seems to reach a limit, narrowing down the domain to
one of queries reduces the complexity and enables better translation correctness.
This paper proposes a query translation method that is robust to ill-formed
questions and exploits knowledge of an ontology for semantic search. It
uses conceptual graphs as the target language for the translation. As a
logical interlingua with smooth mapping to and from natural language, conceptual
graphs simplify translation rules and can be easily converted to other formal
query languages. Experiment results of the method on the TREC 2002 and TREC
2007 data sets are also presented and discussed. |
| 104 |
Bounded Ontological Consistency for Scalable Dynamic Knowledge Infrastructures |
- Maciej Zurawski*, CISA, School of informatics
- Alan Smaill, CISA, School of Informatics, University of Edinburgh
- Dave Robertson, CISA, School of Informatics, University of Edinburgh
|
Both semantic web applications and individuals are in need of knowledge
infrastructures that can be used in dynamic and distributed environments
where different autonomous entities create knowledge and build their own
view of a domain. Our framework represents this using evolving simple contextual
ontologies and mappings between them, at the same time as incremental logical
coherence is maintained. The definition of semantic autonomy includes these
aspects. Our earlier research has shown that a knowledge infrastructure
can have semantic autonomy that maintains global consistency, if the knowledge
representation is kept simple. We generalize that research by investigating
what happens if the consistency of a knowledge infrastructure is bounded
1) within certain regions called spheres of consistency, and 2) by allowing
a limited variable degree of inconsistency. Our experiments show that a
phase transition can occur in this kind of system, beyond which constant-time
and constant-memory complexity is approached. |
| 107 |
Understanding Semantic Web Applications |
- Kouji Kozaki*, Osaka University Yusuke
- Hayashi, I.S.I.R.,Osaka University Munehiko
- Sasajima, Osaka University Riichiro
- Mizoguchi, Osaka University, JP
|
Ten years have passed since the concept of the semantic web was proposed
by Tim Berners-Lee. For these years, basic technologies for them such as
RDF(S) and OWL were published. As a result, many systems using semantic
technologies have been developed. Some of them are not prototype systems
for researches but real systems for practical use. The authors analyzed
semantic web applications published in the semantic web conferences (ISWC,
ESWC, ASWC) and classified them based on ontological engineering. This article
discusses a trend and the future view of them using the results. |
| 109 |
A Formal Model for Classifying Trusted Semantic Web Services |
- Stefania Galizia*, Open University, UK
- Alessio Gugliotta, Open University, UK
|
Semantic Web Services (SWS) aim to alleviate Web service limitations,
by combining Web service technologies with the potentiality of Semantic
Web. Several open issues have to be tackled yet, in order to enable a safe
and efficient Web services selection. One of them is represented by trust.
In this paper, we introduce a trust definition and formalize a model for
managing trust in SWS. The model embeds the trusted Web service selection
in a classification problem, and it is realized by an ontology, which extends
WSMO. A prototype is deployed, in order to give a proof of concept of our
approach. |
| 113 |
An editorial work๏ฌow approach for collaborative ontology
development |
- Raul Palma*, UPM
- Peter Haase, University of Karlsruhe
- Oscar Corcho, UPM
- Asun Gomez-Perez, Universidad Politecnica de Madrid, ES
|
The widespread use of ontologies in the last years has raised new challenges
for their development and maintenance. Ontology development has transformed
from a process normally performed by one ontology engineer into a process
performed collaboratively by a team of ontology engineers, who may be geographically
distributed and play different roles. For example, editors may propose changes,
while authoritative users approve or reject them following a well de?ned
process. This process, however, has only been partially addressed by existing
ontology development methods, methodologies, and tool support. Furthermore,
in a distributed environment where ontology editors may be working on local
copies of the same ontology, strategies should be in place to ensure that
changes in one copy are re?ected in all of them. In this paper, we propose
a work?ow-based model for the collaborative development of ontologies in
distributed environments and describe the components required to support
them. We illustrate our model with a test case in the ?shery domain from
the United Nations Food and Agriculture Organisation (FAO). |
| 114 |
Versatile Semantic Modeling of Frame Logic Programs under Answer Set Semantics |
- Mario Alviano, Universita Della Calabria
- Giovambattista Ianni*, University of Calabria
- Marco Marano, Universita Della Calabria
- Alessandra Martello, Universita Della Calabria
|
This work introduces the framework of Frame Answer Set programs (FAS).
FAS programs are a frame logic-like language working under answer set semantics
augmented with higher order constructs. The syntax of the language includes
the possibility to manipulate nested molecules, class hierarchies, basic
method signatures and contexts (called framespaces). Semantics is defined
in terms of a corresponding stable model semantics, paving the way to model
object ontologies and their semantics under this well known paradigm. The
language is purposely designed so that inheritance behavior and other features
of the language can be easily customized by the introduction of specialized
axiomatic modules, which can be modeled on purpose by advanced developers
of ontology languages. Also, contexts allow to model hybrid systems integrating
multiple data sources working under different entailment regimes. Properties
and relationship with original F-logic semantics of some of the presented
axiomatizations are given. A system prototype has been implemented and is
available for evaluation. |
| 115 |
Semantic Assistants -- User-Centric Natural Language Processing Services
for Desktop Clients |
- René Witte*, Concordia University
- Thomas Gitzinger, University of Karlsruhe
|
Today's knowledge workers have to spend a large amount of time and manual
effort on creating, analyzing, and modifying textual content. While more
advanced semantically-oriented analysis techniques have been developed in
recent years, they have not yet found their way into commonly used desktop
clients, be they generic (e.g., word processors, email clients) or domain-specific
(e.g., software IDEs, biological tools). Instead of forcing the user to
leave his current context and use an external application, we propose a
``Semantic Assistants'' approach, where semantic analysis services relevant
for the user's current task are offered directly within a desktop application.
Our approach relies on an OWL ontology model for context and service information
and integrates external natural language processing (NLP) pipelines through
W3C Web services. |
| 120 |
Named Entity Disambiguation: A Hybrid Statistical and Rule-based Incremental
Approach |
- Hien Nguyen*, Ton Duc Thang University
- TRU CAO, Ho Chi Minh City Uni. Of Tech.
|
The rapidly increasing use of large-scale data on the Web makes named
entity disambiguation become one of the main challenges to research in Information
Extraction and development of Semantic Web. This paper presents a novel
method for detecting proper names in a text and linking them to the right
entities in Wikipedia. The method is hybrid, containing two phases of which
the first one utilizes some heuristics and patterns to narrow down the candidates,
and the second one employs the vector space model to rank the ambiguous
cases to choose the right candidate. The novelty is that the disambiguation
process is incremental and includes several rounds that filter the candidates,
by exploiting previously identified entities and extending the text by those
entity attributes every time they are successfully resolved in a round.
We test the performance of the proposed method in disambiguation of names
of people, locations and organizations in texts of the news domain. The
experiment results show that our approach achieves high accuracy and can
be used to construct a robust named entity disambiguation system. |
| 122 |
Exploiting Gene Ontology to Conceptualize Biomedical Document Collections |
- Hai-Tao Zheng*, Seoul National University
- Charles Borchert, Seoul National University
- Hong-Gee Kim, Seoul National University
|
As biomedical science progresses, ontologies play an increasingly important
role in easing the understanding of biomedical information. Although much
research, such as Gene Ontology annotation, has been proposed to utilize
ontologies to help users understand biomedical information easily, most
of the research does not focus on capturing gene-related terms and their
relationships within biomedical document collections. Understanding key
gene-related terms as well as their semantic relationships is essential
for comprehending the conceptual structure of biomedical document collections
and avoiding information overload for users. To address this issue, we propose
a novel approach called `GOClonto' to automatically generate ontologies
for conceptualization of biomedical document collections. Based on GO (Gene
Ontology), GOClonto extracts gene-related terms from biomedical text, applies
latent semantic analysis to identify key gene-related terms, allocates documents
based on the key gene-related terms, and utilizes GO to automatically generate
a corpus-related gene ontology. The experimental results show that GOClonto
is able to identify key gene-related terms. For a test biomedical document
collection, GOClonto shows better performance than the other clustering
algorithms in terms of F-measure. Moreover, the ontology generated by GOClonto
shows a significant informative conceptual structure. |
| 123 |
SAOR: Authoritative Reasoning for the Web |
- Aidan Hogan*, DERI Galway
- Axel Polleres, DERI Galway
- Andreas Harth, DERI Galway
|
In this paper we discuss the challenges of performing reasoning on large
scale RDF datasets from the Web. We discuss issues and practical solutions
relating to reasoning over web data using a rule-based approach to forward-chaining;
in particular, we identify the problem of ontology hijacking: new ontologies
published on the Web re-defining the semantics of existing concepts resident
in other ontologies. Our solution introduces consideration of authoritative
sources. Our system is designed to scale, comprising file-scans and selected
lightweight on-disk indices. We evaluate our methods on a dataset in the
order of a hundred million statements collected from real-world Web sources. |
| 125 |
Scalable Distributed Ontology Reasoning Using DHT-based Partitioning |
- Qiming Fang*, Tsinghua University
- Ying Zhao,
- Guangwen Yang,
- Weimin Zheng,
|
Ontology reasoning is an indispensable step to fully exploit the implicit
semantics of Semantic Web data. The inherent distribution characteristic
of Semantic Web and huge amount of ontology instance data necessitates efficient
and scalable distributed ontology reasoning. Current researches on distributed
ontology reasoning mainly focus on dealing with the heterogeneity of different
ontologies but pay little attention to the performance of distributed reasoning
and have not presented practical approaches and systems. Our goal is to
propose an efficient and scalable distributed ontology reasoning approach,
making it practical in real semantic applications. We propose an approach
in this paper, in which Description Logic reasoners for TBox reasoning are
com-bined with rule engines for ABox reasoning to support both expressive
ontolo-gies and large amount of instance data. The published data from each
node is distributed using a DHT-based partitioning and stored in well-designed
rela-tional databases to support convenient and efficient reasoning through
coopera-tion of the distributed nodes. A practical distributed ontology
reasoning and querying system called DORS is developed based on our proposed
approach. Our experiments both in LANs and on PlanetLab using University
Ontology Benchmark show high efficiency of DORS compared with the centralized
OWL ontology reasoning system Minerva as well as good scalability with respect
to the number of nodes and volume of data in the system. |
| 126 |
Extracting Semantic Frames from Thai Medical-Symptom Phrases with Unknown
Boundaries |
- Peerasak Intarapaiboon,
- Ekawit Nantajeewarawat*, Sirindhorn International Institute of Technology,
Thailand
- Thanaruk Theeramunkong,
|
Due to the limitations of language-processing tools for the Thai language,
pattern-based information extraction from Thai documents requires supplementary
techniques. Based on sliding-window rule application and extraction filtering
components, we present a framework for extracting semantic information from
medical-symptom phrases with unknown boundaries in Thai free-text information
entries. A supervised rule learning algorithm is employed for automatic
creation of information extraction rules from manually prepared training
data. Classification models are constructed for prediction of rule application
across a symptom-phrase boundary. Overlapping extracted frames are removed
based on extraction distances observed during rule learning. In our experimental
study, we focus our attention on two basic types of symptom phrasal descriptions:
one is concerned with abnormal characteristics of some observable entities
and the other with human-body locations at which symptoms appear. The experimental
results show that the filtering components improve precision satisfactorily
while preserving high recall, overcoming the classical trade-off between
them. |
| 127 |
A Tableau Algorithm for Possibilistic Description Logic ALC |
- Guilin Qi*, University of Karlsruhe
- Jeff Pan, University of Aberdeen, UK
|
Uncertainty reasoning and inconsistency handling are two important problems
that often occur in the applications of the Semantic Web. Possibilistic
description logics provide a flexible framework for representing and reasoning
with ontologies where uncertain and/or inconsistent information is available.
Although possibilistic logic has become a popular logical framework for
uncertainty reasoning and inconsistency handling, its role in the Semantic
Web is underestimated. One of the challenging problems is to provide a practical
algorithm for reasoning in possibilistic description logics. In this paper,
we propose a tableau algorithm for possibilistic description logic ALC.
We show how inference services in possibilistic ALC can be reduced to the
problem of computing the inconsistency degree of the knowledge base. We
then give tableau expansion rules for computing the inconsistency degree
of a possibilistic ALC knowledge. We show that our algorithm is sound and
complete. The computational complexity of our algorithm is analyzed. Since
our tableau algorithm is an extension of a tableau algorithm for ALC, we
can reuse many optimization techniques for tableau algorithms of ALC to
improve the performance of our algorithm so that it can be applied in practice. |