The Semantic Web
Periode 3: November 2003 - Januari 2004

The World Wide Web has the potential to become the primary source for storing and accessing data. However, its content is marked up in such a way that it is accessible only to humans.

Current Web search engines have serious difficulties in processing search queries. Even though they return impressive results, their level of precision and recall clearly shows their limitations.

An interesting alternative is the creation of a Semantic Web in which meaning is made explicit, allowing machines to process and integrate Web resources intelligently. This technology might allow for quick and accurate web search and facilitate communication among heterogeneous web-accesible devices.

The aim of the course is to examine this area of research by reading and discussing papers. We will also extend the techniques developed within the Semantic Web to deal with the integration of heterogeneous linguistic data encoded in various language resources such as corpora and databases.

Searching the world wide web


Semantic web

Suggested books

  • G. Antoniou and F. van Harmelen (2004) A semantic Web Primer The MIT Press.
  • T. Passin (2004) Explorer's Guide to the Semantic Web Manning Publications.


  1. T. Berners-Lee et al. (2001). The semantic web The Scientific American
  2. The semantic web, taking form.
  3. The semantic web: an introduction.
  4. Creation of the semantic web: semantic markup .
  5. The semantic web: a primer.
  6. J. Hendler. (2001) Agents and the Semantic Web IEEE INTELLIGENT SYSTEMS 2001 IEEE Vol. 16No. 2; MARCH/APRIL 2001, pp. 30-37




  1. B.C. Vickery (1997). Ontologies Journal of Information Science, 23 (4), pp.277-286


Ontology software

Linguistic Ontology


  1. Farrar, S., D. T. Langendoen (to appear 2003) A Linguistic ontology for the Semantic Web GLOT International.
  2. Farrar, Scott (2003) New ways of thinking about lexical Resources: a proposal for the semantic web Presented at the ISO Preparation Workshop on Lexicons Nicoletta Calzolari, Peter Wittenburg (Organizers) Feb. 27, 2003 Munich, Germany.
  3. Langendoen, D. T., S. Farrar, W. D. Lewis (2002) Bridging the Markup Gap: Smart Search Engines for Language Researchers International Workshop on Resources and Tools in Field Linguistics, prior to 3rd annual Language Resources and Evaluation Conference, May 26-27, Las Palmas, Canary Islands, Spain.
  4. Lewis, William, Farrar, Scott, and Langendoen, Terry (2001) Building a Knowledge base of Morphosyntactic terminology In S. Bird, P. Buneman, and M. Liberman (Eds.) Proceedings of the IRCS Workshop on Linguistic Databases, 11-13 December 2001, pp. 150-156.
  5. A. Dimitriadis, P. Monachesi (2002): Integrating different data types in a Typological Database System. In: Proceedings of the conference on Language Resources and Evaluation (LREC 2002). ELRA. Paris.


Web Ontology Languages



Agents, ontology communication and wrap up


  1. J. Hendler. (1999) Is There an Intelligent Agent in Your Future? Nature.
  2. H. Stuckenschmidt and Timm, I. (2002) Adapting Communication Vocabularies using Shared Ontologies. Proceedings of the Second International Workshop on Ontologies in Agent Systems. 6-12
  3. H. Wache, T. Voegele, U. Visser, H. Stuckenschmidt, G. Schuster, H. Neumann and S. Huebner (2001) Ontology-Based Integration of Information - A Survey of Existing Approaches Proceedings of the IJCAI-01 Workshop on Ontologies and Information Sharing Seattle, USA, August 4-5, 2001. (108-118)
  4. Michel Klein (2001) Combining and relating ontologies: an analysis of problems and solutions. Proceedings of the IJCAI-01 Workshop on Ontologies and Information Sharing Seattle, USA, August 4-5, 2001. (108-118) (53-62)
  5. J. Heflin and J. Hendler (2001) A Portrait of the Semantic Web in Action IEEE INTELLIGENT SYSTEMS 2001 IEEE Vol. 16No. 2; MARCH/APRIL 2001, pp. 60-71
  6. Noy et al. (2001) Creating Semantic Web Contents with Protege-2000 IEEE INTELLIGENT SYSTEMS 2001 IEEE Vol. 16No. 2; MARCH/APRIL 2001, pp. 60-71

Semantic Web and eLearning


  1. L. Stojanovic et al. (2001) eLearning based on the Semantic Web ??.


Authomatic development of ontologies


Slides talks

Weekdag: Plaats: Tijd:
Thursday 17/2 KNG80, Room 108 11.00 - 13.00
Thursday 24/2 KNG80, Room 108 12.00 - 14.00
Thursday 3/3 KNG80, Room 108 11.00 - 13.00
Thursday 3/3 Uithof, Room 195, BG 14.00 - 16.00
Thursday 10/3, 7/4, 28/4 KNG80, Room 108 11.00 - 13.00
Thursday 17/3, 24/3, 31/3, 14/4, 21/4 KNG80, Room 108 10.00 - 12.00


Week 1: Introduction
Week 2: Readings about search engines and the semantic web. Make an overview and prepare discussion points via the forum. Discussion 24/2
Week 3: Readings about ontologies. Make an overview and prepare discussion point via the forum. Design your own ontology on paper. Discussion 3/3
Week 4: Install protegee and do the tutorial.
Implement your own ontology in protegee. Make a small report of the methodology used. Read paper 1 and 5 of the linguistic ontology list. Discussion 10/3
Week 5: Readings about linguistic ontologies. Make a summary. Make a proposal for the project and a division of work + planning. Discussion 17/3
Week 6: Readings about semantic web languages. Make an overview. Prepare discussion point via the forum. Discussion 24/3
Week 7: Readings about authomatic development of ontologies, ontology integration and ontology communication. Make an overview and prepare discussion points. Discussion 31/3
Week 8: The Semantic Web and its applications. Presentations of scriptie projects by Jayme, Carel and Vincent. Read article about SW and eLearning. Write summary. Discussion 11/4
Week 9: Feasibility study on the use of Gate and OntoLT for authomatic development of ontologies. Presentation by Job en Trude about scriptie. Discussion 14/4
Week 10: Possibility for students to discuss problems with respect to final project. Discussion 21/4
Week 11: Presentation final project and wrap up on 28/4

Final Project: building an ontology for linguistic concepts (Week 1-11)

Language resources such as corpora, databases, as well as electronic dictionaries and grammars are being developed and made available to the linguistic community via the World-Wide Web. Quite frequently these resources have been built independently resulting in heterogeneous systems and in severe limits with respect to their appropriate and efficient use. Heterogeneity is found at two different levels: technical level (different hardware platforms, operating systems, etc.) and conceptual level (different data representations and data models for similar objects as well as semantic differences and ambiguities).

An interesting way to overcome the heterogeneity at the conceptual level makes extensive use of an ontology of linguistic notions. We will develop (fragments) of such an ontology and we will explore whether available language tools can help in the authomatic development of ontologies.


  • Project = 40%
  • Summaries + discussion points= 40%
  • Class participation = 20%

In order to pass the course you have to have at least a 5.5.
You can find the results of the course here.