Publications

  • Stealing Anchors to Link the Wiki
    Philipp Dopichaj, Andre Skusa, Andreas Heß
    Advances in Focused Retrieval, 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008. Revised and Selected Papers
    © Springer Verlag, Lecture Notes in Computer Science

  • Der Markt für Internet-Suchmaschinen
    Christian Maaß, Andre Skusa, Andreas Heß, Gotthard Pietsch
    In: Handbuch Internet-Suchmaschinen, Dirk Lewandowski (Hrsg.), AKA Verlag Heidelberg
    Catalog entry: German National Library

  • Multi-Value Classification of Very Short Texts
    Andreas Heß, Philipp Dopichaj, Christian Maaß
    31st Annual German Conference on Artificial Intelligence (KI 2008), Kaiserslautern, Germany
    © Springer Verlag, Lecture Notes in Computer Science

    Abstract:
    We introduce a new stacking-like approach for multi-value classification. We apply this classification scheme using Naive Bayes and Rocchio classifiers on the well-known Reuters dataset. We use part-of-speech tagging for stopword removal. Our setup performs as well as other approaches using full article text. Finally, we apply a Rocchio classifier on a Web 2.0 dataset suitable for semi-automated labelling of short texts.

  • Playful Validation of Automatically Extracted Data
    Francis Dierick, Philipp Dopichaj, Uwe Fleischer, Andreas Heß, Andre Skusa, Christian Maaß
    Workshop Nutzerinteraktion im Social Semantic Web bei der Tagung Mensch & Computer, Lübeck, Germany

  • From Web 2.0 to Semantic Web: A Semi-Automated Approach
    Andreas Heß, Christian Maaß, Francis Dierick
    ESWC 2008 Workshop on Collective Semantics: Collective Intelligence and the Semantic Web (CISWeb 2008), Tenerife, Spain
    Full Paper:
    Presentation slides

    Abstract:
    Web 2.0 and the Semantic Web are complementary paradigms. We propose five approaches to merge them, improve annotation quality via (semi-)automated tagging, and enhance tag quality using duplicate detection techniques. Verified on a large-scale dataset from Lycos iQ.