BIS 2009

 

Tutorial on SMILA


tutorial at
12th International Conference on Business Information Systems (BIS 2009)

Poznan, Poland




The amount and diversity of information is growing exponentially, mainly in the area of unstructured data, like emails, text files, blogs, images etc. Poor data accessibility, user rights integration and the lack of semantic meta data are constraining factors for building next generation enterprise search and other document centric applications. Missing standards result in proprietary solutions with huge short and long term cost. Overcoming these problems is a key issue for gaining agility in an organization.

SMILA is an extensible framework for processing unstructured information in the enterprise. Besides providing essential infrastructure components and services, SMILA also delivers ready-to-use add-on components, like connectors to most relevant data sources. Using the framework as their basis will enable developers to concentrate on the creation of higher value solutions, like semantic search applications, information extraction and the like.

SMILA is an open source project under the umbrella of the eclipse foundation. It is also a part of the German research programme THESEUS. Further information can be found at [here] and [here].

This one day SMILA Tutorial will introduce the concepts and approach behind the framework, how to use it to build an application and how to integrate new components into it. Topics that will be addressed are:


  • How to handle unstructured data: The principles behind SMILA

  • SMILA explained: The architecture

  • Installation, crawler and service configuration, and building a search application with SMILA and Lucene

  • Using Web Services as SMILA components (Open Calais or Freebase as exercise)

  • Creating a native SMILA component using e.g. an image classifier

  • Exercise: Using GATE-ANNIE as a SMILA component for Named Entity Recognition



Participants should have a basic understanding of JAVA and programming. For the practical exercises, a laptop running Windows or Linux is required.

The tutorial is presented by Ralph Traphöner and Igor Novakovic, empolis GmbH, Germany

Presenter(s)



  • Ralph Traphöner is Deputy Director Research & Consulting at empolis GmbH. After studying computer science and business administration at the University of Kaiserslautern, focusing on practical applications, in particular in AI and Case-Based Reasoning, Ralph Traphöner co-founded the company TECINNO, today a part of empolis, in 1991. On behalf of empolis he managed INRECA I and II (Induction and Reasoning from Cases), was co-ordinating project manager of WEBSELL (Intelligent Sales Assistants for the World-Wide-Web), participation and project management in SMARTSELL (Intelligent Sales Assistant for Electro-Mechanical Parts), ENRICH (Enriching Representations of Work to Support Organisational Learning), SEKT (Semantic Enabled Knowledge Technologies), INKASS (Intelligent Knowledge Asset Sharing and Trading) and FM-ULTRANET (On the job training platform and network for foetal malformation ultra-sonography). He also participated and managed other research projects on behalf of empolis funded by national institutions. In particular INVITE (Intuitive Human-Machine Interaction) has to be named, which is a German "Lead Project" similar to an IP.


  • Ralph Traphöner also acts as advisor, evaluator and reviewer for the Fifth and Sixth Framework Programme for Research of the EU and has contributed as an expert to the EP2010 study. Expertise: Case-Based Reasoning, Intelligent Information Retrieval, Electronic Commerce, Knowledge Management and Artificial Intelligence. He has given many tutorials in AI and has presented courses on knowledge management for IIR on a regular basis.

  • Igor Novakovic is Deputy Director Development at empolis GmbH. After joining empolis in 2000, he was at first responsible for the development of some server-side components written in C + + and Java. Later on, beside designing and developing J2EE applications, he successfully introduced company-wide the application lifecycle management based on open source tools. From 2006 he led the development of the solution "empolis Service Lifecycle Suite". Since late 2007 he is the co-lead of the SeMantic Information Logistics Architecture (SMILA) project.


Organizers



12th International Conference on Business Information Systems (BIS 2009), Poznan, Poland 27-29 April 2009
Department of Information Systems, Poznan University of Economics, al. Niepodleglosci 10, 60-967 Poznan, Poland
phone: +48618543632 , fax: +48618543633 , Web: http://bis.kie.ae.poznan.pl/