Complete bibliography
The publications are, for each period, ordered by the name of first author and then by the descending year of publication.
"Post-Rainbow" publications (since 2006)
Brand-new additions are marked with in the list.
- AIME07
-
Stamatakis K., Metsis V., Karkaletsis V., Ruzicka M., Svatek V., Amigo Cabrera E., Polla E., Spyropoulos C.:
Content Collection for the Labelling of Health-related Web Content.
In: 11th Conference on Artificial Intelligence in Medicine (AIME 07), 7-11 July 2007, Amsterdam, The Netherlands.
Draft paper (RTF) (final version will be available via SpringerLink).
- AIME07
-
Stamatakis K., Metsis V., Karkaletsis V., Ruzicka M., Svatek V., Amigo Cabrera E., Polla E., Spyropoulos C.:
Content Collection for the Labelling of Health-related Web Content.
In: 11th Conference on Artificial Intelligence in Medicine (AIME 07), 7-11 July 2007, Amsterdam, The Netherlands.
Draft paper (RTF) (final version will be available via SpringerLink).
Late Rainbow publications (2004-2005)
- An04a
- Andrt M., Kratky M., Svatek V., Snasel V.: AmphoraWS – webova sluzba pro vyhledavani ve strukturovanych dokumentech.
[AmphoraWS – Web service for querying semi-structured data.]
In: Datakon'04, Brno 2004.
Full paper.
- Kr05a
-
Kratky M., Andrt M., Svatek V.:
XML Query Support for Web Information Extraction: A Study on HTML Element Depth Distribution.
In: First International Workshop on Representation and Analysis of Web Space (RAWS-05).
Full paper.
- La05a
-
Labsky M., Svatek V., Praks P., Svab O.:
Information extraction from HTML product catalogues: coupling quantitative and knowledge-based approaches.
In: Dagstuhl Seminar on Machine Learning for the Semantic Web, 2005.
Full paper.
- La05b
-
Labsky M., Praks P., Svatek V., Svab O.:
Multimedia information extraction from HTML product catalogues.
In: Workshop on Databases, Texts, Specifications and Objects (DATESO'05), Ostrava 2005.
Full paper.
- La05c
-
Labsky M., Vacura M., Praks P.:
Web Image Classification for Information Extraction.
In: First International Workshop on Representation and Analysis of Web Space (RAWS-05).
Full paper.
- La05d
-
Labsky M., Svatek V., Svab O., Praks P., Kratky M., Snasel V.:
Information Extraction from HTML Product Catalogues: from Source Code and Images to RDF.
In: 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), IEEE Computer Science, 2005.
Full paper.
- La04
- Labsky M.:
Product information extraction from semistructured documents using HMMs.
In: Poster papers of Znalosti 2004, Brno, February 2004.
Full paper.
- La04b
- Labsky M., Svatek V.:
Information Extraction from Web Product Catalogues.
Working paper.
Full paper.
- La04c
- Labsky M., Svatek V., Svab O.: Types and Roles of Ontologies in Web Information Extraction.
In: ECML/PKDD04 Workshop on Knowledge Discovery and Ontologies, Pisa.
Full paper.
- La04d
-
Labsky M.: Extrakce informaci ze semi-strukturovanych textu pomoci statistickych metod.
[Statistical Information Extraction from Semi-structured Texts.] In: Acta Oeconomica Pragensia, 5/2004.
- Ne05a
-
Nemrava J., Svatek V.:
Text mining tool for ontology engineering based on use of product taxonomy and web directory.
In: Workshop on Databases, Texts, Specifications and Objects (DATESO'05), Ostrava 2005.
Full paper.
- Ne05b
-
Nemrava J.:
Product taxonomy and web directory as support for ontology engineers.
In: ICML Workshop on Learning and Extending Lexical Ontologies by using Machine Learning Methods, Bonn 2005.
Full paper.
- Sv05b
-
Svab O., Svatek V.:
Proceduralni propojeni nastroju pro extrakci informaci z webovych sidel.
[Procedural combination of tools for information extraction from web sites.]
In: Poster Papers of Znalosti 2005, High Tatras 2005.
Full paper.
- Sv04d
-
Svab O., Labsky M., Svatek V.:
RDF-Based Retrieval of Information Extracted from Web Product Catalogues.
In: Semantic Web workshop at ACM SIGIR 2004, Sheffield, 2004.
Full paper.
- Sv04b
-
Svab O., Svatek V., Kavalec M., Labsky M.:
Querying the RDF: Small Case Study in the Bicycle Sale Domain.
In: Workshop on Databases, Texts, Specifications and Objects (DATESO'04),
also at http://www.ceur-ws.org/Vol-98.
Full paper.
- Sv06a
-
Svatek V.:
The Rainbow Project: Multiway Analysis of Website Content and Structure.
In: Znalosti 2006, Hradec Kralove, February 2006.
Full paper.
- Sv05a
-
Svatek V., ten Teije A., Vacura M.:
Web Service Composition for Deductive Web Mining: A Knowledge Modelling Approach.
In: Znalosti 2005, High Tatras 2005.
Full paper.
- Sv05c
-
Svatek V., Vacura M.:
Automatic Composition of Web Analysis Tools: Simulation on Classification Templates.
In: First International Workshop on Representation and Analysis of Web Space (RAWS-05).
Full paper.
- Sv05d
-
Svatek V.:
Automated Analysis of the WWW Based on Reusable Resources.
Habilitation Thesis, University of Economics, Prague, 2005.
Full text.
- Sv04e
-
Svatek V., Snasel V.:
Formal Model of Meta-Information Acquisition from Information Resources.
In: Workshop on Information Technology - Applications and Theory (ITAT2004), High Tatras 2004.
Full paper.
- Sv04a
-
Svatek V., Vavra V.: Semanticka integrace webovych sluzeb.
[Semantic integration of web services.] In: Systemova integrace'04,
Praha 2004.
Full paper.
- Sv04c
-
Svatek V., Labsky M., Vacura M.: Knowledge Modelling for Deductive Web Mining.
In: EKAW 2004, Whittlebury Hall, UK, Springer LNCS, to appear.
Draft paper
(final version available via SpringerLink).
- Vo04
- Volavka F., Svatek V.:
Identifikace navigační struktury webové prezentace na základě topologie odkazů.
[Identification of navigation structure of website based on link topology.]
In: Znalosti 2004, Brno 2004.
Full paper.
Older publications (2001-2003)
- Ka02
-
Kavalec M., Svatek V.:
Information Extraction and Ontology Learning Guided by Web Directory.
In: ECAI Workshop on NLP and ML for Ontology engineering (OLT-02). Lyon, 2002.
Full paper.
- Ka01
- Kavalec M., Svatek V., Strossa P.:
Web Directories as Training Data for Automated Metadata Extraction.
In: Semantic Web Mining, Workshop at ECML/PKDD-2001, Freiburg 2001.
Full paper.
- La03
- Labsky M., Svatek V.:
Ontology Merging in Context of Web Analysis.
In: Workshop on Databases, Texts, Specifications and Objects (DATESO'03), Ostrava 2003.
Full paper (ZIP).
- St01
- Strossa P., Svatek V., Kavalec M.: Towards Intelligent Indexing of Web Pages
Using Important Information Indicators. LISP-2001-1 Technical Report, 2001.
- Sv01a
- Svatek V.:
RAINBOW - navrh modularni architektury pro analyzu a zpristupnovani WWW.
[RAINBOW - proposal for modular architecture for WWW analysis and information access.]
In: Rauch J., Stepankova O. (eds.). Znalosti 2001. Praha
2001, 209-216.
- Sv01b
- Svatek V., Strossa P., Kavalec M.:
Analysis of text on WWW pages using important information indicators.
In: (M. Bielikova, ed.) DATAKON, Database Conference, Brno 2001, 359-362.
- Sv02a
-
Svatek V., Kosek J., Braza J., Kavalec M., Klemperer J., Berka P.:
Framework and Tools for Multiway Extraction of Web Metadata.
In: Information Systems Modelling, Roznov 2002. Full paper.
- Sv02b
-
Svatek V., Kavalec M., Klemperer J.:
Towards the Discovery of Implicit Metadata in Commercial Web Pages.
In: (Malyankar R., ed.) Collected Posters, ISWC - First International Semantic Web Conference.
Sardinia, Italy, June 2002, p.57. Poster summary.
- Sv02c
-
Svatek V., Kosek J., Vacura M.:
Ontology Engineering for Multiway Acquisition of Web Metadata.
LISP-2002-1 Technical Report, 2002. Full paper.
- Sv03a
-
Svatek V., Berka P., Kavalec M., Kosek J., Vavra V.:
Discovering company descriptions on the web by multiway analysis.
In: New Trends in Intelligent Information Processing and
Web Mining (IIPWM'03), Zakopane 2003. Springer-Verlag, 'Advances in Soft Computing' series, 2003.
Full paper.
- Sv03b
-
Svatek V., Vacura M.:
Problem-Solving Models of Website Analysis.
In: Poster Track of the Twelfth International World Wide Web Conference
(WWW2003), Budapest 2003. Extended abstract.
- Sv03c
-
Svatek V., Braza J., Sklenak V.:
Towards Triple-Based Information Extraction from Visually-Structured HTML Pages.
In: Poster Track of the Twelfth International World Wide Web Conference
(WWW2003), Budapest 2003. Extended abstract.
- Sv03d
-
Svatek V., Kosek J., Labsky M., Braza J., Kavalec M., Vacura M., Vavra V., Snasel V.:
Rainbow - Multiway Semantic Analysis of Websites.
In: 2nd International DEXA Workshop on Web Semantics (WebS03), Prague 2003,
IEEE Computer Society Press 2003.
Full paper.
- Va02
-
Vacura M.:
Multiway Approach to Content Recognition on Internet.
LISP-2002-2 Technical Report, 2002. Full paper.
- Vo03
- Volavka F., Sajal M., Svatek V.:
Topology-based discovery of navigation structure within websites.
In: Datakon'03, Brno 2003.
Full paper.
Very old publications (1999-2000, some related to pre-cursor projects)
- Be99a
- Berka P., Sochorova M., Svatek V., Sramek D.: The VSEved System for
Intelligent WWW Metasearch. In: (Rudas I. J., Madarasz L.,
eds.:) INES'99 - IEEE Intl. Conf. on Intelligent Engineering Systems, Stara
Lesna 1999, 317-321.
- Be99b
- Berka P., Sochorova M., Svatek V.: Metavyhledavani na WWW s naslednym
zpracovanim vysledku.
[WWW metasearch with post-processing] In: (Richta, K., ed.:) Datasem'99,
Brno 1999.
- Ko00
- Kosek J., Svatek V.:
XML a ontologie jako integracni nastroje pro analyzu a zpristupnovani WWW.
[XML and ontologies as integration tools for WWW analysis and information access.]
In: (Valenta J., ed.) Datasem'00, Brno 2000.
- Sr00
- Sramek D., Berka P., Kosek J., Svatek V.: Improving WWW Access - from
Single-Purpose Systems to Agent Architectures? In: Cerri
S. A., Dochev D. (ed.). Artificial Intelligence:
Methodology, Systems, and Application. Berlin : Springer Verlag,
2000, 167-178. Full paper.
- Sv00a
- Svatek V., Berka P.: URL as starting point for WWW
document categorisation. In: (Mariani J., Harman D.:) RIAO'2000 -
Content-Based Multimedia Information Access, CID, Paris, 2000, 1693-1702. Full paper.
- Sv00b
- Svatek, V., and Kavalec, M. Supporting Case Acquisition and Labelling in the
Context of Web Mining, in (Zighed D., Komorowski J., Zytkow J.:) Principles of Data Mining and Knowledge Discovery -
PKDD2000. Springer, 2000, pp. 626-631. Full paper.
|