MetaExtractor: A system for metadata extraction from structured data sources

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The extraction of metadata used during the planning phase in mediation systems assumes the existence of a metadata repository that in most cases must be created with high human involvement. This dependency rises complexity of maintenance of the system and therefore the reliability of the metadata itself. This article presents MetaExtractor, a system which extracts structure, quality, capability and content metadata of structured data sources available on a mediation system. MetaExtractor is designed as a Multi-Agent System(MAS) where each agent specializes in the extraction of a particular type of metadata. The MAS cooperation capability allows the creation and maintenance of the metadata repository. MetaExtractor is useful to reduce the number of data sources selected during query planning in large scale mediation systems due to its ability to prioritize data sources that better contribute to answer a query. The work reported in this paper presents the general architecture of MetaExtractor and emphasizes on the extraction logic of content metadata and the strategy used to prioritize data sources accordingly to a given query.

Original languageEnglish
Title of host publicationAvailability, Reliability, and Security in Information Systems and HCI - IFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference, CD-ARES 2013, Proceedings
Pages84-99
Number of pages16
DOIs
StatePublished - 2013
EventIFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference on Availability, Reliability, and Security in Information Systems and HCI, CD-ARES 2013 - Regensburg, Germany
Duration: 02 Sep 201306 Sep 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8127 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceIFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference on Availability, Reliability, and Security in Information Systems and HCI, CD-ARES 2013
Country/TerritoryGermany
CityRegensburg
Period02/09/1306/09/13

Keywords

  • Large Scale Data Mediation
  • Mediation Systems
  • Metadata Extraction
  • Multi-Agent System
  • Source Selection

Fingerprint

Dive into the research topics of 'MetaExtractor: A system for metadata extraction from structured data sources'. Together they form a unique fingerprint.

Cite this