TY - GEN
T1 - Improving source selection in large scale mediation systems through combinatorial optimization techniques
AU - Pomares, Alexandra
AU - Roncancio, Claudia
AU - Cung, Van Dat
AU - Villamil, María Del Pilar
PY - 2011
Y1 - 2011
N2 - This paper concerns querying in large scale virtual organizations. Such organizations are characterized by a challenging data context involving a large number of distributed data sources with strong heterogeneity and uncontrolled data overlapping. In that context, data source selection during query evaluation is particularly important and complex. To cope with this task, we propose OptiSource, an original strategy for source selection using combinatorial optimization techniques combined to organizational knowledge of the virtual organization. Experiment numerical results show that OptiSource is a robust strategy that improves the precision and the recall of the source selection process. This paper presents the data and knowledge models, the definition of OptiSource, the related mathematical model, the prototype and an extensive experimental study.
AB - This paper concerns querying in large scale virtual organizations. Such organizations are characterized by a challenging data context involving a large number of distributed data sources with strong heterogeneity and uncontrolled data overlapping. In that context, data source selection during query evaluation is particularly important and complex. To cope with this task, we propose OptiSource, an original strategy for source selection using combinatorial optimization techniques combined to organizational knowledge of the virtual organization. Experiment numerical results show that OptiSource is a robust strategy that improves the precision and the recall of the source selection process. This paper presents the data and knowledge models, the definition of OptiSource, the related mathematical model, the prototype and an extensive experimental study.
KW - Combinatorial Optimization
KW - Large Scale Data Mediation
KW - Source Selection
UR - http://www.scopus.com/inward/record.url?scp=80051700294&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-23074-5_6
DO - 10.1007/978-3-642-23074-5_6
M3 - Conference contribution
AN - SCOPUS:80051700294
SN - 9783642230738
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 138
EP - 166
BT - Transactions on Large-Scale Data- and Knowledge-Centered Systems III - Special Issue on Data and Knowledge Management in Grid and P2P Systems
T2 - 3rd International Conference on Data Management in Grid and Peer-to-Peer Systems, Globe 2010
Y2 - 1 September 2010 through 2 September 2010
ER -