Skip to main content
placeholder image

Dynamic query scheduling for online integration of semistructured data

Conference Paper


Abstract


  • In data integration systems a user request issued at a central site is decomposed into a number of sub-requests, which later on are processed at the remote sites. The results are sent back to a central site for data integration and the results of integration are returned to a user. Data integration systems often failed to show its best performance due to unpredictable data arrival rate. Traditionally, data integration requires the complete results from the remote sites to be available at a central site before final computations begin. An online integration system starts and continues the computations at the central site shortly after every piece of data is received from the remote sites. Execution of online integration plan in static scheduling strategy causes poor performance of data integration system as unnecessary computations are executed in some circumstances. This paper proposes a dynamic scheduling for online integration plans, which employs data increment monitoring system which is able to dynamically change the data integration plans whenever it is necessary.

Publication Date


  • 2015

Citation


  • Handoko, & Getta, J. R. (2015). Dynamic query scheduling for online integration of semistructured data. In S. Ahamed, C. Chang, W. Chu, I. Crnkovic, P. Hsiung, G. Huang & J. Yang (Eds.), 39th Annual Computer Software and Applications Conference Workshops (COMPSACW) (pp. 375-380). Piscataway, New Jersey: IEEE.

Scopus Eid


  • 2-s2.0-84962090238

Ro Metadata Url


  • http://ro.uow.edu.au/eispapers/5282

Has Global Citation Frequency


Start Page


  • 375

End Page


  • 380

Place Of Publication


  • Piscataway, New Jersey

Abstract


  • In data integration systems a user request issued at a central site is decomposed into a number of sub-requests, which later on are processed at the remote sites. The results are sent back to a central site for data integration and the results of integration are returned to a user. Data integration systems often failed to show its best performance due to unpredictable data arrival rate. Traditionally, data integration requires the complete results from the remote sites to be available at a central site before final computations begin. An online integration system starts and continues the computations at the central site shortly after every piece of data is received from the remote sites. Execution of online integration plan in static scheduling strategy causes poor performance of data integration system as unnecessary computations are executed in some circumstances. This paper proposes a dynamic scheduling for online integration plans, which employs data increment monitoring system which is able to dynamically change the data integration plans whenever it is necessary.

Publication Date


  • 2015

Citation


  • Handoko, & Getta, J. R. (2015). Dynamic query scheduling for online integration of semistructured data. In S. Ahamed, C. Chang, W. Chu, I. Crnkovic, P. Hsiung, G. Huang & J. Yang (Eds.), 39th Annual Computer Software and Applications Conference Workshops (COMPSACW) (pp. 375-380). Piscataway, New Jersey: IEEE.

Scopus Eid


  • 2-s2.0-84962090238

Ro Metadata Url


  • http://ro.uow.edu.au/eispapers/5282

Has Global Citation Frequency


Start Page


  • 375

End Page


  • 380

Place Of Publication


  • Piscataway, New Jersey