Data Integration Using Machine Learning
Paper i proceeding, 2016

Today, enterprise integration and cross-enterprise collaboration is becoming evermore important. The Internet of things, digitization and globalization are pushing continuous growth in the integration market. However, setting up integration systems today is still largely a manual endeavor. Most probably, future integration will need to leverage more automation in order to keep up with demand. This paper presents a first version of a system that uses tools from artificial intelligence and machine learning to ease the integration of information systems, aiming to automate parts of it. Three models are presented and evaluated for precision and recall using data from real, past, integration projects. The results show that it is possible to obtain Fo.5 scores in the order of 80% for models trained on a particular kind of data, and in the order of 60%-70% for less specific models trained on a several kinds of data. Such models would be valuable enablers for integration brokers to keep up with demand, and obtain a competitive advantage. Future work includes fusing the results from the different models, and enabling continuous learning from an operational production system.

Data integration

Machine Learning

Enterprise interoperability

Författare

Marcus Birgersson

Chalmers, Data- och informationsteknik

Gustav Hansson

Chalmers, Data- och informationsteknik

U. Franke

Swedish Institute of Computer Science

Proceedings - IEEE International Enterprise Distributed Object Computing Workshop, EDOCW

1541-7719 (ISSN)

Vol. 2016-September 313-322

DOI

10.1109/EDOCW.2016.7584357

ISBN

978-1-4673-9933-3