Lecture 17. Using Model Management for Data Integration

 

Readings:

No paper review required for this lecture.

 

Speaker:

Prof. Phil Bernstein from Microsoft Research and the University of Washington

Philip A. Bernstein is a Principal Researcher at Microsoft and Affiliate Professor at University of Washington. For the past fifteen years his research focus has been metadata management. He is an ACM Fellow, a winner of the ACM SIGMOD Innovations Award, and a member of the U.S. National Academy of Engineering.

 

Abstract:

About 40% of the work in enterprise IT departments is centered on data-integration applications, such as data warehouses, application integration, and B2B e-commerce. In this presentation, I'll argue that the core technical problems of data integration are the implementation of a small number of operators that manipulate schemas (a.k.a. models) and mappings between them. These operators include Match, Merge, Diff, ModelGen, TransGen, and Compose. I'll start by explaining the scope of data integration problems and then delve into the details of as many operators as we have time for.

 

Lecture notes:

lecture17.pdf