Matching Objects Across Sources
•How do I know that A. Halevy in source 1 is the same as Alon Halevy in source 2?
•If there are uniform keys across sources, no problem.
•If not:
–Domain specific solutions (e.g., maybe look at the address, ssn).
–Use Information retrieval techniques (Cohen, 98). Judge similarity as you would between documents.
–Use concordance tables. These are time-consuming to build, but you can then sell them for lots of money.