•How do I know that A.
Halevy in source 1 is the same as Alon
Halevy in source 2?
•If there are
uniform keys across sources, no problem.
•If not:
–Domain
specific solutions (e.g., maybe look at the address, ssn).
–Use Information retrieval techniques (Cohen, 98). Judge similarity as you would between documents.
–Use
concordance tables. These are time-consuming to build, but you can then sell them for lots of money.