Use a set of base learners:
Name learner, Naοve Bayes, Whirl, XML learner
And a set of recognizers:
County name, zip code, phone numbers.
Each base learner produces a prediction weighted by confidence score.
Combine base learners with a meta-learner, using stacking.