Minimizing the costs of the training data for learning Web wrappers