When writing a text classification system you need to train it. Typically you have a corpus of good data that has been accurately pre-classified and this is what you throw at the system while it is learning the classification. I came up with what I thought was a good analogy for an untrained text classifier...
Continue reading "Training a text classifier"