C# ASP.NET SQL SERVER
  • Wed
    04
    Feb 09

    Training a text classifier

    When writing a text classification system you need to train it. Typically you have a corpus of good data that has been accurately pre-classified and this is what you throw at the system while it is learning the classification. I came up with what I thought was a good analogy for an untrained text classifier...

    Continue reading "Training a text classifier"

  • Tue
    03
    Feb 09

    Text Classification References

    I'm currently working on a text classification system. This is requiring a fair amount of research and background reading so I'm going to create a list of references that I'm using: A Plan for Spam by Paul Graham Better Bayesian Filtering by Paul Graham Bayesian Filtering: Beyond Binary Classification...

    Continue reading "Text Classification References"

  • Fri
    25
    Jul 08

    Are there enough credit card numbers?

    I was wondering if the 16 digits that most credit cards have (VISA and Mastercard et al) if you don't include American Express' 13 digit cards were enough for the world or if we'd run out of digits at some point. If I've done my calculations correctly and based on my assumption that there are 7 billion...

    Continue reading "Are there enough credit card numbers?"

  • Fri
    25
    Jul 08

    Casting Out Nines

    I'm working on a project at the moment that involves the verification of credit card numbers. This has led to running checksums on the numbers to validate them including Luhn's algorithm. Simply stated, Luhn's algorithm doubles every second digit (starting from the right) and then adds together all of...

    Continue reading "Casting Out Nines"