You talk like a delinquent
(Posted by cwalsh)
This is interesting. Not sure how robust the finding is, but according to an analysis of LendingClub data on all past loans, including descriptions of the use for the money, applicants using certain words in their descriptions are much more likely to default.
For our purposes define a Delinquency as either being late in your payments or having defaulted completely. The 10 words with the greatest p-values are below. [...]"Words and Credit Scores", Social Science Statistics Blog
Word Loans With P(Delinquency|No word) P(Delinquency|Word) p-value also 215 0.067 0.140 0.0004need 608 0.062 0.105 0.0015business 233 0.069 0.116 0.0038live 91 0.070 0.154 0.0057already 64 0.071 0.156 0.0059other 285 0.068 0.112 0.0081bills 223 0.067 0.135 0.0082bill 279 0.066 0.125 0.0117interest 660 0.081 0.053 0.0136
Not something I've studied, but I wonder if a neural network could successfully classify these loans?











Comments
You should be able to use an off-the-shelf Baysian spam classifier for this. That said, there are certain spam words with much higher P-values than any of these!
Posted by: Nicko | November 3, 2008 1:16 PM
"I need the loan to refinance my mortgage and buy Viagra"
:^)
Posted by: chris | November 3, 2008 1:53 PM
You have obviously been using my patent p-value enlargement products :-)
Posted by: Nicko | November 5, 2008 5:54 AM