Giving Vehicle Analysis
There was an interesting question on PRSPCT-L a few weeks ago:
Do you know of or have experience with demographic modeling and data appends that help predict a donor’s preferred giving method (i.e., event, direct mail, online, etc.)?
I thought I would take a minute to look into this. I actually couldn’t initially think of which variables to include aside from age. My starting point was looking at those that made gift by check through the mail, online, over the phone or in person during this fiscal year. I was evaluating the data at the household level so gifts from married households only count once.
I decided to not use prior giving at all though I think that is likely the best predictor. That is, those that gave online last year will likely do so again this year. I also think other giving stats like length of giving, average amount and total amount would also be helpful in making predictions.
However, my interpretation of the original question was that this should just be an analysis of demographic data and so that is the path that I took.
I eventually choose:
- age
- constituent type: company, individual, etc.
- spouse constituent type: for those married (with an ‘N’ for those not married)
- a variable called type2 which is maybe not the greatest variable ever
- type2 = 1 means married
- type2 = 2 means not married and constituent is male
- type2 = 3 means not married and constituent is female
- type2 = 4 means company
- type2 = 5 means foundation
- type2 = 6 means missing information or other
- phone: do they have a phone number in our system or not
- email: do they have an email in our system or not
- mail: do they have a good mailing address in our system or not
- assigned: is this constituent assigned to a gift officer
- I added a 1 or 0 if they had any of the following no contact flags:
- no contact at all
- no contact by phone
- no contact by email
- no contact by mail
- no personal contact
- no solicitations
- I also added a code for geographic distance from the university:
- 1 = in our tri-county area
- 2 = in our state
- 3 = everyone else
- Lastly, I added a count of the number of solicitations they received by type
My approach to problems like this is to get a decent sized set of data points together based on intuition like the ones described and then create a really quick model. I will skip the modelling code here for now but it is in my fundraising analytics folder along with the data set that I used.
This next part is just the report out from the model. I cannot seem to hide it so just scroll through this to get to the plots.
Once the model is done, I like to look at the importance of the features included:
We can see that the various solicitation counts and age seem to have the biggest impact so we can start there.
I should note that at present this model is not performing too well but it is just a quick one for now. Here is the logloss score which is a measure of error. The closer to 0 the better.
We are a long way off.
Here is a quick break down for each group. What the model does is assign a probability for each id as to which giving method they used so each row should sum to 1. I labeled the first column as intercept which may not be quite right. It might be more like the residual or the probability that it belongs to no class. In any case, that probability is always very small and can be ignored.
We can see that the model struggles to identify those that give in person.
It does fairly well with those that give by sending a check through the mail.
It also is decent at guessing donors who give over the phone.
However, it does not do as well predicting who gives online.
With all of that out of the way, to start the conversation and add a few ideas on what factors contribute to which method of giving, here are a few graphs.
First here is the break down of the number of donors in each category. It’s not even but it is not overly dominated by any one category either which is good.
Next we can look at age which I think is the most revealing. I was very surprised that so many young people chose to give over the phone. The popular narrative that I hear is that young people do not answer their phones but this appears to tell a different story. I also feel that every age bracket has a method so this seems useful.
To make each category clear:
- Lockbox is check in the mail
- Online Giving is online giving
- Staff is giving in person to a staff member
- TOP Gift is giving over the phone through our Telephone Outreach Program
Looking at that variable on whether a constituent is married, not married and male or not married and female, it’s a little interesting that men seem to give more online and women give more over the phone but it is not a very significant difference. The fact that married couples give more through the mail I believe is using the age data since older people are more likely to be married and we already saw that older people prefer to give in this way.
Again, these codes are:
- 1 means married
- 2 means not married and constituent is male
- 3 means not married and constituent is female
- 4 means company
- 5 means foundation
- 6 means missing information or other
A quick note on these plots. The y-axis is density so it is basically answering the question: of those that fall into a particular category what is the distribution in giving method choice; what is the percentage that choose to give via a particular method by category type.
The chart on ask count was also quite insightful. It appears that people give based on how you ask them. If you call someone numerous times they will give that way and if you email numerous times then online will be their choice and so on.
This actually reminds me of a small nonprofit in Detroit. I don’t remember the name but they won an award at a Blackbaud conference in 2012 and their trick for increasing gifts was to just ask more often which I think may fly in the face of some of the conventional thought that seems to suggest that this type of behavior would be detrimental. In any case, this is obviously just the trends in our data and it doesn’t mean it would work everywhere but it seems like a possible factor to consider.
Those with two calls gave over the phone more often than those that just received one.
Mail is interesting as it almost seems to be a reminder to give online at first but after four mailings we finally wear down the holdouts and they relent and send us money perhaps just to stop the mailings.
Email also shows this trend where the more we ask via email for a gift, the more we see constituents choosing to give online.
The next plot looks at the constituent type which is a little busy and for me there are not many insights to glean.
I will provide a legend for a few of the types:
- C is company
- F is foundation
- rs is retired staff
- S is staff
- I is any individual who is not staff
Most companies give directly to a staff member which makes sense. Retired staff send in checks which again references the trends seen in the age data earlier. Current staff give online since that is probably the easiest way for them to give.
The last plot is on geographic distance which does not offer very much additional insight.
This is just a post and some code to get the conversation started. It will be interesting to see if anyone has anything additional to add. For me these are the takeaways at this stage in researching this question:
- Age seems to matter and I think experimenting with targeting the way we ask by age may be beneficial
- Ask frequency seems to help. These days it is so easy to ignore anything that we do not want to deal with immediately that these types of reminders may actually be helpful and keep your organization top of mind as well as increase your chance of connecting with someone when they have the time to give right at the moment.