Solved – Confused about the basic maths in Incremental/Uplift modelling

descriptive statisticsgroup-differencesmarketingmathematical-statisticstreatment-effect

I have recently gotten to study Incremental modelling, which is used in Marketing to study the incremental impact of a certain action on a treatment group against a control group.

However, I am very confused about the very basics of this form of modelling: specifically about the formulae used in its calculations. Consider the following example, please:

|CampaignName | Group Type | TotalPeople | Respondents | AverageSpend |
|-------------|------------|-------------|-------------|--------------|
|ToysRus      | Control    | 80          | 10          | 30           |
|ToysRus      | Treatment  | 800         | 130         | 39           |

Supposing that we were to find out:

The response rate for the treatment group
The incremental response rate
The incremental spend per customer
Total incremental spend due to targetting

I understand that for 1., we simply use the formula:

No. of respondents in the treatment group/Total no. of people in treatment group

And this will be the rate of response in treatment group – the percentage of people from the treatment group that responded to our 'treatment'. Likewise we can find the rate of response for the control group.

For 2., I am confused. I understand that the formula is:

Response rate from treatment group - response rate from the control group

But I personally used the following formula in my practice:

No. of respondents in the treatment group - no. of respondents in control group / Total no. of people in the treatment group

I know that the answer I get due to this is certainly wrong, but I am unable to convince myself as to why the formula is wrong. It seems intuitive to me on first glance. But I think there is basic Math here at play, which very unfortunately I am unable to see presently.
I think perhaps it has to do with qualitative and quantitative differences between the groups? That is, one is a control and the other a treatment group, and both are of different sizes, so it makes less sense to subtract them straight away? Like comparing apples and banana?
Even if we do get to apply my formula, I don't see how its interpretation would be possible. Yet, I am unable to convince myself!

For 3., I think the following formula would apply:

Average Spend per customer in the treatment group - average spend per customer in the control group.

For 4., I am unsure, but I think we would use:

The incremental response * the total no. of people in the control group

But how to understand this? Are we extrapolating to the control group with our incremental spend, to project how much the 'untreated' group or the control group would spend if we direct our offers/advertisement/etc. at them?

Lastly, I would be grateful if people answering this question could also suggest further reading for me. Perhaps you feel I need to study the basics of proportion, or percentages, in order to get myself comfortable with such basic calculation in Incremental modelling? Or perhaps you feel that there are certain books in Marketing or Incremental modelling that would help me?

Thanks a lot!

Best Answer

This formula doesn't take into account impact control respondents have on the control total population make doesn't account for the population in the control group fully. Think about having a test population with great response but population less than 1000 vs control with significantly slightly higher response with 1 million population.

Check this link out that has some simple functions broken out to do these sorts of things.

http://excelmasterseries.com/D-_Loads/New_Manuals/Confidence_Intervals_in_Excel.pdf

This would be correct but you still want to use right formulas to account for population sizes of test and control.

Multiple ways to look at this one. A total sales number can be used if the population sizes are considered to be similar for test and control. Otherwise same process as above. Your formula actually would be used in more of a hypothetical, "what-if" analysis.

If anyone wants to learn more about this there are some stat MOOC courses out there that would be great intros.

Best Answer

Related Solutions

Exponential Distribution – Calculating Average Days Between Customer Orders

Related Question