R Function : Gain and Lift Table

This tutorial demonstrates how to calculate gain and lift chart with R. Gain and Lift charts are used to measure the performance of a predictive classification model. They measure how much better results one can expect with the predictive classification model comparing without a model. The lift metrics is a very popular metrics to evaluate the customer and marketing analytics model. The 'lift' term comes from marketing analytics domain.

R Function : Gain and Lift Chart

lift <- function(depvar, predcol, groups=10) {
if(!require(dplyr)){
install.packages("dplyr")
library(dplyr)}
if(is.factor(depvar)) depvar <- as.integer(as.character(depvar))
if(is.factor(predcol)) predcol <- as.integer(as.character(predcol))
helper = data.frame(cbind(depvar, predcol))
helper[,"bucket"] = ntile(-helper[,"predcol"], groups)
gaintable = helper %>% group_by(bucket) %>%
summarise_at(vars(depvar), funs(total = n(),
totalresp=sum(., na.rm = TRUE))) %>%
mutate(Cumresp = cumsum(totalresp),
Gain=Cumresp/sum(totalresp)*100,
Cumlift=Gain/(bucket*(100/groups)))
return(gaintable)
}

Run Function

dt = lift(churn$target , churn$prediction, groups = 10)

In the first parameter of the lift function, you need to define a column wherein binary target variable (dependent variable) is stored. In this case, churn$target : churn is the data frame and target is the dependent variable column.

In the second parameter, you need to specify the predicted probability variable.

How to plot Cumulative Lift

Run the following code to create lift chart.

graphics::plot(dt$bucket, dt$Cumlift, type="l", ylab="Cumulative lift", xlab="Bucket")

Lift Chart

Interpretation of Cumulative Lift

The Cumulative Lift of 3.4 for top two deciles, means that when selecting 20% of the records based on the model, one can expect 3.4 times the total number of targets (events) found by randomly selecting 20%-of-records without a model. In terms of customer attrition (churn) model, we can say we can cover 3.4 times the number of attritors by selecting only 20% of the customers based on the model as compared to 20% customer selection randomly.

Output

The output of gain and lift table is shown in the image below -

Gain and Lift Table

Interpretation of Gain

% of targets (events) covered at a given decile level. For example, 67.9% of events covered in top 20% of data based on model.

Export gain and lift table

You can export gain table in CSV format by submitting the following code -

# Export gain table
write.csv(dt, "D:\\lift.csv", row.names = FALSE)

Related Tutorial - Understand Gain and Lift Chart

About Author:
Deepanshu Bhalla

Deepanshu founded ListenData with a simple objective - Make analytics easy to understand and follow. He has over 10 years of experience in data science. During his tenure, he worked with global clients in various domains like Banking, Insurance, Private Equity, Telecom and HR.

While I love having friends who agree, I only learn from those who don't
Let's Get Connected Email LinkedIn

Post Comment 11 Responses to "R Function : Gain and Lift Table"

UnknownNovember 18, 2015 at 6:45 PM
excelente
SanjoyDecember 5, 2016 at 8:07 AM
Thanks for the code. However, after running it, the gain table that I am getting, its not dividing the data into 10 equal groups, each group is having different observation number, ideally which should not be the case. Need help on this.
UnknownOctober 26, 2017 at 4:52 PM
I keep getting this error:
Error in `[.data.frame`(helper, , "predcol") : undefined columns selected
Calls: lift -> ntile -> [ -> [.data.frame

I'm not able to understand why.

PS I love this post!
UnknownAugust 4, 2018 at 1:42 PM
Wt is depvar and predcol pls explain
AnonymousAugust 12, 2018 at 3:29 AM
Hi, I am keep getting this error when trying to create a table with lift function: "error in lift.default: 'x' should be a formula". Would you be able to help me with it?
Prasanth SreeramaNovember 9, 2018 at 2:39 AM
How to get which customers are in top 3 deciles from gains plot? I mean how to retrieve those observations from top deciles?
swaroopJuly 11, 2019 at 6:45 AM
found it very useful. thanks a lot for posting.
Nethra SambamoorthiSeptember 24, 2019 at 1:54 PM
Thanks.
AbhinavBOctober 7, 2019 at 2:23 AM
Thanks for this lovely function Deepanshu ,you made my job very easy.
Are you full time a listen data person now ? We can collaborate on few projects on datascience if you are interested