Calculating Variable Importance with Random Forest

In random forest, you can calculate important variables with IMPORTANCE= TRUE parameter.

R Code : Variable Importance
library(caret)
rfTune <- train(dev[, -1], dev[,1], method = "rf", ntree = 100, importance = TRUE)

MeanDecreaseAccuracy table represents how much removing each variable reduces the accuracy of the model.

Selecting top 10 variables
ImportanceOrder <- order(rfTune$finalModel$importance[,1],decreasing = TRUE)
top10 <- rownames(rfTune$finalModel$importance[ImportanceOrder,])[1:10]
subsetimp <- subset(training, select = top10)

R Tutorials : 75 Free R Tutorials

About Author:

Deepanshu founded ListenData with a simple objective - Make analytics easy to understand and follow. He has over 7 years of experience in data science and predictive modeling. During his tenure, he has worked with global clients in various domains like banking, Telecom, HR and Health Insurance.

While I love having friends who agree, I only learn from those who don't.

Let's Get Connected: Email | LinkedIn

Get Free Email Updates :
*Please confirm your email address by clicking on the link sent to your Email*
Related Posts:
0 Response to "Calculating Variable Importance with Random Forest"

Post a Comment

Next → ← Prev