Calculating Variable Importance with Random Forest

In random forest, you can calculate important variables with IMPORTANCE= TRUE parameter.

R Code : Variable Importance
library(caret)
rfTune <- train(dev[, -1], dev[,1], method = "rf", ntree = 100, importance = TRUE)

MeanDecreaseAccuracy table represents how much removing each variable reduces the accuracy of the model.

Selecting top 10 variables
ImportanceOrder <- order(rfTune$finalModel$importance[,1],decreasing = TRUE)
top10 <- rownames(rfTune$finalModel$importance[ImportanceOrder,])[1:10]
subsetimp <- subset(training, select = top10)

Best Online Course : Practical Data Science using R

- Explain Advanced Algorithms in Simple English
- Live Projects & Case Studies
- Domain Knowledge
- Job Placement Assistance
- Money Back Guarantee


R Tutorials : 75 Free R Tutorials

About Author:

Deepanshu founded ListenData with a simple objective - Make analytics easy to understand and follow. He has close to 7 years of experience in data science and predictive modeling. During his tenure, he has worked with global clients in various domains like retail and commercial banking, Telecom, HR and Automotive.


While I love having friends who agree, I only learn from those who don't.

Let's Get Connected: Email | LinkedIn

Get Free Email Updates :
*Please confirm your email address by clicking on the link sent to your Email*

Related Posts:

0 Response to "Calculating Variable Importance with Random Forest"

Post a Comment

Next → ← Prev