First, need to split the dataset into training set 60%, validation set 20% and test set 20%.
Then start with 1 most important predictor (predictor 5) model logistic regression, use, validation set to measure error, also do 10 k-fold cross validation to measure error , and use test set to get model accuracy.
I know utilize all predictor in this dataset should get most accuracy and least error. I just need someone to do 1 model, and then I can just use same method to do the rest models.
Be aware the response variable are binary 0 and 1, it is a classification problem.
The dataset is attached
you need to first use
mydata <- [url removed, login to view](mydata)
mydata <- mydata[[url removed, login to view](mydata), ]
to make this data work
12 freelancers are bidding on average $160 for this job
I have done my PHD in Statistics and now I am a lecturer.I can surely help [url removed, login to view] can check my reviews if you [url removed, login to view] let me know the [url removed, login to view] work with me once and you will surely hire me [url removed, login to view] we have a More
Hi, I have done similar assignments in the past, please connect with me i am keen to do this project for you. i am online we can chat and take this forward. regards, Puneet