Statistic r code | Statistics homework help


You are requested to create an India credit risk(default) model, using the data provided in the spreadsheet raw-data.xlsx, and validate it on validation_data.xlsx. Please use the logistic regression framework to develop the credit default model.

Hints :

Data description – Default Risk Prediction. After removing variables for multicollinearity, we should try to take at least one variable for creating the model from each of the 4 factors namely – 1) Profitability2) Leverage3) Liquidity4) Company’s sizeclearly bifurcate all the variables in different buckets.Creation of new variables – This is an important step in the project as the company which is the biggest in size, will also have bigger asset size, cash flows etc. (Hint: We need to think in terms of ratios – Equity to asset ratio, debt to equity ratio etc)Dependent variable – We need to create a default variable which should take the value of 1 when net worth is negative & 0 when net worth is positive.Validation Dataset –  We need to build the model on raw dataset and check the model performance measures on validation dataset.

Please find attached the files to be referred. 





Outlier Treatment


Missing Value Treatment


New Variables Creation (One ration for profitability, leverage, liquidity and company’s size each )


Check for multicollinearity


Univariate & bivariate analysis




Build Logistic Regression Model on most important variables


Analyze coefficient & their signs


Model Performance Measures


Predict accuracy of model on dev and validation datasets


Sort the data in descending order based on probability of default and then divide into 10 dociles based on probability & check how well the model has performed


Please note the following:

  • You have to submit 2 files : 
    1. Business Report not exceeding 3000 words. In this you need to submit all the answers to all the questions in a sequential manner. Your answer should include detailed explanations & inferences to all the questions. Your report should not be filled with codes. You will be evaluated based on the business report.
    2. R code file : This is a must and will be used for reference while evaluating.
  • Any assignment found copied/ plagiarized with other submission(s) will not be graded and marked as zero.
  • Please ensure timely submission as post deadline assignment will not be accepted.
Calculate your essay price
(550 words)

Approximate price: $22

How it Works


It only takes a couple of minutes to fill in your details, select the type of paper you need (essay, term paper, etc.), give us all necessary information regarding your assignment.


Once we receive your request, one of our customer support representatives will contact you within 24 hours with more specific information about how much it'll cost for this particular project.


After receiving payment confirmation via PayPal or credit card – we begin working on your detailed outline, which is based on the requirements given by yourself upon ordering.


Once approved, your order is complete and will be emailed directly to the email address provided before payment was made!