multiple regression using sas

Download Multiple Regression using SAS

Post on 14-Apr-2017



Data & Analytics

1 download

Embed Size (px)


  • Predicting Academic Performance of an Elementary SchoolSubmitted ByAnkush RoyAshitha VSKoushik RakshitKrishna BRoma Agrawal*(Multiple Regression using SAS)

  • Data Description*Dataset contains performance data of 400 elementary schools from the California Department of Education.

    Attributes like average class size, parents education, students opting for free meal etc.

    Many attributes have missing values, outliers which needs to be fixed before using this in analysis.


  • *ObjectiveTo find important attributes that majorly affect the academic performance

    Using this attribute, build a model that can predict the overall academic performance of school

    Verify the model before accepting that.

    Create an automated excel sheet using above regression model, which will give the performance taking attributes as an input.

  • *Finding Important Attributes, Building Model and its Verification

  • *Attributes and their ImpactAfter statistical analysis, we found below 5 attributes to be most important and on which performance depends.Regression Equation


  • *Model VerificationMAPE (Mean Absolute Percentage Error)MAPE=Mean(ABS((Actual value Predicted value)/actual value) )*100(It should be
  • *Residuals are normality distributed (as per graph and also with Anderson-Darling test)

  • *Attribute-wise Interpretation and Recommendation

  • *English Language LearnerELLs have negative impact on overall performance of school means decreasing this will improve performance.ELLs more means students are coming from different geographies where English speaking is less and they are poor as they havent learnt English speaking.There will be a challenge in communicating to these students.Schools can work on this, to improve its performance by providing special coaching to ELLs to ensure students master in English.Recruit teachers who can better understand the need of these ELLs.

  • *Percentage first year in school(Mobility)%age of students who are not completing their studies in school.It is negatively affecting the performance, as students are studying for some time and then they move. Decreasing mobility will improve the performance.It affects students also, as they will not be able to concentrate on studies.One of the reason for mobility can be residence change of family.School should consult with parents and make them understand to keep their child in same school.

  • *Average Class Size 4-6It is positively associated with performance of school means increase in this will increase the overall performance also.It can be improved by decreasing mobilityIncreasing promotion of students from one grade to another gradeIncrease in students taking admission each year.

    This is the direct indicator of overall performance of a school. But without controlling other variables, only increase in this will not help school in increasing performance.

  • *Parents Grad SchoolIt has positive impact on increasing performance of a school.If parents will be educated they can take care of childrens studies at home as well. They will take interest in their homework, they will ask for marks etc. which will help child a lot.Children can learn many things from their parents apart from school.School can start some initiative for parents involvement like monthly parents-teachers meeting in which teachers can discuss with parents about their childs studies.

  • *Percentage Full CredentialThis is percentage of full time teachers in school.It has positive impact on performance of school as they are the source of knowledge for the students. In case of absence of temporary teachers, students would be sitting idle.School should recruit more full time teachers and provide them good working environment so that they can give their 100% in sharing their knowledge with students and help them in learning better.Shortage of teacher can also be a reason for students mobility and low performance.

  • *Dashboard Creation

  • *Regression Equation**By entering values for 5 attributes, api00 (performance) can be calculated

  • *Increasing ELL, decreases the performance value(API00)Impact of English Language Learner

  • *Increasing Mobility, decreases the performance value (API00)Impact of Percentage first year in school(Mobility)

  • *Increasing acs_46, increases the performance value (API00)

    **But this should be increased by controlling other factors. Simply increasing class size will not help in improving performance.Impact of Average Class Size 4-6

  • *Increasing grad_sch, increases the performance value (API00)Impact of Parents Grad School

  • *Increasing full, increases the performance value(API00)

    **This is the attribute which is totally in control of school. So increasing this will help a school in increasing performance.Impact of Percentage Full Credential

  • *