ieor 115: industrial and commercial data systems university of california, berkeley uc berkeley...

21
IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang • Eric Follis • Justin Hsu • Jason Tan • James Tong Project Review 3 – December 5, 2008

Post on 19-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley

UC Berkeley UndergraduateOrientation Database

Group 5:Peter Chang • Eric Follis • Justin Hsu • Jason Tan • James Tong

Project Review 3 – December 5, 2008

Page 2: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Mission Statement

"New students' initial encounters with the institution may have profound effects on subsequent levels of involvement, and these encounters should be carefully designed to socialize students to the institution's highest educational values and goals."

(How College Affects Students, 1991)

Page 3: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

CalSO, New Student Orientation

Design a comprehensive, all-encompassing database to facilitate the operation of CalSO

Student Information

Counselor Information

Event Information

Page 4: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

UC Berkeley New Student Services

People

Counselor

DirectorNew StudentCoordinator

Advisor

SpeakerParent

O,P

CalSO Event

Major

Dormitory

Training

Campus Location

Special Need

Tour

Advises(0,N)

(1,1)

ReceivesSupervises

Supervises

Lives_in

Enrolls_in

Enrolls_in(0,N)

Creates(1,N)

Located_at

(1,N)

(0,N)

Speaks_at(0,N)

(0,N)

Declares

(0,N)

(0,N)

Accommodates_for

Attends

Requires

(0,N)

(0,N)

Is_Assigned_To

(0,N)

Attends

Conducts

(1,N)

(0,N)(0,N)

Liaison_to

(1,N)

(1,N)

(0,N)

(1,N)

Lives_in(0,N)(0,1)

(0,1)

Lives_in

(0,N)

(0,N)

(0,N)

(0,N)

(0,N)

(1,N)

(0,N)

(0,N)

(0,1)

(0,N)

(0,N)

(0,N)

(0,N)Requires

(0,N)

(0,N)

(0,N)

Financial Aid

(0,N)

(0,N)

(0,1)

(0,N)

Reserves

(0,N)

Is_Prerequisite_for

(0,N)

(0,N)

Is_Accessible_For

(0,N)

(0,N)

Page 5: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Relational Schema

Page 6: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Relationship View – MS Access

Page 7: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Add New People

Page 8: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Add New Event

Page 9: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Report: Performance Ratings

Page 10: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Normalization Analysis: 1NF

R is in 1NF if all attribute domains include only values that are atomic (indivisible) and single-valued.

1NF: Training(Training_ID, Name, Hours, Prerequisites, Required)

TrainingName(Training_ID, Name, Hours, Required)

TrainingPrerequisites(Training_ID, Prerequisite)

Page 11: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Normalization Analysis: 2NF

R is in 2NF if it is in 1NF, and every non-prime attribute is fully functionally dependent on the Primary Key

2NF:

TourBuilding(Counselor_ID, CFname, CLname, Tour_ID, TourName, Location_ID, Building)

CounselorName (Counselor_ID, CFname, CLname)

TourID (Tour_ID, TourName)

Location (Location_ID, Building)

TourLocation (Counselor_ID, Tour_ID, Location_ID, Building)

Page 12: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Normalization Analysis: 3NF

R is in 3NF if R is in 2NF and non-prime attributes of R are transitively dependent on the primary key

3NF:TourLoc (Counselor_ID, Tour_ID, Location_ID, Building)

LocationID(Location_ID, Building)

TourLocation (Counselor_ID, Tour_ID, Location_ID)

Page 13: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 1: Absence Analysis

In order to improve student outreach effectiveness, find the economic & geographical demographic information of students who did not attend CalSO

Assumptions:•NewStudent.Attended = 1 if student attended CalSO, 0 if not.•Economic & geographical demographic information is fully described by county of residence and financial aid status.

SELECT P.FName, P.MName, P.LName, P.Email, P.Phone, P.Major, NS.Financial_Aid, P.Class_StandingFROM NewStudent as NS, People as PWHERE P.PID = NS.PID, NS.Attended = 0GROUP BY P.County, NS.Financial_Aid;

Page 14: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Report: Non Attendees

Page 15: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 2: Event Effectiveness

Find the interest level associated with each optional event in CalSO.

Assumptions:•Students are required to attend at least one event of their choice.•Interest level is determined based on a weighted function of the CalSO event attendance level, event survey score, and the counselor performance score.•InterestLevel = 4*AttendanceRate + 6*P.Score + 3*S.Score.

SELECT CE.Event_ID, PP.Year, SUM(CE.No_of_Attendee) / COUNT(S.NewStudent) as AttendanceRate, P.Score, 4*AttendanceRate + 6*P.Score +

3*S.Score as InterestLevelFROM CalSO_Event as CE, Survey_Rating as S, Counselor as C, NewStudent

as NS, Performance_Rating as P, People as PPWHERE CE.Event_ID = S.Event_ID, C.Counselor_ID = CE.Counselor_ID,

C.Counselor_ID = NS.Counselor_ID, P.Counselor_ID = C.Counselor_ID GROUP BY CE.Event_ID, PP.Year;

Page 16: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 3: Parents Attendance Forecast

Forecast parent attendance for CalSO events using regression analysis in order to optimize the resource allocation for future years.

Assumptions:•There exists a relationship between the attendance levels of students and the attendance level of parents.

•Regression Formula: y=Xβ+ε•X = # students in attendance•Y = # students in attendance•SQL code below extracts necessary inputs to perform the proposed regression analysis. All calculations will be executed in MS Excel using macros written in Visual Basic for Applications (VBA).

SELECT COUNT(Pa.PID), COUNT(NS.PID), NS.Sem_Admit, PP.Year

FROM People as PP, Parent as Pa, NewStudent as NS

WHERE PP.PID = Pa.PID, Pa.Student_SID = NS.Student_SID

GROUP BY PP.Year;

Page 17: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 4: Correlation of Training & Performance Review

What is the correlation between the amount of optional training received and counselor performance?

Assumptions:•Correlation Equation:•X = # optional training hours received by counselor•Y = performance score received by counselor•SQL code below extracts necessary inputs to perform the proposed correlation analysis. All calculations will be executed in MS Excel using macros written in Visual Basic for Applications (VBA). SELECT C.Counselor_ID, T.Training_ID, SUM(T.Hours), P.Score, P.Year

FROM Peformance as PF, Training as T, Counselor as C, People P

WHERE P.PID = S.PID, PF.Counselor_ID = C.Counselor_ID, T.Training_ID = S.Training_ID, T.Required = ‘No’

GROUP BY C.Counselor_ID, P.Year;

Page 18: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 5: Optimal Number of Employees to Hire

Uses linear programming to determine optimal number of employees to recruit and hire in order to meet student demand.

Assumptions:•Optimality is defined as minimizing costs while meeting a required service levels.•Coordinators do not contribute to the required student to staff employment ratio.•SQL code below extracts necessary inputs to perform the proposed linear programming analysis. All calculations will be executed in MS Excel using macros written in Visual Basic for Applications (VBA).

Page 19: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 5: Optimal Number of Employees to Hire

The following linear program is used:

 

SELECT C.Wage, CO.Wage, CO.Experience, P.Year

FROM People as P, Counselor as C, Coordinator as CO

WHERE P.PID = C.PID, C.Supervisor = CO.PID, C.Counselor_ID = NS.Counselor_ID

GROUP BY P.Year, CO.Experience

UNION

SELECT COUNT(NS.PID)

FROM NewStudent as NS;

Decision VariablesX1 = # new hiresX2 = # experience hiresX3 = # coordinators Fixed VariablesC1 = wage of new hiresC2 = wage of experienced hiresC3 = wage of coordinatorsS = # new students User InputsB = total budgetG = required counselor to coordinator ratioR = required student to staff employment ratioP = required percentage of experienced hires per total number of hires

Page 20: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Query 5: Optimal Number of Employees to Hire

Access Output

Excel

Page 21: IEOR 115: Industrial and Commercial Data Systems University of California, Berkeley UC Berkeley Undergraduate Orientation Database Group 5: Peter Chang

UC Berkeley Undergraduate Orientation Database

Thank You