dashboarding dirty data with dave tarrant

21
Content created by The Open Data Dashboarding dirty data with Dave Dr David Tarrant @davetaz The Open Data Institute

Upload: oditraining

Post on 10-Jan-2017

60 views

Category:

Technology


0 download

TRANSCRIPT

PowerPoint Presentation

Dashboarding dirty data with DaveDr David Tarrant@davetazThe Open Data Institute

Content created by The Open Data Institute

1

Course aimCreate a dashboard from dirty input dataCourse aim

Content created by The Open Data InstituteOutcomesDesign a properly structured spreadsheetCreate a schema for a given set of dataClean a set of dirty dataSort, filter and analyse data in a spreadsheetCreate a dashboard using data

Outcomes

Content created by The Open Data InstitutePart 1 Organising dataDesign a properly structured spreadsheetCreate a schema for a given set of data

Outcomes

Content created by The Open Data InstituteExercise 1 Organising data

bit.ly/tz_sourceDownload and openWhat would you do (practically) to improve this spreadsheet?

Content created by The Open Data InstituteTop 3 tipsA single sheet for all dataA simple schema without abbreviationsNo mixed data types in columns

Content created by The Open Data InstituteStructure and Unstructured

Content created by The Open Data InstituteDocuments vs DataFor documents the machine is told where to put different things on screen to suit humans. Very fixed output.

Given data, the machine can decide how to use it and how to display it best without the need to be told explicitly by a human.

Content created by The Open Data InstitutePart 2 - Cleaning

Clean a set of dirty data

Outcomes

Content created by The Open Data InstituteOpen refine

Content created by The Open Data InstituteExercise 2 Cleaning

bit.ly/tz_uncleanDownload and open with open refine (refine available from http://training.theodi.org/InADay)Explore clustering and other cleaning features to ensure this data is ready for analysing

Content created by The Open Data InstitutePart 3 Sort, filter & basic analysis

Sort, filter and analyse data in a spreadsheet

Outcomes

Content created by The Open Data InstituteExercise 3 Filtering and analysing

bit.ly/tz_cleanDownload and open with excelInstructor facilitated session

Content created by The Open Data InstituteKey spreadsheet featuresSortFilterFormulaPivot table

Content created by The Open Data InstitutePart 3 Dashboading your data

Create a dashboard using data

Outcomes

Content created by The Open Data InstituteExercise 4 Dasboarding

bit.ly/tz_cleanUpload this csv dataset todataseedapp.com (you will need to register for a free account)

Content created by The Open Data Institute

Content created by The Open Data InstituteDataseed Editing

Re-design elementsChange colourChange measurementExport/embed

Content created by The Open Data InstituteSummaryWhat did we need to do in order to dashboard the original dirty data?

Outcomes

Content created by The Open Data InstituteOutcomesDesign a properly structured spreadsheetCreate a schema for a given set of dataClean a set of dirty dataSort, filter and analyse data in a spreadsheetCreate a dashboard using data

Outcomes

Content created by The Open Data InstituteThank-youDr David Tarrant@davetazThe Open Data InstituteTools usedMicrosoft ExcelOpen RefineDataseedapp

Content created by The Open Data Institute

21