cis 895 – mse project kdd-research entity search tool (krest) presentation 3 eric davis...

27
CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis [email protected]

Upload: matilda-flowers

Post on 13-Dec-2015

223 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

CIS 895 – MSE Project

KDD-Research Entity Search Tool (KREST)

Presentation 3

Eric Davis

[email protected]

Page 2: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Outline

Action Items Component Design Test Evaluation Project Evaluation Future Work Lessons Learned Demonstration Questions / Comments

Page 3: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Action Items

Future Project Use and Extension

Created a FileLoader class for easy project extension

Created Technical Instructions for adding onto the project

Page 4: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Action Items (cont.)

Update Project Plan to include new cost estimate and details

Project Plan was revised to include the new information

Page 5: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Action Items (cont.)

Investigate derivation of Webpage and KrestEntity in UML Model Updated WebObject and KrestEntity objects to show that

they are abstract classes Updated the links to show the derivation relationship

Investigate mutual exclusion of types for WebObject in formal specification Updated OCL to include invariants for WebObject

Page 6: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design

Developed using Fujaba Contains class variables and functions Added separate table to describe the roles of

functions

Page 7: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design

Overall Package View

Page 8: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design

Controller Package:

Page 9: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design

View Package:

Page 10: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design

Model Package:

Page 11: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Component Design (cont.)

EntityObserver Class

Class Visibility Extends Implements  

EntityObserver public none none  

Attribute Visibility Type Other  

matchingEntitiesTable public JTable    

Function Visibility Parameters Returns Actions

EntityObserver public JTable void Constructor to set up the class

sortListByMatchSize private ArrayList ArrayList Sorts the entities found by the number of pages they were found on

updateEntitySearchResults public KrestObjectLibary,String

void Updates the matching entities found

Page 12: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Test Evaluation

Test Case Main Functionality Tested Pass/Fail

Test Case 1 Application Functionality PASS

Test Case 2 Web Crawling Functionality PASS

Test Case 3 Web Searching PASS

Test Case 4 Entity Searching PASS

Test Case 5 Reproducing the results from Tao Cheng’s work PASS

Page 13: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Test Evaluation (cont.)

Reproducing Tao Cheng’s work Downloaded 12 separate data sets from

WebBase repositoryData sets ranged from 300 – 1200 web pages eachThe data sets each contained one item of contact info

from Tao’s workHelped ensure that the entity searcher could find

specific contact info

Page 14: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Test Evaluation (cont.)

Problems encountered: Two problems found

Web crawl stopping at random time Due to internet connection issues at home

Case sensitivity in web and entity search

Page 15: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation

Problems encountered Web Crawler Thread Control Java Class Size Limitations Jigloo GUI Builder

Page 16: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

SLOC Evaluation Phase One Estimate: 2K Phase Two Estimate: 2.3K Final SLOC Developed: 2.9K

1.4K Comment Lines

Analysis Estimates fairly close Most of the extra code was generated by Jigloo

get( ) methods for widgets

Page 17: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Project Schedule:

Phase Expected Completion Date Actual Completion Date

1 November 13, 2007 November 13, 2007

2 February 15, 2008 February 13, 2008

3 April 25, 2008 April 21, 2008

Page 18: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Time Spent During in Each Phase:

Time Spent Per Phase (in Hours)

Phase 1, 55.92, 37%

Phase 2, 57.83, 38%

Phase 3, 37.67, 25%

Phase 1

Phase 2

Phase 3

Page 19: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Overall Task Breakdown:

Time Spent Per Project Activity (in Hours)

Coding, 57.92, 38.25%

Integration, 0.75, 0.50%

Documentation, 57.17, 37.75%

Webpage, 2.00, 1.32%

Presentation, 11.58, 7.65%

Reading, 9.50, 6.27%

Timelog, 1.50, 0.99%

Environment, 4.58, 3.03%

Research, 1.92, 1.27%

Discussion, 4.50, 2.97%

Discussion

Research

Reading

Timelog

Environment

Coding

Integration

Documentation

Presentation

Webpage

Page 20: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Phase 1 Task Breakdown:

Time Spent Per Project Activity During Phase 1 (in Hours)

Coding, 12.58, 22.50%Integration, 0.00, 0.00%

Documentation, 17.50, 31.30%

Discussion, 4.33, 7.75%

Research, 1.92, 3.43%

Environment, 4.58, 8.20%

Timelog, 0.83, 1.49%

Reading, 9.50, 16.99%

Presentation, 3.67, 6.56%

Webpage, 1.00, 1.79%

Discussion

Research

Reading

Timelog

Environment

Coding

Integration

Documentation

Presentation

Webpage

Page 21: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Phase 2 Task Breakdown:

Time Spent Per Project Activity During Phase 2 (in Hours)

Coding, 30.33, 52.45%Documentation, 18.67, 32.28%

Webpage, 0.25, 0.43%

Presentation, 7.92, 13.69%

Timelog, 0.67, 1.15% Discussion

Research

Reading

Timelog

Environment

Coding

Integration

Documentation

Presentation

Webpage

Page 22: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Project Evaluation (cont).

Phase 3 Task Breakdown:

Time Spent Per Project Activity During Phase 3 (in Hours)

Coding, 15.00, 40%

Integration, 0.75, 2%

Documentation, 21.00, 56%

Presentation, 0.00, 0%

Timelog, 0.00, 0%

Discussion, 0.17, 0%Webpage, 0.75, 2%

Discussion

Research

Reading

Timelog

Environment

Coding

Integration

Documentation

Presentation

Webpage

Page 23: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Future Work

Described in more detail in Software Technical Instruction for Reuse and Extension Document Add Open Source Web Crawler Implement Database for Crawled/Loaded Web

Pages Add support for more loading more types (e.g.

XML)

Page 24: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Lessons Learned

Design Development MS Visio Lots of design tools exist, many aren’t very

robust Java Tool Usage

Eclipse IDE GUI builders – should do more in depth search

before choosing one

Page 25: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Demonstration

Page 26: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Final Steps

Final Examination Ballot

Make any necessary changes to the MSE Portfolio

Deliver the Portfolio

Put ISO of all MSE files on KDD Tiki page

Page 27: CIS 895 – MSE Project KDD-Research Entity Search Tool (KREST) Presentation 3 Eric Davis efd3467@ksu.edu

Questions / Comments