signature based

Upload: sankar-rocking

Post on 05-Apr-2018

225 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/2/2019 Signature Based

    1/12

    A Signature-Based Indexing Methodfor Efficient Content-Based Retrieval

    of Relative Temporal Patterns

  • 8/2/2019 Signature Based

    2/12

    AbstractA number of algorithms have been proposed for thediscovery of temporal patterns. However, since the number ofgenerated patterns can be large, selecting which patterns toanalyze can be nontrivial. There is thus a need for algorithms

    and tools that can assist in the selection of discovered patternsso that subsequent analysis can be performed in an efficientand, ideally, interactive manner. In this project, we propose asignature-based indexing method to optimize the storage andretrieval of a large collection of relative temporal patterns.

  • 8/2/2019 Signature Based

    3/12

    Introduction MANY rule discovery algorithms in data mining generate a large number of

    patterns/rules, sometimes even exceeding the size of the underlying database,with only a small fraction being of interest to the user It is generally understoodthat interpreting the discovered patterns/rules to gain insight into the domainis an important phase in the knowledge discovery process. However, when

    there are a large number of generated rules, identifying and analyzing thosethat are interesting becomes difficult. For example, providing the user with alist of association rules ranked by their confidence and support might not be agood way of organizing the set of rules as this method would overwhelm theuser and not all rules with high confidence and support are necessarilyinteresting for a variety of reasons.

    Therefore, to be useful, a data mining system must manage thegenerated rules by offering flexible tools for rule selection. In the caseof association rule mining, several approaches for the post processingof discovered association rules have been discussed. One approach is togroup similar rules which works well for a moderate number of rules.However, for a larger number of rules it produces too many clusters.

  • 8/2/2019 Signature Based

    4/12

    A more flexible approach is to allow the identification of rules that areof special importance to the user through templates or data miningqueries.This approach can complement the rule grouping approach

    and has been used to specify interesting and uninteresting classes ofrules (for both association and episodic rules).The importance of datamining queries has been highlighted by the introduction of theinductive database concept, which allows the user to both query thedata and query patterns, rules, and models extracted from these data.

  • 8/2/2019 Signature Based

    5/12

    Existing System

    Mine-Rule DMQL and OLE DB . These languages are designedto generate the rules from the data rather than allow queriesover the discovered rules.

    Post processing of association rules, an area in which littleresearch to date has been conducted.

  • 8/2/2019 Signature Based

    6/12

    De-Merits of Existing System In previous number of algorithm proposed for discover the temporal

    pattern, in that generated patterns are large .

    Mine-Rule, DMQL to generate the rules from the data rather thanallow queries over the discovered rules.

  • 8/2/2019 Signature Based

    7/12

    Proposed SystemEfficiently retrieving subsets of a large collection of previouslydiscovered temporal patterns.

    Focuses on supporting content-based queries of temporalpatterns, as opposed to point- or range-based queries .

  • 8/2/2019 Signature Based

    8/12

    Advantages of Proposed System

    we address the problem of efficiently retrieving subsets of a large

    collection of previously discovered temporal patterns.

    improves the performance of temporal pattern retrieval.

    This retrieval system is currently being combined with

    visualization techniques for monitoring the behavior of a single

    pattern or a group of patterns over time.

  • 8/2/2019 Signature Based

    9/12

    Dataflow DiagramLarge Database

    Finding temporalpattern similarity

    ConstructingSignature

    Filesfor temporal patterns

    AnsweringContent-Based

    Queries Using theSignature File

    Effectively retrievedthe data

  • 8/2/2019 Signature Based

    10/12

    Modules and Description Finding temporal pattern similarity

    1. Temporal patterns are variable-length objects thatcannot be represented in a k-dimensional metricspace.

    2. Each pattern contains a list of states.3. Each pattern contains a set of state relationships.

    Constructing Signature Files for temporal patterns.

    Given that T and Q denote the target set and query set,respectively, three commonly used set-valued queries are

    1. Subset query T Q. The target set is a superset ofthe query set.

    2. Superset query T Q. The target set is a subset ofthe query set.

    3. Equality query T Q. The target set is equal to the

    query set.

  • 8/2/2019 Signature Based

    11/12

    Answering Content-Based Queries Using the SignatureFile.

    1.Subpattern Queries

    2.Superpattern Queries

    3.Equality Queries

  • 8/2/2019 Signature Based

    12/12

    Hardware:

    PROCESSOR : PENTIUM IV 2.6 GHzRAM : 512 MB DD RAMMONITOR : 15 COLORHARD DISK : 20 GB

    FLOPPY DRIVE : 1.44 MBCDDRIVE : LG 52XKEYBOARD : STANDARD 102 KEYSMOUSE : 3 BUTTONS

    Software:

    Front End : Java.Back End : MS SqlOperating System : WindowsXP

    System Requirements