algorithm to find hidden links in a web page

Post on 16-Nov-2014

298 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

[1]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Under the guidance of

Mr. Indraneel Mukhopadhyay

ALGORITHM TO FIND HIDDEN LINKS IN A WEB PAGE

Presented by

Pradyut Kumar MallickRoll # IT200127292

[2]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Introduction

Hidden links are ones that real people aren’t supposed to actually notice or click on

Hidden links is a way to guide a search engine to our doorway pages.

New dynamic “hidden link” technique for linking a large highly connected graph in a simple hyperbolic space without cluttering the display.

[3]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

A cyclic hyperbolic space with hidden links

[4]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

In a hyperbolic space, the far away nodes/edges (paths) are diminished when the user is not focusing on them.

The user can dynamically warp the display to focus on thousands of different nodes for navigation.

This graph is a non-cyclic hierarchical hyperbolic structure without multiple connected paths.

A cyclic hyperbolic space with hidden links

[5]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

New Technique

The user can easily navigate through all possible paths without tracing many lines and intersections

Robot programs called spiders create search engine databases, computer robot programs that crawl the web seeking search engine content

Pages created as the result of a search are called "dynamically generated" pages .

[6]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

In a directed non-cyclic hierarchical space, there is a primary graph, which links all the nodes in a tree form. These links are primary tree links. The others are non-tree/cross links in a highly connected graph. A node can have one incoming primary link and many non-tree/cross links.

Definitions

[7]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Definition of Cyclic Hierarchical Space

[8]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Primary Path: (tree-link) “AE”

Secondary Path (non-tree/cross link) “AB

Hidden-Link Node

Primary Sub-Space Nodes

Secondary Sub-Space Nodes

Placeholder

Definition of Cyclic Hierarchical Space

[9]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Hidden Link States and Processing Flow

State 1: Idle State

State 2: Activate State

[10]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Hidden Link States and Processing Flow

State 3: Map/Unmap (move) State

State 4: Navigation State

[11]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Hidden Link States and Processing Flow

State 5: Reset

[12]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

“Hidden Link” Client-Server Web Structure

[13]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Code

The basic link tag looks something like <a href="hidden.html">click here</a>.

<a href="hidden.html" style="cursor:help">

<a href="hidden.html" style="color:#FF0080">

<a href="hidden.html" style="text-decoration:none">

Cursor Type …………. auto ……………crosshair ……………hand

[14]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Build hash table of links in the website.

Partition web log by visitor

For each visitor, partition web log file such that each subsequence terminates in a target page.

For each visitor and target page, find any expected locations for that page:

Algorithm

[15]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Website & Search Pattern of Hidden Links

[16]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Hidden Link Applications

CONTENT AND USAGE MINING

CUSTOMER INTERVIEW WEB SERVICE

[17]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

<div id="Links0" style="LEFT:0px;TOP:0px;

VISIBILITY:hidden; POSITION: absolute;">

<a href="index1.htm">hasdf hdkfh afhkj </a>

<a href="index2.htm">kjhf haksf hkasf </a>

<a href="index3.htm">kjhkjdf khdkf haf</a>

<a href="index4.htm">ghdf gdjf kgdf</a>

Related Work

[18]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Conclusion

The hidden link technique enables the mining

of large hierarchies with multiple secondary

paths

Hidden link enables the user to easily navigate

through different links without being

overwhelmed with large member of nodes and

paths.

[19]

Nati

onal In

stit

ute

of

Sci

en

ce &

Tech

nolo

gy

Algorithm to Find Hidden Links

Pradyut Kumar Mallick

Thank You!!

top related