pattern discovery fuzzy time series

Upload: prasanta-debnath

Post on 03-Apr-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    1/32

    Pattern Discovery of Fuzzy Time

    Series for Financial Prediction-IEEE Transaction of Knowledge and Data Engineering

    Presented by Hong YanchengFor COMP630P, Spring 2009

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    2/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    3/32

    Problems with existing stock

    prediction tools

    A lot of tools exists for predicting stock price

    Artificial Neural Network, SVM, NeuroFuzzy,

    Nave Bayes and so on

    Three major problems with these tools Training process is nontrivial and training result

    cannot be further used for other target

    Prediction results are incomprehensible

    Hard for user to tuning the parameters

    Gap exists between prediction result and

    investment decision

    Improving prediction VS buy/sell decision

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    4/32

    Target problem

    Data preprocessing are needed before

    applying various of techniques

    Data mining, machine learning & pattern

    recognition Good knowledge representation method can

    assist investors

    Knowledge-based method to transfer

    financial data to comprehensible rules and

    visual patterns

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    5/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    6/32

    Japanese Candlestick Theory

    Four general ways of represent stock price

    fluctuation

    Original daily fluctuation

    Single close price Bar chart

    Candlestick chart

    More visual information

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    7/32

    Fuzzy Time Series

    Fuzzy time series

    Assume U is the universe of discourse,

    where U = {x1, x2,, xn}. A fuzzy set Ai of U

    is defined byAi = Ai

    (x1)/x1 + Ai(x2)/x2+ + Ai

    (xn)/xn

    where Ai(xk) is membership function of the

    fuzzy set Ai ,Ai: U -> [0,1]

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    8/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    9/32

    Fuzzy candlestick pattern

    A fuzzy candlestick pattern is composed of

    related fuzzy candlestick lines in a period

    A fuzzy candlestick line has seven parts

    Sequence, open style, close style, upper shadow,body, body color and lower shadow

    Sequence defines the location of the candlestick

    Open/Close style model the relationship between

    consecutive candlestick lines

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    10/32

    Candlestick line modeling

    Modeling the length of shadow and body

    Four linguistic variables EQUAL, SHORT,

    MIDDLE and LONG indicate the fuzzy sets

    of length Lupper= ([high MAX(open, close)]/open) * 100

    Llower= ([MIN(open, close) - low]/open) * 100

    Lbody = ([MAX(open, close) MIN(open, close)]/open)

    * 100

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    11/32

    Candlestick line modeling

    The membership function of four fuzzy sets

    are shown as follows

    The range is set to (0, 14) because the Taiwan

    stock price limitation

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    12/32

    Candlestick line modeling

    The body color is defined by three terms

    BLACK, WHITE and CROSS

    If openclose > 0 then body color is BLACK

    If openclose < 0 then body color is WHITE If openclose = 0 then body color is CROSS

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    13/32

    Candlestick line modeling

    The open/close style is another important

    feature

    Five linguistic variables LOW, EQUAL_LOW,

    EQUAL, EQUAL_HIGH, HIGH indicate fuzzysets of open/close style

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    14/32

    Trend modeling

    Two linguistic variables are used to model

    the trends before and afterthe candlestick

    pattern

    previous trend is represented by weeklycandlestick line

    Six fuzzy sets are used to define the trend

    CROSS, EQUAL, WEAK, NORMAL, STRONG,

    and EXTREME

    BEARISH and BULLISH define the body color

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    15/32

    Trend modeling

    Following trend is derived from the variation

    of close price

    (Closet+n Closet)/ Closet * 100

    Closet+n and Closet mean the close price at dayt+n and day t respectively

    n is a user-defined parameter

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    16/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    17/32

    Three major pattern recognition

    problems

    Sensing problem

    Measured values are open, close, high, low

    Feature extraction problem

    Fuzzy candlestick patterns

    Pattern classification problem

    Can be determined by user

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    18/32

    Forecast procedure

    Step 1

    Calculate the variation percentage between two

    close prices.

    Use the minimum increase Imin and maximumincrease Imax to define the universe of discourse

    UoD = [IminD1, Imax +D2]

    E.g. Imin = -5.83, Imax = 7.66 then UoD = [-6, 8]

    Step 2 Partition UoD into several intervals

    E.g. partition [-6, 8] into seven intervals [-6, -4], [-

    4, -2], , [6, 8]

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    19/32

    Forecast procedure

    Step 3

    Define fuzzy sets on the UoD associate with the

    intervals in step 2

    Step 4 Fuzzifying the values calculated in step 1

    If v ux, and there is Ay in which maximum

    membership function occurs at ux, v is translate

    to Ay

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    20/32

    Forecast procedure

    Step 5

    Calculate all the candlestick patterns

    Step 6

    Refine extracted patterns, identify importantattributes

    Step 7

    Select pattern for forecasting based on

    probability P(Ax |Py )

    Statistic T = Count(Py Ax)/Count(Py) as the

    threshold to select the patterns

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    21/32

    Forecast procedure

    Step 8

    Forecast the trend follows

    Rule 1: test pattern not found, set variation v to 0

    Rule 2: test pattern found, set variation v toarithmetic average of midpoints of matched

    patterns

    Forecast = close + close * v

    Step 9 Evaluate the forecasting

    MSE = (Forecasti - Actuali)2 / N

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    22/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    23/32

    Experiments and Applications

    The experiments are conducted based on

    TAIEX index from 2004-01-02 to 2005-01-31

    and 2330(TSMC) from 1997-10-23 to 2002-

    12-25

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    24/32

    Experiments and Applications

    Experiment for TAIEX index

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    25/32

    Experiments and Applications

    Experiment results for TAIEX

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    26/32

    Problems with existing stock

    prediction tools

    Three major problems with these tools

    Training process is nontrivial and training result

    cannot be further used for other target

    Prediction results are incomprehensible Hard for user to tuning the parameters

    Gap exists between prediction result and

    investment decision

    Improving prediction VS buy/sell decision

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    27/32

    Experiments and Applications

    Experiment with 2330

    (TSMC)

    The focus is to find the buying

    time of the stock The rule is: IF T>0.5 and the

    following trend is

    STRONG_INCREASE or

    EXTREME_INCREASE

    THEN select the pattern

    5-day return is 2.9% on

    average

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    28/32

    Experiments and Applications

    Fuzzy modifier can be implemented to help

    user tuning the parameters

    ABOVE, BELOW, PLUS, VERY, EXTREMELY,

    MORE_OR_LESS, SOMEWHAT, and NOT E.g. STRONG_BEARISH and

    EXTREME_BEARISH can be merged by ABOVE

    STRONG_BEARISH

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    29/32

    Outline

    Introduction and target problem

    Background knowledge and related work

    Modeling the candlestick pattern

    Candlestick pattern for financial prediction

    Experiments and applications

    Conclusion and Discussion

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    30/32

    Conclusion and Discussion

    Pros

    Knowledge-based method to represent the

    financial time series and to facilitate the

    knowledge discovery

    Comprehensible, computable and visual

    Can be used directly or as data preprocess

    Cons

    Time complexity How many candlestick lines for a pattern

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    31/32

    Thanks for listening

  • 7/28/2019 Pattern Discovery Fuzzy Time Series

    32/32

    Q & A