luan van loc thu dien tu theo giao thuc smtp

Upload: uy-nguyen-van

Post on 20-Feb-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    1/23

    HC VIN CNG NGH BU CHNH VIN THNG

    ---------------------------------------

    inh Quang nh

    NGHIN CU XY DNG H THNG

    LC NI DUNG TH IN T

    GI THEO GIAO THC SMTP

    Chuyn ngnh: Truyn d liu v mng my tnh

    M s: 60.48.15

    TM TT LUN VN THC S

    H NI- 2013

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    2/23

    Lun vn c hon thnh ti:HC VIN CNG NGH BU CHNH VIN THNG

    Ngi hng dn khoa hc: TS.Nguyn Thnh Phc

    Phn bin 1:

    Phn bin 2: ..

    Lun vn s c bo v trc Hi ng chm lun vn thc s ti Hc vin Cng ngh Bu

    chnh Vin thng

    Vo lc: ....... gi ....... ngy ....... thng ....... .. nm ...............

    C th tm hiu lunvn ti:

    - Th vin ca Hc vin Cng ngh Bu chnh Vin thng

    H NI- 2013

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    3/23

    1

    M US pht trin khng ngng ca khoa hc cng ngh, c bit l tc bng pht

    mnh m ca cng ngh thng tin khin cho Internet tr thnh mt cng c hu ch

    khng th thiu trong i sng hin nay. Khng ch l kho ti nguyn thng tin khng l,

    Internet cn cung cp cho ngi s dng cc cng c khai thc thng tin tin li, nhanh

    chng v hin i. Mt trong s cc dch v mng c ngi dng s dng rng ri nht l

    dch v th in t (e-Mail). Dch v ny cho php ngi s dng trao i th tn mt cch

    d dng, ng thi c th ph bin kin thc, thng tin, thng bo v ni dung chnh sch

    ca mt vn trong mt c quan, t chc Vi tnh cht d s dng, khng ph thuc

    vo khng gian v thi gian, th in t tr thnh mt phng tin trao i thng tin

    quan trng i vi nhiu ngi.

    Cng vi s pht trin tin li ca Internet, vic ly cp thng tin, thm nhp bt hp

    php, ph hoi thng qua Internet cng gia tng v s lng, loi hnh v k thut. Bn cnh

    , cc i tngc t tng chnh tr cc oan, cc th lc phn ng trong nc v

    ngoi nc tn dng trit nhng kh nng ca mng Internet phc v cho mc ch

    tuyn truyn, pht tn ti liu phn ng v thc hin cc hnh vi phn ng khc nhm

    chng ph nh nc cng ha x hi ch ngha Vit Nam. Do , vic u tranh chng cc

    hot ng s dng cc dch v trn mng phc v cho mc ch xu tr thnh mt yucu cp thit ca lc lng Cng an.

    Trc nhu cu thc t , ti nghin cu v mnh dn chn ti: Nghin cu

    xy dng h thng lc ni dung th in t gi theo giao thc SMTP lm bo co tt

    nghip nhm xut mt gii php kim sot cc ni dung ca th in t gi i trn mng

    Internet.

    Mc tiu ca ti

    Xy dng h thng lc nhm tm ra nhng th c gi trn mng Internet theo giao

    thc SMTP c ni dung lin quan n cng tc nghip v Cng an cn quan tm; phc

    v yu cu nghip v trong lc lng.

    Phm vi nghin cu

    Nhng th c ni dung lin quan n cc vn m cng tc Cng an quan tm (ni

    dung ca bc th c gi c th hin th di dng text .doc, .docx, .pdf, .txt; th c ni

    dung thuc cc a ch cn theo di v nhng cm t m cng tc nghip v cng an phiquan tm).

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    4/23

    2

    Gii hn ni dung nhng th c vit bng ngn ng ting Vit vi cc loi font

    ch ting Vit hin c (TCVN3, Unicode, VNI, UTF8). ng thi, th c gi i trn

    mng theo giao thc SMTP.

    Phng php nghin cu

    -Nghin cu mt th c gi theo giao thc SMTP phn tch ra cc phn trong

    ni dung bc th nh: a ch gi, nhn, ch bc th, ni dung bc th, cc file vn bn

    nh km Trn c s s dng phng php so snh a chui (multi matching) tm

    kim cc cm t cn quan tm v i snh xy dng chnh sch lc theo yu cu nghip

    v.

    -Nghin cu cc font ch ting Vit v phng php chuyn i cc font ch, t

    chuyn i cc font ch sang mt font tiu chun ri p dng phng php tm kim v sosnh a chui thc hin vic lc ni dung th.

    Ngoi phn li m u, kt lun, ti liu tham kho v phlc, lun vn c chia

    thnh 3 chng:

    Chng 1: Tng quan v h thng th in t

    Chng ny ni v cu trc ca mt th in t, mt h thng th tn in t v cc

    giao thc c s dng cho dch v th in t. Ngoi ra, chng ny cng gii thiu giao

    thc SMTP, i su vo nghin cu vic gi mt th in t qua giao thc SMTP, nghin

    cu proxy cho SMTP.

    Chng 2: Phng php v cc k thut s dng cho h thng lc ni dung th

    in t

    Chng ny trnh by v nhng thng tin cn kim sot i vi mt th in t;

    mt s bin php kim sot ni dung th in t; cc k thut s dng phc v cho vic

    lc th: k thut tm kim, so snh chui (multi matching), x l phng ting Vit(Unicode, TCVN3,VNI, UTF8)

    Chng 3: Lc ni dung th in t gi theo giao thc SMTP

    Chng ny xut phng php lc v m hnh, xy dng h thng lc ni dung

    th in t gi theo giao thc SMTP.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    5/23

    3

    SMTP (via Internet )

    System Call

    SMTP

    System Call

    Vit System Call

    c/ Vit POP/IMAP

    SMTP( Direct connection )

    Chng1- TNG QUAN V H THNG TH IN T

    1.1 L thuyt chung v th in t

    H thng thin t cho php ngi dng trao i th in t vi nhau. H thng

    ny bao gm mt hoc nhiu my ch th tn (mail server) , trn c ci t mt phnmm mail server qun l ti khon ca ngi dng, thc hin vic trao i th gia

    nhng ngi dng v trao i th vi cc my ch th tn khc.

    1.1.1Cu trc mt h thng th in t

    thc hin vic trao i th vi ngi dng, gia my ch (mail server) v cc

    my khch (mail client) thng nht s dng chung mt b giao thc gi v nhn th, trong

    c quy nh c th v cng lm vic, quy trnh thao tc, cc cu lnh trao i, cu trc

    ca th in t... Hnh 1.1 trnh by m hnh mt h thng th tn in t vi giao thc gi

    thSMTP v giao thc nhn th POP hoc IMAP.

    H thng ny bao gm bn phn t chnh: MUA (Mail User Agent), MTA (MailTransfer Agent), MDA (Mail Delivery Agent), MRA (Mail Retrieval Agent).

    MUA

    gi

    MDAcc b

    TCPMDA

    MUA

    nhn

    MTA chuyn tip

    MTA nhn

    Hp th

    MailRetreival

    Agent

    Hnh 1.1: M hnh mt h thng th tn in t

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    6/23

    4

    Mail User Agent (MUA):l chng trnh phn mm ca my client c ngi

    dng s dng gi, nhn, son tho, x l th in t.

    Mail Tranfer Agent (MTA): l mt chng trnh th ca my ch, cho php

    truyn ti th in t t my ny sang my khc. Mail Delivery Agent (MDA):l chng trnh m MTA s dng chuynth

    vo hp th ca ngi dng hoc truyn ti th ti mt MTA khc. Mi MTA s dng

    mt hoc nhiu MDA, mi MDA c s dng cho mt loi yu cu phn pht th ring.

    Mail Retrieval Agent (MRA): l mt chng trnh hoc mt dch v c chc

    nng ly th in t v t mt hp th trn mt my ch xa v a chng ti mt MUA.

    Cc MRA truy vn cc th v cc phn header t nhng hp th xa v phn pht chng

    ti cc MUA trn my ca ngi dng.

    1.1.2Cu trc mt th in t

    Th in t thng c hai phnchnh: phn u(Header) v phn thn (Body) l vn

    bn cha ni dung ca th. Khi gi i, ton b th in t c gi trong ni dung

    (content). Ngoi ra, h thng th cn to thm mt phn na c gi l b th (envelope),

    phn ny cha cc thng tin cn thit cho vic chuyn th n ni nhn. Hnh 1.2 trnh by

    cu trc mt th in t.

    Hnh 1.2: Cu trc ca mt th in t

    B th

    Tiu

    Phnthn

    Ni dung

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    7/23

    5

    1.1.3Mt sgiao thc s dng gi v nhn th in t

    H thng Mail c xy dng da trn mt s giao thc: Simple Mail Transfer

    Protocol (SMTP), Post Office Protocol (POP), Multipurpose Internet Mail Extensions

    (MIME) vInteractive Mail Access Protocol(IMAP) c nh dng trong RFC 1176 lmt giao thc quan trng thay thPOP, n cung cp nhiu c ch tm kim vn bn,

    phn tch messaget xa m ta khng tm thy trongPOP.

    1.1.3.1 Mt sgiao thc s dng gi th in t

    1.1.3.1.1 Giao thc SMTP (Simple Mail Transfer Protocol)

    SMTP l giao thc tin cy, chu trch nhim phn pht th in t. N chuyn th

    in t t h thng mng ny sang h thng mng khc, chuyn th trong h thng mng

    ni b. Giao thc ny s c trnh by c th phn Xy dng proxy cho SMTP.

    1.1.3.1.2 Giao thc X.400

    X.400 l giao thc c ITU-T v ISO nh ngha v c ng dng rng ri

    Chu u, Canada. X.400 cung cp tnh nng iu khin v phn phi e-Mail, s dng nh

    dng nh phn, do khng cn m ha ni dung khi phn pht th trn mng Internet.

    1.1.3.2 Mt sgiao thc s dng nhn th in t

    C hai giao thc chnh thng c dng bi cc ng dng my th khch truy

    cp th tn t cc my ch: Post Office Protocol (POP) v Internet Message Access Protocol

    (IMAP).

    1.1.3.2.1 Giao thcPOP

    POP l giao thc c thit kt h tr tin trnh th offline, trong tin trnh ny

    th c phn pht ti mt my ch. Mt my tnh c nhn ngi dng gi nh k mt

    chng trnh th khch ckt ni ti my ch v ti tt c th treo ti my tnh cangi dng. Cch truy cp offline l mt loi dch v store-to-forward, c s dng

    chuyn th (theo n t hng) t my ch th (v tr a v) ti my ca ngi c th,

    thng l mt PC hoc Mac.

    1.1.3.2.2 Giao thcIMAP (Internet Message Access Protocol)

    IMAP l mt giao thc chun cho vic truy cp th in t t my ch th cc b.

    N l mt giao thc ch/khch trong th in t c nhn v duy tr bi my ch th.

    Vi nhng yu cu ny ch mt trao i d liu nh lm vic tt thm ch qua mt kt nichm nh l mt modem. Ch khi ngi dng yu cu c mt th in t c th th n s

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    8/23

    6

    SMTPCommands/Replies

    And Mail

    SMTP khch SMTP ch

    c ti v tmy ch th . Ngi dngc th to v ch tc cc th mc hoc cc hp

    th trn my ch, xa nhng tin nhn

    1.2Giao thc SMTP

    1.2.1

    Gii thiu c bn v SMTP

    SMTP (Simple Mail Transfer Protocol) l giao thc quy nh vic truyn th ch yu

    trn Internet, c s dng nh mt c ch chung cho vic chuyn ti th in t gia cc

    my tnh vi nhau trong giao thc TCP/IP. Khi mt tin trnh SMTP thc hin, SMTP client

    m mt kt ni TCP ti mt tin trnh SMTP server nm trn mt my ch xa v c gng

    gi mail thng qua kt ni. SMTP server lng nghe mt kt ni TCP trn mt cng 25.

    Hnh 1.3: M hnh truyn th s dng giao thc SMTP

    Khi SMTP client c mt thng ip ctruyn i, n thit lp mt knh truyn hai

    chiu ti mt SMTP server. Trch nhim ca SMTP client l chuyn giao nhng thng ip

    th cho mt hoc nhiu SMTP server (hoc bo nhng li sai khi thc hin).

    Hot ng ca giao thcSMTP trong h thng th in t

    Mi quan h gia SMTP v h thng th in tc trnh by trong hnh 1.4:

    Hnh 1.4: Hot ng ca giao thc SMTP trong h thng th in t

    Ngidn

    SMTPkhch

    SMTPch

    Hthng

    Hthng

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    9/23

    7

    Client lin quan n th i, Server lin quan nnhn th. H thng th cc b cha

    hp th (mailbox) ca mi user. Mailbox c 2 phn: phn cc b v phn ton cc.

    Sau khi tho bc th trong khun dng chun, h thng th cc b xc nh tn ngi

    nhn hp th cc b hay phi gi ra ngoi. bc thc gi i,Client SMTP phi bit

    a ch IP ca ni nhn thng qua DNS v gi qua cng a ch SMTP (25) bt u thit

    lp kt ni server SMTP ni nhn. Khi mi ni c thit lp, Client bt u chuyn th

    n Server bi cc lnh ca SMTP.

    1.2.2M hnh giao thc SMTP

    M hnh SMTP h tr c hai phng php truyn pht th end-to-end (khng c cc

    MTA trung gian) v store-and-forward. Phng php end-to-end c s dng gia cc

    mng ni b ca cc t chc v phng phpstore-and-forwardc la chncho cc h

    iu hnh gia cc t chc c mng s dng giao thc TCP/IP v SMTP c s.

    Mt tin trnh SMTP c bn c th truyn ti th in t ti mt tin trnh khc trn

    cng mt mng hoc ti mt mng khc thng qua mt tin trnh truyn tip hoc qua cng

    ni c th ti c c hai mng. Mt m hnh n gin cc thnh phn ca h thng SMTP

    c trnh by trong hnh 1.5:

    SMTP Commands,Replies and Mail

    Kt ni giao thcTCP, cng 25

    User

    ( ngi gi)

    UA(User Agent)

    UA(User Agent)

    Queue of mailHng i th

    UserMailboxes

    MTA client(Mail Transfer Agent)

    MTA Server(Mail Transfer Agent)

    User(ngi nhn)

    Hnh 1.5: M hnh giao thc SMTP

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    10/23

    8

    SMTP Commands,Replies and Mail

    Thng qua Internet

    1.2.3H thng chuyn tip th theo giao thc SMTP

    Ngi dng lm vic vi UA (User Agent). Vic trao i th s dng giao thc TCP

    c thc hin nh mt MTA. MTA gi truyn th qua mng ti cng 25 ca giao thc

    TCP ca MTA nhn. Vic truyn thng tin gia my ch gi v my ch nhn mngngoi th vic chuyn tip c th phc tp (xem Hnh 1.6). Vic thm mt MTA vo pha

    ngi gi v mt MTA vo pha ngi nhn, cc MTA khc thc hin nh my ch v my

    khch, c th chuyn tip th in t qua mng.

    H thng cc MTA relay cho php nhng ni khng s dng b giao thc TCP/IP

    gi th in t ti nhng ngi dng nhng ni khc c th hoc khng th s dng b

    giao thc TCP/IP.

    UA(UserAgent)

    LocalMTA

    User(nginhn)

    UA(User Agent)

    Queue of mail

    i thgi

    Relay MTALocalMTA

    LocalMTA

    Queue ofmail

    Relay MTA LocalMTA

    LocalMTA

    Queue ofmail

    User

    Mailboxes

    User(nginhn)

    LocalMTA

    My ch gi My ch nhn

    Hnh 1.6: M hnh SMTP vi cc MTA chuyn tip

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    11/23

    9

    S gi tr qu tnh phn pht th

    Giao thc SMTP cho php gi tr s phn pht, v th in t c th c gi tr ti

    v tr ngi gi, ch ngi nhn, hoc cc my ch trung gian.

    1.3

    Xy dngProxy cho SMTP

    1.3.1Khi nim Proxy

    Proxy l mt my ch trung gian cho php kt ni t my c nhn (client) ti cc

    my ch chy web trn Internet (theo ngha trc tip), khng cho php cc packet (gi tin)

    i trc tip gia h thng s dng v Internet m phi i gin tip thng qua cc dual home

    host hoc qua s kt hp gia basion host (pho i phng ng) v screening router.

    Hnh 1.7: M hnh proxy

    Cc dch v Proxy khng cho php kt ni trc tip, chng buc tt c cc gi tin trn

    mng phi c kim tra v lc theo quy tc thch hp. Thay v trao i thng tin trc tip

    vi dch v thc s, ngi dng phi trao i thng tin vi Proxy server.

    1.3.2Proxy tng ng dng

    Proxy tng ng dng l mt my chtrung gian cho php kt ni t my trm khai

    thc dch v (client) ti cc my ch cung cp dch v (server), khng cho php cc packet

    (gi tin) i trc tip t client ti server m phi i gin tip thng qua cc proxy.

    ProxyServer ging nh cu ni trung gian gia Client v Server. Khi c yu cu kt niClient n Server tht, ProxyServer s quyt nh p ng hay khng p ng.

    Mt dch v proxy c ba b phn ring bit, l: proxy server, proxy client, v b

    phn tch giao thc. Trong , proxy server lm nhim v chuyn cc yu cu c chp

    nhn ca client ti server thc v chuyn p ng ca yu cu ny ti client thc.

    Hnh 1.8 trnh by m hnh trao i thng tin gia Client v Server qua Proxy.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    12/23

    10

    SMTPSMTP

    POP,IMAP

    POP,IMAP

    SMTP

    SMTP

    Hnh 1.8: M hnh trao i thng tin gia Client v Server qua Proxy

    1.3.3Xy dng proxy cho SMTP

    SMTP proxy l proxy thuc lp ng dng s dng cho dch v th tn kim sot

    th in t c gi i qua giao thc SMTP. N gm hai thnh phn thc hin nhng chc

    nng cch ring bit: Proxy Server v Proxy Client. Hnh 1.9 trnh by mt m hnh proxy

    cho giao thc SMTP.

    Hnh 1.9: M hnh hot ng ca SMTP Proxy

    Clientthc

    Server ClientSMTPServerthc

    SMTP Prox

    Spool

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    13/23

    11

    Chng 2- PHNG PHP V CC K THUT S DNG CHO H THNGLC NI DUNG TH IN T

    2.1 Nhng thng tin cn kim sot ca mt th in t

    Da trn mc tiu ca h thng lc l kim sot cc ni dung ca mt th in t

    c gi i, nhng thng tin cn kim sot l:

    a ch ngi gi (trong trng from), ngi nhn (trong trngto);

    Ch , tiu (trong trng Subject);

    Cc a ch trong trng CC v Bcc: y l trng gm nhiu a ch ngi

    nhn;

    Ni dung ca bc th c vit trong phn bodyc ngi gi son tho;

    Cc file nh km (c th c nn, m ho, hoc nh dng .doc,docx,txt,pdf.).

    2.2 Mt s bin php kim sotni dung th in t

    2.2.1Lc th rc

    Th rc (spam)l th in t, tin nhn c gi n ngi nhn m ngi nhn

    khng mong mun hoc khng c trch nhim phi tip nhn theo quy nh ca php lut.

    Vn th rc l vn gy nhc nhi trong x hi trong nhng nm gn y.Nhiucng trnh nghin cu v phng php lc th rc c u t v tin hnh t kh lu.

    nh gi hiu qu ca mt cng c lc th rc ngi ta thng da trn hai o sau:

    False PositiveT l th thng b lc nhm thnh th rc.

    False NegativeT l th rc b lc nhm thnh th thng.

    Tt c nhng cng c lc c gi tr ngy nay thng s dng mt trong s nhng

    phng php hoc kt hp ca cc phng php sau:

    Phng php lc Bayesian

    Phng php lc SpamAssassin

    Phng php dng danh sch trng/en

    Phng php lc th rc dng chui hi p (Challenge/Response filters)

    Phng php lc da vo v tr ca cc b lc (Filter Placement)

    Phng php lc da trn xc nhn danh tnh ca ngi gi

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    14/23

    12

    2.2.2Lc theo t kha

    Phng php lc theo t kha l mt phng php truyn thng trong vic lc th.

    Ngi ta da vo nhng t hay cm t c trong u ca th (subject) v ni dung ca

    th lc.Phng php nyc u im l:- Tnh thch nghi:Ngi dng c th d dng bin i b lcv thit lp chnh sch

    lc theo mc chca mnh n c th lcra nhng th c ni dung cn quan tm phc

    v mc ch ca ngi dng.

    - Tnh m rng: phng php ny cng c th s dng cho vic lc th rc. Khi mt

    th mi c gi i, cc t hay cm t ny s xc nh l th rc hay khng.

    Phng php ny c bit hiu qu vi mc ch kim sot ni dung thng tin, a

    ch ca th nhm phc v mc ch ca ngi dng. V d mt s th in t c nhng ni

    dung mang tnh cht phn ng nh sau: bn cht cp ca cng ng Vit Nam

    Nhng ni dung ny cn c kim sot, ngn chn v tm ra ngun gc ngi pht tn,

    cng nh c gi n ngi nhn no.

    2.3 Mt s thutton s dng so snh chui(multi matching)

    2.3.1 Khi nim so snh chui

    So snh chui l qu trnh tm kim mt hoc nhiu v tr xut hin ca mu

    (pattern), tm kim t kha trongvn bn. Phn ny s tm hiu v so snh ba gii thut c

    bn trong so snh chuibao gm Nave, Knutt MorrisPratt v BoyerMoore. Kthut

    ny c s dng lc ni dung theo phng php lc t kha.

    2.3.2 Mt sthut ton so snh chui

    2.3.2.1 Thut ton Nave

    y l gii thut c bn v n gin nht, s dng nguyn l vt cn kim tra tt

    c cc kh nng xut hin ca mt chui k t c trong vnbn.

    Hnh 2.1: Minh ha thut ton Nave String Matcher

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    15/23

    13

    Thut ton Knutt Morris Pratt c trnh by di y c ci thin, tt n

    nhiu so vi Nave v tn dng c cc thng tin hu ch khi tm kim.

    2.3.2.2 Thut ton KnuthMorrisPratt

    Thut ton Knuth-Morris-Pratt c xy dng da trn thut ton Nave vi tngli dng li nhng thng tin ca ln th trc cho ln sau.

    Hnh 2.2: Minh ha hm tin t

    (a)

    So snh mu P = ababaca vi vn bn T, cho ra kt qu 5 k t u tinc khp ni thnh cng. (b)Vi nhng kt qu bit v vn bn v 5 k t c ghp

    ni, c th thy dch chuyn s + 1 s khng hp l, nhng s = s + 2 c kh nng hp l.

    (c) Thng tin hu ch ny s gip gim s ln th, v n c th c tnh ton bng cch so

    snh chnh cc k t trong chui mu.

    Mc d gim c s ln so snh trn mi k t, thut ton KMP vn c gng

    kim tra mi k t mt ln. Do , thi gian x l cha c ci thin nhiu. iu quan

    trng l phi gim bt s k t cn so khp. y cng chnh l tng ca thut ton

    BoyerMoore c trnh by tip theo.

    2.3.2.3 Thut ton BoyerMoore

    Thut ton Boyer - Moore l thut ton tm kim chui rt c hiu qu trong thc

    tin, hot ng theo quy tc kim tra cc k t ca mu t phi sang tri v khi pht hin s

    khc nhau u tin, thut ton s tin hnh dch chuyn.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    16/23

    14

    Bng cch loi b ti a s k t cn so snh, tc x l ca thut ton BM

    c tng ln ng k. Tuy nhin, khi di ca vn bn ln v s lng vn bn nhiu th

    qu trnh tm kim s rt tn km. Trn thc t, rt nhiu nghin cu xut thut ton mi

    v ci tin thut ton c c tin hnh nhm gii quyt bi ton so snh a mu . Bn

    cnh cc thut ton so snh a mu in hnh nh Aho Corasick, Wu Manber (WB)

    c cho l thut ton ti u hn c trong qu trnh tm kim v so snh chui. Phn tip

    theo s trnh by chi tit hn v thut ton ny.

    2.3.2.4 Thutton WuManber

    a) Gii thiu thut ton:

    Thut ton WM c xy dng vi hai c ch li, l c ch lc da trn cng

    ngh bm v c ch dch chuyn khi k t da trn cng ngh dch chuyn k t xu ca

    thut ton Boyer Moore.

    phc v vic tnh ton dch chuyn, giai on tin x l s xc nh kch

    thc ca ca s i snh; thng thi thit lp ba bng d liu c bn gm bng SHIFT lu

    trkhong cch dch chuyn ca cc khik txut hin trong vn bn; bng HASH cha

    danh sch lin kt cc mu c chui hu t ging nhau bn trong cas i snh; bng

    PREFIX chadanh sch lin kt cc mu c chuitin t ging nhaubn trong casisnh.

    Qu trnh so khp mu c thc hin thng qua tnh ton gi tr bm, thit lp

    danh sch nhng mu c chui hu t ging nhau, sau so snh vi gi tr bm ca khi

    k t bn trong ca s i snh hin thi. Khi mt chui mu c so khp, ca s tip tc

    c dch chuyn sang phi. Cng ngh dch chuyn k t xu c ng dng dch

    chuyn ca s i snh, tuy nhin y s dch chuyn theo khi k t, thay v tng k t

    n l nh trong Boyer Moore.

    Nhm khc phc nh hng ca cc mu ngn, thut ton High Concurrence Wu

    Manber (HCWM) c xut vi nhiu ci tin ng k. Bng cch tin hnh chia tt

    cmu thnh nhiu b mu khc nhau ty theo dicachng; thut ton HCWM sau

    ln lt x l cc b mu theo tng cch khc nhau.Bn cnh , cu trc d liu c lp

    c s dng cho cc bmu khc nhau to ra stng tranh cao, c th ng thi x

    l ccb mu, nh tng cng tc sosnh mu ca HCWM, lm cho thut ton ny

    t hiu sut cao hn nhiu so vi WM.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    17/23

    15

    Server Client

    SMTP PROXY

    M unh thng lc ni

    dung th

    Chng 3- LC NI DUNG CHO TH IN T GI THEOGIAO THC SMTP c th kim sot c vic gi th in t qua giao thc SMTP, cn phi c mt

    h thng lc ni dung th nm gia ng i ca my gi th v cc mail server. Chnh

    sch lc th da trn vic kim tra cc phn envelope, contentv cc file nh km ca th

    in t pht hin nhng th vi phm chnh sch bo m an ninh an ton thng tin.

    3.1 M hnh h thng lc ni dung Th in t qua giao thc SMTP

    M hnh mt h thng lc th in t c trnh by trong Hnh 3.2.

    Hnh 3.1: M hnh h thng lcni dung th in t s dng giao thc SMTP

    B lc th in t c gi i theo giao thc SMTP gm cc m un chnh sau: m

    un Firewall, m un SMTP proxy, m un x l font ting Vit, m un lc v m unqun tr.

    Server Client

    M un x l font ting Vit

    M un Qun Tr b lc

    M unlc

    Chnh schlc

    CSDL

    Lu gi thvi phm

    E-mailClient Mail

    Server

    FIRE WALL

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    18/23

    16

    3.1.1M unFirewall

    M un nythc hinchc nng nh hng li gi tin ca th in t c gi t

    my Client ti mt SMTP server theo cng 25. Cc gi tin c a ch cng ch TCP l 25

    s c nh hng li i ti m un SMTPproxy server vi mt cng ch c xc

    nh.

    3.1.2M unSMTP proxy

    Khi h thng cha c c ch lc ni dung, m un SMTP proxy c chc nng

    nh mt Proxy bnh thng nhn th t pha my ngi gi ri chuyn tip th ti my

    mail server truyn pht th ti a ch ch.

    Khi h thng c c ch lc ni dung, th hot ng ca cc thnh phn ca Proxy

    c s thay i. Cc thnh phn ny s ng vai tr nh ngi trung gian tip nhn cc yu

    cu t my Client cng nh tip nhn cc yu cu t Mail server chuyn n m un x

    l font ting Vit.

    3.1.3M un x l font ting Vit

    Khi nhn th t SMTPproxy server gi n, m un ny s phn tch phn envelope

    vcontentca bc th.Mt th c ngi dng to ra v c gi i n c vit di

    nhiu dng font ch khc nhau. Hin nay, c nhiu cch m ha cc k t ting Vit khc

    nhau, dn ti c nhiu bng m khc nhau c s dng. Theo thng k, c ti trn 40 bng

    m ting Vit khc nhau c s dng, do , vic khai thc ti liu cng nh x l d liu

    rt phc tp.

    Tuy nhin, hin nay vic s dng ting Vit trn my tnh vn cha c s thng nht

    cao v chun m ting Vit, gy kh khn ln cho vic thu thp, khai thc v x l ting

    Vit. i hi cc h thng x l vn bn ting Vit cn phi c bc tin x l nhn

    dng v quy chun cc k t v mt bng m chung.M un x l font ting Vit c chc nng:

    + Chuyn cc t kha c m ha di nhiu dng font ch khc nhau v

    mt mt dng quy chun cc k t v mt bng m chung c h thng thit lp t

    trc, sau chuyn ti m un lc thc hin vic lc th.

    + Chuyn i m ca ni dung bc th v nh dng ban u ca ngi dng

    son tho chuyn tip cho SMTP proxy client gi th i n a ch nhn.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    19/23

    17

    3.1.4M un lc

    y l phn quan trng nht ca h thnglc ni dung th in t. Khi ni dung bc

    th c m un x l font ting Vit thc hin, m un lcs s dng cc k thut tm

    kim v so snh chui (multi matching) thc hini snh ccphna ch gi i, a

    ch nhn th, v cc tkha caphn tiu bc thv trong ni dungvi cc tv cm

    tkha cm c trong c s d liu ca b lc (t in lc) kim tra nhng ni dung

    c trng khp hay khng? Nu mt trong cc thng tin i snh trng nhau th b lc s

    thc hin vic ghi li nht k v thc hin chnh sch lc.

    3.1.5M un Qun tr

    Cho php ngi qun tr h thng iu khin b lc, thit lp cc chnh sch lc v

    cc cng c lc nh: thit lp cc a ch cn lc, cc t kha cn lc, kt xut nht k viphm, lu gi th vi phm v.v...

    3.2 Lu hot ng ca h thng lc th in t

    3.2.1 Lu tip nhn v gi mt th khng vi phm

    Lu tip nhn v gi mt th khng vi phm c biu din Hnh 3.2.

    Khi SMTP proxy khi to kt ni, my mail client s gi yu cu kt ni ti SMTP

    proxy server. Sau khi SMTP proxy server xc nhn v cho php mail client gi th, hthng s tin hnh x l font ting Vit. Tip theo, h thng lc th s kim tra th gi ic

    vi phm chnh sch lc hay khng.Nu th c gi khng vi phm chnh sch lc, th

    s c chuyn ngay ti Proxy SMTP client.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    20/23

    18

    Khng vi phm

    Vi phm

    Gi th Proxy SMTPServer

    Proxy SMTPkhi to

    E-mail Client

    X l font tingVit

    Kim tra th

    vi phm?

    Proxy Client

    SMTP SERVER

    Thc hin theoyu cu nghip

    v

    Hnh 3.2: Lu tip nhn v gi th in t khng vi phm

    Ngc li, nu th vi phm chnh sch lc th th s c x l theo cc chnh

    sch vi phm c trnh by phn 3.2.2.

    3.2.2 Lu x l th vi phm

    Hnh 3.3 th hin cc bc x l i vi mt th vi phm chnh sch an ninh an ton

    thng tin.

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    21/23

    19

    Vi phm

    Th gi i

    D liu cn i snh

    Khn vi h m

    Hnh 3.3: Lu x l th vi phm

    Th c SMTP proxy tip nhn t mail client s c lu vo mt vng nh. Ti

    y h thng lc sx l font ting Vit trc khily a ch v t kha cn ngn chnc

    trong CSDL ca h thng lc i snh vi a ch v ni dung ca th (nh ni phn

    trn). Khi c s trng khpth th vi phm chnh sch lc. H thng s ghi li nht k

    vi phm, ng thi thay i ni dung th ri chuyn tip th ti Proxy client chuyn

    th ti SMTP server.

    CSDLCn i snh

    Proxy SMTP

    server

    B nh

    X l font ting Vit

    Kim trath

    vi hm?

    SMTP SERVER

    Ghi nht k

    Client proxy

    Thay i ni dungth

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    22/23

    20

    3.2.3 Thit lpchnh sch lc

    Chnh sch lc th in t c thit lp theo cc tiu ch: lc theo a ch v lc

    theo t kha.

    Lc theo a chbao gm:lc a ch ngi gi v a ch ngi nhn.

    Lc theo a ch ngi gi.

    Lc theo a ch nhn gm c cc trng: To: ;Cc: ;Bcc: . Nu a ch gi i

    trng vi a ch c trong CSDL ca h thng lc cn chn th b lc s thay i trng a

    ch ngi nhn bng trng a ch ngi gi ng thi thay ni dung bc th bng thng

    bo Th vi phm an ninh an ton thng tin.

    Lc theo t khac thc hinlc t tiu bc th (subject) v phn ni dung

    ca bc th. Nhng t cn kim sot s c thit lp trong t in lc, h thng s tin

    hnh so snh vi ni dung th, nu vi phm s b h thng ngn chn hoc gi thng bo

    ngi qun tr c bin php x l kp thi.

    3.3 Ci t h thng v hot ng ca phn mm lc th in t

    3.3.1 Ci t Firewall

    3.3.2 Ci t h thng lc th in t

    ci t h thng lc th in t cn thao tc theo th t cc bc nh sau:

    Bc 1: Chpb ci temailrelay.tar vo trong th mc/opt dnglnh/tarxvzf

    emailrelay.tar gii nn b ci t vo trong th mcc la chn ci t

    cho b lc mail.Sa file cu hnh ca b lc theo chnh sch lc v cng lc

    c n nh. Trong :

    Th mc/usr/local/emailrelay/etccha file cu hnh ca chng trnh.

    Th mc/usr/local/emailrelay/libexec cha cc file th vin v file chy cachng trnh.

    Th mc/usr/local/emailrelay/tmp cha cc file nht k.

    Th mc/usr/local/emailrelay/var/spool lu gi th tm thi, ch h thng

    chuyn i.

    Bc 2: Kch hot h thng lc th bng vic nhlnh:

    /usr/local/emailrelay/libexec/emailrelay/init/emailrelay start

    Bc 3: Kim tra hot ng ca h thng:psef | grep emailrelay

  • 7/24/2019 Luan Van Loc Thu Dien Tu Theo Giao Thuc SMTP

    23/23

    21

    KT LUN

    Lc ni dung th in t theo giao thc SMTPnhm mc ch kim sot ni dung

    th in t c gi trn Internet l mt hng mi trong vic m bo an ninh an ton

    thng tin. Hng tip cn ti c bit hiu qutrong vic thu thp thng tin, theo di itng phc v cho cc hot ng nghip v ca lc lng Cng an. cng nh gp phn

    khng nh trong vic hn ch vic pht tn ti liu, tuyn truyn, kch ng v ni xu

    ng, lnh o, v Nh nc. ti cng p ng c mt phnyu cu thc tin t ra

    v vn kim sot ni dung thng tin trn Internet.

    Kha lun h thng ha c mt s vn l thuyt v h thng th in t, gii

    thiu mt s bin php kim sot ni dung th in t, mt s phng php c ng dng

    phbin. Tuy nhin vi mc tiu ti t ra th vic s dng phng php lc t kha sdng k thut multi matching thu c cc kt qu thc s kh quan v p ng c

    nhu cu rt cp thit hin nay i vi mt s vn cn quan tm.

    Tuy nhin, do thi gian nghin cu c hnchng trnh lc mi ch thc hin c

    cc chc nng lc c bn da theo cu trc ca mt bc th in t nh lc theo cc trng

    a ch ngi nhn, ngi gi, tiu th v ni dung th theo t kha ting vit.

    Vi cc kt qu t c, ti ang tin hnh th nghim ti phng th nghim

    An ninh an ton thng tin, Cc Tin hc nghip v, B Cng an. Nu c iu kin, thi gian

    v c s cho php, tc gimong mun c nghin cu pht trin hon thin cc chc

    nng lc ca chng trnh nh lc ni dung cc tp nh km vi cc nh dng khc nhau

    Trong thi gian ti s tin hnh th nghim tch hp vi cc m un lc khc nh:

    lc th in t s dng giao thc khc nh: giao thc POP, IMAP; lc ni dung trang web,

    kim tral hng an ninh trang web a h thng vo hot ng trong thc t.