yesha gupta plagiarism detection. string matching algorithms: kmp lcss rabin-karp fingerprints an...
TRANSCRIPT
![Page 1: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/1.jpg)
Yesha Gupta
Plagiarism detection
![Page 2: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/2.jpg)
String Matching Algorithms:
KMPLCSSRabin-Karp fingerprints• an algorithm of choice for multiple pattern search
![Page 3: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/3.jpg)
Testing text file information:
21 Lines Each line(treated as pattern) is of different length Max line size: 370 Minimum line size: 85
![Page 4: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/4.jpg)
LCSS is performing very slowRabin Karp performed better than KMP
Why? Efficient use of Hashing techniques
![Page 5: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/5.jpg)
KMP generated optimum output. Rabin Karp did not.
Why? Because of fixed length patterns in a text
![Page 6: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/6.jpg)
Testing text file information:
21 Lines Each line(treated as pattern) is of same length
![Page 7: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/7.jpg)
Result of RabinKarp and KMP is the same
Why?Each pattern has same length
![Page 8: Yesha Gupta Plagiarism detection. String Matching Algorithms: KMP LCSS Rabin-Karp fingerprints an algorithm of choice for multiple pattern search](https://reader036.vdocuments.mx/reader036/viewer/2022083008/56649f3b5503460f94c596fd/html5/thumbnails/8.jpg)
Execution time of RabinKarp is slightly better than KMP