0% found this document useful (0 votes)

97 views7 pages

Randomized Hash and Karp-Rabin Algorithm

1. Randomized hashing adds randomization to hashing to reduce collisions independently of the data model. It chooses a random prime number p and takes the hash value as the modulo of the string and p. 2. The Karp-Rabin algorithm uses randomized hashing to quickly find patterns in strings in expected O(n+m) time by rolling the hash of substrings. 3. Modifications include using multiple primes, regenerating p if a collision occurs, and other rolling hash functions. It can also search for multiple patterns in O(n+k*m) time to detect plagiarism.

Uploaded by

Kunal Jangid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views7 pages

Randomized Hash and Karp-Rabin Algorithm

Uploaded by

Kunal Jangid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Randomized Hash

and Karp-Rabin Algorithm

Project for CSEP 531 by Stanislav Narivonchik (stanar@uw)
Hash Function
• H(S): where X is a string of any size (n), but H(S) is fixed-size (T)
• Used as checksums, fingerprints, error correction codes
• Hash collision: H(A) = H(B), while A != B.
• How to estimate chance of collision? Need some data model.

Example:
Distribution of a file A, with a checksum. How to estimate transfer
error? Need P(B|A), where B is a potential copy. Transmission model?
What if there's an adversary willing to fake a copy?
Use Randomized Algorithm
Add randomization independent of data model:
• Choose a uniformly random prime number p ∈ {2, 3, . . . , T}
• Hp(A) = A mod p
• Chance of collision: if A != B, then
P(Hp(A) = Hp(B)) < 1.26 (n / ln n) / (T / ln T)
E.g. if we have T = n * n, then collision probability is O(1/n)

This estimation does NOT depend on the values of A and B.

Proof Sketch
1. Let π(x) denote the number of primes <= x.
For x >=17: x / ln x < π(x) < 1.26 x / ln x

2. If Hp(A) = Hp(B) then A ≡ B (mod p).

P(Hp(A) = Hp(B)) <= (# primes dividing |A - B|) / π(T)

3. Number of primes dividing N is less than π(log2 N).

P(Hp(A) = Hp(B)) < π(n) / π(T) < 1.26 (n / ln n) / (T / ln T)
Application: Pattern Matching (Karp-Rabin)
1 function RabinKarp(string s[1..n], string pattern[1..m])
2 hpattern := hash(pattern[1..m]); hs := hash(s[1..m])
3 for i from 1 to n-m+1
4 if hs = hpattern
5 if s[i..i+m-1] = pattern[1..m]
6 return i
7 hs := hash(s[i+1..i+m])
8 return not found

Use rolling hash: H(s[i+1..i+m]) = R(H(s[i..i+m-1], s[i], s[i+m]), e.g. Hp(S):

Hp := (2 * (Hp - (1<<(m-1)) * s[i]) + s[i+m]) % p;
Karp-Rabin Algorithm Modifications
Expected running time is O(n+m). Worst case O(nm).
Need random prime generator – use Miller-Rabin primality test.
Modifications:
• Use k different primes, but never check that A=B: worst case O(n+m),
the result is not 100%, but anything close enough to 100%.
• Regenerate p, if Hp(A) = Hp(B), but A != B. Hedge against catastrophe
(long series of false matches). Expected running time is still O(n+m), if
p is not prime!
• Use other rolling hash functions, e.g. Rabin fingerprint.
Multiple Pattern Search
Used to detect plagiarism. Expected running time is O(n+k*m).

1 function RabinKarpSet(string s[1..n], set of string subs, m):

2 set hsubs := emptySet
3 foreach sub in subs
4 insert hash(sub[1..m]) into hsubs
5 hs := hash(s[1..m])
6 for i from 1 to n-m+1
7 if hs ∈ hsubs and s[i..i+m-1] ∈ subs
8 return i
9 hs := hash(s[i+1..i+m])
10 return not found

Algo Lab Project
No ratings yet
Algo Lab Project
9 pages
Rabin-Karp Algorithm
No ratings yet
Rabin-Karp Algorithm
3 pages
1) Draw A Red-Black Tree For The Following Values Inserted in This Order. Illustrate Each Operation That Occurs: K W o S y T P R 10 Points
No ratings yet
1) Draw A Red-Black Tree For The Following Values Inserted in This Order. Illustrate Each Operation That Occurs: K W o S y T P R 10 Points
18 pages
Topcoder Article
No ratings yet
Topcoder Article
8 pages
RABIN KARP ALGORITHM
No ratings yet
RABIN KARP ALGORITHM
3 pages
4101_Assignment_9
No ratings yet
4101_Assignment_9
5 pages
String Matching and Hashing
No ratings yet
String Matching and Hashing
10 pages
Rabin Karp
No ratings yet
Rabin Karp
11 pages
Lecture 04 Inaryseachtree
No ratings yet
Lecture 04 Inaryseachtree
20 pages
Rabin Karp Matching
No ratings yet
Rabin Karp Matching
11 pages
The Rabin-Karp Algorithm: String Matching
No ratings yet
The Rabin-Karp Algorithm: String Matching
18 pages
Hash Table 2010
No ratings yet
Hash Table 2010
43 pages
Rabin-Karp String Matching Algorithm
No ratings yet
Rabin-Karp String Matching Algorithm
11 pages
07 Hashing
No ratings yet
07 Hashing
73 pages
Rabin Karp CPP Hashing
No ratings yet
Rabin Karp CPP Hashing
2 pages
03-Rabinkarp Dfa Bitap
No ratings yet
03-Rabinkarp Dfa Bitap
55 pages
Rabin-Karp Algorithm For Pattern Searching: Examples
No ratings yet
Rabin-Karp Algorithm For Pattern Searching: Examples
5 pages
Rabin-Karp
No ratings yet
Rabin-Karp
7 pages
Ocslogs: Empty BJ
No ratings yet
Ocslogs: Empty BJ
9 pages
Exercise 1
No ratings yet
Exercise 1
17 pages
Rolling Hash (Rabin-Karp Algorithm) : Objective
No ratings yet
Rolling Hash (Rabin-Karp Algorithm) : Objective
4 pages
DAA-DA-output
No ratings yet
DAA-DA-output
9 pages
Hashing
No ratings yet
Hashing
111 pages
hw05 Solution PDF
No ratings yet
hw05 Solution PDF
8 pages
String Matching
No ratings yet
String Matching
4 pages
Rabin Karp Alorithm For String Search
No ratings yet
Rabin Karp Alorithm For String Search
3 pages
Steps
No ratings yet
Steps
3 pages
Rabin Karp Algorithm of Pattern Matching (Goutam Padhy)
No ratings yet
Rabin Karp Algorithm of Pattern Matching (Goutam Padhy)
15 pages
CS5800 Assignment 6
No ratings yet
CS5800 Assignment 6
10 pages
Rabin-Karp Algorithm
No ratings yet
Rabin-Karp Algorithm
2 pages
String Matching
No ratings yet
String Matching
16 pages
MIT6_006S20_ps3-solutions
No ratings yet
MIT6_006S20_ps3-solutions
9 pages
Blfilter Note
No ratings yet
Blfilter Note
2 pages
aoa.10
No ratings yet
aoa.10
2 pages
Rabin Karp
100% (1)
Rabin Karp
13 pages
Ch11 Soln 2
No ratings yet
Ch11 Soln 2
8 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Rabin-Karp Algorithm
No ratings yet
Rabin-Karp Algorithm
2 pages
Mit6 857S14 2.1.2
No ratings yet
Mit6 857S14 2.1.2
3 pages
Hash Data Structure
No ratings yet
Hash Data Structure
18 pages
Lecture 56string Matching
No ratings yet
Lecture 56string Matching
43 pages
Universal Hashing
No ratings yet
Universal Hashing
4 pages
Designe and Analysis of Algoritham Mid-Term Equivalent Assignment
No ratings yet
Designe and Analysis of Algoritham Mid-Term Equivalent Assignment
9 pages
20021519-050 Sec-B Hash
No ratings yet
20021519-050 Sec-B Hash
7 pages
CS2040 Tutorial4 Ans
No ratings yet
CS2040 Tutorial4 Ans
5 pages
CLRS Chapter 11 Solutions
No ratings yet
CLRS Chapter 11 Solutions
7 pages
Epasalic Hashfunc Zbirka Nalog 2
No ratings yet
Epasalic Hashfunc Zbirka Nalog 2
25 pages
MDCS LAB_MANUAL_F
No ratings yet
MDCS LAB_MANUAL_F
32 pages
Problem Idea of Universal Hashing
No ratings yet
Problem Idea of Universal Hashing
14 pages
Lec 11 Hash Table
No ratings yet
Lec 11 Hash Table
43 pages
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
No ratings yet
Compsci Algorithms For Data Science: Cameron Musco University of Massachusetts Amherst. Fall 2019
28 pages
Problem
No ratings yet
Problem
3 pages
1 Overview: Lecture 2 - February 3, 2005
No ratings yet
1 Overview: Lecture 2 - February 3, 2005
6 pages
DAA (Algorithms Knowledge Capsule 4 by Dr. Choudhary Ravi Singh)
No ratings yet
DAA (Algorithms Knowledge Capsule 4 by Dr. Choudhary Ravi Singh)
20 pages
String Matching Algorithm
No ratings yet
String Matching Algorithm
18 pages
Solutions to Exercises on Hash Tables
No ratings yet
Solutions to Exercises on Hash Tables
3 pages
patternmatching
No ratings yet
patternmatching
29 pages
54.string Inotes
No ratings yet
54.string Inotes
20 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
An Introduction to Linear Algebra and Tensors
From Everand
An Introduction to Linear Algebra and Tensors
M. A. Akivis
1/5 (1)
Why Take Therapy?: Find Answers To The Burning Question
No ratings yet
Why Take Therapy?: Find Answers To The Burning Question
8 pages
Iss 12 08
No ratings yet
Iss 12 08
62 pages
Vocabulary Words PDF 1
No ratings yet
Vocabulary Words PDF 1
3 pages
Lab Oriented Programming and C++
No ratings yet
Lab Oriented Programming and C++
86 pages
Research Paper on String Matching Algorithm
No ratings yet
Research Paper on String Matching Algorithm
8 pages
Design and Analysis of Algorithm Lab (BSCS2351) Lab Manual
No ratings yet
Design and Analysis of Algorithm Lab (BSCS2351) Lab Manual
46 pages
String Matching - RYS - Lect - 1 - 2 - 3 - Update
No ratings yet
String Matching - RYS - Lect - 1 - 2 - 3 - Update
61 pages
String Matching
No ratings yet
String Matching
9 pages
Crack The Interview-Part-1
No ratings yet
Crack The Interview-Part-1
167 pages
Implementation of Pattern Matching Algorithm
No ratings yet
Implementation of Pattern Matching Algorithm
4 pages
DAA - Notes-Unit-3 and 4
No ratings yet
DAA - Notes-Unit-3 and 4
21 pages
Daa notes
No ratings yet
Daa notes
21 pages
Ada Ans
No ratings yet
Ada Ans
42 pages
ADS UNIT5
No ratings yet
ADS UNIT5
26 pages
Exploring A Self-Replication Algorithm To Flexibly Match Patterns
No ratings yet
Exploring A Self-Replication Algorithm To Flexibly Match Patterns
18 pages
Lab Sesssion2-A or P
No ratings yet
Lab Sesssion2-A or P
7 pages
String Matching
No ratings yet
String Matching
30 pages
Data Structures and Algorithms Made Easy With Java Learn Data Structure Using Java in 7 Days
No ratings yet
Data Structures and Algorithms Made Easy With Java Learn Data Structure Using Java in 7 Days
364 pages
Princeton Substring Search
No ratings yet
Princeton Substring Search
14 pages
Programming Assignment 3: Hash Tables and Hash Functions
No ratings yet
Programming Assignment 3: Hash Tables and Hash Functions
19 pages
UNIT-V String Matching
No ratings yet
UNIT-V String Matching
24 pages
Unit5
No ratings yet
Unit5
106 pages
Strings and Pattern Searching
100% (1)
Strings and Pattern Searching
80 pages
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
No ratings yet
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
3 pages
Unit - I: Random Access Machine Model
No ratings yet
Unit - I: Random Access Machine Model
39 pages
String Matching Algorithm
No ratings yet
String Matching Algorithm
5 pages
Analysis of Algorithm Viva QA
No ratings yet
Analysis of Algorithm Viva QA
4 pages
Lecture 34, 35 36 - String Matching Algorithms
No ratings yet
Lecture 34, 35 36 - String Matching Algorithms
42 pages
Randomized Hash and Karp-Rabin Algorithm
No ratings yet
Randomized Hash and Karp-Rabin Algorithm
7 pages
2d Pattern Matching
No ratings yet
2d Pattern Matching
35 pages

Uploaded by

Uploaded by

Randomized Hash

and Karp-Rabin Algorithm

This estimation does NOT depend on the values of A and B.

2. If Hp(A) = Hp(B) then A ≡ B (mod p).

3. Number of primes dividing N is less than π(log2 N).

Use rolling hash: H(s[i+1..i+m]) = R(H(s[i..i+m-1], s[i], s[i+m]), e.g. Hp(S):

1 function RabinKarpSet(string s[1..n], set of string subs, m):

You might also like