Introduction to Algorithm

June 8, 2017 3 minute read

Some Notes of Introduction to AlgorithmPermalink

Fiboniacci NumberPermalink

[\begin{matrix} F_{n + 1} & F_{n} \\ F_{n} & F_{n - 1} \end{matrix}] = {[\begin{matrix} 1 & 1 \\ 1 & 0 \end{matrix}]}^{n}

$R u n n i n g T i m e = θ (\log_{2} (n))$

Order StatisticsPermalink

Given n elements in an array, find $k^{t h}$ smallest element.

Quick Select
- Expected running time $θ (n)$
- Worse case $θ (n^{2})$

Worse-case linear time order statistics

Select(i, n)
1. Divide n elements into [n/5] groups of 5 elements each. Find the median of each
group. O(n)
2. Recurrsively select the medium x of the [n/5] group medians. T(n/5)
3. Partition with x as pivot, let k = rank(x). O(n)
4. if i==k then return x
  if i<k then recurrsively select ith smallest element in left part
  else then recurrsively select (i-k)th smallest element in upper part

Hash FunctionsPermalink

Division MethodPermalink

$h (k) = k m o d m$

pick $m$ to be prime and not too close to power of $2$ or $10$ .

Multiplication MethodPermalink

$h (k)$ $=$ $A \cdot k$ $m o d$ $2^{w}$ » $(w - r)$ , $A o d d \land 2^{w - 1}$ < $A$ < $2^{w}$

Universal HashingPermalink

Let $u$ be a universe of keys, and let $H$ be a finite colleciton of hash functions mapping $U$ to { $0, 1, \dots, m - 1$ }.

$H$ is $u n i v e r s a l$ if $\forall x, y \in U, x \neq y$

| {h \in H; h (x) = h (y)} | = | H | / m

i.e. if $h$ is chosen randomly from $H$ , the probability of collision between $x$ and $y$ is $1 / m$ .

Perfect HashingPermalink

Given $n$ keys, construct a static hash table of size $m = O (n)$ such that searching takes $O (1)$ time in the worst case.

Idea: 2 level scheme with universal hashing at both levels and NO collisions at level 2.

if $n_{i}$ items that hashes to level 1 slot $i$ , then use $m_{i} = n_{i}^{2}$ slots in the level 2 table $S_{i}$ .

Augmented Data StructuresPermalink

Dynamic Order StatisticsPermalink

Supports: Insert, Delete, Search(x), Select(i), Rank(x).

Idea: use a R-B tree while keeping sizes of the subtree.

$s i z e [x] = s i z e [l e f t (x)] + s i z e [r i g h t (x)] + 1$

Select(root, i):
    k = size[left(x)] + 1 // k = rank(x)
    if i == k then return x
    if i < k then return Select(left(x), i)
    else return Select(right(x), i - k)

$R u n n i n g T i m e = θ (\log_{2} (n))$

Interval TreePermalink

Supports: Intert, Delete, Interval-Search: Find an interval in the set that overlaps a given query interval.

Idea: use a R-B tree while keeping the largest value $m$ in the subtree.

m [x] = m a x {h i g h [i n t [x]], m [r i g h t (x)], m [l e f t (x)]}

Interval-Search(i) // finds an interval that overlaps i
    x = root
    while x != nil and (low[i] > high[int[x]] or low[int[x]] > high[i]) do // i and int[x] don't overlap
        if left[x] != nil and low[i] <= m[left[x]] then x = left[x]
        else x = right[x]
    return x

Amortized AnalysisPermalink

Potential MethodPermalink

Framework:

Start with data structure $D_{0}$
operation $i$ transforms $D_{i - 1} \to D_{i}$
cost of the operation is $c_{i}$
Define a potential function:

Φ : {D_{i}} \to R s u c h t h a t Φ (D_{0}) = 0 \land Φ (D_{i}) \geq 0 \forall i

Amortized cost $\hat{c_{i}}$ with respect to $Φ$ is

\hat{c_{i}} = c_{i} + Φ (D_{i}) - Φ (D_{i - 1})

Total amortized cost of n operations is

\begin{aligned} \sum_{i = 1}^{n} \hat{c_{i}} & = \sum_{i = 1}^{n} (\hat{c_{i}} + Φ (D_{i}) - Φ (D_{i - 1})) \\ = \sum_{i = 1}^{n} \hat{c_{i}} + Φ (D_{n}) - Φ (D_{0}) \\ \geq \sum_{i = 1}^{n} c_{i} \end{aligned}

Competitive AnalysisPermalink

An online algorithm A is $α$ - $c o m p e t i t i v e$ if $\exists k$ such that for any sequence of operations $S$ ,

C o s t_{A} (S) \leq α \cdot C_{o p t} (S) + k

where $C_{o p t} (S)$ is the optimal, off-line, “God’s” algorithm.

Karp-Rabin Algorihm: Find s in tPermalink

Rolling Hash ADT:

r.append(c): r maintains a string x where $r = h (x)$ , add char c to the end of x
r.skip(): delete the first char of x. (assume it is c).

Then just use ADT to “roll over” t to find s.

Note: If their hashes are equal, there is still a probability $\leq 1 / | S |$ that they are actual not the same string.

To implement ADT: use hash simple hash function $h (k) = k mod m$ where $m$ is a random prime $\geq | S |$

We can treat $x$ as a multidigit number $u$ in base $a$ , where $a$ is just the alphabet size.

So:

$r () = u mod m$
$r$ stores $u mod m$ and $| x |$ , (really $a^{| x |}$ ), not $u$ .

r.append(c)
    u = u * a + ord(c) mod m 
      = [(u mod p) * a + ord(c)] mod m
      = [r() * a + ord(c)] mod m

r.skip(c) // assume char c is skipped
    u = [u − ord(c) * (pow(a, |u| - 1) mod p)] mod p
      = [(u mod p) − ord(c) * (pow(a, |u| - 1) mod p)] mod p
      = [r() − ord(c) * (pow(a, |u| - 1) mod p)] mod p

Share on

X Facebook LinkedIn Bluesky

Yanxi Chen