1 jeff edmonds york university cosc 2011 abstract data types positions and pointers loop invariants...

Jeff Edmonds

York University COSC 2011

Abstract Data TypesPositions and PointersLoop InvariantsSystem InvariantsTime ComplexityClassifying FunctionsAdding Made EasyUnderstand QuantifiersRecursionBalanced TreesHeapsHuffman CodesHash TablesGraphsParadigms

Midterm Review

Midterm Review• Review slides.

Midterm Review• Review slides.• Review the assignment notes and solutions!

Midterm Review• Review slides.• Review the assignment notes and solutions!• Review 3101

• Steps0: Basic Math• First Order Logic • Time Complexity• Logs and Exp• Growth Rates• Adding Made Easy• Recurrence Relations

Midterm Review• Review slides.• Review the assignment notes and solutions!• Review 3101

• Steps0: Basic Math• Steps1: Loop Invariants• Steps2: Recursion

Jeff Edmonds

York University COSC 2011Lecture 1

Abstractions (Hierarchy)ElementsSetsLists, Stacks, & QueuesTreesGraphsIteratorsAbstract Positions/Pointers

Abstract Data Types

Software Engineering• Software must be:

– Readable and understandable • Allows correctness to be verified, and software to be easily updated.

– Correct and complete • Works correctly for all expected inputs

– Robust• Capable of handling unexpected inputs.

– Adaptable• All programs evolve over time. Programs should be designed so that

re-use, generalization and modification is easy.– Portable

• Easily ported to new hardware or operating system platforms.– Efficient

• Makes reasonable use of time and memory resources.

James Elder

Abstract Data Types (ADTs)

• An ADT is a model of a data structure that specifies– The type of data stored

– Operations supported on these data

• An ADT does not specify how the data are stored or how the operations are implemented.

• The abstraction of an ADT facilitates– Design of complex systems. Representing complex data

structures by concise ADTs, facilitates reasoning about and designing large systems of many interacting data structures.

– Encapsulation/Modularity. If I just want to use an object / data structure, all I need to know is its ADT (not its internal workings).

Abstraction

James Elder

Abstract Data Types

Restricted Data Structure:Some times we limit what operation can be done• for efficiency • understandingStack: A list, but elements can only be pushed onto and popped from the top.Queue: A list, but elements can only be added at the end and removed from the front. • Important in handling jobs.Priority Queue: The “highest priority” element is handled next.

Data Structures Implementations• Array List

– (Extendable) Array

• Node List– Singly or Doubly Linked List

• Stack– Array– Singly Linked List

• Queue– Circular Array– Singly or Doubly Linked List

• Priority Queue– Unsorted doubly-linked list– Sorted doubly-linked list– Heap (array-based)

• Adaptable Priority Queue– Sorted doubly-linked list with

location-aware entries– Heap with location-aware entries

• Tree– Linked Structure

• Binary Tree– Linked Structure– Array

Jeff Edmonds

Abstract Positions/PointersPositions in an ArrayPointers in CReferences in JavaImplementing Positions in TreesBuilding Trees

Positions and Pointers

High Level Positions/Pointers

Positions: Given a data structure, we want to have one or more current elements that we are considering.Conceptualizations: • Fingers in pies• Pins on maps• Little girl dancing there• Me

See Goodrich Sec 7.3 Positional Lists

Positions/Pointers:

Implementations of Positions/Pointers

Now lets redo it in Java.

element next

The right hand side of the “=” specifies a memory location.So does its left hand side.The action is to put the value contained in the first into the second.

5element next2182

2182head.next.next

head .next;=2182

Implementing Positions in Treesclass LinkedBinaryTree { class Node { E element; Node parent; Node left; Node right; } private Node root = null;

Implementing Positions in Treesclass LinkedBinaryTree { Position sibling(Position p) { Node n=p; if( n.parent.right = n ) return n.parent.left; else return n.parent.right; }

At any time the user can move a position to the sibling. p3 = tree.sibling(p2);

…tree

if( n.parent != null )

else throw new IllegalArgumentException(“p is the root"); }

Implementing Positions in Treesclass LinkedBinaryTree { Position addRight(Position p,E e) { Node n=p; if( n.right = null ) n.right = return else throw new IllegalArgumentException(

"p already has a right child");

}At any time the user can add a position/node to the right of a position. p3 = tree.addRight(p2,“Toronto”);

Toronto

new Node(e, ,null,null);nn.right;

Implementing Positions in Trees

Defining the class of trees nodes can have many children.We use the data structure Set or List to store the Positions of a node’s children.

class LinkedTree {tree

Jeff Edmonds

ContractsAssertionsLoop InvariantsThe Sum of ObjectsInsertion and Selection SortBinary Search Like ExamplesBucket (Quick) Sort for HumansReverse Polish Notation (Stack)Whose Blocking your View (Stack)Parsing (Stack)Data Structure InvariantsStack and Queue in an ArrayLinked Lists

Contracts, Assertions, and Invariants

Precondition Postcondition

On Step At a Time

I implored you to not worry about the entire computation.

It can be difficult to understand where computation go.

Trust who passes you the baton

and go around once

Iterative Algorithm with Loop Invariants• Precondition:

What is true about input

• Post condition: What is true about output.

Iterative Algorithm with Loop Invariants

Goal: Your goal is to prove that • no matter what the input is,

as long as it meets the precondition, • and no matter how many times your algorithm iterates,

as long as eventually the exit condition is met, • then the post condition is guarantee to be achieved.

Proves that IF the program terminates then it works

Iterative Algorithm with Loop Invariants• Loop Invariant:

Picture of what is true at top of loop.

Iterative Algorithm with Loop Invariants• Establishing the Loop Invariant.

• Our computation has just begun.• All we know is that we have an input instance that

meets the Pre Condition. • Being lazy, we want to do the minimum work. • And to prove that it follows that the Loop Invariant is

then made true.

<preCond>codeA

<loop-invariant>

Establishing Loop Invariant

Iterative Algorithm with Loop Invariants• Maintaining the loop invariant

(while making progress)79 km 75 km

• We arrived at the top of the loop knowing only • the Loop Invariant is true • and the Exit Condition is not.

• We must take one step (iteration) (making some kind of progress).

• And then prove that the Loop Invariant will be true when we arrive back at the top of the loop.

<loop-invariantt>¬<exit Cond>codeB

<loop-invariantt+1>

Maintaining Loop Invariant

<loop-invariant><exit Cond>codeC

Obtain the Post Condition

• We know the Loop Invariant is true because we have maintained it. We know the Exit Condition is true because we exited.

• We do a little extra work. • And then prove that it follows that the Post

Condition is then true.

• Obtain the Post Condition: Exit

982562

3114,23,25,30,31,52,62,79,88,98

• Precondition: What is true about input

Insertion Sort

9825 62

3023,31,52,88

Sorted sub-list

9825 62

23,31,52,88

3023,31,52,62,88

6 elements

to school

• Making progress while Maintaining the loop invariant

79 km 75 km

9825 6

31n elements

to school

14,23,25,30,31,52,62,79,88,98

14,23,25,30,31,52,62,79,88,980 elements

to school

• Beginning & Endingkm Exit0 km Exit

n+1 n+1 n+1 n+1 n+1

= 1+2+3+…+n = (n2)

• Running Time

Define Problem Define Loop Invariants

Define Measure of Progress

Define Step Define Exit Condition Maintain Loop Inv

Make Progress Initial Conditions Ending

to school

79 km 75 km

0 km Exit

Proves that IF the program terminates then it works

<preCond>codeA

<loop-invariant>

Establishing Loop Invariant

<loop-invariant><exit Cond>codeC

Clean up loose ends

<loop-invariantt>¬<exit Cond>codeB

<loop-invariantt+1>

Maintaining Loop Invariant

Iterative Algorithm with Loop Invariants• Precondition:

What is true about input

Binary Searchkey 25

3 5 6 13 18 21 21 25 36 43 49 51 53 60 72 74 83 88 91 95

key 25

3 5 6 13 18 21 21 25 36 43 49 51 53 60 72 74 83 88 91 95

• If the key is contained in the original list, then the key is contained in the sub-list.

• Making progress while Maintaining the loop invariant

79 km 75 km

key 25

3 5 6 13 18 21 21 25 36 43 49 51 53 60 72 74 83 88 91 95

If key ≤ mid,then key is inleft half.

If key > mid,then key is inright half.

key 25

3 5 6 13 18 21 21 25 36 43 49 51 53 60 72 74 83 88 91 95

If key ≤ mid,then key is inleft half.

If key > mid,then key is inright half.

• Running Time

The sub-list is of size n, n/2, n/4, n/8,…,1Each step (1) time.

Total = (log n)

Iterative Algorithm with Loop Invariants• Beginning & Ending

km Exit0 km Exit

key 25

3 5 6 13 18 21 21 25 36 43 49 51 53 60 72 74 83 88 91 95

Parsing with a StackInput: A string of brackets.

Output: Each “(”, “{”, or “[” must be paired with a matching “)”, “}”, or “[”.

Loop Invariant: Prefix has been read.• Matched brackets are matched and removed.• Unmatched brackets are on the stack.

Opening Bracket: • Push on stack.

Closing Bracket: • If matches that on stack

pop and match.• else return(unmatched)

[( ) ( ( ) ) {

Dude! You have been teaching 3101 too long.This is not an course on Algorithms,

but on Data Structures!

Data Structure Invariants

The importance of invariants is the same.

Differences:1. An algorithm must terminate with an answer,

while systems and data structures may run forever.

2. An algorithm gets its full input at the beginning, while data structures gets a continuous stream of instructions from the user.

• Both have invariants that must be maintained.

Assume we fly in from Mars and InvariantsData Struc t is true:

InvariantsData Struc t+1

postCondPush

Maintaining Loop InvariantExit

InvariantsData Struc tPush OperationpreCondPush

Assume the user correctly calls the Push Operation: preCondPush The input is info for a new element.Implementer must ensure: postCondPush The element is pushed on top of the stack. InvariantsData Struc t+1

InvariantsData Struc t+1

postCondPush

Maintaining Loop InvariantExit

InvariantsData Struc tPush OperationpreCondPush

top = top + 1;A[top] = info;

Data Structure InvariantsQueue: Add and Remove from opposite ends.

Algorithm dequeue()

if isEmpty() then

throw EmptyQueueException

info A[bottom]

bottom (bottom + 1) mod N

return e

Data Structure InvariantsInvariantsData Struc t

preCondPush

postCondPushInvariantsData Struc t+1

Don’t panic. Just draw the pictures and move the pointers.

preCondPush

Special Case: Empty

preCondRemove Rear

postCondRemove RearInvariantsData Struc t+1

How about removing an element from the rear?

Is it so easy???

last must point at the second last element.

How do we find it?

You have to walk there from first!

time # of elementsinstead of constant

Front Rear

Add Element Time Constant Time Constant

Remove Element Time Constant Time n

Stack: Add and Remove from same end.

Actually, for a Stack the last pointer is not needed.

Front Rear

Remove Element Time Constant Time n

Stack: Add and Remove from same end.

Queue: Add and Remove from opposite ends.

Front Rear

Remove Element Time Constant Time nTime Constant

trailerheader nodes/positions

elements

Doubly-linked lists allow more flexible list

Jeff Edmonds

Asymptotic Analysis of Time ComplexityHistory of Classifying ProblemsGrowth RatesTime ComplexityLinear vs Constant TimeBinary Search Time (logn)Insertion Sort Time (Quadratic)Don't Redo WorkTest (linear) vs Search (Exponential)Multiplying (Quadratic vs Exponential)Bits of InputCryptographyAmortized Time ComplexityWorst Case InputClassifying Functions (BigOh)Adding Made EasyLogs and ExponentialsUnderstand Quantifiers

Some MathTime Complexity

t(n) = Q(n2)

Input Size

Classifying Functionsf(i) = nQ(n)

Logs and Exps

2a × 2b = 2a+b

2log n = n

Adding Made Easy∑i=1 f(i).

Logic Quantifiers g "b Loves(b,g)"b g Loves(b,g)

Recurrence RelationsT(n) = a T(n/b) + f(n)

• Specifies how the running time • depends on the size of the input.

The Time Complexity of an Algorithm

“size” of input

“time” T(n) executed .

Work for me to give you the instance.

Work for you tosolve it.

A function mapping

History of Classifying Problems

Computable

Exp = 2n

Poly = nc

Quadratic = n2

Fast sortingLook at input

Slow sorting

Considered Feasible

Brute Force (Infeasible)

Mathematicians’ dream

ConstantTime does not depend on input.

Linear = nBinary Search

HaltingImpossible

Quadratic = n2

log nLook at input

Slow sorting

Brute Force (Infeasible)

ConstantTime does not depend on input.

Linear = nBinary Search

input size

Growth Rates

Exp = 2n

5log n

Search:• Input: A linked list. • Output: Find the end. • Alg: Walk there.• Time

Insert Front:• Input: A linked list. • Output: Add record to front.• Alg: Play with pointers.• Time

# of records = n.

Linear vs Constant Time

• Time = 4

Linear vs Constant Time

= Constant time = O(1)Time does not “depend” on input.

a Java Program J, an integer k, " inputs I, Time(J,I) ≤ k

Is this “Constant time”

= O(1)?

Yes because bounded

by a constant

Test/Evaluate:• Input: Circuit & Assignment• Output: Value at output.• Alg: Let values percolate down.• Time:

Search/Satisfiablity:• Input: Circuit • Output: An assignment

giving true:• Alg: Try all assignments.

(Brute Force)• Time:

Test vs Search

x3x2x1

ORANDAND

NOTF F

# of gates.

* * * * * * * * * * * * * * * *

Grade School vs Kindergarten

a × b = a + a + a + ... + a

Running Time

T(n) = Time multiply

= θ(b) = linear time.

= θ(n2) = quadratic time.Which is faster?

92834765225674897 × 838839775901103948759

Size of Input Instance

• Size of paper

• # of bits• # of digits• Value

- n = 2 in2

- n = 17 bits

- n = 5 digits

- n = 83920

• Intuitive• Formal• Reasonable• Unreasonable

# of bits = log2(Value) Value = 2# of bits

2’’

1’’

• Specifies how the running time • depends on the size of the input.

The Time Complexity of an Algorithm

“size” of input

“time” T(n) executed .

Work for me to give you the instance.

Work for you tosolve it.

A function mapping

* * * * * * * * * * * * * * * *

a × b = a + a + a + ... + a

Running Time

92834765225674897 × 8388397759011039475n = # digits = 20Time ≈ 202 ≈ 400

b = value = 8388397759011039475Time ≈ 8388397759011039475

* * * * * * * * * * * * * * * *

a × b = a + a + a + ... + a

Running Time

92834765225674897 × 8388397759011039475n = # digits = 20Time ≈ 202 ≈ 400

b = value ≈ 10n Time ≈ 10n ≈ exponential!!!

* * * * * * * * * * * * * * * *

a × b = a + a + a + ... + a

Running Time

92834765225674897 × 8388397759011039475n = # digits = 20Time ≈ 202 ≈ 400

Adding a single digit multiplies the time by 10!

Time Complexity of Algorithm

O(n2): Prove that for every input of size n, the algorithm takes no more than cn2 time.

Ω(n2): Find one input of size n, for which the algorithm takes at least this much time.

θ (n2): Do both.

The time complexity of an algorithm isthe largest time required on any input of size n.

Time Complexity of Problem

O(n2): Provide an algorithm that solves the problem in no more than this time.

Ω(n2): Prove that no algorithm can solve it faster.θ (n2): Do both.

The time complexity of a problem is the time complexity of the fastest algorithm that solves the problem.

Classifying FunctionsFunctions

Poly L

ogarithmic

Polynom

Exponential

Double E

Constant

(log n)5 n5 25n5 2n5 25n

2<< << << << <<

(log n)θ(1) nθ(1) 2θ(n)θ(1) 2nθ(1) 2θ(n)

Classifying Functions

Linear

Quadratic

θ(n2)θ(n) θ(n3)

Polynomial = nθ(1)

θ(n4)

Others

θ(n3 log7(n))log(n) not absorbed

because not Mult-constant

BigOh and Theta?

• 5n2 + 8n + 2log n = (n2)

Drop low-order terms.Drop multiplicative constant.

• 5n2 log n + 8n + 2log n = (n2 log n)

Notations

Theta f(n) = θ(g(n)) f(n) ≈ c g(n)

BigOh f(n) = O(g(n)) f(n) ≤ c g(n)

Omega f(n) = Ω(g(n)) f(n) ≥ c g(n)

Little Oh f(n) = o(g(n)) f(n) << c g(n)

Little Omega f(n) = ω(g(n)) f(n) >> c g(n)

Definition of Theta