segmentation by zvi solomon & seri khoury lecturer : hagit hel-or

SEGMENTATION

By Zvi solomon & Seri khoury

Lecturer : Hagit Hel-Or

Segmentation

In computer vision, image segmentation is the process of partitioning a digital image into multiple segments (sets of pixels, also known as superpixels).

Goal: move from array of pixel values to a collection of regions, objects, and shapes.

Segmentation(cont.)

• The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze.

• Image segmentation is typically used to locate objects and boundaries (lines, curves, etc.) in images.

• More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.

Segmentation(cont.)

• Each of the pixels in a region are similar with respect to some characteristic or computed property, such as color, intensity, or texture.

• Adjacent regions are significantly different with respect to the same characteristic.

Region based segmentation

• We have seen that threshold as a segmentation tool is a bit limited because there is no notion of spatial context.

• Therefore we will formalize segmentation in a little more abstract way.

• We will discuses about region based segmentation as a technique for determining the region directly.

Edge vs. Region segmentation

What is a region?

• A group of pixels with similar properties.

• Let define a region R of an image f as a connected homogenous subset of the image with respect to some criterion.

• The criterion makes pixels that correspond to each other to be grouped together / marked.

• Some possible criterions are: gray value difference , gray value variance, Euclidean distance and so on.

Region based approaches

• Completeness: . The segmentation must be complete. Every pixel must be in some region.

• Disjointness: . Regions must be disjoint. No overlap.

Region based approaches

• Satisfiability: for every i. Pixels of a region must satisfy one common property P at least. i.e. the region must satisfy a homogeneity predicate P.

• Segmentability: . Different regions satisfy different properties. i.e. any two adjacent regions cannot be merged into single region.

Region growing segmentation

• Region growing is the simplest region based segmentation that groups pixels or sub-regions into larger regions based on pre-defined criteria / predicate.

• The pixels aggregation starts with a set of seed points in a way that the corresponding regions grow by appending to each seed points those neighboring pixels that have similar properties (defined by the criteria: such as gray level, texture, color, shape…).

Example

• Notice that region growing based techniques are better than the edge based techniques in noisy images where edges are difficult to detect.

Original image

Seed point

Growing aggregation

Final region

Seed based region growing segmentation

• Pseudocode:

• Let R be a region to extract.• Initially, the region R only contains its seed point p.

• Let F be a FIFO queue that contains the boundary points of R.• Initially, F contains the 8-neighborhood of the seed point p.

Seed based region growing segmentation

• While F is not empty• For each neighbor pixel p* of p in F

• If p* is similar to p • p* is added to R• Neighbor pixels of p* (not in R) are added to F.

• Else• Set p* as non-similar (a new seed point).

Example

• Original image of a gray scale lighting image with values between 0 and 255. Apply region growing and mark the strongest lightning part.

Wikipedia.com

Example (cont.)

Choose the seeds to be the points with highest grayscale

value (255).

After determining the seed points we have to determine the range

of the threshold (the criteria/predicate). In this image

the threshold chosen is 225-255.

Wikipedia.com

Example (cont.)

190-255 155-255

The result grew from the same regions. And the points will not be grown without being connected with the seed points. Therefore, there are still lots of points in the original image having grayscale level above 155 which are

not marked in the last image.

Wikipedia.com

Advantages and disadvantages

• Region growing methods can correctly separate the regions that have the same properties we define.

• The concept is simple and fast. We only need a small number of seed points to represent the property we want and then grow the region.

• The method is local. We have no global view of the problem.

• So lets view another tool for segmentation.

Region splitting and merging segmentation

• Unlike region growing which starts from a set of seed points, region splitting starts with the whole image as a single region and subdivides it into sub-regions recursively while a condition of homogeneity is not satisfied.

• Region merging is the opposite of region splitting and is being used to avoid over segmentation.

Pseudocode

• Step 1: Splitting steps, for every region , which P()=FALSE (Predicate) split the region into (usually 4) sub-regions.

• Step 2: Merging steps, when no further splitting is possible, merge any adjacent regions and for which =TRUE.

• Step3: Stop only if no further merging is possible.

Pseudocode

Original Splitted

Merged

Pseudocode

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

Original First Split

Example (cont.)

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

Second Split Third Split

Example (cont.)

Merge

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

Final result

1 1 1 1 1 1 1 2

1 1 1 1 1 1 1 0

3 1 4 9 9 8 1 0

1 1 8 8 8 4 1 0

1 1 6 6 6 3 1 0

1 1 5 6 6 3 1 0

1 1 5 6 6 2 1 0

1 1 1 1 1 1 0 0

Split and merge criterias

• Sometimes we need to be careful with the criteria we choose for the split and merge algorithm.

• For example if we use a intensity difference as a criteria a single noisy pixel (black or white) could decide about splitting or merging large regions.

• Lets try and use a standard deviation as a criteria.

Example

Original Splitting with

Example

Original Merging with

Example

Original Splitting with

Advantages

• The image could be split progressively according to our demanded resolution because the number of splitting level is determined by us.

• We could split the image using the criteria we decide, such as mean or variance of segment pixel value. In addition, the merging criteria could be different to the splitting criteria.

Disadvantages

• It may produce the “blocky” segments. This problem could be reduced by splitting in higher level, but the trade off is that computation time will arise.

• A partially processed image would not contain a few clear, dominant regions in the image, but would contain many small unmerged regions.

Watershed

Definition: • Watershed (drainage basin) is an area of land where surface water (from rain, melting snow or ice) converges to a single point at lower elevation.

Watershed transformation

• The intuitive idea underlying this method comes from geography.

• Watershed transformation belongs to the region-based approach.

• Will be talking about the concept introduced by Beucher S. and Lantuejoul C. (1979).

Example

Gradient (rem)

|𝑔𝑟𝑎𝑑 𝑓 (𝑥 )|¿ [(𝜕 𝑓𝜕𝑥 )2

+( 𝜕 𝑓𝜕 𝑦 )2]

1 /2As we have seen, the gradient image helps

us to detect edges. Points with higher gradient are suspicious to be edges

Some Definitions…

• Image can be represented by a function

• f(x) is the gray value of the image at point x.

• A section of at level is a set defined as:

• And in the same way we define as:

Definitions (cont.)

• Geodesic distance: Let be a set. And x,y are two points in X. the geodesic distance between x and y defined as the length of the shortest path included in X and linking x and y.

Definitions (cont.)

• Let Y be any set included in X. We can compute the set of all points of X that are at finite geodesic distance from Y:

• is called the X-reconstructed set by the marker set Y. It is made of all the connected components of X that are marked by Y.

𝑋

Definitions (cont.)

• Suppose now that is composed of n connected components . The geodesic zone of influence of is the set of points in at a finite distance from and closer to than any other :

Definitions (cont.)

• The boundaries between the various zones of influence give the geodesic skeleton by zones of influence of in And we shall write:

The catchment basins

The watershed

Minima and Maxima of a function

• Lets look on an image as a topographic surface. The lighter the gray value in some point, the higher the altitude of the corresponding point on the topographic surface.

Minima and Maxima of a function (cont.)

• Now consider two points and on the surface. Define a ascending path as a sequence of points , ,…, that:

• We say a point s is a minima iff there exists ascending path starting from s and no other ascending path reaches s. on the topographic surface a minima will look like a sink.

• Define the set M of all the minima of f to be made of various – the minima’s until height i.

• Similar definition for the maxima.

Example

The watershed transformation

• Looking again on the image f as a topographic surface. Look on all the minima’s while we start flooding the surface with water. During the flooding two or more floods coming from different minima may merge. Avoid this event by building a dam of those merging points of the surface.

The watershed transformation (cont.)

• Define the group as the catchment basins flooded until height i on the topographic surface.

• At the end of the process the dams will emerge. These dams are the watershed of f.

• The dams separate the various catchment basins , each one containing one and only minimum from the group .

Demo

http://cmm.ensmp.fr/~beucher/lpe1.gif

https://www.youtube.com/watch?v=C8u3yzsNjpA



Building the watershed

• Consider a section that the flood has reached, Consider now the flood reach the section , we can see that the flooding of is performed in the zones of influence of the connected components of in .

• That is, .

Building the watershed (cont.)

• If we define as the catchment basin of f at level and as the minima of f at height then:

• This iterative algorithm is initiated with .• At the end of the process the watershed line is

with

Over segmentation

• Unfortunately using watershed transform on some of the gradient images we may get to much catchment basins.

• Each one of them corresponds to a minimum of the gradient.

• These minima are produced by small variations mainly due to image noise in the gray values.

Over segmentation (cont.)

Solution

• Mark the patterns to be segmented before performing the watershed transform.

• And now instead of taking the whole image M set of minima’s we will start the algorithm on the minima’s at the markers patterns we chose.

Solution (cont.)

A full example

Read a color image and convert it to

grayscale

Use gradient magnitude (Sobel

edge mask).

Mark the foreground objects

Mathworks.com

A full example (cont.)

Take a better threshold to differ

object from background

Compute watershed transformation

Mathworks.com

A full example (cont.)

Color the watershed label matrix

Use the color label matrix on the original image to better visualize

the segmentation results

Mathworks.com

Another algorithm

• The watershed algorithms can be divided in two groups.

• The first group contains algorithms which simulate the flooding process. The second group is made of procedures aiming at the direct detection of the watershed points.

• The algorithm we saw belongs to the first group. It simulates the flooding of the surface S starting from the minima of f.

Another algorithm (cont.)

• Lets briefly present another algorithm belonging to the second group based on the arrow representation of a function f.

• From , define a graph whose vertices are the points of and with edges or arrows from x to any adjacent point y iff f(x) < f(y).

Example

Function f Graph of arrows


• The definition of this graph does not allow the arrowing of the plateaus of the topographic surface.

• Any point receiving arrows from more than one connected component of its neighborhood may be flooded by different lakes. Consequently, this point will be treated as a watershed and a dam will be built.

• Doing so , we will change the arrowing of the neighborhood points and consequently the graph of arrows.

Example

Selection of primary points

Final result


• This procedure of selecting points can be re-run on each graph that we build so more divide may appear.

• Re-run the procedure until no new divide points is selected.

• The algorithm produces local watershed lines. The true divide lines will be extracted easily but the divide points with smaller curves will be harder to detect.

Active contours (snakes)

• Other example of segmentation that we are going to elaborate today is Active Contours (or Snakes).

I’m catching the object!!!

History

• A framework in Computer vision and image processing.

• Appeared in the first ICCV conference in 1987.

• Michael Kass, Andrew Witkin, and Demetri Terzopoulos.

The goal

• delineating an object outline from a possibly noisy 2D image.

• Subdividing or partitioning an image into its constituent regions or objects (segmentation).

The goal (cont.)

• What is the problem? we have learned edge detection in the Image Processing course :

Taken from the slides of Kristen GraumanUT-Austin

Problem with other methods

• Now we are looking for boundaries, not edges.

• Other methods aren’t effective, for example, in presence of noise and sampling artifacts (e.g. medical images).

What is snake ?

• A snake is deformable spline influenced by constraint and image forces that pull it towards object contours and internal forces that resist deformation (internal and external energies).

• They autonomously and adaptively search for the minimum state.

• They can be used to track dynamic objects.

Applications

• Object tracking

• Shape recognition

• Segmentation

• Edge detection

Demonstrations

Procedure

• Snakes do not solve the entire problem of finding contours in images.• They depend on other mechanisms such as interaction with a user or

with some other higher-level computer vision mechanism:

• (1) First, the snake is placed near the image contour of interest.

• (2) During an iterative process, the snake is attracted towards the target contour by various forces that control the shape and location of the snake within the image.

I smell boundariesLeave me alone

So, how it works ( example )

Initial snake position Convergence

Curves in the Plane

Before we jump to the sea, we need to learn how to swim,

Lets talk about curves :

V(s)={x(s),y(s)}, s [0,1]

V(0.2)V(0)

V(0.9)V(0.3)

V(0.5)

V(0.6)

V(0.7)V(0.8)

x

y

Curves in the Plane (cont.)

We are very familiar with functions, but not with curves.

In a function, we have a coordinates system that for every value of x there is one unique value of y, that doesn’t happen in normal curves.

So, our curves are parametrized, lets say in time p : V(s)={x(s),y(s)}.

x

y

Curves in the Plane (cont.)

Interesting facts :

The derivative of C taken by some point p, gives us the tangent at this point:

The second derivative of C taken by some point p, gives us the curvature at this point.

How these two fact will help us? Stay tuned!

What are we dealing with ?

• Representation of the contours

• Defining the energy functions• Internal• external

• Minimizing the energy function

Measuring snake’s quality: Energy function

Usually, the total energy(cost function) of snake is a combination of internal and external energies

exin EEE

A good fit between the current deformable contour and the target shape in the image will yield a low value for this cost function.

External energy: intuition

• Measure how well the curve matches the image data.

• “Attract” the curve toward different image features edges, lines, etc.

• Think of external energy from image as gravitational pull towards areas of high contrast.

External energy: intuition(cont.)

- (Magnitude of gradient)

Magnitude of gradient

22 )()( IGIG yx

22 )()( IGIG yx

External energy: intuition(cont.)

• Suppose we have an image I(x,y).

• We can compute image gradient at any point.

• Edge strength at pixel (x,y) is

• External energy of a contour point v=(x,y) could be

|)y,x(I|

22 |),(||)(|)( yxIIEex vv

1

0

))(( dssEE exex continuous case ]}1,0[s|)s({ νC

External energy term for the whole snake is

}ni0|{ i νCdiscrete case

Internal energy

The smoothness energy at contour point V(s) could be evaluated as

sd

ddsd

sssEin 2

2

)()())((

22

Elasticity/stretching Stiffness/bending

Then, the interior energy (smoothness) of the whole snake is

1

0

inin ds))s((EE ]}1,0[s|)s({ νC

Internal energy (cont.)

5v4v

3v

2v

1v 6v

7v

8v

10v

9v

elastic energy(elasticity)

i1ivds

d

bending energy(stiffness)

1ii1i1iii1i2

2

2)()(ds

d

• Elasticity/stretching.

• Abs(dist(V(i),V(i-1)).

• Goal = Smaller distance between all points of the snake.

• The importance of the distance between points is related to α.

Neighbouring PointsCurrent Point

Possible New Points


• Stiffness/bending.

• Abs(V(i-1) -2·V(i) + V(i+1))2

• Smaller curvature between all points of the snake.

• The importance of the curvature between points is related to β.

Neighbouring PointsCurrent Point

Possible New Points



Elasticity Stiffness

i1ivds

d

11112

2

2)()( iiiiiiids

d

1

0

211

21 |2|||

n

iiiiiiinE

)( iii y,xν


The weights α and β dictate how much influence each component

has.

Elasticity

High curvature

Low curvature


• Notice that the strength of the internal elastic component can be controlled by a parameter,

• Increasing this increases elasticity of curve

large small

1

0

2n

iiE Elastic energy

Makes the curve insensitive to stretch


• While the curvature component can be controlled by ʙ parameter,

• Increasing this increases curvature

large smallmedium

1

0

2n

iiC

Curvature energy

The greedy algorithm

• The greedy algorithm makes locally optimal choices, hoping that the final solution will be globally optimum.

• Step1 (greedy minimization): each point of the snake is moved within a small neighborhood (e.g. M) to the point which minimizes the energy functional.

• Step 2 (corner elimination): search for corners (curvature extrema) along the contour; if a corner is found at point p, set ʙ for point p to zero.

Each iteration = )( nmO

The greedy algorithm (cont.)

At first the snake start to shrink untill it become closer to the object boundaries, where the external energy in high.

Demo

https://www.youtube.com/watch?v=Xw1tmVlUsgc

https://www.youtube.com/watch?v=Xw1tmVlUsgc

The Viterbi algorithm

In many cases, snake energy can be written as a sum of pair-wise interaction potentials:

More generally, it can be written as a sum of higher-order interaction potentials (e.g. triple interactions).

1

0110 ),(),,(

n

iiiintotal EE

1

01110 ),,(),,(

n

iiiiintotal EE

Snake energy: pair-wise interactions

1

0

210 ||)(||),,(

n

iintotal GE

1

0

21 ||||

n

iii

1

0110 ),(),,(

n

iiiintotal EE

21

21 ||||||)(||),( iiiiii GE where

We can call it the elastic energy (β=0)

1v2v

3v

4v6v

5v

With this form of the energy function, we can minimize using dynamic programming, with the Viterbi algorithm.

Iterate until optimal position for each point on the snake is optimal in the local search space constrained by boxes.

[Amini, Weymouth, Jain, 1990]Fig from Y. Boykov

The Viterbi algorithm (cont.)

5-91

),( 44 nvvE),( 433 vvE

)3(3E

)4(3E )4(4E

)3(4E

)2(4E

)1(4E

)4(nE

)3(nE

)2(nE

)1(nE

)2(3E

)1(3E

)4(2E

)3(2E

),(...),(),( 11322211 nnn vvEvvEvvE

),( 322 vvE

)1(2E

)2(2E

),( 211 vvE

)( 2nmOComplexity:

0)1(1 E

0)2(1 E

0)3(1 E

0)4(1 E

states

1

2

…

m

site

s

1v 2v 3v 4v nv

The Viterbi algorithm (cont.)

Taken from the slides of Kristen GraumanUT-Austin

Viterbi algorithm for higher order interaction

(e.g. if bending energy is added into the “model” of the snake)

Now β ≠ 0

),,(...),,(),,( 12243223211 nnnn vvvEvvvEvvvE

Note that we have to compute a table of values at each node instead of only m values per node for the simpler snakes with β=0.

)( 3nmOComplexity:

Extensions

• What if the input curve starts inside the object we are looking for?.

• What happens if we have multiple object we want to detect?.

Balloon force

Sometimes we want our contours to expand:

The idea is to add another force or another energy function (inflating force or balloon force).

Demo

https://www.youtube.com/watch?v=TWhyRUDKhuI

https://www.youtube.com/watch?v=TWhyRUDKhuI

Splitting snakes

The level set approach:

• Define level set function z = (x,y,t) where the (x,y) plane contains the contour, and z = signed Euclidean distance transform value (negative means inside closed contour, positive means outside contour).

• Move the level set function, (x,y,t), so that it rises, falls, expands, etc.

• Contour = cross section at z = 0, i.e. {(x,y) | (x,y,t) = 0}

The level set approach(cont.)• The zero level set (in blue) at one point in time as a slice of the level

set surface (in red).

Taken from the slides ofFan Ding and Charles DyerComputer Sciences DepartmentUniversity of Wisconsin

The level set approach(cont.)• Later in time the level set surface (red) has moved and the new zero

level set (blue) defines the new contour.


The level set approach(cont.)

1. Define a velocity field, F, that specifies how contour points move in time (Based on application-specific physics such as time, position, normal, curvature, image gradient magnitude).

2. Build an initial value for the level set function, (x,y,t=0), based on the initial contour position.

3. Adjust over time; contour at time t defined by (x(t), y(t), t) = 0.

0y

Φ

x

ΦF

t

Φ

0ΦFt

Φ

2122

Hamilton-Jacobi equation


The level set approach(cont.)

Demo

https://www.youtube.com/watch?v=YhqhuhfEPWQ

https://www.youtube.com/watch?v=YhqhuhfEPWQ

Live wire (Intelligent Scissors)

• Livewire, is a segmentation technique which allows a user to select regions of interest to be extracted quickly and accurately, using simple mouse clicks.

• It is based on the lowest cost path, by Dijkstra.

• Implemented on the gradient image.

• Each pixel of the resulting image is a vertex of the graph and has edges going to the 8 pixels around it.

• The edge costs are defined based on a cost function(energy function).

Live wire algorithm

• The user sets the starting point clicking on an image’s pixel, known as an anchor.

• Then, as he starts to move the mouse over other points, the smallest cost path is drawn from the anchor to the pixel where the mouse is over, changing itself if the user moves the mouse.

• One can easily see in the image that the places where the user clicked to outline the desired region of interest are marked with a small square. It is also easy to see that the livewire has snapped on the object borders.

Live wire (cont.)

The anchor

Snapping on the object

The image is take from the book Computer Vision:Algorithms and ApplicationsBy:Richard Szeliski

Demo

Summary

• Segmentation: essential first step in the most scenes of analysis and recognition problems of images.

• A universal algorithm of segmentation doesn’t exist, as each type of image corresponds of a specific approach.

• Applications: finding tumors, veins etc. in medical images, finding targets in satellite/aerial images, finding people in surveillance images,snakes,scissors etc.

References

• Region based segmentation:

Marc Polmlun

http://www.cs.umb.edu/~marc/cs675/cpv10-28.pdf• Split and Merge tutorial: (CV-online) http://

homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/MARBLE/medium/segment/split.htm

• Watershed-Based Segmentation and Region Merging

Andre bleau, L.joshua Leon.

http://www.sciencedirect.com/science/article/pii/S1077314299908226

Michael Kass, Andrew Witkin, Demetri Terzopoulos.• http://link.springer.com/article/10.1007/BF00133570#page-1• http://

link.springer.com/article/10.1023/A:1007922224810#page-1• http://en.wikipedia.org/wiki/Image_segmentation

http://www.cs.umb.edu/~marc/cs675/cpv10-28.pdf



http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/MARBLE/medium/segment/split.htm






http://link.springer.com/article/10.1007/BF00133570#page-1

http://link.springer.com/article/10.1007/BF00133570#page-1

http://link.springer.com/article/10.1023/A:1007922224810#page-1



http://en.wikipedia.org/wiki/Image_segmentation

http://en.wikipedia.org/wiki/Image_segmentation

References

George Bebis

Kristen Grauman.

Jana Kosecka

http://www.cse.unr.edu/~bebis/CS791E/Notes/DeformableContours.pdf

http://www.cs.utexas.edu/~grauman/courses/378/handouts/snakes.pdf

http://cs.gmu.edu/~kosecka/cs682/lect-snakes.pdf





http://cs.gmu.edu/~kosecka/cs682/lect-snakes.pdf

segmentation by zvi solomon & seri khoury lecturer : hagit hel-or

Documents