intro to computer vision: hw 2 - omgimanerd.tech...intro to computer vision: hw 2 alvin lin august...

Intro to Computer Vision: HW 2 Alvin Lin August 2018 - December 2018 Part (i) My implemented solution fetches the Leung Malik filters from the provided file and builds the bank of filters as a 48 × 49 × 49 matrix before apply each filter to the grayscaled image. This generates a 48 × image x × image y matrix which we transpose to image x × image y × 48 so that we have a 48 dimensional vector for each pixel of the image. In order to apply the K-means clustering algorithm on it, I flatten this into a (image x × image y ) × 48 matrix in two dimensions. Running the K-means clustering algorithm yields a resulting matrix of the same size containing the index of the cluster each pixel belongs to. This cluster result matrix is then reshaped back into the dimensionality of the image to be image x × image y . From this, we can select the cluster IDs which represent the animal that we want to segment out and transpose only those segment IDs onto the background. I ran into many problems implementing this. The provided filter banks gave them as a 49 × 49 × 48 dimensional matrix, and this needed to be transposed. I ran into several issues because I was using numpy’s reshape() function on the matrices instead of transpose(). Additionally, my version of OpenCV had some bugs with resize(), so I did not resize the source image before transferring it to the background. Some of the edges around the cheetah had gaps and holes due to the texture of the grass underneath the cheetah. The spots on the cheetah created a lot of complexity and I had to choose k = 40 to correctly separate them out. Using a lower k value in the K-means clustering algorithm resulted in the cheetah’s back being included in the same cluster as the grassy background. I achieved a much better result segmenting the gecko. This was probably due to the higher contrast between the gecko and the white background and the lack of complex textures like grass. Part (ii) I used scikit-learn’s implementation of the K-means clustering algorithm to do the image segmentation described in part (i), but I also implemented my own version of the algorithm. My own version was many orders of magnitude slower. Since I have implemented this algorithm in the past, it was heavily based on my past work, which can be found here: https://gist.github.com/omgimanerd/ff0b9a8dd225f2d2e94aa4be76bc91d9 1

Upload: others

Post on 16-Oct-2020

6 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Intro to Computer Vision: HW 2

Alvin Lin

August 2018 - December 2018

Part (i)

My implemented solution fetches the Leung Malik filters from the provided file and builds the bankof filters as a 48 × 49 × 49 matrix before apply each filter to the grayscaled image. This generates a48×imagex×imagey matrix which we transpose to imagex×imagey×48 so that we have a 48 dimensionalvector for each pixel of the image. In order to apply the K-means clustering algorithm on it, I flatten thisinto a (imagex× imagey)×48 matrix in two dimensions. Running the K-means clustering algorithm yieldsa resulting matrix of the same size containing the index of the cluster each pixel belongs to. This clusterresult matrix is then reshaped back into the dimensionality of the image to be imagex × imagey. Fromthis, we can select the cluster IDs which represent the animal that we want to segment out and transposeonly those segment IDs onto the background.

I ran into many problems implementing this. The provided filter banks gave them as a 49 × 49 × 48dimensional matrix, and this needed to be transposed. I ran into several issues because I was using numpy’sreshape() function on the matrices instead of transpose(). Additionally, my version of OpenCV hadsome bugs with resize(), so I did not resize the source image before transferring it to the background.

Some of the edges around the cheetah had gaps and holes due to the texture of the grass underneaththe cheetah. The spots on the cheetah created a lot of complexity and I had to choose k = 40 to correctlyseparate them out. Using a lower k value in the K-means clustering algorithm resulted in the cheetah’s backbeing included in the same cluster as the grassy background. I achieved a much better result segmentingthe gecko. This was probably due to the higher contrast between the gecko and the white background andthe lack of complex textures like grass.

Part (ii)

I used scikit-learn’s implementation of the K-means clustering algorithm to do the image segmentationdescribed in part (i), but I also implemented my own version of the algorithm. My own version was manyorders of magnitude slower. Since I have implemented this algorithm in the past, it was heavily based onmy past work, which can be found here:https://gist.github.com/omgimanerd/ff0b9a8dd225f2d2e94aa4be76bc91d9

https://gist.github.com/omgimanerd/ff0b9a8dd225f2d2e94aa4be76bc91d9

Page 2: Intro to Computer Vision: HW 2 - omgimanerd.tech...Intro to Computer Vision: HW 2 Alvin Lin August 2018 - December 2018 Part (i) ... Additionally, my version of OpenCV had some bugs

Part (iii)

(a) Original image of cheetah (b) Segmentation of cheetah image(c) Cheetah transferred to

background

(a) Original image of gecko (b) Segmentation of gecko image (c) Gecko transferred to background

Part (iv)

The segmentation of the cheetah has many holes in it most likely due to the coloration of the cheetah. Thewhite regions around the anterior edge and the mouth of the cheetah blend with the background and donot form a clear edge to be detected by the filter bank. This is evident in the visualization of the cheetah’ssegmentation. Note that the light regions around the cheetah’s head also blend with the horizon lines inthe background.

It seems like oversegmenting the image resulted in more distinctly separated regions, so choosing ahigher k for the K-means clustering algorithm would likely result in less ambiguity between the animal andthe background region. Increasing the contrast of the image would probably help with the segmentation.

If you have any questions, comments, or concerns, please contact me at [email protected]

CS5670: Intro to Computer Vision · Introduction to Recognition CS5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp

150819 Intro to Machine Vision Slides

Going Social / DMA Event Intro & Vision

CS 1699: Intro to Computer Vision Introductionpeople.cs.pitt.edu/~kovashka/cs1699_fa15/vision_16_ml_recognition... · CS 1699: Intro to Computer Vision Intro to Machine Learning &

ECE 661 (Fall 2016) - Computer Vision - HW 9...ECE 661 (Fall 2016) - Computer Vision - HW 9 Debasmit Das November 22, 2016 1 Objective The goal of this homework is to estimate the

MC-TRX Intro, Hw Descr.pdf

ENGINEERING HW - Lost River Development - Horace, ND€¦ · x x x x x bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl bcl hw hw hw hw hw hw hw hw hw hw hw hw hw hw hw drainage

Jambo! Do Now: Do Now: Take out HW and Thank you notes Take out HW and Thank you notes Agenda: Agenda: Notes quiz Notes quiz Intro to Soil Intro to Soil

01.- Intro Vision Gral y Evolucion

Chapter 4 Powerpoint : Intro and Vision

CS 231A Computer Vision Midterm - Stanford Universitycvgl.stanford.edu/teaching/cs231a_winter1415/HW/cs231a_midterm.pdf · CS 231A Computer Vision Midterm Out: 12:30pm, February 25,