outline · outline 1. problem 2. teori ... “rich feature hierarchies for accurate object...
Post on 07-Sep-2018
216 Views
Preview:
TRANSCRIPT
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
Outline1. Problem2. Teori
○ Terminologi○ Definisi & klasifikasi transfer learning
3. Review algoritma4. Scatter Component Analysis (SCA) - [TPAMI 2017]
○ Algoritma transfer learning untuk object recognition○ (Kernel) Principal Component Analysis○ Hasil eksperimen
5. Kesimpulan
1
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
1. Sketch-based face recognition
B. F. Klare, A. K. Jain. “Heterogenous Face Recognition using Kernel Prototype Similarities”, TPAMI 2013
Bagaimana komputer mencocokan gambar wajah asli dengan sketsa?
3
Sketch
Original
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
1. Dataset biasA.Torralba & A. Efros. “Unbiased look at dataset bias”, CVPR 2011
SUN09 LabelMe PASCAL ImageNet Caltech101 MSRC Self Mean others
Percent drop
SUN09 69.8 50.7 42.2 42.6 54.7 69.4 69.8 51.9 26%
LabelMe 61.8 67.6 40.8 38.5 53.4 67.0 67.6 52.3 23%
PASCAL 55.8 55.2 62.1 56.8 54.2 74.8 62.1 59.4 4%
ImageNet 43.9 31.8 46.9 60.7 59.3 67.8 60.7 49.9 18%
Caltech101 20.2 18.8 11.0 31.4 100 29.3 100 22.2 78%
MSRC 28.6 17.1 32.3 21.5 67.7 74.3 74.3 33.4 55%
Mean others 42.0 34.7 34.6 38.2 57.9 61.7 72.4 44.8 48%
4
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
1. Problem
● Data training tidak sepenuhnya mewakili realita/target ! -- domain mismatch● Tidak mudah untuk mengoleksi dan melakukan anotasi data training● Target domain tidak sepenuhnya diketahui
6
Gambar diambil dari Y. Ganin & V. Lempitsky, “Unsupervised Domain Adaptation by Backpropagation”, ICML 2015
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
2. Definisi transfer learning
8
“Kemampuan sistem untuk mentransfer pengetahuan/skill yang telah didapatkan dari domain / task tertentu (source), untuk diaplikasikan di domain / task yang lain (target)”
● Psikologi/Neurobiologi:
Indonesian → Malay, Japanese → Korean, Karate → Tinju, Piano → Gitar, C++ → Java, MATLAB → Python (numpy), Math →{Computer Science, Physics, Economics, …}
● Machine learning○ mengidentifikasi kesamaan pola/atribut/fitur antar domain
● Domain:
● Task:
● Training data
2. Definisi transfer learning
9
● Source domain & task:
● Target domain & task:
● Supervised learning:
● Transfer learning:
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
2. Klasifikasi transfer learning
1. Inductive transfer ( )○ Multi-task learning, self-taught learning, one-shot learning, zero-shot learning
2. Transductive transfer learning ( )○ Domain adaptation○ Domain generalization○ Covariate shift○ Sample selection bias
10
S. J. Pan & Q. Yang. “A Survey on Transfer Learning”, TKDE 2009
3. DASVM (1)
12
SVM recap● Data training:
● Classifier:
● Formulasi:
L. Bruzzone, M. Marconcini, “Domain adaptation problems: a DASVM classification technique and a cricular validation strategy.”, TPAMI 2010
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. DASVM (2) [Bruzzone & Marconcini, TPAMI 2010]
13
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. DASVM (3) [Bruzzone & Marconcini, TPAMI 2010]
14
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. DASVM (4) [Bruzzone & Marconcini, TPAMI 2010]
15
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. DASVM (5) [Bruzzone & Marconcini, TPAMI 2010]
16
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. DASVM (6) [Bruzzone & Marconcini, TPAMI 2010]
17
Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. Feature Augmentation
Daume-III, “Frustratingly easy domain adaptation”, ACL 2007
Li et al, “Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation”, TPAMI 2014
18
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. Transfer Component Analysis
Memanfaatkan fungsi jarak antar distribusi probabilitas
● Maximum Mean Discrepancy:
● Formulasi
19
Sumber: http://informationtransfereconomics.blogspot.co.nz/2015/05/the-economic-allocation-problem.html
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. Geodesic Interpolation
Gopalan et al., “Domain adaptation for object recognition: an unsupervised approach”, ICCV 2011
20
Gong et al., “Geodesic flow kernel for unsupervised domain adaptation”, CVPR 2012
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
3. Deep Learning
Donahue et al., “DeCAF: a deep convolutional activation feature for generic visual recognition”, ICML 2014Girshick et al., “Rich feature hierarchies for accurate object detection and semantic segmentation”, CVPR 2014Yosinski et al., “How transferable are features in deep neural networks”, NIPS 2014Long et al., “Learning transferable features with deep adaptation networks”, ICML 2015Ganin & Lempitsky, “Unsupervised domain adaptation through backpropagation”, ICML 2015Ghifary et al., “Deep reconstruction-classification networks for unsupervised domain adaptation”, ECCV 2016
21
[Girshick et al. CVPR2014]
4. Scatter Component Analysis(SCA)
22
M. Ghifary, D. Balduzzi, W. B. Kleijn, M. Zhang, “Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization”, TPAMI 2017
Preprint: https://arxiv.org/abs/1510.04373
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. SCA
23
* persebaran = variance = scatter
Algoritma representation/feature learning yang bertujuan:1. Memaksimalkan persebaran data keseluruhan2. Meminimalkan persebaran data pada kelas yang sama3. Memaksimalkan persebaran data yang memiliki kelas berbeda4. Meminimalkan persebaran domain
Discriminative
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. SCA
SCA menggabungkan unsur yang ada pada Principal Component Analysis (PCA)*, Linear Discriminant Analysis (LDA), Maximum Mean Discrepancy (MMD), dan kernel trick.
24
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. Principal Component Analysis (1)● Mereduksi dimensi data menjadi beberapa komponen● Komponen-komponen tersebut mempertahankan
informasi yang dimiliki data
25
Sumber: http://www.nlpca.org/fig_pca_principal_component_analysis.png
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA (2)
PCA berbentuk model linear
Goal:
26
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA (3)
Tulis
sehingga
PCA berbasis linear projection → matrix W ortogonal ( ) →
Asumsi , persebaran / variance dari Z dapat dinyatakan dengan
27
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA (4)
Optimisasi: hitung matriks W
Ekspansi Var(Z)
Bentuk Lagrangian
dimana variabel-variabel Lagrange
28
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA (5)
Hitung
Eigen-decomposition:
Eigen-decomposition dari covariance matrix input/data menghasilkan basis yang memaksimalkan !
Komputasi representasi/component scores:
29
4. PCA (6)
(Trained) loadings/basis:
Diberikan sebuah data
30
Training● Input
○ Pastikan memiliki rata-rata 0
● Bentuk matrix input ● Hitung covariance matrix
● Hitung eigen-decomposision:
Test:● Input● Ekstrak representasi/fitur/components:
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA (7)
Python (numpy)
31
Scikit-learn: http://scikit-learn.org/stable/
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. Kernel PCA
B. Scholkopf, A. Smola, “Nonlinear Component Analysis as a Kernel Eigenvalue”, Neural Computation 1998
32
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. PCA vs Kernel PCA
Sumber: http://2.bp.blogspot.com/_slrAR0IXTL0/TF-OZaNbRCI/AAAAAAAAAUo/SdYS3hXd4MI/s1600/figure.png
33
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. SCA: ExperimentsOffice dataset [Saenko et al., ECCV2010]
38
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
4. SCA: ExperimentsOffice dataset [Saenko et al., ECCV2010]
39
Muhammad Ghifary | Transfer LearningWeta Digital Ltd.
5. Kesimpulan1. Transfer learning bertujuan untuk mengatasi problem ketidaksesuaian
antara data dan target/realita (domain mismatch)○ Pengoleksian data sulit dilakukan○ Target tidak diketahui secara pasti
2. Klasifikasi transfer learning○ Inductive transfer: multi-task learning, self-taught learning, one-shot learning, zero-shot
learing○ Transductive transfer: domain adaptation, domain generalization, covariate shift, sample
selection bias
3. Berbagai algoritma TL untuk aplikasi computer vision telah tersedia, namun masih open problem
40
top related