christel vrain, thi-bich-hanh dao · christel vrain, thi-bich-hanh dao lifo université...

29
Descriptive clustering Christel VRAIN , Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 1 / 29

Upload: others

Post on 25-Jun-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Descriptive clustering

Christel VRAIN, Thi-Bich-Hanh DAO

LIFOUniversité d’Orléans

Workshop on Machine Learning and Explainability

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 1 / 29

Page 2: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

MotivationClustering used extensively in AI applicationsIn many domains, data have very good features/attributes to formcompact clusters, but

I features cannot explain the clustering wellI data also described by another set of (potentially sparse and noisy)

descriptors/tags that are useful for explanation

Setting Features/attributes Descriptors/tagsTwitter network mention/retweet graph hashtag usageImages SIFT features tags

Needs to balance compact clusters (w.r.t. to a distance betweenobjects) with

I their consistency with human expectationsI their explanations to human

Aims:1 find clusters close to the expert expectations by leveraging

knowledge2 discover simultaneously explanations during the clustering process

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 2 / 29

Page 3: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Mainly two frameworks for clustering

Conceptual Clustering:I introduced in the 80’s [Michalski & Stepp, 1983, Fisher, 1985]I presently based on closed patterns (FCA and pattern mining)I based on qualitative propertiesI does not take into account quantitative attributes, nor distance

between objects (no notion of compactness, e.g. clusters diameter)

Distance-based clustering:I based on dissimilarities between objectsI appropriate for quantitative dataI qualitative properties must be encapsulated in a distance

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 3 / 29

Page 4: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

A declarative framework for constrained clustering inCPDao, Duong, Vrain, AIJ 2017

Input: a dataset or a dissimilarity measure between pairs of pointsClusters are defined by an assignment of points to clusters:

G[o] = c, c ∈ [1, k ]Optimization criterion, e.g. minimizing the maximum diameterConstraints are put

I for representing a partitionI for breaking symmetriesI user constraints: size, diameter, split, . . .

G1 = 1Gi ≤ maxj∈[1,i−1](Gj) + 1, for i ∈ [2,n]#{i | Gi = kmin} ≥ 1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 4 / 29

Page 5: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

How to make clustering interpretable?

Before the clustering process: leverage human knowledge beforeclustering→ actionable clusteringAfter the clustering process→ explain the cluster :

I CharacterizationI GeneralizationI Statistics

During the clustering process. Two assumptionsI Clustering and explanations are in the same representation space→ conceptual clustering

I Clustering and explanations are in two different representationspaces.

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 5 / 29

Page 6: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Actionable clusteringDao, Vrain, Duong, Davidson, ECAI 2016Express constraints that makes the clustering useful for a givenpurpose

Find useful groups each of which you can invite to a different dinnerparty

equal number of males and femaleswidth of a cluster in terms of age at most 10each person in a cluster should have at least r other people withthe same hobby

Instances 3,9 are in the same cluster if 11,15 are in different clusters.

B1 ↔ (G11 6= G15)B2 ↔ (G3 = G9)B1 ≤ B2

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 6 / 29

Page 7: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Unifying conceptual and distance clusteringDao, Lesaint, Vrain, JFPC 2015

- taking into account quantitative and qualitative data- combining conditions/criteria from both frameworks

Data:I a set O of objects, a set I of Boolean propertiesI a dissimilarity measure d(o,o′) for any o, o′ in OI a binary database D: Dop = 1, when o satisfies property p

Clusters are defined by:1 assignment of points to clusters: G[o] = c, c ∈ [1, k ]2 description of clusters: A[c,p] = 1 iff p is in the description of

cluster c.Constraints

I Constraints of the distance-based model: partition, breakingsymmetries

I Constraints from the conceptual model: an object is in a cluster iff itsatisfies all its properties.

∀o ∈ O, ∀c ∈ C G[o] = c ⇔∑

p∈I A[c,p](1− Dop) = 0

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 7 / 29

Page 8: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Car Dataset

193 objectstechnical properties (22 attributes) :

I motorization (diesel or not)I drive wheels (4, 2 front, 2 rear)I power (between 48 and 288)I etc.

discretization : 64 qualitative attributesprice (quantitative attribute)

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 8 / 29

Page 9: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Car dataset

Conceptual setting

→ (e) concepts + maximizing min. size ofclusters

→ (f) concepts + maximizing min. size ofconceptsPrice distribution not convincing

Distance-based setting

→ (g) minimizing max diameterNo convincing concepts

Unified framework

→ (h) concepts + minimizing max diameterA better modeling of the 3 car rangeswith concepts based on size, enginepower, fuel consumption, . . .

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 9 / 29

Page 10: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Descriptive clustering formulationDao, Kuo, Ravi, Vrain, Davidson, IJCAI 2018

Data: n data instances described by numerical features X andinterpretable boolean descriptors/tags DAims: Simultaneously look for clusters which are both

I good/compact in one modality (e.g. SIFT features for images orgraph distance)

I useful/descriptive in another modality (e.g. tags)

The objectives are not compatible

→ computation of a Pareto front corresponding to Pareto optimalsolutions, allowing to model a trade-off with both objectivesf : feature-focused objective to minimize compactnessg: descriptor-focused objective to maximize interpretability

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 10 / 29

Page 11: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Pareto optimal solutions and Pareto front

g

fCriterion space

Partition P ′ dominates P iff better in one criterion and not worse inthe otherP is a Pareto optimal solution iff there is no P ′ which dominates PPareto front = {(f (P),g(P)) | P is a Pareto optimal solution}

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 11 / 29

Page 12: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Compute the complete Pareto front

P ← ∅;sf

1 ← minimize f subject to C;i ← 1;while sf

i 6= NULL dosg

i ← maximize g subject to C ∪ {f ≤ f (sfi )};

P ← P ∪ {sgi };

i ← i + 1;sf

i ← minimize f subject to C ∪ {g > g(sgi−1)};

return P;

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 12 / 29

Page 13: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Data and variables

Data:I X : n × f matrix of n data instances with f numerical featuresI D: n × r matrix of the same n instances with r tag indicators

Variables:I cluster indication matrix Z : n × k boolean matrix

Zic = 1 indicates the i-th instance is in the c-clusterI cluster description matrix S: k × r boolean matrix

Scp = 1 means the p-th tag is included in the description of the c-thcluster

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 13 / 29

Page 14: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Partitioning constraints

Each instance is in one clusterEach cluster has at least one elementBreaking symmetries between clusters

∀i = 1, . . . ,n,∑k

c=1 Zic = 1∀c = 1, . . . , k ,

∑ni=1 Zic ≥ 1

Z11 = 1∀i = 2, . . . ,n,∀c = 2, . . . , k ,

∑i−1j=1 Zjc−1 ≥ Zic

Each cluster description has at least one tag

∀c = 1, . . . , k ,n∑

i=1

Scp ≥ 1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 14 / 29

Page 15: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Cluster description constraintsEach cluster is described by a non empty subset of tagsAn instance in a cluster must satisfy most of its descriptions (up toα exceptions):

∀c = 1, . . . , k , ∀i = 1, . . . ,n,

Zic = 1 =⇒r∑

p=1

Scp(1− Dip) ≤ α

A tag is included in a cluster description if and only if most of theinstances in the cluster (up to β exceptions) possess it:

∀c = 1, . . . , k , ∀p = 1, . . . , r ,

Scp = 1 ⇐⇒n∑

i=1

Zic(1− Dip) ≤ β

With dense tags dataset, stronger version with α = β = 0Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 15 / 29

Page 16: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Feature-focused optimization criteria

Finding compact clusters, based on their distance d(·, ·) defined overpairs of instances

f(Z, S) =n

maxi<j,i,j=1

ZiZTj d(Xi, Xj)

<latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AAAC83icjVHLSsNAFD3G97vq0s1gESqUmoiggoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu22WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+55qdOSaCXxZxt0bkuufqQ==</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AAAC83icjVHLSsNAFD3G97vq0s1gESqUmoiggoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu22WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+55qdOSaCXxZxt0bkuufqQ==</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AAAC83icjVHLSsNAFD3G97vq0s1gESqUmoiggoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu22WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+55qdOSaCXxZxt0bkuufqQ==</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AAAC83icjVHLSsNAFD3G97vq0s1gESqUmoiggoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu22WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+55qdOSaCXxZxt0bkuufqQ==</latexit>

Diameter arg min f (Z ,S)

f(Z, S) = ⌃ni<j,i,j=1ZiZ

Tj d(Xi, Xj)

<latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AAAC9XicjVHLSsNAFD3G97vq0s1gESqUkoiggoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xgg9EJyQ5c+49Z+be68a+SKVpPncZ3T29ff0Dg0PDI6Nj44WJyb00yhKPV73Ij5Ka66TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmmWu3RKT69CSkZZkkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR99ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezzg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AAAC9XicjVHLSsNAFD3G97vq0s1gESqUkoiggoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xgg9EJyQ5c+49Z+be68a+SKVpPncZ3T29ff0Dg0PDI6Nj44WJyb00yhKPV73Ij5Ka66TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmmWu3RKT69CSkZZkkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR99ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezzg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AAAC9XicjVHLSsNAFD3G97vq0s1gESqUkoiggoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xgg9EJyQ5c+49Z+be68a+SKVpPncZ3T29ff0Dg0PDI6Nj44WJyb00yhKPV73Ij5Ka66TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmmWu3RKT69CSkZZkkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR99ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezzg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AAAC9XicjVHLSsNAFD3G97vq0s1gESqUkoiggoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xgg9EJyQ5c+49Z+be68a+SKVpPncZ3T29ff0Dg0PDI6Nj44WJyb00yhKPV73Ij5Ka66TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmmWu3RKT69CSkZZkkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR99ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezzg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit>

Sum of within-cluster distances

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 16 / 29

Page 17: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Descriptor-focused optimization criteria

Tags may be dense, sparse or noisy⇒ different objective functions

1 Minimize tag disagreement (MTD)I minimize α+ βI useful when the tags contain noise

2 Max-min complete tag agreement (MMCTA):I the tags of a cluster are shared by all its instances (α = β = 0)I maximize the tag set of each cluster (size of the smallest)I Use: tags well populated with little noise

3 Max-min neighborhood agreement (MMNA):I each pair of instances in a same cluster must share at least q tagsI maximize qI Use: tags are sparse

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 17 / 29

Page 18: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Minimize tag disagreement (MTD)

allows disagreementsα number of tags an instance may not possesβ number of instances a tag may not coveruseful when tags are sparse and/or noisy

A B C D E F G

1 3 4 5

Inst.

Tags

2α = 1, β = 2

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 18 / 29

Page 19: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Max-min complete tag agreement (MMCTA)

the tags of a cluster are shared by all its instances (α = β = 0)useful when the tags are well populated with little noise

A B C D E F G

1 2 3 4 5

Inst.

Tags

MMCTA=1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 19 / 29

Page 20: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Max-min neighborhood agreement (MMNA)

Every pair of instances in a cluster must share at least q tagsuseful when tags are sparse

A B C D E F G

1 2 3 4 5

Inst.

Tags

MMNA=1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 20 / 29

Page 21: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Two methods

Integer linear programming (ILP): all the constraints andobjectives can be transformed into linear formConstraint programming (CP): using global constraints, reifiedconstraints and restart search

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 21 / 29

Page 22: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

CP formulationSupplementary variables Gi ∈ {1, . . . , k} for i = 1, ..,nGi = c means i-th instance is in c-clusterChanneling constraints:

Zic = 1⇐⇒ Gi = c

Global constraints:

precede(G, [1, .., k ])atleast(G,1, k)diameter(G, f ,d)

Reified constraints to express description constraintsFor MMCTA, MMNA, new global constraints to enforce:

∀i , j = 1, ..,n, Gi = Gj =⇒r∑

p=1

DipDjp ≥ q

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 22 / 29

Page 23: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Clustering tagged images

[Lampert C.H, Nickisch H., Harmeling S., CVPR 2009]30000 images from 50 classes of animalsEach image described by 2000 SIFT features and variablenumber of tags (black, fast, timid, etc.)Animal names are not given to algorithmRandomly sample 100 images from 10 first animal classes

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 23 / 29

Page 24: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Trade off compactness vs. useful description

7,200 7,600 8,000 8,400

12

14

16

18

Diameter

MM

NA

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 24 / 29

Page 25: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

First Pareto point: Diameter minimized. MMCTA=4. MMNA=11Cl# Composition by animals Description by tagsC1 1 grizzly bear, 2 dalmatian, 1 horse, 2 blue whale big, fast, strong, muscle,

newworld, smartC2 5 antelope, 2 grizzly bear, 2 beaver, 5 dalmatian,

5 persian cat, 5 horse, 6 german shepherd, 3siamese cat

furry, chewteeth, fast,quadrapedal, newworld,ground

C3 69 beaver, 64 dalmatian, 42 persian cat, 29 bluewhale, 42 siamese cat

tail, fast, newworld, timid,smart, solitary

C4 100 killer whale, 69 blue whale, 1 siamese cat tail, fast, fish, smartC5 95 antelope, 97 grizzly bear, 29 beaver, 29 dalma-

tian, 53 persian cat, 94 horse, 94 german shep-herd, 54 siamese cat

furry, chewteeth, fast,quadrapedal, newworld,ground

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 25 / 29

Page 26: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Third Pareto point. MMCTA=9, MMNA=15Cl# Composition by animals Description by tagsC1 2 antelope, 4 dalmatian, 2 horse,

3 german shepherd, 4 siamesecat

furry, lean, longleg, tail, chewteeth, walks, fast,muscle, quadrapedal, active, agility, newworld,oldworld, ground

C2 2 beaver, 1 persian cat, 1 horse,1 german shepherd

furry, tail, chewteeth, fast, quadrapedal, agility,newworld, ground, smart

C3 100 grizzly bear, 98 beaver, 99persian cat, 1 siamese cat

furry, paws, chewteeth, claws, fast,quadrapedal, fish, newworld, ground, smart,solitary

C4 100 killer whale, 100 blue whale spots, hairless, toughskin, big, bulbous, flip-pers, tail, strainteeth, swims, fast, strong, fish,plankton, arctic, ocean, water, smart, group

C5 98 antelope, 96 dalmatian, 97horse, 96 german shepherd, 95siamese cat

furry, lean, longleg, tail, chewteeth, walks, fast,muscle, quadrapedal, active, agility, newworld,oldworld, ground

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 26 / 29

Page 27: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Fifth Pareto point: MMNA maximized. MMCTA=15, MMNA=18Cl# Composition by animals Description by tagsC1 100 antelope, 100 dalmatian furry, big, lean, longleg, tail, chewteeth, walks,

fast, strong, muscle, quadrapedal, active,agility, newworld, oldworld, ground, timid, group

C2 100 horse, 99 german shepherd,98 siamese cat

black, brown, gray, patches, furry, lean, lon-gleg, tail, chewteeth, walks, fast, muscle,quadrapedal, active, agility, newworld, old-world, ground, smart, domestic

C3 100 grizzly bear, 100 beaver, 1siamese cat

brown, furry, paws, chewteeth, claws, fast, mus-cle, quadrapedal, active, nocturnal, fish, new-world, ground, smart, solitary

C4 100 killer whale, 100 blue whale spots, hairless, toughskin, big, bulbous, flip-pers, tail, strainteeth, swims, fast, strong, fish,plankton, arctic, ocean, water, smart, group

C5 100 persian cat, 1 german shep-herd, 1 siamese cat

gray, furry, pads, paws, tail, chewteeth, meat-teeth, claws, walks, fast, quadrapedal, agility,meat, newworld, oldworld, ground, smart, soli-tary, domestic

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 27 / 29

Page 28: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Towards actionable/explainable clustering

Con

stra

inab

ility

Explanability

Classicclustering

ConstrainedClusteringAIJ 2017

Actionable clusteringECAI 2016

HIL clusteringAAAI 2017

Conceptualclustering

Descriptiveclustering

IJCAI 2018

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 28 / 29

Page 29: Christel VRAIN, Thi-Bich-Hanh DAO · Christel VRAIN, Thi-Bich-Hanh DAO LIFO Université d’Orléans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive

Future work

Framework is extendable to other types of compactness anddescription

ScalabilityI Smart dataI Sampling ??I Relaxing the search for an optimal solution

Learning constraints and preferences

Leveraging knowledge by means of constraints in otherframeworks

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 29 / 29