Phantom Clustering .


19 views
Uploaded on:
Category: Funny / Jokes
Description
Spectral Clustering. Course: Cluster Analysis and Other Unsupervised Learning Methods (Stat 593 E) Speakers: Rebecca Nugent 1, Larissa Stanberry 2 Department of 1 Statistics, 2 Radiology, University of Washington. Outline. What is spectral clustering?
Transcripts
Slide 1

Phantom Clustering Course: Cluster Analysis and Other Unsupervised Learning Methods (Stat 593 E) Speakers: Rebecca Nugent 1, Larissa Stanberry 2 Department of 1 Statistics, 2 Radiology, University of Washington

Slide 2

Outline What is ghastly grouping? Grouping issue in diagram hypothesis On the way of the fondness lattice Overview of the accessible ghastly bunching calculation Iterative Algorithm: A Possible Alternative

Slide 3

Spectral Clustering Algorithms that bunch focuses utilizing eigenvectors of frameworks got from the information Obtain information representation in the low-dimensional space that can be effortlessly grouped Variety of strategies that utilization the eigenvectors in an unexpected way

Slide 4

Data-driven Method 1 Method 2 network Data-driven Method 1 Method 2 grid Data-driven Method 1 Method 2 framework

Slide 5

Spectral Clustering Empirically extremely effective Authors deviate: Which eigenvectors to utilize How to get groups from these eigenvectors Two general techniques

Slide 6

Method #1 Partition utilizing stand out eigenvector at once Use system recursively Example: Image Segmentation Uses 2 nd (littlest) eigenvector to characterize ideal cut Recursively produces two groups with every cut

Slide 7

Method #2 Use k eigenvectors (k picked by client) Directly process k-way parceling Experimentally has been seen to be "better"

Slide 8

Spectral Clustering Algorithm Ng, Jordan, and Weiss Given an arrangement of focuses S={s 1 ,… s n } Form the partiality framework Define corner to corner grid D ii = S k an ik Form the lattice Stack the k biggest eigenvectors of L to shape the sections of the new framework X: Renormalize each of X\'s lines to have unit length. Group lines of Y as focuses in R k

Slide 9

Cluster examination & chart hypothesis Good old case : MST  SLD Minimal spreading over tree is the diagram of least length associating all information focuses . All the single-linkage groups could be gotten by erasing the edges of the MST, beginning from the biggest one.

Slide 10

Cluster examination & diagram hypothesis II Graph Formulation View information set as an arrangement of vertices V={1,2,… ,n} The comparability between items i and j is seen as the heaviness of the edge interfacing these vertices An ij . An is known as the proclivity grid We get a weighted undirected diagram G=(V,A). Bunching (Segmentation) is identical to parcel of G into disjoint subsets. The last could be accomplished by just expelling interfacing edges.

Slide 11

Nature of the Affinity Matrix "nearer" vertices will get bigger Weight as an element of s

Slide 12

Simple Example Consider two 2-dimensional marginally covering Gaussian mists each containing 100 focuses.

Slide 13

Simple Example cont-d I

Slide 14

Simple Example cont-d II

Slide 15

Magic s Affinities develop as develops  How the decision of s esteem influences the outcomes? What might be the ideal decision for s ?

Slide 16

Example 2 (not all that straightforward)

Slide 17

Example 2 cont-d I

Slide 18

Example 2 cont-d II

Slide 19

Example 2 cont-d III

Slide 20

Example 2 cont-d IV

Slide 21

Spectral Clustering Algorithm Ng, Jordan, and Weiss Motivation Given an arrangement of focuses We might want to group them into k subsets

Slide 22

Algorithm Form the fondness lattice Define if Scaling parameter picked by client Define D a corner to corner grid whose (i,i) component is the aggregate of A\'s line i

Slide 23

Algorithm Form the network Find , the k biggest eigenvectors of L These frame the sections of the new framework X Note: have decreased measurement from nxn to nxk

Slide 24

Algorithm Form the framework Y Renormalize each of X\'s lines to have unit length Y Treat every column of Y as a point in Cluster into k bunches by means of K-means

Slide 25

Algorithm Final Cluster Assignment Assign indicate bunch j iff push i of Y was allocated to bunch j

Slide 26

Why? On the off chance that we in the end utilize K-implies, why not simply apply K-intends to the first information? This technique permits us to bunch non-arched locales

Slide 28

User\'s Prerogative Choice of k, the quantity of bunches Choice of scaling component Realistically, seek over and pick esteem that gives the most secure bunches Choice of grouping strategy

Slide 29

Comparison of Methods

Slide 30

Advantages/Disadvantages Perona/Freeman For square inclining partiality networks, the principal eigenvector discovers focuses in the "dominant"cluster; not extremely steady Shi/Malik 2 nd summed up eigenvector minimizes proclivity between gatherings by liking inside every gathering; no assurance, requirements

Slide 31

Advantages/Disadvantages Scott/Longuet-Higgins Depends to a great extent on decision of k Good results Ng, Jordan, Weiss Again relies on upon decision of k Claim: successfully handles bunches whose cover or connectedness shifts crosswise over groups

Slide 32

Affinity Matrix Perona/Freeman Shi/Malik Scott/Lon.Higg 1 st eigenv. 2 nd gen. eigenv. Q network Affinity Matrix Perona/Freeman Shi/Malik Scott/Lon.Higg 1 st eigenv. 2 nd gen. eigenv. Q lattice Affinity Matrix Perona/Freeman Shi/Malik Scott/Lon.Higg 1 st eigenv. 2 nd gen. eigenv. Q framework

Slide 33

Inherent Weakness sooner or later, a bunching strategy is picked. Every bunching strategy has its qualities and shortcomings Some techniques additionally require from the earlier learning of k.

Slide 34

One enticing option The Polarization Theorem (Brand&Huang) Consider eigenvalue disintegration of the fondness framework V L V T =A Define X= L 1/2 V T Let X (d) =X(1:d, :) be best d lines of X: the d foremost eigenvectors scaled by the square foundation of the relating eigenvalue A d =X (d) T X (d) is the best rank-d estimation to An as for Frobenius standard (||A|| F 2 = S an ij 2 )

Slide 35

The Polarization Theorem II Build Y (d) by normalizing the segments of X (d) to unit length Let Q ij be the edge btw x i ,x j – sections of X (d) Claim As An is anticipated to progressively bring down positions A (N-1) , A (N-2), … , A (d), … , A (2), A (1) , the entirety of squared point cosines S (cos Q ij ) 2 is entirely expanding

Slide 36

Brand-Huang calculation Basic methodology: two exchanging projections: Projection to low-rank Projection to the arrangement of zero-inclining doubly stochastic grids (all lines and segments whole to solidarity) stochastic network has all lines and segments total to solidarity

Slide 37

Brand-Huang calculation II While {number of EV=1}<2 do A PA(d)PA(d) … Projection is finished by smothering the negative eigenvalues and solidarity eigenvalue. The nearness of at least two stochastic (unit)eigenvalues suggests reducibility of the subsequent P grid. A reducible grid can be line and segment permuted into square corner to corner frame

Slide 38

Brand-Huang calculation III

Slide 39

References Alpert et al Spectral dividing with different eigenvectors Brand&Huang A binding together hypothesis for otherworldly inserting and bunching Belkin&Niyogi Laplasian maps for dimensionality diminishment and information representation Blatt et al Data grouping utilizing a model granular magnet Buhmann Data bunching and learning Fowlkes et al Spectral gathering utilizing the Nystrom technique Meila&Shi An irregular strolls perspective of ghastly division Ng et al On Spectral bunching: investigation and calculation Shi&Malik Normalized cuts and picture division Weiss et al Segmentation utilizing eigenvectors: a bringing together view

Recommended
View more...