Run T-SNE using 500 PC's:
During the experiment, I only adjusted the PC's value, and other parameters are the same as in Problem
1. By comparing the above experimental results, it can be found that when the PC's value gradually
increases, the points of each cluster in the dimensionality reduction result of T-SNE will become sparse and
the number of classes will change. Specifically, it can be found in the results that the number of clusters
corresponding to 100PC's and 250PC's is 5, and the number of clusters corresponding to 100PC's and
250PC's in general is 3 (two clusters in the 100PC's seem to be divisible into two classes internally, and
one cluster in the 100PC's seems to be divisible into two classes internally) and the points in the clusters
are sparser than before. Note that when the PC's value reaches 500, the corresponding number of clusters
is 4 (one cluster corresponds to very few points).
2. (13 points) Pick three hyper-parameters below (the 3 is the total number that a report needs to analyze.
It can take a) 2 from A, 1 from B, or b) 1 from A, 2 from B.) and analyze how changing the hyper-parameters
affect the conclusions that can be drawn from the data. Please choose at least one hyper-parameter from
each of the two categories (visualization and clustering/feature selection). At minimum, evaluate the hyper-
parameters individually, but you may also evaluate how joint changes in the hyper-parameters affect the
results. You may use any of the datasets we have given you in this project. For visualization hyper-
parameters, you may find it productive to augment your analysis with experiments on synthetic data, though
we request that you use real data in at least one demonstration.
Solution: The experimental parameters I selected are Category A (T-SNE perplexity, T-SNE learning rate),
Category B (Effect of number of PC's chosen on clustering). The experimental data I used is the
experimental data of Problem 1.
I adjusted the values of PC's independently during the experiment, and then adjusted the T-SNE perplexity
and T-SNE learning rate at the same time. 25 images can be obtained for each value of PC's, and I made a
5×5 arrangement for easy comparison. T-SNE perplexity trial parameters: 10,20,30,40,50. T-SNE Learning
Rate parameter:100,500,1500,2000,2500. PC quantity parameters: 10,50,100,250,500.