Cluster analysis

  1. Load the dataset CreditCards.csvPreview the document into memory.
  2. Perform the k-means cluster analysis
    1. Remove the first column: CUST_ID since it doesn’t provide any info for cluster.
    2. Determine the optimal number of clusters. Justify your answer. It may take longer running time since it uses a large dataset.
    3. Perform k-means clustering using the optimal number of clusters.
    4. Visualize the clusters in different colors.