PCA on the Wisconsin breast cancer dataset

In this project, the Wisconsin breast cancer dataset is visualized in two dimensions using Principal Component Analysis (PCA). The process involves the following steps:

Rescaling the Data:
- Utilize sklearn.preprocessing.StandardScaler to rescale the data, ensuring that every feature has a mean of 0 and a standard deviation of 1 across the various points in the dataset.
Compute Top Two Principal Components:
- Use two different approaches:
  - Direct SVD (Singular Value Decomposition):
    - Compute the top two principal components directly using SVD without using any PCA built-ins.
  - PCA from sklearn.decomposition:
    - Use sklearn.decomposition.PCA to compute the top two principal components.
Coordinate Calculation:
- For every data point, compute its coordinates (projections) along the two principal components.
Scatterplot Visualization:
- Create a scatterplot of the dataset in 2 dimensions.
- X-axis represents the first principal component, and the Y-axis represents the second.
- Color the points based on their diagnosis (malignant or benign).

The two approaches (direct SVD and PCA from sklearn.decomposition) should yield exactly the same results, with potential sign differences that can be resolved by flipping signs to ensure identical representations. The scatterplots generated for both approaches should be identical. The analysis aims to explore whether the data is roughly separable in two dimensions, providing insights into the inherent structure of the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
PCA.ipynb		PCA.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PCA on the Wisconsin breast cancer dataset

About

Releases

Packages

Languages

saraabme/PCA_Wisconsin_breast_cancer_dataset

Folders and files

Latest commit

History

Repository files navigation

PCA on the Wisconsin breast cancer dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages