Biology Cluster Analysis

This project uses the Irish dataset which refers us back to 1939. This dataset was created by a biology and statistics practitioner, Ronald Fischer, with measurements of various iris plants. With it, we can find 150 instances, 5 attributes and a class named Species. The class attribute can have three values: iris setosa, iris versicolor and iris virginica.

To understand any concept of data analysis, we must start with a question that we want the data to answer for us. In this case, the question was “Is there a pattern in the data by which we can group the three species of iris so that if we see a new sample we can identify the species to which it belongs?”.

In order to identify and observe the clusters, I’ve gone through the following points:
1. Environment setup and load data;
2. View the data;
3. Build the K Means model;
4. View cluster results.

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google