K-Means Clustering

This is an unsupervised cluster analysis for identifying groups of approaches most similar in functionality to MidSemI. We evaluated the integration systems in related works section against the assessment presented in Table 3. The results are summarized in Table 4. In order to achieve this purpose, we transformed Table 4 into a matrix with numerical values that denote normalized distances (in the range [0-1]) among features, values close to 1 indicate proximity to the best implementation of this feature or dimension. We calculated distances related to dimensions by summing distances of all the features belongs to given dimension divided by their number. Figure 14 (a) reports the distances values for both features and dimensions. We employed Weka [https://www.cs.waikato.ac.nz/ml/weka/] which is a well-known machine learning analysis tool to perform the clustering. We applied the K-Means clustering algorithm with K=4. By clustering ten systems to four clusters, we aim at identifying a medium groups of systems which share similar function and to reveal dependencies encoded in the vector representation. Figure 14 (b) depicts the clustering results. The first cluster includes MidSemI along with FuhSen and LDIF systems. The second cluster contains OD CleanStore and LSM tools. The Third cluster holds MOMIS and SWIS systems. The rest of systems Mastro, KRAFT and OBSERVER form the last cluster. We generated the positions of clusters randomly, and the radius of each bubble is proportional to the sum of features distances for each system. Figure 15 (a) and (b) plots comparison of features and dimensions coverage respectively concerning only the first cluster that holds systems closest to our MidSemI approach (FuhSen and LDIF).

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
evaluation		evaluation
1- Table_3 and Table_4.xlsx		1- Table_3 and Table_4.xlsx
Classeur_DI.arff		Classeur_DI.arff
Classeur_DI.csv		Classeur_DI.csv
Classeur_F.arff		Classeur_F.arff
Classeur_F.csv		Classeur_F.csv
Clustering_Resulats.txt		Clustering_Resulats.txt
Clustering_Resultas_Classeur_DI		Clustering_Resultas_Classeur_DI
Clustering_Resultas_Classeur_F		Clustering_Resultas_Classeur_F
Figure 14 _ Table 4.png		Figure 14 _ Table 4.png
Figure_15.png		Figure_15.png
README.md		README.md
Table_3.png		Table_3.png
Table_4-Figure_14.png		Table_4-Figure_14.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Means Clustering

About

Releases

Packages

SamRepository/K-Means_Clustering

Folders and files

Latest commit

History

Repository files navigation

K-Means Clustering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages