Click here to Skip to main content
15,885,914 members
Articles / Artificial Intelligence / Machine Learning

Intermediate Data Clustering with k-means

Rate me:
Please Sign up or sign in to vote.
5.00/5 (36 votes)
7 Jun 2023Public Domain54 min read 51.1K   1.2K   61  
Add features to k-means for missing data, mixed data, and choosing the number of clusters
The k-means data clustering algorithm is extended to support missing data, mixed data, and to choose the number of clusters. A modified distance calculation finds solutions with a lower total error. Multi-dimensional scaling with missing data support is included. Full source code with no library dependencies is provided in FORTRAN with a full translation to C. Several example problems are discussed.

Views

Daily Counts

Downloads

Weekly Counts

License

This article, along with any associated source code and files, is licensed under A Public Domain dedication


Written By
Engineer Kruger Optical
United States United States
I work on an assembly line and have been doing amateur programming since 1992. Those rare times away from the computer I spend bicycling, canoeing the Columbia River, or cheering on the Seattle Seahawks.

Comments and Discussions