kpax2 is a R package, written with the purpose of clustering (big) datasets of categorical statistical variables. Main application of kpax2 is with genetic datasets, such as dna/protein multiple sequence alignments. Being a general method, it can be easily applied to any kind of categorical dataset. kpax2 output consists of a classification of both the rows (statistical units) and columns (statistical variables) of the provided data matrix.

Code available on GitHub.

Note: development of this package is discontinued. kpax2 was superseded by Kpax3.jl

Alberto Pessia
Postdoctoral researcher