cluster-analysis

kpax2

R package for bi-clustering multivariate categorical data.

Kpax3.jl

Julia package for bi-clustering multivariate categorical data.

Convergent amino acid signatures in polyphyletic Campylobacter jejuni sub-populations suggest human niche tropism

Human infection with the gastrointestinal pathogen C. jejuni is dependent upon the opportunity for zoonotic transmission and the ability of strains to colonize the human host. Certain lineages of this diverse organism are more common in human …

Kpax3: Bayesian bi-clustering of large sequence datasets

MotivationEstimation of the hidden population structure is an important step in many genetic studies. Often the aim is also to identify which sequence locations are the most discriminative between groups of samples for a given data partition. …

Bayesian cluster analysis with applications to pathogen population genomics

Identifying similarity patterns in heterogeneous observations is a very common problem in many branches of science. When the similarities and dissimilarities are encoded by a group structure, the task of dividing the observed sample into an unknown …

K-Pax2: Bayesian identification of cluster-defining amino acid positions in large sequence datasets

The recent growth in publicly available sequence data has introduced new opportunities for studying microbial evolution and spread. Because the pace of sequence accumulation tends to exceed the pace of experimental studies of protein function and the …

Dense genomic sampling identifies highways of pneumococcal recombination

Evasion of clinical interventions by Streptococcus pneumoniae occurs through selection of non-susceptible genomic variants. We report whole-genome sequencing of 3,085 pneumococcal carriage isolates from a 2.4 km2 refugee camp. This sequencing …