SSC Logo


Ma, Castillo-Davis, Zhong, & Liu. 2006. Nucl. Acids Res. 34:1261-1269
Supplementary Information


What is Smoothing Spline Clustering?

Smoothing Spline Clustering is a statistical method for clustering time-series gene expression data.

In particular, Smoothing Spline Clustering is useful for clustering genes in microarray experiments performed over several time-points, for example, over the course of development, a drug treatment, or other temporally based experiments.

What can Smoothing Spline Clustering tell me?

Smoothing Spline Clustering provides clusters of similary expressed genes using a statistically rigorous, biologically based, data-driven method. Importantly, SSC provides the number of gene clusters in a given dataset without an a priori specification the genes that belong to each cluster, a mean curve for each cluster describing the average expression profile of each cluster, and associated 95% confidence bands.

Example of an SSC cluster from D. melanogaster developmental data [1] showing mean expression curve and 95% confidence bands Example Cluster

Why Use Smoothing Spline Clustering?

The big advantage of Smoothing Spline Clustering over other clustering algorithms is that you do not have to specify a priori the number of clusters in your dataset or specify the expected functional forms (curves) of genes in the data. SSC achieves this by modelling the natural properties of gene expression over time, taking into account gene-specific differences in gene expression within a cluster of similarly expressed genes, the effects of experimental measurement error, and missing data. Furthermore, SSC provides a visual summary of each cluster's gene expression function and goodness-of-fit as shown above.

Additionally:

Availability

SSClust is freely available as a stand-alone package utilizing R.

Download SSClust

Documentation

Details on the smoothing spline clustering statistical model and algorithm are provided in Ma, Castillo-Davis, Zhong, Liu. 2006.

Information on running SSClust is available in the documentation and SSClust manual.


[1] Arbeitman M., Furlong, E., Imam, F., Johnson, E., Null, B. H., Baker, B. S., Krasnow, M. A., Scott, M. P., Davis, R. W., & White, K. P. (2002) Science 297, 2270-2275.



Smoothing Spline Clustering - for time course gene expression data.