new cluster version available

Andreas Stolcke stolcke at ICSI.Berkeley.EDU
Wed Jan 20 14:22:01 EST 1993


I'm releasing a new version of the time-honored cluster program (that also
does PCA).  I recently made a small change to the algorithm
that speeds clustering up by a factor of n (the number of data points).
The algorithm now runs in time O(n^2) (formerly O(n^3)) and uses memory
O(n) (formerly O(n^2)).   On a sparcstation2, this means you can cluster
a 1000-by-10 data set in 39 secs as opposed to 230 secs.  Systems short on
memory should see even more dramatic improvements due to reduced paging.

As before, the source code is availabe by ftp:

	% mkdir cluster; cd cluster
	% ftp ftp.icsi.berkeley.edu
	ftp> cd pub/ai
	ftp> binary
	ftp> get cluster-2.5.tar.Z
	ftp> quit
	% zcat cluster-2.5.tar.Z | tar xf -
	% make  # after looking over the Makefile


--Andreas


More information about the Connectionists mailing list