comp.lang.idl-pvwave archive
Messages from Usenet group comp.lang.idl-pvwave, compiled by Paulo Penteado

Home » Public Forums » archive » Cluster analysis
Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend 
Return to the default flat view Create a new topic Submit Reply
Re: Cluster analysis [message #41889 is a reply to message #11153] Sun, 05 December 2004 06:43 Go to previous messageGo to previous message
George N. White III is currently offline  George N. White III
Messages: 56
Registered: September 2000
Member
On Fri, 3 Dec 2004, Chris Jacobsen wrote:

> In our work we started out using the "stock" IDL
> cluster routine but we have added to it a bit.
> Still, we have not changed the basic algorithm.
> We've found that preparation of the data can
> make a big difference. If the variation in
> variable X is 100 times bigger than the variation
> in variable Y, then the clustering (which looks
> at simple Euclidian distance) will not see the
> variation in Y very well. One approach is
> to subtract the mean of each variable, and
> apply a scale factor to the data in variable
> Y so that it is spread out over the same distance
> as in variable X.

Sound advice. You should also consider whether a
non-linear transform (e.g. alog()) should be applied
to some variables. Many people overlook the rank
transform, which gives you a distance measure that
is essentially the number of observations with values
between the two points. This is a way to make sense
of comparisons between different physical quantities.

--
George N. White III <aa056@chebucto.ns.ca>
[Message index]
 
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Previous Topic: READCOL compilation error
Next Topic: Re: Canny Edge Detection

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ] [ PDF ]

Current Time: Wed Oct 08 19:26:04 PDT 2025

Total time taken to generate the page: 0.00437 seconds