comp.lang.idl-pvwave archive: archive » Need Some Advice on Seperating Out Some Data

Home » Public Forums » archive » Need Some Advice on Seperating Out Some Data

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: Need Some Advice on Seperating Out Some Data [message #49699 is a reply to message #49667]

Fri, 11 August 2006 11:44

JD Smith
Messages: 850
Registered: December 1999

Senior Member

On Fri, 11 Aug 2006 10:22:35 -0700, kuyper wrote:

> rdellsy@gmail.com wrote:
>> I'm working on doing a cluster tree and getting say the lower-right
>> cluster and the one or two nearest neighbors (sp?). I may still be
>> loosing some data though. Another possibilty would be compressing the
>> data, say, by half, and see if that helps.
>> Thanks,
>> Rob
>
> IDL> help,data
> DATA FLOAT = Array[2, 681]
>
> If all of the dimensions of your data have the same physical meaning,
> then
> you don't need to do anything to your data. However, I got the
> following
> results:
>
> IDL> print,stddev(data[0,*]),stddev(data[1,*])
> 2748.5689 1.7135388
>
> Which implies to me that your x and y coordinates probably have
> drastically
> different meanings, so they need to be scaled to have a meaningful
> distance
> measurement. The simplest way is to base the scale factors on the
> standard deviations:
>
> IDL> scaled = data
> IDL> scaled[0,*] /= stddev(data[0,*])
> IDL> scaled[1,*] /= stddev(data[1,*])
>
> I recommend, since you're analyzing many different but comparable
> datasets, to use a single scaling factor on each axis for all the
> datasets; otherwise it will be difficult to compare your results
> between one dataset and another.
>
> IDL> pairdistance = DISTANCE_MEASURE(scaled)
> IDL> clusters =
> CLUSTER_TREE(pairdistance,linkdistance,LINKAGE=0,data=scaled )
>
> I'm surprised by the fact that I haven't been able to locate an IDL
> function or procedure for taking the output from CLUSTER_TREE and using
> it to determine cluster membership at the point
> when there are N clusters left, so I wrote my own:
>
> FUNCTION cluster_member, clusters
> dims = SIZE(clusters, /DIMENSIONS)
> num = dims[1] + 1
> membership = INTARR(num, num-1)
> work = indgen(num)
> FOR i=0, num-2 DO BEGIN
> newclust = WHERE (work eq clusters[0,i] OR work EQ
> clusters[1,i])
> work[newclust] = num+i
> membership[0,i] = work
> ENDFOR
>
> RETURN, membership
> END
>
> There's probably a more efficient way of handling that loop.

Very cool! I'll have to remember this one. If you only care about n
remaining clusters, you can simplify somewhat to:

function cluster_member, clusters,n
dims = SIZE(clusters, /DIMENSIONS)
num = dims[1] + 1L
n>=1
work = lindgen(num)
for i=0L, num-1L-n do $
work[where(work eq clusters[0,i] OR work eq clusters[1,i])]= num+i
return, work
end

JD

Report message to a moderator

[Message index]

		Need Some Advice on Seperating Out Some Data By: rdellsy on Tue, 08 August 2006 12:13
		Re: Need Some Advice on Seperating Out Some Data By: James Kuyper on Fri, 11 August 2006 13:48
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Mon, 14 August 2006 12:42
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Fri, 11 August 2006 13:44
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Fri, 11 August 2006 12:43
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Fri, 11 August 2006 11:44
		Re: Need Some Advice on Seperating Out Some Data By: James Kuyper on Fri, 11 August 2006 10:22
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 22:39
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Thu, 10 August 2006 15:10
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 12:57
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 11:44
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Wed, 09 August 2006 15:17

Previous Topic:	functions to access ETOPO2 or other bathymetric data sets?
Next Topic:	IDL Image Processing

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Mon Dec 01 01:52:47 PST 2025

Total time taken to generate the page: 0.80400 seconds