comp.lang.idl-pvwave archive: archive » Need Some Advice on Seperating Out Some Data

Home » Public Forums » archive » Need Some Advice on Seperating Out Some Data

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: Need Some Advice on Seperating Out Some Data [message #49695 is a reply to message #49667]

Fri, 11 August 2006 12:43

rdellsy
Messages: 11
Registered: August 2006

Junior Member

Interesting. I found that if I take the 4th power of the y variable, I
get a few decent clusters. I was planning on doing a linear fit of
those good clusters, and then using that to find the trailing data. Add
in a small histogram to find the lower x bound of what I want and it
should work. Your idea also sounds promissing. I'm getting close.
Hopefully, I'll have something that'll work by Tuesday, and I'll let
you all tear it to pieces...err find more efficient ways of doing it.
=)
Thanks,
- Rob

JD Smith wrote:
> On Fri, 11 Aug 2006 10:22:35 -0700, kuyper wrote:
>
>> rdellsy@gmail.com wrote:
>>> I'm working on doing a cluster tree and getting say the lower-right
>>> cluster and the one or two nearest neighbors (sp?). I may still be
>>> loosing some data though. Another possibilty would be compressing the
>>> data, say, by half, and see if that helps.
>>> Thanks,
>>> Rob
>>
>> IDL> help,data
>> DATA FLOAT = Array[2, 681]
>>
>> If all of the dimensions of your data have the same physical meaning,
>> then
>> you don't need to do anything to your data. However, I got the
>> following
>> results:
>>
>> IDL> print,stddev(data[0,*]),stddev(data[1,*])
>> 2748.5689 1.7135388
>>
>> Which implies to me that your x and y coordinates probably have
>> drastically
>> different meanings, so they need to be scaled to have a meaningful
>> distance
>> measurement. The simplest way is to base the scale factors on the
>> standard deviations:
>>
>> IDL> scaled = data
>> IDL> scaled[0,*] /= stddev(data[0,*])
>> IDL> scaled[1,*] /= stddev(data[1,*])
>>
>> I recommend, since you're analyzing many different but comparable
>> datasets, to use a single scaling factor on each axis for all the
>> datasets; otherwise it will be difficult to compare your results
>> between one dataset and another.
>>
>> IDL> pairdistance = DISTANCE_MEASURE(scaled)
>> IDL> clusters =
>> CLUSTER_TREE(pairdistance,linkdistance,LINKAGE=0,data=scaled )
>>
>> I'm surprised by the fact that I haven't been able to locate an IDL
>> function or procedure for taking the output from CLUSTER_TREE and using
>> it to determine cluster membership at the point
>> when there are N clusters left, so I wrote my own:
>>
>> FUNCTION cluster_member, clusters
>> dims = SIZE(clusters, /DIMENSIONS)
>> num = dims[1] + 1
>> membership = INTARR(num, num-1)
>> work = indgen(num)
>> FOR i=0, num-2 DO BEGIN
>> newclust = WHERE (work eq clusters[0,i] OR work EQ
>> clusters[1,i])
>> work[newclust] = num+i
>> membership[0,i] = work
>> ENDFOR
>>
>> RETURN, membership
>> END
>>
>> There's probably a more efficient way of handling that loop.
>
> Very cool! I'll have to remember this one. If you only care about n
> remaining clusters, you can simplify somewhat to:
>
> function cluster_member, clusters,n
> dims = SIZE(clusters, /DIMENSIONS)
> num = dims[1] + 1L
> n>=1
> work = lindgen(num)
> for i=0L, num-1L-n do $
> work[where(work eq clusters[0,i] OR work eq clusters[1,i])]= num+i
> return, work
> end
>
> JD

Report message to a moderator

[Message index]

		Need Some Advice on Seperating Out Some Data By: rdellsy on Tue, 08 August 2006 12:13
		Re: Need Some Advice on Seperating Out Some Data By: James Kuyper on Fri, 11 August 2006 13:48
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Mon, 14 August 2006 12:42
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Fri, 11 August 2006 13:44
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Fri, 11 August 2006 12:43
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Fri, 11 August 2006 11:44
		Re: Need Some Advice on Seperating Out Some Data By: James Kuyper on Fri, 11 August 2006 10:22
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 22:39
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Thu, 10 August 2006 15:10
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 12:57
		Re: Need Some Advice on Seperating Out Some Data By: rdellsy on Thu, 10 August 2006 11:44
		Re: Need Some Advice on Seperating Out Some Data By: JD Smith on Wed, 09 August 2006 15:17

Previous Topic:	functions to access ETOPO2 or other bathymetric data sets?
Next Topic:	IDL Image Processing

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Sun Nov 30 09:03:45 PST 2025

Total time taken to generate the page: 1.75992 seconds