comp.lang.idl-pvwave archive: archive » Re: Box-Whisker plots in IDL

Home » Public Forums » archive » Re: Box-Whisker plots in IDL

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: Box-Whisker plots in IDL [message #55555 is a reply to message #55391]

Sun, 26 August 2007 18:50

JD Smith
Messages: 850
Registered: December 1999

Senior Member

On Mon, 20 Aug 2007 23:04:14 +0000, jschwab@gmail.com wrote:

> Pardon me if I'm mistaken, but I think these "quartiles with
> histogram" examples, including the one that's in JD's histogram
> tutorial are fundamentally incorrect.
>
> You are assuming "Equal bin widths" ==> "Equal #'s in each bin" !

I probably shouldn't have called them "quartiles", as they are really
quarter range bins (data quartiles). The data quartile is of course
as useful as the ordered quartile, but not for this problem.

One straightforward option for the ordered quartile is to use SORT,
picking out only the elements at nel/4 and 3*nel/4, e.g.:

n=n_elements(data)
s=sort(data)
qval=data[s[3*n/4]]

Unfortunately, this is fairly slow for large data sets. Another
faster but approximate option is to form a cumulative total of
HISTOGRAM's output with an appropriate bin size, and find where it
reaches 25% and 75% of the total count of data points.

Depending on your needs and bin width, you may want to dive into the
individual bin using REVERSE_INDICES to find the *exact* quartile
value itself. This isn't as hard as it sounds:

bs=0.05 ;something appropriate for bin size
h=histogram(data,REVERSE_INDICES=r,BINSIZE=bs,OMIN=om)
cum=total(h,/CUMULATIVE,/PRESERVE_TYPE)
quart=3*n/4
v=value_locate(cum,quart)
vals=data[r[r[v+1]:r[v+2]-1]]
qval=vals[(sort(vals))[quart-cum[v]]]

You'll find this is roughly 10x faster than using SORT by itself. And
if you only need the approximate value (good to the histogram bin
width), simply replace the last two lines with:

qval=om+bs*(v+1.5)

for a modest additional speed-up. All the usual caveats with
HISTOGRAM apply (e.g. beware when dealt overly sparse data).

This problem reminds me of the one quote I always remember from
Numerical Recipes: "Selection is Sorting's austere sister."

JD

Report message to a moderator

[Message index]

		Re: Box-Whisker plots in IDL By: David Fanning on Mon, 20 August 2007 21:01
		Re: Box-Whisker plots in IDL By: Marshall Perrin on Mon, 20 August 2007 17:37
		Re: Box-Whisker plots in IDL By: jschwab@gmail.com on Mon, 20 August 2007 17:06
		Re: Box-Whisker plots in IDL By: David Fanning on Mon, 20 August 2007 16:47
		Re: Box-Whisker plots in IDL By: jschwab@gmail.com on Mon, 20 August 2007 16:04
		Re: Box-Whisker plots in IDL By: teich on Mon, 20 August 2007 14:03
		Re: Box-Whisker plots in IDL By: David Fanning on Mon, 20 August 2007 13:56
		Re: Box-Whisker plots in IDL By: teich on Mon, 20 August 2007 13:22
		Re: Box-Whisker plots in IDL By: David Fanning on Mon, 20 August 2007 12:40
		Re: Box-Whisker plots in IDL By: teich on Mon, 20 August 2007 12:18
		Re: Box-Whisker plots in IDL By: Brian Larsen on Mon, 20 August 2007 12:01
		Re: Box-Whisker plots in IDL By: teich on Mon, 20 August 2007 10:27
		Re: Box-Whisker plots in IDL By: David Fanning on Mon, 20 August 2007 10:10
		Re: Box-Whisker plots in IDL By: mankoff on Mon, 20 August 2007 10:04
		Re: Box-Whisker plots in IDL By: JD Smith on Sun, 26 August 2007 18:50
		Re: Box-Whisker plots in IDL By: jschwab@gmail.com on Mon, 20 August 2007 21:46

Previous Topic:	Re: MODIS spectral radiance
Next Topic:	How to read ASTER in ENVI?

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Sun Nov 30 16:30:24 PST 2025

Total time taken to generate the page: 0.00943 seconds