comp.lang.idl-pvwave archive
Messages from Usenet group comp.lang.idl-pvwave, compiled by Paulo Penteado

Home » Public Forums » archive » Re: compute quartiles of a distribution
Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend 
Switch to threaded view of this topic Create a new topic Submit Reply
Re: compute quartiles of a distribution [message #77909] Tue, 18 October 2011 21:32
Jeremy Bailin is currently offline  Jeremy Bailin
Messages: 618
Registered: April 2008
Senior Member
On 10/18/11 3:48 PM, Jeremy Bailin wrote:
> On 10/18/11 12:12 PM, bing999 wrote:
>> Thanks to both of you for your answers.
>>
>> The procedures in summary.pro and cgBoxPlot.pro compute "real"
>> quartiles. Actually, I should not have used this word in my case i
>> guess.
>>
>> What I want is the interval [M-Q;M+Q] which encompass 75% of the
>> values of the sample around the mean (not the median) value M, where Q
>> is unique (i.e the same at lower and higher values around M). I do not
>> want the 37.5% above M and the 37.5% below. It makes a little
>> difference with what is calculated with your routines.
>> The idea would be to span the sample starting from the mean, and
>> counting the points at lower and higher values around the mean in an
>> iterative manner, until I have counted 75% of sample. This would give
>> the value of Q at which the 75% is reached. I have a crude idea to do
>> that with for loops but it will take forever...
>>
>> If you see what I mean, and if you have a piece of code, this could
>> help a lot!
>>
>> Thanks again.
>>
>>
>>> bing999 writes:
>>>> I have sample of data (which distribution is unknown) of mean M. I
>>>> would like to calculate the quartiles with IDL, i.e what is the value
>>>> of Q for which 25% (or 75%) of the sample is comprised between [M-Q;M
>>>> +Q] ?
>>>> Do you know a routine which does that?
>>>
>>> cgBoxPlot.
>>>
>>> Cheers,
>>>
>>> David
>>>
>>> --
>>> David Fanning, Ph.D.
>>> Fanning Software Consulting, Inc.
>>> Coyote's Guide to IDL Programming:http://www.idlcoyote.com/
>>> Sepore ma de ni thui. ("Perhaps thou speakest truth.")
>>
>
> Easy enough (untested):
>
> data = [......]
> frac_to_enclose = 0.75
> meanval = mean(data)
> absdiff = abs(data-meanval)
> quartile_index = floor(n_elements(absdiff) * frac_to_enclose)
> q = absdiff[quartile_index]
>
>
> But I share David's concern that this may not really be what you want...
>
> -Jeremy.

Okay, now that I've tested it, there's clearly a SORT missing.
Substitute the last line with:

q = absdiff[(sort(absdiff))[quartile_index]]

-Jeremy.
Re: compute quartiles of a distribution [message #77911 is a reply to message #77909] Tue, 18 October 2011 12:48 Go to previous message
Jeremy Bailin is currently offline  Jeremy Bailin
Messages: 618
Registered: April 2008
Senior Member
On 10/18/11 12:12 PM, bing999 wrote:
> Thanks to both of you for your answers.
>
> The procedures in summary.pro and cgBoxPlot.pro compute "real"
> quartiles. Actually, I should not have used this word in my case i
> guess.
>
> What I want is the interval [M-Q;M+Q] which encompass 75% of the
> values of the sample around the mean (not the median) value M, where Q
> is unique (i.e the same at lower and higher values around M). I do not
> want the 37.5% above M and the 37.5% below. It makes a little
> difference with what is calculated with your routines.
> The idea would be to span the sample starting from the mean, and
> counting the points at lower and higher values around the mean in an
> iterative manner, until I have counted 75% of sample. This would give
> the value of Q at which the 75% is reached. I have a crude idea to do
> that with for loops but it will take forever...
>
> If you see what I mean, and if you have a piece of code, this could
> help a lot!
>
> Thanks again.
>
>
>> bing999 writes:
>>> I have sample of data (which distribution is unknown) of mean M. I
>>> would like to calculate the quartiles with IDL, i.e what is the value
>>> of Q for which 25% (or 75%) of the sample is comprised between [M-Q;M
>>> +Q] ?
>>> Do you know a routine which does that?
>>
>> cgBoxPlot.
>>
>> Cheers,
>>
>> David
>>
>> --
>> David Fanning, Ph.D.
>> Fanning Software Consulting, Inc.
>> Coyote's Guide to IDL Programming:http://www.idlcoyote.com/
>> Sepore ma de ni thui. ("Perhaps thou speakest truth.")
>

Easy enough (untested):

data = [......]
frac_to_enclose = 0.75
meanval = mean(data)
absdiff = abs(data-meanval)
quartile_index = floor(n_elements(absdiff) * frac_to_enclose)
q = absdiff[quartile_index]


But I share David's concern that this may not really be what you want...

-Jeremy.
Re: compute quartiles of a distribution [message #77912 is a reply to message #77911] Tue, 18 October 2011 10:38 Go to previous message
Brian Wolven is currently offline  Brian Wolven
Messages: 94
Registered: May 2011
Member
Sounds like the kind of thing that the histogram routine would let you do quickly and easily. I am not, however, volunteering to do the coding. ;)
Re: compute quartiles of a distribution [message #77914 is a reply to message #77912] Tue, 18 October 2011 09:36 Go to previous message
Thibault Garel is currently offline  Thibault Garel
Messages: 55
Registered: October 2009
Member
:) On this one, I am my own reviewer !
I know what I ask sounds weird but that is really what I'd like to
compute. As I want to work with the means, not medians, "statistically
justifiable real" quartiles do not really help. In my case, means and
median may be quite different so that normal 75% quartiles may be out
of the sample...
I am gonna try to find a way to code that.
Thanks again,

Cheers
bing

> bing999 writes:
>> The procedures in summary.pro and cgBoxPlot.pro compute "real"
>> quartiles. Actually, I should not have used this word in my case i
>> guess.
>
>> What I want is the interval [M-Q;M+Q] which encompass 75% of the
>> values of the sample around the mean (not the median) value M, where Q
>> is unique (i.e the same at lower and higher values around M). I do not
>> want the 37.5% above M and the 37.5% below. It makes a little
>> difference with what is calculated with your routines.
>> The idea would be to span the sample starting from the mean, and
>> counting the points at lower and higher values around the mean in an
>> iterative manner, until I have counted 75% of sample. This would give
>> the value of Q at which the 75% is reached. I have a crude idea to do
>> that with for loops but it will take forever...
>
> I'm guessing you are going to have a hard time
> explaining to your reviewers why your "fake"
> quartiles are better than the statistically
> justifiable real quartiles. :-)
>
> Cheers,
>
> David
>
> --
> David Fanning, Ph.D.
> Fanning Software Consulting, Inc.
> Coyote's Guide to IDL Programming:http://www.idlcoyote.com/
> Sepore ma de ni thui. ("Perhaps thou speakest truth.")
Re: compute quartiles of a distribution [message #77916 is a reply to message #77914] Tue, 18 October 2011 09:25 Go to previous message
David Fanning is currently offline  David Fanning
Messages: 11724
Registered: August 2001
Senior Member
bing999 writes:

> The procedures in summary.pro and cgBoxPlot.pro compute "real"
> quartiles. Actually, I should not have used this word in my case i
> guess.
>
> What I want is the interval [M-Q;M+Q] which encompass 75% of the
> values of the sample around the mean (not the median) value M, where Q
> is unique (i.e the same at lower and higher values around M). I do not
> want the 37.5% above M and the 37.5% below. It makes a little
> difference with what is calculated with your routines.
> The idea would be to span the sample starting from the mean, and
> counting the points at lower and higher values around the mean in an
> iterative manner, until I have counted 75% of sample. This would give
> the value of Q at which the 75% is reached. I have a crude idea to do
> that with for loops but it will take forever...

I'm guessing you are going to have a hard time
explaining to your reviewers why your "fake"
quartiles are better than the statistically
justifiable real quartiles. :-)

Cheers,

David



--
David Fanning, Ph.D.
Fanning Software Consulting, Inc.
Coyote's Guide to IDL Programming: http://www.idlcoyote.com/
Sepore ma de ni thui. ("Perhaps thou speakest truth.")
Re: compute quartiles of a distribution [message #77917 is a reply to message #77916] Tue, 18 October 2011 09:12 Go to previous message
Thibault Garel is currently offline  Thibault Garel
Messages: 55
Registered: October 2009
Member
Thanks to both of you for your answers.

The procedures in summary.pro and cgBoxPlot.pro compute "real"
quartiles. Actually, I should not have used this word in my case i
guess.

What I want is the interval [M-Q;M+Q] which encompass 75% of the
values of the sample around the mean (not the median) value M, where Q
is unique (i.e the same at lower and higher values around M). I do not
want the 37.5% above M and the 37.5% below. It makes a little
difference with what is calculated with your routines.
The idea would be to span the sample starting from the mean, and
counting the points at lower and higher values around the mean in an
iterative manner, until I have counted 75% of sample. This would give
the value of Q at which the 75% is reached. I have a crude idea to do
that with for loops but it will take forever...

If you see what I mean, and if you have a piece of code, this could
help a lot!

Thanks again.


> bing999 writes:
>> I have sample of data (which distribution is unknown) of mean M. I
>> would like to calculate the quartiles with IDL, i.e what is the value
>> of Q for which 25% (or 75%) of the sample is comprised between [M-Q;M
>> +Q] ?
>> Do you know a routine which does that?
>
> cgBoxPlot.
>
> Cheers,
>
> David
>
> --
> David Fanning, Ph.D.
> Fanning Software Consulting, Inc.
> Coyote's Guide to IDL Programming:http://www.idlcoyote.com/
> Sepore ma de ni thui. ("Perhaps thou speakest truth.")
Re: compute quartiles of a distribution [message #77918 is a reply to message #77917] Tue, 18 October 2011 09:13 Go to previous message
Thibault Garel is currently offline  Thibault Garel
Messages: 55
Registered: October 2009
Member
Thanks to both of you for your answers.

The procedures in summary.pro and cgBoxPlot.pro compute "real"
quartiles. Actually, I should not have used this word in my case i
guess.

What I want is the interval [M-Q;M+Q] which encompass 75% of the
values of the sample around the mean (not the median) value M, where Q
is unique (i.e the same at lower and higher values around M). I do not
want the 37.5% above M and the 37.5% below. It makes a little
difference with what is calculated with your routines.
The idea would be to span the sample starting from the mean, and
counting the points at lower and higher values around the mean in an
iterative manner, until I have counted 75% of sample. This would give
the value of Q at which the 75% is reached. I have a crude idea to do
that with for loops but it will take forever...

If you see what I mean, and if you have a piece of code, this could
help a lot!

Thanks again.



> bing999 writes:
>> I have sample of data (which distribution is unknown) of mean M. I
>> would like to calculate the quartiles with IDL, i.e what is the value
>> of Q for which 25% (or 75%) of the sample is comprised between [M-Q;M
>> +Q] ?
>> Do you know a routine which does that?
>
> cgBoxPlot.
>
> Cheers,
>
> David
>
> --
> David Fanning, Ph.D.
> Fanning Software Consulting, Inc.
> Coyote's Guide to IDL Programming:http://www.idlcoyote.com/
> Sepore ma de ni thui. ("Perhaps thou speakest truth.")
Re: compute quartiles of a distribution [message #77927 is a reply to message #77917] Mon, 17 October 2011 18:26 Go to previous message
David Fanning is currently offline  David Fanning
Messages: 11724
Registered: August 2001
Senior Member
bing999 writes:

> I have sample of data (which distribution is unknown) of mean M. I
> would like to calculate the quartiles with IDL, i.e what is the value
> of Q for which 25% (or 75%) of the sample is comprised between [M-Q;M
> +Q] ?
> Do you know a routine which does that?

cgBoxPlot.

Cheers,

David



--
David Fanning, Ph.D.
Fanning Software Consulting, Inc.
Coyote's Guide to IDL Programming: http://www.idlcoyote.com/
Sepore ma de ni thui. ("Perhaps thou speakest truth.")
Re: compute quartiles of a distribution [message #77938 is a reply to message #77927] Mon, 17 October 2011 06:05 Go to previous message
Vincent Sarago is currently offline  Vincent Sarago
Messages: 34
Registered: September 2011
Member
There are plenty of self user routine on web.

maybe you can try this one :

http://www.star.le.ac.uk/~sav2/idl/summary.pro

vincent
  Switch to threaded view of this topic Create a new topic Submit Reply
Previous Topic: Re: Pass variables into Newton or BROYDEN for solving non-linear equations
Next Topic: ITT Suggestion box

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ] [ PDF ]

Current Time: Wed Oct 08 15:12:56 PDT 2025

Total time taken to generate the page: 0.00520 seconds