comp.lang.idl-pvwave archive: archive

Home » Public Forums » archive » mean() function

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

mean() function [message #46938]

Tue, 10 January 2006 14:34

biocpu
Messages: 3
Registered: June 2004

Junior Member

The following looks very odd. Have any clues?

IDL> y = fltarr(1008879)+35
IDL> id = where(y ne 35, cc)
IDL> print, cc
0
; so y is strictly 35.0 BUT
IDL> print, mean(y)
35.5249

IDL> print, mean(y(0:400000))
35.0000

Thanks,

Report message to a moderator

Re: mean() function [message #46952 is a reply to message #46938]

Mon, 16 January 2006 02:27

Foldy Lajos
Messages: 268
Registered: October 2001

Senior Member

Hi,

one small correction: instead of n_elements(...) you should use
n_elements(...)-1.

This approach has an other problem: the elements at the beginning of the
array are divided ~n_elements() times, which can introduce large rounding
errors.

Example:

IDL> print, !version
print, !version
{ x86 linux unix linux 6.2 Jun 20 2005 32 64}

IDL> a=fltarr(10000000l)
IDL> a[0]=1.0e7
IDL> print, alt_mean(a)
1.02190

vs.

IDL> print, mean(a)
% Compiled module: MEAN.
% Compiled module: MOMENT.
1.00000

there is no golden way :-)))

regards,
lajos

On Mon, 16 Jan 2006, Maarten wrote:

> I think that this comes close. I ignores infinite numbers on request.
> Is it fast: no. But implementing the thing is C should be near trivial
> if you have dealt with that before (I haven't, at least not in IDL).
>
> function alt_mean, D, nan=nan
> compile_opt defint32, strictarr, logical_predicate, strictarrsubs
>
> M = 0.0
>
> if keyword_set(nan) then begin
> idx = where(finite(D), cnt)
> if cnt gt 0 then begin
> for ii=0,n_elements(idx) do $
> M += (D[idx[ii]] - M)/(ii+1)
> endif else begin
> M = !values.d_nan
> endelse
> endif else begin
> for ii=0,n_elements(D) do $
> M += (D[ii] - M)/(ii+1)
> endelse
>
> return, M
> end
>
>

Report message to a moderator

Re: mean() function [message #46953 is a reply to message #46938]

Mon, 16 January 2006 00:01

Maarten[1]
Messages: 176
Registered: November 2005

Senior Member

I think that this comes close. I ignores infinite numbers on request.
Is it fast: no. But implementing the thing is C should be near trivial
if you have dealt with that before (I haven't, at least not in IDL).

function alt_mean, D, nan=nan
compile_opt defint32, strictarr, logical_predicate, strictarrsubs

M = 0.0

if keyword_set(nan) then begin
idx = where(finite(D), cnt)
if cnt gt 0 then begin
for ii=0,n_elements(idx) do $
M += (D[idx[ii]] - M)/(ii+1)
endif else begin
M = !values.d_nan
endelse
endif else begin
for ii=0,n_elements(D) do $
M += (D[ii] - M)/(ii+1)
endelse

return, M
end

Report message to a moderator

Re: mean() function [message #46956 is a reply to message #46938]

Sun, 15 January 2006 05:53

Foldy Lajos
Messages: 268
Registered: October 2001

Senior Member

HI,

sometimes there is a trade-off between speed and correctness.
Here is another example:

IDL> print, !version
{ x86 linux unix linux 6.2 Jun 20 2005 32 64}

IDL> print, smooth([1, 1.0e20, 1, 1, 1, 1], 3)
1.00000 3.33333e+19 3.33333e+19 0.00000 0.00000 1.00000

Instead of calculating the average for each 3-element window, IDL uses
the first average only, then slides the window by adding/subtracting
elements from the window edge. For width w it uses only 2 add/sub per
window instead of w additions, which is a huge speed-up for large widths,
but can give incorrect result for rare cases.

regards,
lajos

On Sun, 15 Jan 2006, Reimar Bauer wrote:

> mean is not useable if it results in this
>
> IDL> print,mean( make_array(500000,val=35,/float) )
> 35.0413
> IDL> print,mean( make_array(400000,val=35,/float) )
> 35.0000
>
> I prefer a slower routine if this is right.
>
> no one would accept 1.0 + 1.0 result = 1.5
>

Report message to a moderator

Re: mean() function [message #46976 is a reply to message #46938]

Fri, 13 January 2006 00:57

Maarten[1]
Messages: 176
Registered: November 2005

Senior Member

savoie@nsidc.org wrote:

> Would you mind explaining this a bit for me? What's a proper running
> average? And why is it better in general?

It assumes that values are resonably close the the average,
mathematically it is equivalent to taking the total, and then dividing,
but it avoids some precision problems.

In IDL code it requires an explicit loop over the data, which is slow.
An alternative is to do everything in double precision, but that is
just postponing the inevitable.

Code to calculate the mean in an array X:

mean = 0.
for ii = 0, n_elements(X)-1 do $
mean += (X[ii] - mean) / (ii + 1)

Maarten

Report message to a moderator

Re: mean() function [message #46991 is a reply to message #46938]

Thu, 12 January 2006 09:07

Kenneth P. Bowman
Messages: 585
Registered: May 2000

Senior Member

In article <ywkuzmm1l7bv.fsf@snowblower.colorado.edu>, savoie@nsidc.org
wrote:

> "Maarten" <maarten.sneep@knmi.nl> writes:
>
>> And the reason you need that page, is in part because IDL uses the
>> moment routine described in Numerical Recipes (take total first, divide
>> later), instead of a proper running average, like the GNU scientific
>> library does.
>>
>> However, since looping is slow in IDL, you don't want to implement that
>> in IDL, so the next best thing is to have that page.

The situation to avoid is adding values with different magnitudes. That
results in a loss of precision.

For example, summing a large number of values with similar magnitudes
will eventually result in adding small values to large values.

There are several tricks one can use to avoid this, particularly in the
case where all the values have similar magnitudes. The simplest is to
assume that the mean is close to the first value. Subtract the first
value from each value before accumulating, compute the mean, then add
the first value back into the total. Alternatively, compute a first
approximation to the mean the naive way, then recompute the mean by
subtracting the approximate mean from each value before summing. This
requires two passes through the data, so the tradeoff is computational
time and precision.

Ken Bowman

Report message to a moderator

Previous Topic:	Re: mean() function
Next Topic:	ASP.NET vs JSF

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Fri Nov 28 15:37:25 PST 2025

Total time taken to generate the page: 0.01052 seconds