comp.lang.idl-pvwave archive
Messages from Usenet group comp.lang.idl-pvwave, compiled by Paulo Penteado

Home » Public Forums » archive » find max in 3D array -- slow
Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend 
Switch to threaded view of this topic Create a new topic Submit Reply
find max in 3D array -- slow [message #70378] Sat, 10 April 2010 09:03 Go to next message
Timothy W. Hilton is currently offline  Timothy W. Hilton
Messages: 1
Registered: April 2010
Junior Member
Hello IDL users,

I have a 1200x1200x2900 array of floats. The dimensions correspond to
latitude x longitude x time. I need to find the maxium at each
location -- that is, I need the 1200x1200 array containing the max
along the 3rd dimsion. IDL takes almost 3 minutes to do this on my
system. This seemed slow. I compared it with Matlab, which took ten
seconds. Is there a better way to search for the maxima using IDL?

The demo code I used to compare IDL and Matlab is below (with output).

I'm wondering if I ought to switch to Matlab. I just spent a couple
of days writing IDL code to read my data, so I'd rather not.

Many thanks,
Tim

--

Timothy W. Hilton
PhD Candidate, Department of Meteorology
The Pennsylvania State University
503 Walker Building, University Park, PA 16802
hilton@meteo.psu.edu

========
scratch.pro:

foo = randomu(0, 1200, 1200, 2920)
PRINT, systime()
foo_max = max(foo, DIMENSION = 3)
PRINT, systime()
END

IDL> .run scratch
% Compiled module: $MAIN$.
Sat Apr 10 10:44:44 2010
Sat Apr 10 10:47:36 2010
IDL>

========
scratch.m:

foo = rand(1200,1200,2920);
fprintf('%s\n', datestr(now()));
foo_max = max(foo, [], 3);
fprintf('%s\n', datestr(now()));

>> scratch
10-Apr-2010 10:42:45
10-Apr-2010 10:42:55
Re: find max in 3D array -- slow [message #70457 is a reply to message #70378] Mon, 12 April 2010 10:48 Go to previous message
JDS is currently offline  JDS
Messages: 94
Registered: March 2009
Member
On Apr 10, 12:03 pm, "Timothy W. Hilton" <hil...@meteo.psu.edu> wrote:
> Hello IDL users,
>
> I have a 1200x1200x2900 array of floats.  The dimensions correspond to
> latitude x longitude x time.  I need to find the maxium at each
> location -- that is, I need the 1200x1200 array containing the max
> along the 3rd dimsion.  IDL takes almost 3 minutes to do this on my
> system.  This seemed slow.  I compared it with Matlab, which took ten
> seconds.  Is there a better way to search for the maxima using IDL?

It would be of interest to run your IDL vs. MATLAB test again with an
array size which can easily fit in memory (e.g. 500MB worth). I'd
guess that most of that 3 minutes is spent hitting the disk for page
faults. The data ordering of IDL vs. MATLAB could easily come into
play, though both are column-major. I get about 1s for 500MB, which
should scale to ~30s for your data set (and I'll guess your machine is
faster than mine).
Re: find max in 3D array -- slow [message #70462 is a reply to message #70378] Mon, 12 April 2010 09:20 Go to previous message
Juggernaut is currently offline  Juggernaut
Messages: 83
Registered: June 2008
Member
On Apr 10, 12:03 pm, "Timothy W. Hilton" <hil...@meteo.psu.edu> wrote:
> Hello IDL users,
>
> I have a 1200x1200x2900 array of floats.  The dimensions correspond to
> latitude x longitude x time.  I need to find the maxium at each
> location -- that is, I need the 1200x1200 array containing the max
> along the 3rd dimsion.  IDL takes almost 3 minutes to do this on my
> system.  This seemed slow.  I compared it with Matlab, which took ten
> seconds.  Is there a better way to search for the maxima using IDL?
>
> The demo code I used to compare IDL and Matlab is below (with output).
>
> I'm wondering if I ought to switch to Matlab.  I just spent a couple
> of days writing IDL code to read my data, so I'd rather not.
>
> Many thanks,
> Tim
>
> --
>
> Timothy W. Hilton
> PhD Candidate, Department of Meteorology
> The Pennsylvania State University
> 503 Walker Building, University Park, PA   16802
> hil...@meteo.psu.edu
>
> ========
> scratch.pro:
>
> foo = randomu(0, 1200, 1200, 2920)
> PRINT, systime()
> foo_max = max(foo, DIMENSION = 3)
> PRINT, systime()
> END
>
> IDL> .run scratch
> % Compiled module: $MAIN$.
> Sat Apr 10 10:44:44 2010
> Sat Apr 10 10:47:36 2010
> IDL>
>
> ========
> scratch.m:
>
> foo = rand(1200,1200,2920);
> fprintf('%s\n', datestr(now()));
> foo_max = max(foo, [], 3);
> fprintf('%s\n', datestr(now()));
>
>>> scratch
>
> 10-Apr-2010 10:42:45
> 10-Apr-2010 10:42:55

In my experience for large arrays it's best to simply do the following
foo = randomu(0, 1200, 1200, 2920)
maxArray = fltarr(1200,1200)
for i = 0, 2919 do maxArray = maxArray > foo[*,*,i]

Using IDLs routines is sloooow for this.
Re: find max in 3D array -- slow [message #70465 is a reply to message #70378] Mon, 12 April 2010 04:15 Go to previous message
Steve[5] is currently offline  Steve[5]
Messages: 10
Registered: September 2007
Junior Member
In a similar problem we found it much faster to read the data a slice at
a time and do a running minima. The IDL minima routine seems to take
values one-by-one across the dimension you specify and so page faults a
lot in this situation.
Re: find max in 3D array -- slow [message #70466 is a reply to message #70378] Mon, 12 April 2010 03:36 Go to previous message
Maxwell Peck is currently offline  Maxwell Peck
Messages: 61
Registered: February 2010
Member
On Apr 12, 6:07 pm, Maxwell Peck <maxjp...@gmail.com> wrote:
> On Apr 12, 7:41 am, Maxwell Peck <maxjp...@gmail.com> wrote:
>
>
>
>> On Apr 11, 4:23 am, FÖLDY Lajos <fo...@rmki.kfki.hu> wrote:
>
>>> On Sat, 10 Apr 2010, Timothy W. Hilton wrote:
>>>> Hello IDL users,
>
>>>> I have a 1200x1200x2900 array of floats.  The dimensions correspond to
>>>> latitude x longitude x time.  I need to find the maxium at each
>>>> location -- that is, I need the 1200x1200 array containing the max
>>>> along the 3rd dimsion.  IDL takes almost 3 minutes to do this on my
>>>> system.  This seemed slow.  I compared it with Matlab, which took ten
>>>> seconds.  Is there a better way to search for the maxima using IDL?
>
>>>> The demo code I used to compare IDL and Matlab is below (with output).
>
>>>> I'm wondering if I ought to switch to Matlab.  I just spent a couple
>>>> of days writing IDL code to read my data, so I'd rather not.
>
>>>> Many thanks,
>>>> Tim
>
>>>> --
>
>>>> Timothy W. Hilton
>>>> PhD Candidate, Department of Meteorology
>>>> The Pennsylvania State University
>>>> 503 Walker Building, University Park, PA   16802
>>>> hil...@meteo.psu.edu
>
>>>> ========
>>>> scratch.pro:
>
>>>> foo = randomu(0, 1200, 1200, 2920)
>>>> PRINT, systime()
>>>> foo_max = max(foo, DIMENSION = 3)
>>>> PRINT, systime()
>>>> END
>
>>>> IDL> .run scratch
>>>> % Compiled module: $MAIN$.
>>>> Sat Apr 10 10:44:44 2010
>>>> Sat Apr 10 10:47:36 2010
>>>> IDL>
>
>>>> ========
>>>> scratch.m:
>
>>>> foo = rand(1200,1200,2920);
>>>> fprintf('%s\n', datestr(now()));
>>>> foo_max = max(foo, [], 3);
>>>> fprintf('%s\n', datestr(now()));
>
>>>> >> scratch
>>>> 10-Apr-2010 10:42:45
>>>> 10-Apr-2010 10:42:55
>
>>> I think that randomu(0, 1200,1200,2920) should be rand(2920, 1200, 1200)
>>> in Matlab (an array of 2920 rows x 1200 columns x 1200 something). The
>>> memory layout makes a big difference.
>
>>> regards,
>>> lajos
>
>> That's probably a good point, maybe storing the dataset in the
>> equivalent of a Byte Interleaved by Pixel storage order would speed
>> things up considerably.
>
> It certainly does seem to be memory layout related. Here are some
> numbers.
>
> foo = randomu(seed, 100, 100, 2900)
>
> foo_max = max(foo, DIMENSION = 3)
> This takes 0.36 seconds
>
> Reforming and transposing the array as follows:
> h = transpose(reform(foo,100*100,2900))
>
> Then finding the max along the row dimension
> k=max(h,dimension=1)
> gives 0.11 seconds. This is NOT including the initial transpose/reform
> (or one after). This adds considerable time. There might be a smarter
> way to do this bit...
>
> Not using the transpose and finding the max along the columns gives
> similar times to using the dimension=3 as done initially.
>
> Clearly having the values stored contiguous in memory as it is across
> the rows gives much faster results. I'm not sure if there are paging
> issues happening as well though, you're using a pretty big array!
>
> I'm not sure what the best way in actual application is to do this in
> IDL, perhaps there is opportunity when the file is being read in to
> store it in this way as it's probably I/O limited anyway at this
> point. Someone smarter on here might have a better solution..
>
> Max
>
> pro testsort
> l=100L
> c=2900L
>
> foo = randomu(seed, l, l, c)
> t=systime(1)
> foo_max = max(foo, DIMENSION = 3)
> PRINT,systime(1) -t
>
> h = transpose(reform(foo,l*l,c))
> t=systime(1)
> k=max(h,dimension=1)
> PRINT,systime(1) -t
> j=reform(k,l,l)
>
> print,'make sure nothing stupid has happened', total(j-foo_max)
>
> END

actually use transpose(foo,[2,0,1]) to convert the file to BIL and
then find the max along dimension=1 instead of the reform stuff
  Switch to threaded view of this topic Create a new topic Submit Reply
Previous Topic: Re: IDL license costs in europe?
Next Topic: clever way to subregion an image?

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ] [ PDF ]

Current Time: Wed Oct 08 11:35:35 PDT 2025

Total time taken to generate the page: 0.00492 seconds