comp.lang.idl-pvwave archive: archive » Memory management when concatenating arrays

Home » Public Forums » archive » Memory management when concatenating arrays

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: Memory management when concatenating arrays [message #92189 is a reply to message #92185]

Wed, 28 October 2015 07:44

Yngvar Larsen
Messages: 134
Registered: January 2010

Senior Member

I ran the following script:

nx = 360
ny = 180
nt = 1000
N = 10
data = fltarr(nx,ny,nt)

print, 'Concatenation:'
tic
all_data = []
for ii=0, N-1 do all_data = [[[all_data]],[[data]]]
toc
print, '**************'
print, 'Preallocation with zero initialization:'
tic
all_data = fltarr(nx,ny,nt*10)
for ii=0, N-1 do all_data[0,0,ii] = data
toc

print, '**************'
print, 'Preallocation without zero initialization'
tic
all_data = fltarr(nx,ny,nt*10, /NOZERO)
for ii=0, N-1 do all_data[0,0,ii] = data
toc

I get the following:

IDL> .r test
% Compiled module: $MAIN$.
Concatenation:
% Time elapsed: 6.3571460 seconds.
**************
Preallocation with zero initialization:
% Time elapsed: 1.5204742 seconds.
**************
Preallocation without zero initialization
% Time elapsed: 0.62908983 seconds.

My script excludes the I/O part which should be the same for all three versions.

Bottom line: I think preallocating your array with the /NOZERO flag set is your best option for the scenario you describe.
Of course, as has already been commented in the thread, you need to make sure that your data fit in memory. Your example is 2.4GB in single precision float, and 4.8GB in double precision. And even if this fits in memory, you will likely want to do operations on this big array, and then you will quickly run out of memory. On my 8GB laptop, the following would be enough to run out of memory:

all_data = dblarr(360,180,10000)
all_data_scaled = all_data*!dpi

On Wednesday, 28 October 2015 14:49:38 UTC+1, rj...@le.ac.uk wrote:
> I have a large multi-dimensional array that is split across several files by time.
>
> i.e. file1 contains the first 1000 timesteps, [360,180,1000], file2 contains the next 1000 timesteps [360,180,1000], etc.
>
> What I want to end up with is one big array that's read say all 10 files in and is (360, 180, 10000).
>
> What I'm doing is this in a loop:
>
> all_data=[[[all_data]], [[data]]]
>
> But I quickly run out of memory trying to concatenate in this way.
>
> I tried using temporary
>
> all_data=[[[temporary(all_data)]], [[data]]]
>
> but this doesn't help.
>
> Is there an efficient way of doing this?
>
> Cheers

Report message to a moderator

[Message index]

		Memory management when concatenating arrays By: rjp23 on Wed, 28 October 2015 06:49
		Re: Memory management when concatenating arrays By: Alain Kattnig on Wed, 28 October 2015 06:58
		Re: Memory management when concatenating arrays By: greg.addr on Wed, 28 October 2015 07:16
		Re: Memory management when concatenating arrays By: Phillip Bitzer on Wed, 28 October 2015 07:41
		Re: Memory management when concatenating arrays By: Yngvar Larsen on Wed, 28 October 2015 07:44
		Re: Memory management when concatenating arrays By: Yngvar Larsen on Wed, 28 October 2015 07:54

Previous Topic:	Display of hyperspectral image in the window of a specified size
Next Topic:	interpolation

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Wed Oct 08 11:43:01 PDT 2025

Total time taken to generate the page: 0.19820 seconds