comp.lang.idl-pvwave archive: archive » Determining length of a text file?

Home » Public Forums » archive » Determining length of a text file?

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Determining length of a text file? [message #40889]

Sat, 11 September 2004 16:04

Jonathan Greenberg
Messages: 91
Registered: November 2002

Member

I have a text file that contains one column of data (and each line of data
is separated by a return) -- how do I determine how many lines of data are
in the file?

--j

--
Jonathan Greenberg
Graduate Group in Ecology, U.C. Davis
http://www.cstars.ucdavis.edu/~jongreen
http://www.cstars.ucdavis.edu
AIM: jgrn307 or jgrn3007
MSN: jgrn307@msn.com or jgrn3007@msn.com

Report message to a moderator

Re: Determining length of a text file? [message #40952 is a reply to message #40889]

Tue, 21 September 2004 08:18

Craig Markwardt
Messages: 1869
Registered: November 1996

Senior Member

lloyd@evilprofessor.co.uk (Lloyd Watkin) writes:

> [...]
>> It is usually more time efficient to allocate a big text array, read,
>> ignore the error, and truncate the empty text strings thus avoiding a
>> FOR loop. Since all null strings point to the same (null) memory (I
>> think) you only use a small temporary memory. For large table this can
>> improve speed a lot.
>
> I have a file of a few million lines to read in. Obviously it takes
> awhile to read in (using while EOF), and would like to speed it up.
>
> How do I get the code to ignore the error generated?

You can trap errors using ON_IOERROR.

Another way you might approach this is to read the data in as
*bytes*. For example, a 256 kB array like this, and read it
unformatted,

bb = bytarr(256L*1024L)

; ...

bb(*) = 0
readu, unit, bb

You will still need to trap the I/O error which will probably come on
the last read, since your file will probably not be a multiple of
256k. Initializing BB to zero is important since if you only get a
partial read, then the previous contents will remain.

Then just search for the line-ending character of choice, usually
either ascii 13 or 10,

wh = where(bb EQ 13, ct)
; or
wh = where(bb EQ 10, ct)

total = total + ct

Roll that into a loop, and you probably can't get faster than that
from within IDL. Since you are reading a large amount of data at
once, 256k, the overhead for looping is very low.

Craig

--
------------------------------------------------------------ --------------
Craig B. Markwardt, Ph.D. EMAIL: craigmnet@REMOVEcow.physics.wisc.edu
Astrophysics, IDL, Finance, Derivatives | Remove "net" for better response
------------------------------------------------------------ --------------

Report message to a moderator

Re: Determining length of a text file? [message #40961 is a reply to message #40889]

Tue, 21 September 2004 01:58

lloyd
Messages: 16
Registered: February 2003

Junior Member

[...]
> It is usually more time efficient to allocate a big text array, read,
> ignore the error, and truncate the empty text strings thus avoiding a
> FOR loop. Since all null strings point to the same (null) memory (I
> think) you only use a small temporary memory. For large table this can
> improve speed a lot.

I have a file of a few million lines to read in. Obviously it takes
awhile to read in (using while EOF), and would like to speed it up.

How do I get the code to ignore the error generated?

Thanks

Lloyd

Report message to a moderator

Re: Determining length of a text file? [message #40964 is a reply to message #40889]

Mon, 20 September 2004 15:14

kklare
Messages: 7
Registered: April 2004

Junior Member

lingfeng.wen@gmail.com (Robin Wen) wrote in message news:<9cac9f11.0409161530.5e083dae@posting.google.com>...
> pestMay@gmail.com wrote in message news:<U1P0d.18575$H83.10215@newssvr27.news.prodigy.com>...
>> In IDL v6.1:
>>
>> n_lines = File_lines(filename)
>>
>> does what you need.
>>
>> Ed May
>
> In regard of the old verion of IDL, you can try the 'readf' with loop
> until EOF so that you can obtain the exact the line number.
>
> Robin
It is usually more time efficient to allocate a big text array, read,
ignore the error, and truncate the empty text strings thus avoiding a
FOR loop. Since all null strings point to the same (null) memory (I
think) you only use a small temporary memory. For large table this can
improve speed a lot.

Report message to a moderator

Previous Topic:	Re: Irregular Grid in IDL
Next Topic:	Re: xinteranimate and string

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Wed Dec 03 01:13:50 PST 2025

Total time taken to generate the page: 0.72173 seconds