comp.lang.idl-pvwave archive: archive » Duplicate lat/long points

Home » Public Forums » archive » Duplicate lat/long points

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Duplicate lat/long points [message #64690]

Wed, 14 January 2009 02:21

hethomas
Messages: 19
Registered: December 2008

Junior Member

From searching this forum for "duplicate points" I found that a while
back, under the thread entitled "duplicates - a new twist" my problem
was posted (almost exactly identically) by Martin Doyle.
[ http://groups.google.com/group/comp.lang.idl-pvwave/browse_t hread/
thread/470ca560db41c58a/df9dba74d5788f6c?lnk=gst&q=dupli cate
+points#df9dba74d5788f6c ]

In short, I have a list of latitude, longitude and data and need to
combine any duplicate lat longs by summing the data value.

Despite the many follow up answers to this I am still having problems
with using the UNIQ function on both latitude and longtude as they
each need to be sorted numerically for IDL to work. Is anyone able to
shed any light on this?! Or indeed, know of a quicker/easier method.
There is a function in R called "aggregate" which appears to do
exactly what I need, but I am unable to find an IDL equivalent.

Any help is greatly appreciated!

Helen

Report message to a moderator

Re: Duplicate lat/long points [message #64738 is a reply to message #64690]

Thu, 15 January 2009 08:00

Jeremy Bailin
Messages: 618
Registered: April 2008

Senior Member

On Jan 15, 3:54 am, RussellGrew <russell.g...@gmail.com> wrote:
> You make a histogram where the bins are each latitude and also each
> longitude.
>
> It takes some thinking, but it works.

Or just use HIST_ND from the Coyote library, which (unlike the built-
in HIST_2D) will return REVERSE_INDICES.

You might actually find my HIST_ND_WEIGHT useful for this, using your
data values as the weights (it's actually a wrapper to David's
HIST_ND):
http://web.astroconst.org/jbiu/jbiu-doc/math/hist_nd_weight. html

-Jer.

Report message to a moderator

Re: Duplicate lat/long points [message #64779 is a reply to message #64690]

Tue, 20 January 2009 05:57

hethomas
Messages: 19
Registered: December 2008

Junior Member

Thank you all for your responses - I am still trying to get the
histogram method working, but have come up with a temporary working
solution using UNIQ and running it over two FOR loops.
James - my problem wasn't as complicated as yours, as all my points
are the same precision, so that identical points are truly exactly the
same. Thanks for sharing your method though. May be useful later! (I
can't easily move to R as I am using IDL coupled with ENVI for
satellite data work . . .)

Helen

Report message to a moderator

Re: Duplicate lat/long points [message #64802 is a reply to message #64738]

Fri, 16 January 2009 11:55

mccreigh
Messages: 13
Registered: January 2008

Junior Member

Helen-
I've written a routine to do something very similar to what you are
describing.
My problem, however may be significantly different (or not, depending
on if you are really giving the full details).
Basically, I identify unique station lons and lats. The *extremely*
complicating factor is that the same stations may be reported in 2
completely different precisions. For example 49N10' would not equal
49N10'15". But are these different stations or not? If you dont have
this tolerance problem, then, use hist_nd. If your data are regularly
spaced, and your hist not tremendously sparse then use hist_nd.
Otherwise you may need to consider what I've done. I tried to use
hist_nd in vein for this problem, so i thought it would be worth
sharing. My problem requires a tolerance factor, eg any two stations
separated by more than 1 minute are distinct. This translates into a
bin size in histogram or in hist_nd. Even if your tolerance is
relatively low (like 1 minute), the distance between any two points
makes the size of the implied grid huge, ie the sparsity of the
histogram output is overwhelming and you'll probably get and error.
That's because of how huge the earth is and the implied grid in the
histogram is massive with virtually no actual data on it. You have to
use long64 to even represent numbers if you want them to be non-
floating. It gets ugly, the problem is not a very clean one. It's one
more level of complication you may need to consider before trying
hist_nd. So hist_nd might not be the way to go, depending on how mesy
your problem is. I think a searching and matching routine like i've
used is much more practical for this described case. I posted my two
codes (from a quick scan, i couldnt see any other externals in the
programs, but there probably are, so just let me know on the side if i
need to post those as well):

http://cires.colorado.edu/~mccreigh/idl/
qa_identify_unique_stations.pro
qa_find_data_precision.pro

the first calls the second in the case that the user does not supply a
tolerance, in that case the second routine trys to find a default
tolerance in the data.

But, since I love hist_nd so much, I thought I'd try to share a quick
example like came up just the other day when I was sorting some dates.
But you really must/should study JD's histgram blub on David's site.
Afer that, to use hist_nd, you really only need read the header for
hist_nd.

IDL> a=[3,3,3,4] ; i guess you could think of these like dates -
day
IDL> b=[8,9,9,9] ; month
IDL> c=[10,10,10,12] ; year
IDL> in=[transpose(a),transpose(b),transpose(c)] ;note the required
form of the input for hist_nd
IDL> hist=hist_nd(in,1,rev=ri) ;binsize 1
IDL> p,hist
[*]
1 0
2 0

0 0
0 0

0 0
0 1

;; note the sparsity and structure - [2,2,3] is because 2 numbers span
variable1, 2 number span variable 2, 3 span the third
;; will illustrate the following reverse indices from the hist_nd
header (the trickiest part, make sure you read the hist primer)
;; ind=[i+nx*(j+ny*k)]
;; ri[ri[ind]:ri[ind+1]-1]

IDL> dims=size(hist,/dim)
;you'd typically do this inside a "for ... do begin" loop and not
store the coeffs and it would look so much
;; nicer! but since I'm working from command line...
IDL> inds=intarr(dims[0],dims[1],dims[2])
IDL> for i=0,dims[0]-1 do for j=0,dims[1]-1 do for k=0,dims[2]-1 do
inds[i,j,k]=i+dims[0]*(j+dims[1]*k)
IDL> for i=0,dims[0]-1 do for j=0,dims[1]-1 do for k=0,dims[2]-1 do if
ri[inds[i,j,k]+1]-1-ri[inds[i,j,k]] ge 0 then $
IDL> print,i,j,k,ri[ri[inds[i,j,k]]:ri[inds[i,j,k]+1]-1]
0 0 0 0 ; this says
that hist [i,j,k]=[0,0,0] has and entry in the input array at 0
0 1 0 1 2 ; this says
that hist [i,j,k]=[0,1,0] has and entries in the input array at [1,2]
1 1 2 3 ; you
guessed it.

My only other routine in my code directory (variogram.pro) is a fancy
wrapper on hist_nd. Hist is multithreaded, so this routine can run
really well on a multiprocessor machine, one process will gladly take
around 4 cpus on the machine I use. That's why programmers love
histogram.

One last note, it may well be worth your while to ditch IDL for R,
unless you have serious memory requirements. R has way more libraries
and useful things available in general (from my experience). If you
dont have alot invested in IDL, I'd seriously consider R for a variety
of reasons (esp cause it's free). But IDL offers the satisfation of
writing your own programs in many cases!

Good luck, hope this is useful or at least reassuring if you've
already figured it out.

James

Report message to a moderator

Re: Duplicate lat/long points [message #64853 is a reply to message #64690]

Thu, 22 January 2009 06:58

Jeremy Bailin
Messages: 618
Registered: April 2008

Senior Member

On Jan 21, 11:02 am, James McCreight <mccre...@gmail.com> wrote:
> that's basically re-inventing hist_nd, but sometimes it's nice to
> understand it from the ground up.

To some degree... but you can avoid the sparseness problems by just
working directly on the 1D mappings. I.e. you do a second mapping from
the 1D indices into their ordinals and then do a histogram of those,
which is guaranteed to be completely non-sparse.

-Jeremy.

Report message to a moderator

Re: Duplicate lat/long points [message #64860 is a reply to message #64690]

Wed, 21 January 2009 08:02

mccreigh
Messages: 13
Registered: January 2008

Junior Member

that's basically re-inventing hist_nd, but sometimes it's nice to
understand it from the ground up.

Report message to a moderator

Re: Duplicate lat/long points [message #64863 is a reply to message #64779]

Wed, 21 January 2009 07:14

Jeremy Bailin
Messages: 618
Registered: April 2008

Senior Member

On Jan 20, 8:57 am, hethomas <het...@googlemail.com> wrote:
> Thank you all for your responses - I am still trying to get the
> histogram method working, but have come up with a temporary working
> solution using UNIQ and running it over two FOR loops.
> James - my problem wasn't as complicated as yours, as all my points
> are the same precision, so that identical points are truly exactly the
> same. Thanks for sharing your method though. May be useful later! (I
> can't easily move to R as I am using IDL coupled with ENVI for
> satellite data work . . .)
>
> Helen

How about turning each (lat,lon) pair into a single unique index?
Since the latitudes are bounded by (-90,+90) the following mapping
function should work:

onedcoord = (lats+90) + 200*lons

You can then just run onedcoord through sort and uniq and just use one
for loop (or probably none if you're clever enough...).

-Jeremy.

Report message to a moderator

Previous Topic:	FLOAT images instead of BYTE ones from IDL Object graphics ?
Next Topic:	BAR_PLOT with pairs of bars

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Fri Nov 28 08:25:59 PST 2025

Total time taken to generate the page: 0.01101 seconds