comp.lang.idl-pvwave archive: archive » Re: About the bits reserved for float variable

Home » Public Forums » archive » Re: About the bits reserved for float variable

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: About the bits reserved for float variable [message #39497]

Sat, 22 May 2004 05:01

George N. White III is currently offline

George N. White III
Messages: 56
Registered: September 2000

Member

On Fri, 21 May 2004, Nuno Oliveira wrote:

> I looking at the Chapter 5 of the Bulding Aplication.
>
> It says, for float variables that it's a 32 bits number in the range of
> +/-10^38 withe approximately six or seven decimal places of significance.
> What I'm missing here? How can a number 32 bit number range between -10^38
> and +10^38?

Read chapter 1 of any decent introductory numerical analysis text and then
"What Every Computer Scientist Should Know About Floating-Point
Arithmetic" by D. Goldberg, ACM Computing Surveys, Vol 23, 1991, p5-48)
and reprinted in Sun's online manuals. See links on:

http://cch.loria.fr/documentation/IEEE754/

While it is true that many people do get by without understanding
f.p. arithmetic, there are also many examples of calculations going astray
due to failure to recognize situations where the difference between
f.p. and real numbers matters. It is becoming more important to
understand the material in Goldberg's paper because newer hardware
speedups (speculative execution, merged operations, parallel
processing) tend to make it harder to diagnose arithmetic problems.

--
George N. White III <aa056@chebucto.ns.ca>

Report message to a moderator

Re: About the bits reserved for float variable [message #39498 is a reply to message #39497]

Fri, 21 May 2004 15:22

Kenneth P. Bowman
Messages: 585
Registered: May 2000

Senior Member

In article <MPG.1b17c91c4c958ba1989761@news.frii.com>,
David Fanning <davidf@dfanning.com> wrote:

> it this way, you can have fast or accurate, but you
> can't have both. That's about as much as I've ever
> needed to know using a computer. :-)

It used to be that integer arithmetic was faster than floating point,
but that is generally no longer the case. Just about all machines
that I know of can do integer or floating point ops in a single
clock cycle. Some cpus can do more than one op per clock cycle.
(That's what many of those millions of transistors on
modern cpus are used for.)

Additionally, many (but not all) architectures have double-precision
floating point hardware units. DP operations on those systems are as fast as
single precision. On many machines the only drawbacks to doing everything
in DP are: twice as much memory is required and twice as much file
space.

My rules of thumb:
Use integers for things you can count (i.e., no fractions).
Use doubles for "real numbers", unless memory is a problem.
Write files in single or double precision, as needed.

Ken Bowman

Report message to a moderator

Re: About the bits reserved for float variable [message #39504 is a reply to message #39498]

Fri, 21 May 2004 08:07

David Fanning
Messages: 11724
Registered: August 2001

Senior Member

James Kuyper writes:

> It can do that by not representing every integer value in that range. A
> 32-bit type can represent a maximum of 2^32 different values. An
> ordinary 32 bit integer type represents 2^32 consecutive integer values.
> A 32-bit IEEE format floating point number represents a slightly smaller
> set of values (because some of the bit patters represent +infinity,
> -infinity, denormalized numbers, and NaNs), but those values are very
> closely spaced near 0, and more widely spaced out the larger the values
> are, which allows them to cover a much larger dynamic range.
>
> To be specific, an IEEE format number contains a sign bit, a mantissa,
> an exponent, and has an implicit offset which is used to interpret the
> value. The value represented by such a number is
>
> (-1)^sign *(1 + mantissa/2^n)*2^(exponent+offset)
>
> where 'n' is the number of bits in the mantissa, and offset is negative.
> Note that this formula provides no way to represent 0 (the mantissa is
> never negative). As a special exception, a mantissa and exponent that
> are both zero are treated as representing 0, rather than 2^offset, which
> is what the general formula would call for.
>
> Thus, for any given value of 'k' within a certain range, this format can
> represent exactly 2^n different value x in the range 2^k <= x < 2^k+1,
> evenly spaced within that interval.

Nuno, aren't you glad you asked. :-)

This kind of answer has always fallen into the
"Too Much Information" category for me. I think of
it this way, you can have fast or accurate, but you
can't have both. That's about as much as I've ever
needed to know using a computer. :-)

Cheers,

David

--
David Fanning, Ph.D.
Fanning Software Consulting, Inc.
Coyote's Guide to IDL Programming: http://www.dfanning.com/

Report message to a moderator

Re: About the bits reserved for float variable [message #39505 is a reply to message #39504]

Fri, 21 May 2004 08:00

James Kuyper
Messages: 425
Registered: March 2000

Senior Member

Nuno Oliveira wrote:
> I looking at the Chapter 5 of the Bulding Aplication.
>
> It says, for float variables that it's a 32 bits number in the range of
> +/-10^38 withe approximately six or seven decimal places of significance.
> What I'm missing here? How can a number 32 bit number range between -10^38
> and +10^38?

It can do that by not representing every integer value in that range. A
32-bit type can represent a maximum of 2^32 different values. An
ordinary 32 bit integer type represents 2^32 consecutive integer values.
A 32-bit IEEE format floating point number represents a slightly smaller
set of values (because some of the bit patters represent +infinity,
-infinity, denormalized numbers, and NaNs), but those values are very
closely spaced near 0, and more widely spaced out the larger the values
are, which allows them to cover a much larger dynamic range.

To be specific, an IEEE format number contains a sign bit, a mantissa,
an exponent, and has an implicit offset which is used to interpret the
value. The value represented by such a number is

(-1)^sign *(1 + mantissa/2^n)*2^(exponent+offset)

where 'n' is the number of bits in the mantissa, and offset is negative.
Note that this formula provides no way to represent 0 (the mantissa is
never negative). As a special exception, a mantissa and exponent that
are both zero are treated as representing 0, rather than 2^offset, which
is what the general formula would call for.

Thus, for any given value of 'k' within a certain range, this format can
represent exactly 2^n different value x in the range 2^k <= x < 2^k+1,
evenly spaced within that interval.

Report message to a moderator

Re: About the bits reserved for float variable [message #39506 is a reply to message #39505]

Fri, 21 May 2004 07:44

Paul Van Delst[1]
Messages: 1157
Registered: April 2002

Senior Member

Report message to a moderator

Re: About the bits reserved for float variable [message #39507 is a reply to message #39506]

Fri, 21 May 2004 07:36

Chris Lee
Messages: 101
Registered: August 2003

Senior Member

In article <c8l0c4$jle$1@pegasus.fccn.pt>, "Nuno Oliveira"
<nmoliveira@fc.ul.pt> wrote:

> I looking at the Chapter 5 of the Bulding Aplication. It says, for
> float variables that it's a 32 bits number in the range of +/-10^38
> withe approximately six or seven decimal places of significance. What
> I'm missing here? How can a number 32 bit number range between -10^38
> and +10^38?
>

For an 32 bit floating point number, the first bit is the sign bit. the
next 8 bits are the exponent, the last 23 bits are the mantissa (IEEE)

The exponent has 8 bits, it can do -128 -> 128 in base 2,
2^128 = 3.4 x 10^38
2^-128 = ...10^-38

The 23 bits of the mantissa represent a number between 0 and 2 (scaled).
2^23 = 8388608, a 7 digit number

There's an equation to convert them on the IEEE website I think.

Chris.

Report message to a moderator

Re: About the bits reserved for float variable [message #39539 is a reply to message #39504]

Tue, 25 May 2004 10:08

Nuno Oliveira
Messages: 75
Registered: October 2003

Member

Those were definitely careful answers. Indeed, more that I needed to know.
But anyway I enjoyed knowing all those things. And besides that it's a great
thing to know that are that like to help the others. Enchanted!, I can say.

"David Fanning" <davidf@dfanning.com> wrote in message
news:MPG.1b17c91c4c958ba1989761@news.frii.com...
>
> Nuno, aren't you glad you asked. :-)
>
> This kind of answer has always fallen into the
> "Too Much Information" category for me. I think of
> it this way, you can have fast or accurate, but you
> can't have both. That's about as much as I've ever
> needed to know using a computer. :-)
>
> Cheers,
>
> David
>
>
> --
> David Fanning, Ph.D.
> Fanning Software Consulting, Inc.
> Coyote's Guide to IDL Programming: http://www.dfanning.com/

Report message to a moderator

Previous Topic:	Re: 3D plot of set of curves
Next Topic:	Re: Depth visibility with Object Graphics !!!

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Tue Dec 02 19:04:19 PST 2025

Total time taken to generate the page: 0.55795 seconds