What are the 'real numbers', really?(math.vanderbilt.edu) |
What are the 'real numbers', really?(math.vanderbilt.edu) |
Infinitesimals have been made rigorous with modern mathematics.
Terry Tao has a wonderful series of posts about hard and soft analysis, ultrafilters, and non-standard analysis. He writes
I feel that one of the reasons that non-standard analysis is
not embraced more widely is because the transfer principle,
and the ultrafilter that powers it, is often regarded as some
sort of “black box” which mysteriously bestows some
certificate of rigour on non-standard arguments used to prove
standard theorems, while conveying no information whatsoever
on what the quantitative bounds for such theorems should
be. Without a proper understanding of this black box, a
mathematician may then feel uncomfortable with any
non-standard argument, no matter how impressive and powerful
the result.
and The main drawbacks to use of non-standard notation (apart
from the fact that it tends to scare away some of your
audience) is that a certain amount of notational setup is
required at the beginning, and that the bounds one obtains at
the end are rather ineffective (though, of course, one can
always, after painful effort, translate a non-standard
argument back into a messy but quantitative standard argument
if one desires)
(from http://terrytao.wordpress.com/2007/06/25/ultrafilters-nonsta...)Anybody with a small bit of curiosity or a dashing of non-conformity will be suspicious of this narrative.
If anything, infinitesimals in their various guises carry a certain explanatory heft, and are quite beguiling little creatures if you take the time to get to know them. I'd be happy to elaborate or leave a few links here if anybody is interested.
Suppose there's a location on the line that's somehow missing - call it x. Let A be all the numbers less than x, let B be all the numbers greater than x, and that gives you your Dedekind cut. That Dedekind cut is, in a very real sense, x, and that means x is a real. QED.
That needs tidying up and formalising, but it does work.
In-between any two rational numbers there's an infinite number of other rational numbers. So, in any reasonable sense and at any level of "magnification", if you can "see" two dots representing two rational numbers then they are connected by a line of other little dots (just like the reals). Perhaps you could argue though that at "infinite magnification" there are no rational numbers to be seen, it's just empty space, whereas the reals of course still make a nice line.
Holy hell that is clear, concise and compelling. If only my professors would have explained it like this more often in my freshman calc class which was so much more abstract and proof based than anything I had encountered before. The only thing I remember form that time is hellishly long study groups late into the night with my classmates.
We need to stop venerating the "real" numbers and start focusing on sets that are actually usable.
Still, as in the OP, mentioning Dedekind cuts is okay since it is one way to establish completeness, but there is much more, e.g., as in
John C. Oxtoby, Measure and Category.
and even that doesn't fathom all that is special about the reals. E.g., for just a little more, there is the continuum hypothesis, that little thing!
The OP wants to say that by mentioning Dedekind and completeness he is getting at what the reals really are; no, instead he is just cutting one layer deeper of something that has likely some infinitely many layers available.
Yes, yes, yes, I know; I know; the reals are the only complete, Archimedean ordered field, okay, after we have defined completeness, Archimedean ordered, and field and explained why these are important.
So, back to "points on the line" -- it's actually pretty good for a first cut.
The comments about how "few students take [Real Analysis]" doesn't square with my experience and survey of an undergraduate mathematics education. Such a course is often called "Advanced Calculus", and is a required course for a Bachelors-level education in Math. I also understand in the European-style approach to teaching Math, students start off with a foundational approach to Calculus through Real Analysis, and not the hand-wavy & computation-driven Calculus course.
The equivalence class approach attributed to Cantor is more generalizable in discussing sets. The theoretical foundation of Fourier Transforms lies in a similar completion of functions.
Yes. Where I graduated, all engineering majors learn the axiomatic definition of the real numbers including the "supremum (least upper bound) axiom" at the beginning of the first calculus class.
The foundations of analysis by Larry Clifton. I always enjoy checking out the references in his papers as they are often hundreds of years old or more.
This is a curious paper. It's a rigorous derivation of (positive) real numbers without the use of 0 or negative numbers anywhere. It isn't very useful, although the fact that this can easily be done is by itself interesting.
I have sometimes thought about the possibility of us encountering an advanced alien civilization and trying to match our math to theirs. Someone told me recently that if aliens were able to get into space, we can take it for granted that they knew negative numbers (in addition to more advanced concepts). I disagreed. Negative numbers are very convenient, but all the math that's needed for modern physics can, I think, be built up without them in a way that's more bulky and awkward, but not an order of magnitude bulky. This paper is weak evidence of my position.
Other than that, a great article!
It also states that you cannot order the field of complex numbers. Whereas I seem to recollect that there are ways to do so. For instance, z1 < z2 if x1 < x2 or x1 = x2 and y1 < y2.
It's wonderful to have this little insight now. It's unfortunate that my math knowledge is so filled with holes.
Unfortunately there is no such theory."
http://njwildberger.wordpress.com/2012/12/02/difficulties-wi...
These posts are always stimulating.
My understanding of a line is that it is delimited by two points, but does not contain any points. To elaborate, no point could be "on" a line because a point has no extension, whereas a line does. This is the crux of the matter. Therefore a line is not "made up of" points. (By analogy a plane could not be made up of lines.) This begs the question, what are lines made up of? Are they made up of anything? Is a point really where two (or more) lines would intersect if they could intersect. Is this what is meant by a Dedekind cut?
I don't remember the precise proof, but if memory serves it derives from the existance of opposites and inverses, and 0 and 1 being unique in the set, due the commutative properties of abelean groups.
Consider that the integral of the ruler function from 0 to 1 is 0 (as is stated in your reference 1). In layman's terms you could express this as "there are infinitely more irrational than rational numbers between 0 and 1". At the same time, "for every two rational numbers there are infinitely many rational numbers in-between them". What sort of "picture" is this compatible with?
I still think that the only picture that really makes any sense is a solid line at any finite magnification, yet empty space at infinite magnification.
(Compare e.g. with the fourier transform of a function. It consists of a sum series which comes "arbitrarily close" to the function, but "at the limit" when the number of terms approaches infinity the function and its fourier transform is one and the same.)
There exists a function of the reals which is continuous at every irrational point but discontinuous at every rational point. However, there is no function of the reals which is discontinuous on the irrationals but continuous on the rationals. In this sense, the irrationals are "more continuous" than the rationals.
That's about the best I can do, though, which I admit is a stretch.
One can definitely "work with" numbers that aren't easy to write. a + (-a) = 0, and this is valid for every real number a, not just "the ones which I can describe with a finite amount of information", or the ones I've written down at some point in my life.
Every number that we can construct can be constructed in a finite amount of symbols. For example sqrt(2) is an unambiguous description of itself. Without use of the sqrt function, we can also call it the number x such that x*x=2. However, every description is a finite string constructed from a finite alphabet. We can easily show that the set of all such descriptions is countably infinite. However, we can also show that the set of all real numbers is uncountably infinite. Therefore, there is an uncountable infinity of real numbers that cannot be constructed.
The Reals are constructed specifically to be the smallest set that has some nice algebraic properties, like Least Upper Bounds. Sets that model the real world, like the constructables, countables, computables, etc. tend to be subsets of the Reals, and therefore don't have those properties. That absence makes life difficult.
The Real Number system, like almost everything in mathematics, is an approximation of reality that makes a trade-off between faithfulness and tractibility. As it turns out, gaining more of the former loses you quite a bit of the latter. It's generally not worth it.
The line you are talking about in the rest of your post seems to be an 'unrelated' object that is used in geometry. I am not familiar with the formal definition of line that is used in geometry, but one way of defining a line is as the set of all points which satisfy "y=mx+b", for a given (m,b). A line segment would be the above definition with restrictions on the domain: x_0<x<x_f.
That is not how Euclid defined it and how it is still seen in geometry today. What you describe is called a (line) segment (http://en.wikipedia.org/wiki/Line_segment)
"but does not contain any points"
Lines extend indefinitely in two directions (if you go past Euclidean geometry, that 'indefinitely' changes meaning a bit)
One talks of a point being _on_ a line in geometry. 'contains' is something from set theory: "the set of all points on line l contains point P" is a perfectly valid expression (but "P is on l" is way shorter)
A line is (or can be viewed as) an infinite set of points.
> To elaborate, no point could be "on" a line because a point has no extension, whereas a line does.
That seems to be a consequence of an unusual definition of "on".
You can deal instead with Euclid's axiomatization of geometry, and there "line" is an abstract thing defined by two points. Different animal, although seldom explained clearly by teachers, who often themselves don't really understand what's going on. (Although some do, and don't get the chance to explore these things because of the pressure of the curriculum, and students who don't care, but need to pass.)
All too often people get confused about this and are told to shut up by their teacher, whereas in fact the student has had an insight, and demonstrated deeper understanding.
You may find the Fano plane (a three-dimensional finite projective space) interesting:
* http://en.wikipedia.org/wiki/Fano_plane (brief description)
* http://math.ucr.edu/home/baez/octonions/node4.html (connections with higher math)And besides, the rationals are totally ordered, and their completion is totally ordered, so it makes sense to think of them as arranged in a line. The problem is that the reals are very, very strange in some ways, and people do get seduced into thinking they understand them, whereas usually it's just a case that they've got used to them.
Personally, I'm not a constructivist; I think that these undefinable real numbers exist just as well as the ones that we can define. But that's a philosophical argument and I was never any good at those.
The phrase "all undescribable real numbers" does not introduce any problems, because we have still not described any specific undescribable number. We would run into a problem with a phrase such as "the smallest undescribable real number", as that would be a description of a specific undescribable real number. Fourtuantly, that particular phrase does not raise any problems because we can simply conclude that their is no smallest undescribable real number, in the same way that there is no smallest real number in general.
Then apply the diagonal argument. Take the computable numbers between 0 and 1, including 0, not including 1. These are countable, so we can write them in a list, taking a mapping k from the natural numbers: { 1, 2, 3, 4, ... } to the set of computable numbers in [0,1).
Now let's construct a new number. In the first decimal place we put 1 if the first decimal place of k(1) is 0, and 0 otherwise. In the second place we put 1 if the second decimal place of k(2) is 0, and 0 otherwise. And so on.
This results in a number that's not on the list, and is between 0 and 1. So it must, by our assumption, not be computable.
Things become tricky.
So there's a choice to be made, and most mainstream mathematicians have decided to talk about, use, study, and otherwise accept the existence of the real numbers because it's convenient.
Feel free to choose otherwise.
Let a constructable number be one which can be unambiguously described in a finite string. Because we are working from a finite alphabet, we can trivially see that their is a bijection between the constructables and the integers (if we have n symbols, then each string can be read as an integer in base n, so the amount of constructables is no larger than the integers. We can also show that all integers are constructable, so the amount of constructables is no smaller than the integers). Now, take the set of all constructables, and use the diagonal arguement to construct a new number. We can see that this number is not constructable, however, it would appear that I have just unambigously described it, meaning that it must be constructable.
The only potential hole I see is that the ordering of the constructables when I apply the diagonal arguement is ambigous, but we can unambiguously order them by the lexical ordering of their 'canocial' description, and we can unambiguous define the canonical description as the smallest one when translated into a base n integer.
I suspect that doing the above will run into problems with computable numbers (as it likely involves the halting problem), however it appears to be an unambiguous description of a real number that is not constructable. Obviously there is some flaw in this reasoning.
Second, what does it mean for something to be described (unambiguously or otherwise) with a "finite string?" What is a "string" here?
You're playing too loose with these ideas and it's biting you. You have to start by defining them precisely. For example, I don't see at all how the new number not on your list is "described unambiguously." It's presumably not enough to say "there is some number not on my list, we will call it x" since we know there is more than just one such number. How is that unambiguous?
In any case, that's why you have to define these things precisely.
So you cannot list all constructables in a constructive manner because the list itself is not constructible.
A related concept
The Cantor set shares the property that "for every two [points in the set] there are infinitely many [points in the set] in between", but no one would describe it as looking like a line. It's rather sparse.
The Cantor set is very different. It's even easy to give an example of two points in the set that can (sanely) be depicted with empty space in-between: 1/3 and 2/3. If I'm not mistaken that example also disproves your stated conjecture... ;)
> It's even easy to give an example of two points in the [Cantor] set that can (sanely) be depicted with empty space in-between: 1/3 and 2/3. If I'm not mistaken that example also disproves your stated conjecture... [that between any two points in the set, there is a third one] ;)
Fair enough. Consider, then, the intersection of the Cantor set with the irrational numbers (you can think of this as the "open Cantor set"). It is, obviously, a subset of the Cantor set, and really does have the property described.
Since I'm feeling embarrassed about that last time, a proof follows:
-----
The Cantor set consists of all real numbers in the interval [0,1] which have a "decimal" expansion in trinary which does not contain the digit 1. That is to say, they can be expressed in terms of powers of (1/3) such that the coefficient of each power of 1/3 is either 0 or 2. (1/3 would usually be represented in trinary as 0.1, but is in the Cantor set because of its representation as 0.02222222...)
Let a,b be two irrational numbers in the Cantor set, a less than b. There is some decimal place at which they diverge, and since a is smaller, it has a 0 at that point, while b has a 2. Since a is irrational, it also has a 0 at some later point in its expansion (if every digit after that were 2, then a's expansion would be repeating and a would be rational). The number constructed by substituting a 2 for a 0 at that index is greater than a, less than b, and in the Cantor set.
Graphical representation of the proof:
a = 0.......0......
b = 0.......2......
then a = 0.......0....0.....
c = 0.......0....2.....
b = 0.......2.......... If all you want to do is differentiate and integrate,
then non-standard analysis is probably, for most people,
a faster way to be able to do just that.
Now read on ...Non-standard analysis has been put on a firm, formal footing. Theorems have been proven showing that (largely) it's equivalent to the regular form of analysis. Some things are easier to prove in standard analysis, some things are easier to prove in non-standard analysis, etc, etc.
However, this is only really of use if all you want to do is calculus. If you want to go beyond calculus, almost everything (in this and related areas) is about sequences, limits, limiting processes, functions, and transformations. There, non-standard analysis tends not to help, and unless you've done calculus the standard way, you have to learn all this stuff in an unfamiliar and difficult-to-visualize, abstract area.
One of the main reasons for continuing to learn calculus in the epsilon-delta limiting process manner is exactly because it's not only formally sound, it's also giving you tools for moving beyond the rather limited world of differential calculus.
Speculating wildly from limited experience, it might also be the case that starting people with the non-standard approach in calculus is actually just as confusing. You may find that you really only got the insights you did because you had already struggled with the standard approach, and then were given something that made it all fall into place. Perhaps some people they think the non-standard approach is easier, but in fact it's only because they've actually got the foundations from the other. Just a thought.
Why do you say this? I ask because I've found internal set theory, Edward Nelson's axiomatic version of nonstandard analysis, to be a lovely tool for doing typical sorts of things in analysis.
You have to learn to wield the "standard" predicate [0], which is too dark an art for some mathematicians, I suppose. But, in my opinion, nonstandard characterizations of notions like convergence and continuity are delightfully simple and direct.
It also turns out that when you have nonstandard numbers at hand, infinity is an over-powerful abstraction for some purposes. Nelson came up with a new formalism for probability theory [1], for example, that makes finite spaces powerful enough to capture what's interesting for most purposes. Similarly, finite but unlimited sequences often are "long enough" to incorporate all the interesting behavior of infinite sequences.
0. Alain Robert's Nonstandard Analysis is a good starting point.
1. See his short book Radically Elementary Probability Theory. I love this book, and didn't much like probability theory before reading it.
So we are in agreement. My point is that if you teach calculus that way you have immediately ham-strung anyone who might go on and do anything other than engineering or physics. In fact, there are deep theoretical arguments in physics where you need to use the standard approach, and the non-standard approaches are much more difficult.
My point is that if all you want is calculus then it's very likely that the non-standard approach is fine. I'm also arguing that this is limited thinking. Clearly you were never going to go further in these sorts of subjects - does that mean that everyone else should also be taught in a similarly limited way?
I also observe that limiting arguments are essential in anything other than the most direct and practical versions of engineering, so again, the point isn't in the calculus, the point is learning about limits.
Many people don't need any math at all beyond arithmetic, and I know a lot of people who proudly announce that they can't even do that. And to some extent it's true - most people don't need any math at all. Why were you bothering to take calculus? I'm sure you've never needed it.
But let me add that if all you want to do is arithmetic, why bother? Just use a calculator. If all you want to be able to do is differentiate, why bother? Feed it to Wolfram Alpha. If all you want to do is program, why bother? Hire someone to do it.
But yes, if all you want to do is high-school calculus, there are easier ways to learn the processes to jump through the hoops, pass the exam, and get the piece of paper. For most people that's all they care about. We probably agree on that.
This meant that maths stopped having the same appeal to me as computer programming.
It was only years later when I revisited the epsilon delta arguments that it finally made sense. It was a revelation to me that you could explain all of calculus without ever talking about "infinite".
I wish it had been taught to me rigorously the first time around: I would have been much better off.
How can you tell whether it's standard analysis that's confusing per se or you just had poor math teachers?
It's not a great paper and most of the insights in it come from others but here is some of the arithmetic of nilpotent[1] infinitesimals as shown in the appendix.
Imagine an entity which is not equal to zero but that when raised to the power of 2 or higher is equal to zero! Sounds odd, doesn't it, but it works! (ϵ is an infinitesimal)
ϵ != 0 but ϵ^n = 0 | n>1
ok? so we get:
(ϵ + 1)^n = 1 + nϵ thus: (ϵ + 1)^−1 = 1 − ϵ
e^ϵ = 1+ϵ
(ϵ + 1)(ϵ−1) = −1, or alternately (1 + ϵ)(1 − ϵ) = −1
and finally (for calculus): ϵf′(x) = f(x + ϵ)−f(x)
1: http://leto.electropoiesis.org/propaganda/The_Analyst_Revisi...
2: https://en.wikipedia.org/wiki/Nilpotent
edit: clarity, line breaks!
(ϵ + 1)(ϵ−1) = −1, or alternately (1 + ϵ)(1 − ϵ) = −1
That alternative should surely be: (1 + ϵ)(1 − ϵ) = 1
Not least, in a commutative system (1+x)(1-x) = 1-x^2. Thus (1 + ϵ)(1 − ϵ) = 1 - ϵ^2 = 1You're right though that a more precise definition of "real-world numbers" is needed, but I confess that my attempts to think of one in the past few minutes have been essentially circular (coming down to "the ones we know how to compute")!
It's not clear whether the universe is computable, however, in the sense that we only find computable numbers in nature. This is kind of an epistemological catch-22, though. How would we know whether this were the case or not?
Can you make this rigorous? Because using the standard definitions, this statement is not true. It's true that a countable number of points must have total length zero (and you can even give a rigorous proof of this) but not necessarily true for a non-countable number of points. The study of "lengths of sets of points" is called measure theory.
I think it is unnecessary, however, to bring in the whole concept of length when defining lines. For example, we could simply define a line as a set of points obeying some special properties.
Infinities are weird; anybody who wants to learn math has to accept that. 0.99999… does equal 1, there are as many even numbers as integers, etc. these things are 'true' not because they make sense initially, but because they make the most sense of all the other things we have thought of so far. Similarly, a set of Aleph-0 points can completely cover a line.
Charles Sanders Peirce said in 1903, ”Now if we are to accept the common idea of continuity […] we must either say that a continuous line contains no points or […] that the principle of excluded middle does not hold of these points. The principle of excluded middle applies only to an individual […] but places being mere possibilities without actual existence are not individuals.”
This isn't true for an uncountably infinite set of points, assuming by 'extension' you mean what is usually called 'meausre' in modern mathematics. Modern theory is perfectly fine with saying that a line of nonzero length contains an infinite number of points of zero length, and trying to draw on Euclidean notions definitions of 'point' and 'line' to find conclusions about real analysis is going to be unhelpful.
I'm not sure what that Peirce quote is trying to say.