Table of contents:
- Introduction: the general strategy of finding homomorphic spaces
- Integral points on hyperplanes: prerequisites
- Integral points on hyperplanes: Bezout, Euclid's algorithm, Gaussian elimination and Smith normal form
Introduction
Number theory is just algebra done over the integers. This seems kind of weird at first -- the integer lattice seems like a rather arbitrary lattice to care about; I mean, if you're looking for solutions for an equation over the reals, you can always show existence and approximate the solution by analytic means, but whether or not an arbitrary real number is an integer is just so arbitrary, why would you even CARE about that--
But the integers do have some special structure. The most obvious interesting thing about the integers is divisibility. One can show something isn't a solution by taking both sides modb for some b -- i.e. one can get restrictions on the space of possible solutions by considering the problem in Zmodb.
This is an example of a much more general strategy in mathematics to restrict the space of possible solutions -- apply a homomorphism to both sides and eliminate non-solutions in the homomorphic space. If you apply an isomorphism, you can also do the converse -- i.e. accept solutions in the isomorphic space.
So we're interested in the structure of Z, so as to know what it can homomorph to. Addition and multiplication are rather important to the structure of Z -- we like to call this a ring. In fact, Z is the "free ring". If you know any category theory, this means Z is the ring that has homomorphisms to every other ring.
In basic number theory, there are two sorts of such rings to which we will be interested in homomorphisms to: Z/bZ (which we've discussed) and Z[α] for some non-integer α.
This is why the algebraic properties of these rings are of so much importance in number theory -- that's why algebra is so important in number theory. The Chinese remainder theorem -- the statement that the obvious projection map Z/(n1…nk)Z→Z/n1Z×⋯×Z/nkZ is a ring isomorphism -- is such an algebraic property. Stuff about quadratic and higher residues (squares mod b) are such algebraic properties. Stuff about when Z[α] is a UFD, PID blabla are such algebraic properties. We will make reference to them throughout this course without getting into the details of things.
So the topic is solving equations over the integers. In particular, solving polynomial equations over the integers.
Integral points on hyperplanes: prerequisites
The following animation is a picture illustration of Euclid's algorithm:
Integral points on hyperplanes: Bezout, Euclid's algorithm, Gaussian elimination and Smith normal form
381x+66y=b
So Bezout's identity tells us that the equation a1x+a2y=b (by convention a1,a2,b are known/"constants" while x,y,z are to be solved for/"variables") is solvable iff gcd(a1,a2) divides b. In fact it says the equation ∑aixi=b is solvable iff gcd((ai)) divides b.
In fact it tells us there is an algorithm to compute this solution xi -- namely, Euclid's algorithm.
To help us generalize this: the nicest way to think of Euclid's algorithm algorithmically is as a sequence of matrix multiplications. E.g.
381=5×66+5166=1×51+1551=3×15+615=2×6+36=2×3+0
[38166]=[5110][1110][3110][2110][2110][30]=[12752229][30][30]=[38166][−92252−127]
So in effect, we're solving the linear equation (where a,d are row vectors) ax=b by instead solving dy=b where d=aT is of the form (d,0) and T∈GL(Z2).
Note also that left- (resp. right-) multiplication by [λ110] can also be understood as "add λ copies of Row 1 to Row (resp. Col) 2, then switch Row (resp. Col) 1 and Row (resp. Col) 2" -- i.e.
[λ110]=[0110][10λ1]
But the nice thing about this matrix approach is that it can be used to solve systems of linear equations. It is not sufficient to solve these equations independently and state the conditions as d1∣c1,…dm∣cm, because the solution sets for each equation needn't intersect (think: 3x+6y=3,3x+9y=3).
Instead, we solve Ax=b by solving Dy=c where D=SAT, c=Sb and D is diagonal, S,T∈GL(Zn). This D is called the Smith normal form of A, and can be seen as a generalization of the Greatest Common Divisor to a set of integer vectors.
No comments:
Post a Comment