Content deleted Content added
clarification |
Link suggestions feature: 3 links added. |
||
(47 intermediate revisions by 33 users not shown) | |||
Line 1:
{{Short description|Quadratic homogeneous polynomial in two variables}}
{{About|binary quadratic forms with [[integer]] coefficients|binary quadratic forms with other coefficients|quadratic form}}
{{more footnotes|date=July 2009}}
In [[mathematics]], a '''binary quadratic form''' is a quadratic [[homogeneous polynomial]] in two variables
Line 8:
where ''a'', ''b'', ''c'' are the '''coefficients'''. When the coefficients can be arbitrary [[complex number]]s, most results are not specific to the case of two variables, so they are described in [[quadratic form]]. A quadratic form with [[integer]] coefficients is called an '''integral binary quadratic form''', often abbreviated to ''binary quadratic form''.
This article is entirely devoted to integral binary quadratic forms. This choice is motivated by their status as the driving force behind the development of [[algebraic number theory]]. Since the late nineteenth century, binary quadratic forms have given up their preeminence in algebraic number theory to [[quadratic field|quadratic]] and more general [[number field]]s, but advances specific to binary quadratic forms still occur on occasion.
Pierre Fermat stated that if p is an odd prime then the equation <math>p = x^2 + y^2</math> has a solution iff <math>p \equiv 1 \pmod{4}</math>, and he made similar statement about the equations <math>p = x^2 + 2y^2</math>, <math>p = x^2 + 3y^2</math>, <math>p = x^2 - 2y^2</math> and <math>p = x^2 - 3y^2</math>.
<math>x^2 + y^2, x^2 + 2y^2, x^2 - 3y^2</math> and so on are quadratic forms, and the theory of quadratic forms gives a unified way of looking at and proving these theorems.
Another instance of quadratic forms is [[Pell's equation]] <math>x^2-ny^2=1</math>.
Binary quadratic forms are closely related to ideals in quadratic fields. This allows the class number of a quadratic field to be calculated by counting the number of reduced binary quadratic forms of a given discriminant.
The classical [[theta function]] of 2 variables is <math> \sum_{(m,n)\in \mathbb{Z}^2} q^{m^2 + n^2}</math>, if <math>f(x,y)</math> is a positive [[definite quadratic form]] then <math> \sum_{(m,n)\in \mathbb{Z}^2} q^{f(m,n)}</math> is a theta function.
== Equivalence ==
Two forms ''f'' and ''g'' are called '''equivalent''' if there exist integers <math>\alpha, \beta, \gamma, \text{ and } \delta</math> such that the following conditions hold:
: <math>\begin{align} f(\alpha x + \beta y, \gamma x + \delta y) &= g(x,y),\\
\alpha \delta - \beta \gamma &= 1.\end{align}</math>
For example, with <math>f= x^2 + 4xy + 2y^2</math> and <math>\alpha = -3</math>, <math>\beta = 2</math>, <math>\gamma = 1</math>, and <math>\delta = -1</math>, we find that ''f'' is equivalent to <math>g = (-3x+2y)^2 + 4(-3x+2y)(x-y)+2(x-y)^2</math>, which simplifies to <math>-x^2+4xy-2y^2</math>.
The above equivalence conditions define an [[equivalence relation]] on the set of integral quadratic forms. It follows that the quadratic forms are [[partition of a set|partition]]ed into equivalence classes, called '''classes''' of quadratic forms. A '''class invariant''' can mean either a function defined on equivalence classes of forms or a property shared by all forms in the same class.
Lagrange used a different notion of equivalence, in which the second condition is replaced by <math> \alpha \delta - \beta \gamma = \pm 1</math>. Since Gauss it has been recognized that this definition is inferior to that given above. If there is a need to distinguish, sometimes forms are called '''properly equivalent''' using the definition above and '''improperly equivalent''' if they are equivalent in Lagrange's sense.
In [[matrix (mathematics)|matrix]] terminology, which is used occasionally below, when
: <math> \begin{pmatrix} \alpha & \beta \\ \gamma & \delta \end{pmatrix} </math>
has integer entries and determinant 1, the map <math> f(x,y) \mapsto f(\alpha x + \beta y, \gamma x + \delta y)</math> is a (right) [[Group action (mathematics)|group action]] of <math>\mathrm{SL}_2(\mathbb{Z})</math> on the set of binary quadratic forms. The equivalence relation above then arises from the general theory of group actions.
If <math>f=ax^2+bxy+cy^2</math>, then important invariants include
* The [[discriminant]] <math>\Delta=b^2-4ac</math>.
* The content, equal to the greatest common divisor of ''a'', ''b'', and ''c''.
Terminology has arisen for classifying classes and their forms in terms of their invariants. A form of discriminant <math>\Delta</math> is '''definite''' if <math>\Delta < 0</math>, '''degenerate''' if <math>\Delta</math> is a perfect square, and '''indefinite''' otherwise. A form is '''primitive''' if its content is 1, that is, if its coefficients are coprime. If a form's discriminant is a [[fundamental discriminant]], then the form is primitive.<ref>{{harvnb|Cohen|1993|loc=§5.2}}</ref> Discriminants satisfy <math>\Delta\equiv 0,1 \pmod 4. </math>
=== Automorphisms ===
If ''f'' is a quadratic form, a matrix
: <math> \begin{pmatrix} \alpha & \beta \\ \gamma & \delta \end{pmatrix} </math>
in <math>\mathrm{SL}_2(\mathbb{Z})</math> is an '''automorphism''' of ''f'' if <math>f(\alpha x + \beta y, \gamma x + \delta y) = f(x,y)</math>. For example, the matrix
: <math> \begin{pmatrix} 3 & -4 \\ -2 & 3 \end{pmatrix} </math>
is an automorphism of the form <math>f = x^2 - 2y^2</math>. The automorphisms of a form are a [[subgroup]] of <math>\mathrm{SL}_2(\mathbb{Z})</math>. When ''f'' is definite, the group is finite, and when ''f'' is indefinite, it is infinite and [[cyclic group|cyclic]].
==Representation==
A binary quadratic form <math>q(x,y)</math> ''represents'' an integer <math>n</math> if it is possible to find integers <math>x</math> and <math>y</math> satisfying the equation <math>n = q(x,y).</math> Such an equation is a ''representation'' of {{math|''n''}} by {{math|''q''}}.
=== Examples ===
[[Diophantus]] considered whether, for an odd integer <math>n</math>, it is possible to find integers <math>x</math> and <math>y</math> for which <math>n = x^2 + y^2</math>.<ref>{{harvnb|Weil|2001|p=30}}</ref> When <math>n=65</math>, we have
: <math>\begin{align} 65 &= 1^2 + 8^2,\\
65 &= 4^2 + 7^2,
Line 26 ⟶ 76:
A similar argument shows that for each <math>n</math>, the equation <math>n =x^2+y^2</math> can have only a finite number of solutions since <math>x^2+y^2</math> will exceed <math>n</math> unless the absolute values <math>|x|</math> and <math>|y|</math> are both less than <math>\sqrt{n}</math>. There are only a finite number of pairs satisfying this constraint.
Another ancient problem involving quadratic forms asks us to solve [[Pell's equation]]. For instance, we may seek integers ''x'' and ''y'' so that <math>1 = x^2 - 2y^2</math>. Changing signs of ''x'' and ''y'' in a solution gives another solution, so it is enough to seek just solutions in positive integers. One solution is <math>(x,y) = (3,2)</math>, that is, there is an equality <math>1 = 3^2 - 2 \cdot 2^2</math>. If <math>(x,y)</math> is any solution to <math>1 = x^2 - 2 y^2</math>, then <math>(3x+4y,2x+3y)</math> is another such pair. For instance, from the pair <math>(3,2)</math>, we compute
: <math>(3\cdot 3 + 4 \cdot 2, 2\cdot 3 + 3 \cdot 2) = (17,12)</math>,
Line 42 ⟶ 84:
: <math>\begin{align}
(3 \cdot 17 + 4 \cdot 12, 2 \cdot 17 + 3 \cdot 12) &= (99,70),\\
(3 \cdot 99 + 4 \cdot 70, 2 \cdot 99 + 3 \cdot 70) &= (
&\vdots \end{align}
</math>
These values will keep growing in size, so we see there are infinitely many ways to represent 1 by the form <math>x^2 - 2y^2</math>. This recursive description was discussed in Theon of Smyrna's commentary on [[Euclid's Elements]].
=== The representation problem ===
The oldest problem in the theory of binary quadratic forms is the '''representation problem''': describe the representations of a given number <math>n</math> by a given quadratic form ''f''. "Describe" can mean various things: give an algorithm to generate all representations, a closed formula for the number of representations, or even just determine whether any representations exist.
The examples above discuss the representation problem for the numbers 3 and 65 by the form <math>x^2 + y^2</math> and for the number 1 by the form <math>x^2 - 2y^2</math>. We see that 65 is represented by <math>x^2 + y^2</math> in sixteen different ways, while 1 is represented by <math>x^2 - 2y^2</math> in infinitely many ways and
3 is not represented by <math>x^2+y^2</math> at all. In the first case, the sixteen representations were explicitly described. It was also shown that the number of representations of an integer by <math>x^2+y^2</math> is always finite. The [[sum of squares function]] <math>r_2(n)</math> gives the number of representations of ''n'' by <math>x^2+y^2</math> as a function of ''n''. There is a closed formula<ref>{{harvnb|Hardy|Wright|2008|loc=Thm. 278}}</ref>
: <math> r_2(n) = 4(d_1(n) - d_3(n)), </math>
where <math>d_1(n)</math> is the number of [[divisor]]s of ''n'' that are [[Modular arithmetic|congruent]] to 1 modulo 4 and <math>d_3(n)</math> is the number of divisors of ''n'' that are congruent to 3 modulo 4.
There are several class invariants relevant to the representation problem:
* The set of integers represented by a class. If an integer ''n'' is represented by a form in a class, then it is represented by all other forms in a class.
* The minimum absolute value represented by a class. This is the smallest nonnegative value in the set of integers represented by a class.
* The congruence classes modulo the discriminant of a class represented by the class.
The minimum absolute value represented by a class is zero for degenerate classes and positive for definite and indefinite classes. All numbers represented by a definite form <math>f = ax^2 + bxy + cy^2</math> have the same sign: positive if <math>a>0</math> and negative if <math>a<0</math>. For this reason, the former are called '''positive definite''' forms and the latter are '''negative definite'''.
The number of representations of an integer ''n'' by a form ''f'' is finite if ''f'' is definite and infinite if ''f'' is indefinite. We saw instances of this in the examples above: <math>x^2+y^2</math> is positive definite and <math>x^2 - 2y^2</math> is indefinite.
=== Equivalent representations ===
The notion of equivalence of forms can be extended to '''equivalent representations'''. Representations <math>m = f(x_1,y_1)</math> and <math>n = g(x_2,y_2)</math> are equivalent if there exists a matrix
: <math> \begin{pmatrix} \alpha & \beta \\ \gamma & \delta \end{pmatrix} </math>
with integer entries and determinant 1 so that <math>f(\alpha x + \beta y, \gamma x + \delta y) = g(x,y)</math> and
: <math>\begin{pmatrix} \delta& -\beta \\ -\gamma & \alpha\end{pmatrix} \begin{pmatrix} x_1 \\ y_1 \end{pmatrix} = \begin{pmatrix} x_2 \\ y_2 \end{pmatrix}</math>
The above conditions give a (right) action of the group <math>\mathrm{SL}_2(\mathbb{Z})</math> on the set of representations of integers by binary quadratic forms. It follows that equivalence defined this way is an equivalence relation and in particular that the forms in equivalent representations are equivalent forms.
As an example, let <math>f = x^2 - 2y^2</math> and consider a representation <math>1 = f(x_1,y_1)</math>. Such a representation is a solution to the Pell equation described in the examples above. The matrix
: <math> \begin{pmatrix} 3 & -4 \\ -2 & 3 \end{pmatrix} </math>
has determinant 1 and is an automorphism of ''f''. Acting on the representation <math>1 = f(x_1,y_1)</math> by this matrix yields the equivalent representation <math>1 = f(3x_1 + 4y_1, 2x_1 + 3 y_1)</math>. This is the recursion step in the process described above for generating infinitely many solutions to <math>1 = x^2 - 2y^2</math>. Iterating this matrix action, we find that the infinite set of representations of 1 by ''f'' that were determined above are all equivalent.
There are generally finitely many equivalence classes of representations of an integer ''n'' by forms of given nonzero discriminant <math>\Delta</math>. A complete set of [[representative (mathematics)|representative]]s for these classes can be given in terms of ''reduced forms'' defined in the section below. When <math>\Delta < 0</math>, every representation is equivalent to a unique representation by a reduced form, so a complete set of representatives is given by the finitely many representations of ''n'' by reduced forms of discriminant <math>\Delta</math>. When <math>\Delta > 0</math>, Zagier proved that every representation of a positive integer ''n'' by a form of discriminant <math>\Delta</math> is equivalent to a unique representation <math>n = f(x,y)</math> in which ''f'' is reduced in Zagier's sense and <math>x > 0</math>, <math>y \geq 0</math>.<ref>{{harvnb|Zagier|1981|loc=}}</ref> The set of all such representations constitutes a complete set of representatives for equivalence classes of representations.
== Reduction and class numbers<!--'Class number (binary quadratic forms)' redirects here--> ==
Lagrange proved that for every value ''D'', there are only finitely many classes of binary quadratic forms with discriminant ''D''. Their number is the '''{{vanchor|class number}}'''<!--boldface per WP:R#PLA--> of discriminant ''D''. He described an algorithm, called '''reduction''', for constructing a canonical representative in each class, the '''reduced form''', whose coefficients are the smallest in a suitable sense.
Gauss gave a superior reduction algorithm in ''[[Disquisitiones Arithmeticae]]'', which
== Composition ==
'''Composition''' most commonly refers to a [[binary operation]] on primitive equivalence classes of forms of the same discriminant, one of the deepest discoveries of Gauss, which makes this set into a finite [[abelian group]] called the '''form class group''' (or simply class group) of discriminant <math>\Delta</math>. [[Class group]]s have since become one of the central ideas in algebraic number theory. From a modern perspective, the class group of a fundamental discriminant <math>\Delta</math> is [[isomorphic]] to the [[narrow class group]] of the [[quadratic field]] <math>\mathbf{Q}(\sqrt{\Delta})</math> of discriminant <math>\Delta</math>.<ref>{{harvnb|Fröhlich|Taylor|1993|loc=Theorem 58}}</ref> For negative <math>\Delta</math>, the narrow class group is the same as the [[ideal class group]], but for positive <math>\Delta</math> it may be twice as big.
"Composition" also sometimes refers to, roughly, a binary operation on binary quadratic forms. The word "roughly" indicates two caveats: only certain pairs of binary quadratic forms can be composed, and the resulting form is not well-defined (although its equivalence class is). The composition operation on equivalence classes is defined by first defining composition of forms and then showing that this induces a well-defined operation on classes.
"Composition" can also refer to a binary operation on representations of integers by forms. This operation is substantially more complicated{{citation needed|date=March 2017}}<!--<ref>{{harvnb|Shanks|1989}}</ref> full citation not in article yet --> than composition of forms, but arose first historically. We will consider such operations in a separate section below.
Composition means taking 2 quadratic forms of the same discriminant and combining them to create a quadratic form of the same discriminant, as follows from [[Brahmagupta's identity]].
=== Composing forms and classes ===
A variety of definitions of composition of forms has been given, often in an attempt to simplify the extremely technical and general definition of Gauss. We present here Arndt's method, because it remains rather general while being simple enough to be amenable to computations by hand. An alternative definition is described at [[Bhargava cube]]s.
Suppose we wish to compose forms <math>f_1 = A_1 x^2 + B_1 xy + C_1 y^2</math> and <math>f_2 = A_2 x^2 + B_2 xy + C_2 y^2</math>, each primitive and of the same discriminant <math>\Delta</math>. We perform the following steps:
# Compute <math>B_\mu = \tfrac{B_1 + B_2}{2}</math> and <math> e = \gcd(A_1, A_2, B_\mu)</math>, and <math>A = \tfrac{A_1 A_2}{e^2}</math>
# Solve the system of congruences <blockquote><math>\begin{align} x &\equiv B_1 \pmod{2 \tfrac{A_1}{e}}\\ x &\equiv B_2 \pmod{2 \tfrac{A_2}{e}}\\ \tfrac{B_\mu}{e} x &\equiv \tfrac{\Delta + B_1 B_2}{2e} \pmod{2A} \end{align} </math></blockquote>It can be shown that this system always has a unique integer solution modulo <math>2A</math>. We arbitrarily choose such a solution and call it ''B''.
# Compute ''C'' such that <math>\Delta = B^2 - 4AC</math>. It can be shown that ''C'' is an integer.
The form <math>Ax^2 + Bxy + Cy^2</math> is "the" composition of <math>f_1</math> and <math>f_2</math>. We see that its first coefficient is well-defined, but the other two depend on the choice of ''B'' and ''C''. One way to make this a well-defined operation is to make an arbitrary convention for how to choose ''B''—for instance, choose ''B'' to be the smallest positive solution to the system of congruences above. Alternatively, we may view the result of composition, not as a form, but as an equivalence class of forms modulo the action of the group of matrices of the form
: <math>\begin{pmatrix} 1 & n\\ 0 & 1\end{pmatrix}</math>,
where ''n'' is an integer. If we consider the class of <math>Ax^2 + Bxy + Cy^2</math> under this action, the middle coefficients of the forms in the class form a congruence class of integers modulo 2''A''. Thus, composition gives a well-defined function from pairs of binary quadratic forms to such classes.
It can be shown that if <math>f_1</math> and <math>f_2</math> are equivalent to <math>g_1</math> and <math>g_2</math> respectively, then the composition of <math>f_1</math> and <math>f_2</math> is equivalent to the composition of <math>g_1</math> and <math>g_2</math>. It follows that composition induces a well-defined operation on primitive classes of discriminant <math>\Delta</math>, and as mentioned above, Gauss showed these classes form a finite abelian group. The [[identity element|identity]] class in the group is the unique class containing all forms <math>x^2 + Bxy + Cy^2</math>, i.e., with first coefficient 1. (It can be shown that all such forms lie in a single class, and the restriction <math>\Delta \equiv 0 \text{ or } 1 \pmod{4}</math> implies that there exists such a form of every discriminant.) To [[inverse element|invert]] a class, we take a representative <math>Ax^2 + Bxy + Cy^2</math> and form the class of <math>Ax^2 - Bxy + Cy^2</math>. Alternatively, we can form the class of <math>Cx^2 + Bxy + Ay^2</math> since this and <math>Ax^2 - Bxy + Cy^2</math> are equivalent.
== Genera of binary quadratic forms ==
Gauss also considered a coarser notion of equivalence, with each coarse class called a '''genus''' of forms. Each genus is the union of a finite number of equivalence classes of the same discriminant, with the number of classes depending only on the discriminant. In the context of binary quadratic forms, genera can be defined either through congruence classes of numbers represented by forms or by '''genus characters''' defined on the set of forms. A third definition is a special case of the [[genus of a quadratic form]] in n variables. This states that forms are in the same genus if they are locally equivalent at all rational primes (including the [[Algebraic number field#Places|Archimedean place]]).
== History ==
There is circumstantial evidence of protohistoric knowledge of algebraic identities involving binary quadratic forms.<ref>{{harvnb|Weil|2001|loc=Ch.I §§VI, VIII}}</ref> The first problem concerning binary quadratic forms asks for the existence or construction of representations of integers by particular binary quadratic forms. The prime examples are the solution of [[Pell's equation]] and the representation of integers as
The general theory of quadratic forms was initiated by [[Lagrange]] in 1775 in his ''[[List of important publications in mathematics#Recherches d'Arithmétique|Recherches d'Arithmétique]]''. Lagrange was the first to realize that "a coherent general theory required the simulatenous consideration of all forms."<ref>{{harvnb|Weil|2001|loc=p.318}}</ref> He was the first to recognize the importance of the discriminant and to define the essential notions of equivalence and reduction, which, according to Weil, have "dominated the whole subject of quadratic forms ever since".<ref>{{harvnb|Weil|2001|loc=p.317}}</ref> Lagrange showed that there are finitely many equivalence classes of given discriminant, thereby defining for the first time an arithmetic [[Ideal class group|class number]]. His introduction of reduction allowed the quick enumeration of the classes of given discriminant and foreshadowed the eventual development of [[infrastructure (number theory)|infrastructure]]. In 1798, [[Adrien-Marie Legendre|Legendre]] published ''Essai sur la théorie des nombres'', which summarized the work of Euler and Lagrange and added some of his own contributions, including the first glimpse of a composition operation on forms.
The theory was vastly extended and refined by [[Carl Friedrich Gauss|Gauss]] in Section V of ''[[List of important publications in mathematics#Disquisitiones Arithmeticae|Disquisitiones Arithmeticae]]''. Gauss introduced a very general version of a [[composition operator]] that allows composing even forms of different discriminants and imprimitive forms. He replaced Lagrange's equivalence with the more precise notion of proper equivalence, and this enabled him to show that the primitive classes of given discriminant form a [[group (mathematics)|group]] under the composition operation. He introduced genus theory, which gives a powerful way to understand the quotient of the class group by the subgroup of squares. (Gauss and many subsequent authors wrote 2''b'' in place of ''b''; the modern convention allowing the coefficient of ''xy'' to be odd is due to [[Gotthold Eisenstein|Eisenstein]]).
These investigations of Gauss strongly influenced both the arithmetical theory of quadratic forms in more than two variables and the subsequent development of algebraic number theory, where quadratic fields are replaced with more general [[number field]]s. But the impact was not immediate. Section V of ''Disquisitiones'' contains truly revolutionary ideas and involves very complicated computations, sometimes left to the reader. Combined, the novelty and complexity made Section V notoriously difficult. [[Dirichlet]] published simplifications of the theory that made it accessible to a broader audience. The culmination of this work is his text ''[[List of important publications in mathematics#Vorlesungen über Zahlentheorie|Vorlesungen über Zahlentheorie]]''. The third edition of this work includes two supplements by [[Dedekind]]. Supplement XI introduces [[ring theory]], and from then on, especially after the 1897 publication of [[Hilbert|Hilbert's]] ''[[List of important publications in mathematics#Zahlbericht|Zahlbericht]]'', the theory of binary quadratic forms lost its preeminent position in [[algebraic number theory]] and became overshadowed by the more general theory of [[algebraic number fields]].
Even so, work on binary quadratic forms with integer coefficients continues to the present. This includes numerous results about quadratic number fields, which can often be translated into the language of binary quadratic forms, but also includes developments about forms themselves or that originated by thinking about forms, including [[Daniel Shanks|
==See also==
* [[Bhargava cube]]
*[[Fermat's theorem on sums of two squares]]
* [[Legendre symbol]]
* [[Brahmagupta's identity]]
==Notes==
{{reflist|30em}}
==References==
* Johannes Buchmann, Ulrich Vollmer: ''Binary Quadratic Forms'', Springer, Berlin 2007, {{ISBN
* Duncan A. Buell: ''Binary Quadratic Forms'', Springer, New York 1989
* David A. Cox, ''Primes of the form <math>x^2 + ny^2</math>, Fermat, class field theory, and complex multiplication''
* {{Citation
| last=Cohen
Line 125 ⟶ 209:
| last= Fröhlich
| first = Albrecht
|
| last2=Taylor
| first2=Martin
|
| title=Algebraic number theory
| publisher=[[Cambridge University Press]]
Line 137 ⟶ 221:
| volume=27
}}
* {{
| last1=Hardy
| first1=G. H.
Line 144 ⟶ 228:
| first2=E. M.
| author2-link=E. M. Wright
| edition=
| others=
| title=An Introduction to the Theory of Numbers
| publisher=
| ___location=Oxford
| isbn= 978-0-19-921986-5
| mr= 2445243
| zbl= 1159.11001
| year=2008
| orig-year=1938
}}
* {{Citation
| last = Weil
| first = André
|
| title = Number Theory: An approach through history from Hammurapi to Legendre
| publisher = Birkhäuser Boston
Line 167 ⟶ 250:
| last = Zagier
| first = Don
|
| title = Zetafunktionen und quadratische Körper: eine Einführung in die höhere Zahlentheorie
| publisher = Springer
Line 175 ⟶ 258:
==External links==
* [http://oeis.org/wiki/User:Peter_Luschny/BinaryQuadraticForms Peter Luschny, Positive numbers represented by a binary quadratic form]
* {{eom|id=b/b016370|author=A. V. Malyshev|title=Binary quadratic form}}
[[Category:Quadratic forms]]
|