on the moduli space of semistable sheavesybilu/algant/documents/...pairs (v,ϕ) where v is an...
TRANSCRIPT
-
On the moduli space of semistable sheaves
Author:
Matija Tapušković
Supervisor:
Prof. Qing Liu
Master Thesis
July 2016
-
1
-
Contents
Introduction 4
1 Moduli problems 7
1.1 The moduli functor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2 Coarse moduli space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3 Fine moduli space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.4 Pathological cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2 The Quot scheme 14
2.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2 The Quot functor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.3 The Grassmannian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.4 Representability theorem of Altman and Kleiman . . . . . . . . . . . . . . . . . . . 22
2.5 Construction of the Quot scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3 Geometric Invariant Theory 30
3.1 Actions of algebraic groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
3.2 Nice quotients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.3 Hilbert’s 14th problem and Nagata’s theorem . . . . . . . . . . . . . . . . . . . . . . 35
3.4 Constructing the projective GIT quotient . . . . . . . . . . . . . . . . . . . . . . . . 36
3.5 G-linearization of quasi-coherent OX -modules . . . . . . . . . . . . . . . . . . . . . . 39
3.6 The Hilbert-Mumford criterion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4 Stability of sheaves 47
4.1 Notions of stability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.2 Filtrations of sheaves and S-equivalence . . . . . . . . . . . . . . . . . . . . . . . . . 50
2
-
5 Construction of the moduli space of semistable sheaves 52
5.1 Boundedness of semistable sheaves . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
5.2 The moduli functor of semistable sheaves . . . . . . . . . . . . . . . . . . . . . . . . 58
5.3 Semistable sheaves as points on a Quot scheme . . . . . . . . . . . . . . . . . . . . . 59
5.4 Preliminaries for the GIT construction . . . . . . . . . . . . . . . . . . . . . . . . . . 60
5.5 Linearizing the action of SL(V ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.6 Construction of the moduli space of semistable sheaves . . . . . . . . . . . . . . . . 64
5.6.1 Statement of the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
5.6.2 Analysis of GIT semistability . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
5.6.3 Analysis of sheaf semistability - Le Potier’s theorem . . . . . . . . . . . . . 69
5.6.4 Proof of Theorem 12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Conclusion 74
3
-
Introduction
Moduli problems are, in a certain sense, extended classification problems. Suppose we have
defined a certain class M of geometric objects that we wish to classify up to some defined
equivalence. We might ask if some geometric space, such as that of a scheme or a variety, might
exists, such that its points correspond toMmodulo the equivalence. Moreover, since in algebraic
geometry many objects of interest vary in algebraically defined families, it would be useful if
the geometry of this space was in some way natural with respect to these families. If we can
construct such a space, we may obtain additional information about the objects that we started
with by studying the global and local geometry of the space parametrizing them. One can bear
in mind some simple examples: the j-invariant classifying elliptic curves up to isomorphism, the
Grassmannian variety, which prametrizes linear subspaces with a given dimension of a vector
space V , or the Jacobian variety of a non-singular projective algebraic curve, which parametrizes
degree 0 line bundles on the given curve.
In fact, a more general version of this last example — the moduli space of vector bundles on
a curve of a given rank and degree (or equivalently, of a given first Chern class) — was studied
by Mumford in the early ’60s. This led him to the concept of stability of vector bundles and
Geometric Invariant Theory, which is still an invaluable tool in the theory of moduli. Attention
then naturally turned to vector bundles on surfaces, and to higher dimensional varieties. In fact,
the problem of determining which cohomology classes on a projective variety can be realized
as Chern classes of vector bundles is a profound and important problem in algebraic geometry.
It is natural then to try to describe the moduli space of vector bundles with a fixed rank and
Chern classes on an arbitrary projective variety.
As additional motivation, it is worth mentioning that the study of moduli spaces often
appears in the applications of algebraic geometry to number theory, such as in the work of
Faltings on the original proof of the Mordell Conjecture, as well as in Wiles’ proof of Fermat’s
Last Theorem. Moreover, a shining example of its use in connection to differential geometry
and mathematical physics is in the work of Donaldson, who used the moduli space of instantons
(certain connections in a principal bundle over a 4-dimensional Riemannian manifold) to define
4
-
certain invariants as the intersection numbers on the moduli space. These invariants depend on
the differentiable structure of the manifold, and can be used to construct homeomorphic smooth
4-manifolds which are not diffeomorphic.
If the previous discussion provides ample motivation to study moduli spaces, it also hints
at the vastness of the field. Therefore, we must focus in this text on a manageable selection of
topics. In Chapter 1, a general introduction to the theory of moduli spaces is made, along with
some simple examples. In Chapter 2, we first introduce some preliminaries, most importantly the
notion of flat families, and then turn to our first serious, and highly useful, example of a moduli
space — the Quot scheme of Grothendieck. The Quot scheme parametrizes quotients of a fixed
coherent sheaf on a fixed projective scheme. Its existence was demonstrated by Grothendieck in
his Séminarie Bourbaki talks, and has since been used extensively in constructing other moduli
spaces. Its special case is the Hilbert scheme, which parametrizes closed subschemes of a given
scheme. The Hilbert scheme is important in its own right, as illustrated for example by the work
of Beauville, who studied the Hilbert scheme of n points on a K3 surface, and proved that it is
an irreducible hyperkähler manifold, thus giving the first example of such a manifold in higher
dimensions ( dim > 2). This example also illustrates the common situation of moduli spaces
having a rich and interesting geometry. The Quot scheme plays a crucial role in the rest of the
text, as our main result is obtained by taking its quotient by a certain action of an algebraic
group. However, before we can get to that, we must turn to Mumford and his Geometric
Invariant Theory in Chapter 3, as an understanding of its inner workings is necessary for the
main construction. Next, in Chapter 4, the various notions of stability for sheaves are introduced
and briefly discussed. It is necessary to limit our attention to the family of semistable sheaves
on the fiexed projective variety, in order to obtain boundedness, which we do at the beginning
of Chapter 5, following the arguments of Le Potier [10] and Simpson [18]. This result enables us
to see a semistable sheaf as a point on a certain Quot scheme, and to follow Simpson’s approach
in constructing the moduli space.
The main sources for this thesis were N. Nitsure’s article on the construction of the Quot
scheme [16], Mumford’s book on GIT [13], and Huybrechts and Lehn’s book [8] on the construc-
tion of the moduli space of semistable sheaves. The latter contains all the prerequisites for the
main result, though in a compressed form, which is why the other two books proved very useful.
Additionally, Le Potier and Newstead’s books [15, 9] , which deal with vector bundles on an
algebraic curve (with a chapter on semistable sheaves on the projective plane in [9]), were helpful
in grasping the special case, so that one might more easily understand the general one, which is
the topic of this text. Finally, Liu’s book [11] was essential in grasping the necessary elements
5
-
of scheme theory, as it was the author’s companion through his first course on the topic, and
throughout the writing of this text.
6
-
Chapter 1
Moduli problems
Let us give some elementary definitions, in order to formalise the concepts from the introduction
in the language of category theory. We will then illustrate them with simple examples.
1.1 The moduli functor
Definition 1. Let M be a collection of objects, and ∼ an equivalence relation defined on M.
For any S-scheme T , let MT be the set of families parametrized by T , with an equivalence
relation ∼T defined onMT . Then, a moduli functor is a contravariant functor from the category
of S-schemes, which we will denote Sch/S to the category of sets (i.e. a presheaf on Sch/S)
M ∶ (Sch/S)o → Set
which to each S-scheme T associates the set of equivalence classes of families of elements ofM
parametrized by T , and to each morphism of S-schemes T → T ′ a pullback map f∗ ∶ M(T ′) →
M(T ).
Example 1. (The Grassmannian of a vector bundle) Let S be a scheme, V a vector bundle on
S and r a positive integer less then the rank of V . Define
Grass(S,V, r) ∶ (Sch/S)o → Set
be the contravariant functor that associates to an S-scheme T vector subbundles of T ×S V
of rank r, with the equivalence relation given by equality, and to a morphism f ∶ T → T ′ of
S-schemes associates the pullback f∗.
7
-
1.2 Coarse moduli space
Definition 2. Given a moduli functor M we say that an S-scheme M is a coarse moduli scheme
for the functor M (or the class of objectsM) if there is a natural transformation of functors
Φ ∶M → HomS(−,M)
such that:
1. Φ(Speck) ∶ M(Speck) → HomS(Speck,M) is a bijection for every algebraically closed
field k,
2. For any S-scheme N and any natural transformation
Ψ ∶M → HomS(−,N)
there is a unique morphism of S-schemes e ∶M → N such that Ψ = he ○Φ
If only condition 2. is satisfied we say that M corepresents the functor M .
Remark 1. Let us unravel this categorical definition a bit. Consider the case when our base
scheme is simply a field k. Then condition 1. tells us that the elements of M, considered
as trivial families over Speck are in one-to-one correspondence with the rational points of M .
From the fact that Φ is a morphism of functors we get that for any family E ∈M(T ) its image
ϕE ∈ Homk(T,M) has the property that at each rational point t ∈ T , the point ϕE(t) ∈ M
corresponds to the equivalence class of the fiber Et ∈ M. We get this from the commutative
square 1.1 that is obtained when considering functoriality for the morphism q ∶ Speck → T
sending the point to t, combined with point 1. The condition 2. implies uniqueness up to
unique isomorphism of M if it exists: let (M,Φ) and (M ′,Φ′) be two coarse moduli spaces,
then by 2. we have morphisms f ∶ M → M ′ and f ′ ∶ M ′ → M . Since Φ = hf ′ ○ hf ○ Φ and
Φ = hidM ○ Φ we have by the uniqueness in 2. and the Yoneda lemma that f ′ ○ f = idM , and
similarly for f ○ f ′ = idM ′ .
M(T ) Homk(T,M)
M(Speck) Homk(Speck,M)
q∗ −○q (1.1)
Definition 3. IfM is a coarse moduli scheme for the moduli problemM, we define a tautological
family for M to be the family T such that for each k-point m, for any algebraically closed
field k, the fiber Tm is the element of M corresponding to m by the bijection M(Speck) →
HomS(Speck,M) above.
8
-
Example 2. Let us consider now a simple example of a coarse moduli space. LetM consist of
quadruples of ordered distinct points on the projective line. We recall that the automorphism
group of P1 is the projective linear group PGL2, and we define that two ordered quadruples of
distinct points (p1, p2, p3, p4) and (q1, q2, q3, q4) on P1 are equivalent ((p1, ..., p4) ∼ (q1, ..., q4)) if
there exists an automorphism f ∶ P1 → P1 such that f(pi) = f(qi) for all i = 1, ...,4. We recall
that for any 3 distinct points (p1, p2, p3) on P1, there exists a unique automorphism f ∈ PGL2which sends (p1, p2, p3) to (0,1,∞). Furthermore, we have a projective invariant associated to
any quadruple of distinct points on the projective line, the cross-ratio, defined by
(p1p2 ∶ p3p4) =(ξ1µ3 − ξ3µ1)(ξ2µ4 − ξ4µ2)(ξ1µ4 − ξ4µ1)(ξ2µ3 − ξ3µ2)
where pi = [ξi, µi], i = 1, ...,4. We can see that this is invariant by the action of PGL2 as
scaling any pair [ξi, µi] scales the numerator and the denominator by the same factor, and any
projective transformation⎛⎜⎝
ξi ξj
µi µj
⎞⎟⎠↦⎛⎜⎝
a b
c d
⎞⎟⎠
⎛⎜⎝
ξi ξj
µi µj
⎞⎟⎠
scales the cross terms (ξiµj − ξjµi) by the nonzero determinant ad − bc.
Now, as we said, we can move p1, p2, p3 by some element of f ∈ PGL2 to [1,0], [0,1], [1,1],
leaving the cross-ratio unchanged. Then
(p1p2 ∶ p3p4) = ([1,0][0,1] ∶ [1,1]f(p4)) =(1 ⋅ 1 − 0 ⋅ 1)(0 ⋅ µ̂4 − ξ̂4 ⋅ 1)(1 ⋅ µ̂4 − 0 ⋅ µ̂4)(0 ⋅ 1 − 1 ⋅ 1)
= ξ̂4µ̂4
where f(p4) = [ξ̂4, µ̂4]. Thus we get that the cross-ratio is given by f(p4) ∈ P1 − (0,1,∞).
Therefore, we see that the setM/ ∼ is in bijection with the set of k-points in the quasi-projective
variety P1 − (0,1,∞).
We can speak naturally about families of 4 distinct points on P1 over a scheme S. A family is
given by a proper flat morphism π ∶X → S, such that the fibers π−1(s) ≅ P1 are smooth rational
curves, and 4 disjoint sections (σ1, ..., σ4) of π. We say that two families (π ∶ X → S,σ1, ..., σ4)
and (π′ ∶ X ′ → S,σ′1, ..., σ′4) are equivalent if there is an isomorphism f ∶ X → X ′ over S (i.e.
π = π′ ○ f) such that f ○ σi = σ′i.
There exists a tautological family parametrized by P1−(0,1,∞). Let π ∶ P1−(0,1,∞)×P1 →
P1−(0,1,∞) be the projection map and choose sections (σ1(s) = 0, σ2(s) = 1, σ3(s) =∞, σ4(s) =
s). This family is in fact a universal family which makes P1 − (0,1,∞) a fine moduli space -
both to be defined shortly.
9
-
1.3 Fine moduli space
Definition 4. If the functor M is isomorphic to a functor of the form hM = Hom(−,M), then
we say that M is a representable functor, or that it is represented by M . In the case of moduli
functors we call the scheme M that represents a moduli functor M a fine moduli space for M
(or forM).
Definition 5. If M → Hom(−,M) is an isomorphism, then in particular we have a bijection
M(M)→ Hom(M,M), and there is a family U parametrized by M corresponding to the identity
map idM ∈ Hom(M,M). U is called the universal family associated to the fine moduli space
M . This family has the following special property. As any family E ∈ M(T ) parametrized by
a S-scheme T corresponds to a unique morphism p ∶ T → M , called the classifying morphism,
via Φ, and as the families p∗U and E correspond to the same classifying morphism idM ○ p = p,
because Φ is a natural transformation of functors we have
p∗U = E
that is any family can be obtained by pulling back the universal family. Conversely, if there
exists a scheme M parametrizing a family U with such a property, then M represents the functor
M .
Remark 2. If a moduli functor has a fine moduli space, then it is also a coarse moduli space
for that functor. In fact, we have the bijection M(Speck) → Hom(Speck,M) for any al-
gebraically closed k, since M represents the functor M , and one easily checks the universal
property. Conversely, if we have a coarse moduli space (M,Φ) such that there exists a family
U over M such that ΦM(U) = idM and for any two families F ,G over some scheme S we have
F ∼S G ⇐⇒ ΦS(F) = ΦS(G), then M is a fine moduli space.
1.4 Pathological cases
Of course, a fine moduli space may fail to exist. This is why we weaken our requirements and
come to the notion of a coarse moduli space. However, sometimes even a coarse moduli space
may fail to exist for a problem. In this subsection, we will consider a couple of pathologies which
prevent a moduli functor from having a moduli space.
The first situation in which a coarse moduli space fails to exist is when we have a so called
jump phenomenon for our moduli problem. Consider the following lemma describing this situa-
tion.
10
-
Lemma 1. Let M be a moduli functor and suppose there exists a family F ∈M(T ), where T is
an integral scheme of dimension at least 1 of finite type over k, such that Ft ∼ Ft1 for all t /= t0and Ft0 /∼ Ft1 (i.e. all fibers over T are isomorphic except one). Then for any scheme M and
natural transformation Φ ∶ M → hM we have that the classification morphism ΦT (F) ∶ T → M
is constant. In particular, there is no coarse moduli space for this problem.
Proof. By assumption we have a natural transformation Φ ∶M → hM , so we consider a morphism
f ∶ T → M given by ΦT (F). For any t ∶ Speck → T we have that f ○ t = ΦSpeck(Ft) by
functoriality, and Ft = Ft1 ∈ M(Speck), for t /= t0, so we have that f ∣T−{t0} is a constant map.
Let m ∶ Speck →M be the point corresponding to the equivalence class of Ft1 under Φ. Since the
M is integral of finite type over k all k-points are closed, so their preimages must also be closed.
Then, since T −{t0} ⊂ f−1(m), the closure T of T −{t0} must also be contained in f−1(m), so f
is a constant map, mapping T to m ∈M . In particular Φ(Speck) ∶M(Speck) → hM(Speck) is
not a bijection since Ft0 /∼ Ft1 ∈M(Speck) but they correspond to the same k-point m ∈M .
Example 3. Lets look at an example of a jump phenomenon. We consider the moduli problem
of classifying endomorphisms of an n-dimensional vector space. Our collection M consists of
pairs (V,ϕ) where V is an n-dimensional k-vector space and ϕ is an endomorphism of V . We
define an equivalence (V,ϕ) ∼ (V ′, ϕ′) when there exists an isomorphism h ∶ V → V ′ compatible
with the endomorphisms h ○ ϕ = ϕ′ ○ h. We extend the problem over any scheme of finite type
over k by defining a family over a scheme S to be a rank n vector bundle F over S with an
endomorphism ϕ ∶ F → F . Then we say that two families over S are equivalent if there exists
an isomorphism compatible with the endomorphisms as before.
For any n ≥ 2 we can construct a family which exhibits a jump phenomenon. Let n = 2 and
consider a family over A1 given by F = (O⊕2A1 , ϕ), where for s ∈ A1
ϕs =⎛⎜⎝
1 s
0 1
⎞⎟⎠
For any two s, t /= 0 these matrices are similar so we have an equivalence of families Ft ∼ Fs.
However F0 /∼ F1 since these matrices have distinct Jordan normal forms. Thus we have a jump
phenomenon and no coarse moduli space.
Apart from there being a jump phenomenon, a situation may occur that there is no family
over a scheme T which parametrises all objects in the moduli problem. In this case we say that
the problem is unbounded.
Example 4. Consider the moduli problem of vector bundles over P1 of rank 2 and degree 0.
Suppose there exists a family F over a scheme T such that for any vector bundle E on P1 of
11
-
rank 2 and degree 0 there is a k-point t ∈ T such that Ft ≅ E . For each n ∈ N we have a rank 2
degree 0 vector bundle OP1(n)⊕OP1(−n). Moreover
dimH0(P1,OP1(n)⊕OP1(−n)) = dimk(k[x0, x1]n, k[x0, x1]−n) =⎧⎪⎪⎪⎪⎨⎪⎪⎪⎪⎩
2, n = 0
n + 1, n ≥ 1
Now consider the subschemes Tn ∶= {t ∈ T ∶ dimH0(P1,Ft) ≥ n} of T , which are closed by
the semicontinuity theorem (see [6] III Theorem 12.8). We obtain a decreasing chain of closed
subschemes
S = S2 ⫌ S3 ⫌ S4 ⫌ ...
each of which is distinct since OP1(n) ⊕ OP1(−n) ∈ Sn+1 − Sn+2. But we assumed that T is
Noetherian, so this chain should not exist. We’ve arrived at a contradiction and thus shown
that the moduli problem of rank 2 degree 0 vector bundles is unbounded.
Let us also consider a necessary condition for the existence of a fine moduli space for a given
moduli functor. It is based on the fact that for a scheme X, the functor hX satisfies descent
under faithfully flat quasi-compact coverings. Descent theory, developed by Grothendieck, is an
important topic that we are not able to cover here, but let us at least sketch the problem that
arises. Suppose that (fi ∶ Ui → U) is an open cover of U in faithfully flat quasi-compact topology,
or fpqc for short. What is meant here by "topology" is a Grothendieck topology in which we
consider a family of morphisms (fi ∶ Ui → U) to be a covering family for an affine scheme U if
they are jointly surjective with each Ui affine and each fi a flat morphism. For an arbitrary U ,
we say that (fi ∶ Ui → U) is a covering family in the fpqc topology if it is a covering family in
the previous sense after base changing to an open affine subset of U . Then, we say that a set
valued functor F satisfies faithfully-flat descent if for any covering family (fi ∶ Ui → U) in the
fpqc topology it satisfies the sheaf condition i.e. if the following sequence is exact
F (U)→∏i
F (Ui)→→∏i,j
F (Ui ×U Uj)
Let us now consider the problem that Mumford considered. Let C be a connected smooth
projective algebraic curve of genus g ≥ 2 over k = k̄ an algebraically closed field, and let S be a
k-scheme. Denote by
Fr,d(T ) ∶= {E a vector bundle on C ×k T of rank r and degree d}/ ≅
where ≅ is the isomorphism of vector bundles, the moduli functor we are trying to represent.
Suppose that a k-scheme M represents this functor. Then we would have, by definition of fine
moduli space a bijection of sets
{ϕ ∶ T →M ∶ ϕ a k-morphism } = {E a vector bundle on C ×k T of rank r and degree d}/ ≅
12
-
To see why this cannot happen, we consider how the two sides of the above bijection behave
with respect to descent. Suppose we have a cover in fpqc topology (fi ∶ Ti → T ). For a k-scheme
M a k-morphism ϕ ∶ T → M is given by a k-morphism ϕi ∶ Ti → M for every i, satisfying the
condition ϕi∣Ti×kTj = ϕj ∣Ti×kTj for all i, j. For simplicity one can consider the Zariski topology
on T and note that the fiber product is the same as intersection in the cateogry of sets, so the
above is simply a generalization to fpqc topology of the situation where we have a covering of T
and ϕ is defined on the elements of the cover with the condition being that the maps coincide
on overlaps. Consider now vector bundles. A vector bundle E on C ×k T is given by:
- a vector bundle Ei on C ×k Ti for each i
- an isomorphism αi,j ∶ Ei ≅ Ej over C ×k Ti × Tj for each i, j
- satisfying the cocycle condition αi,k = αj,k ○ αi,j over C ×k Ti × Tj × Tk for all i, j, k
We can see from this explicitly the two sides of the above bijection behave differently under
"gluing", so F cannot be represented by any scheme M . We would have an exception to
this situation if we would consider isomorphism classes of objects without automorphisms. In
this case, the above isomorphisms αi,j are unique if they exist, so the cocycle condition is
automatically satisfied, so the sheaf condition for the fpqc topology is satisfied. This still doesn’t
guarantee representability, but in many cases it can be established. To deal with this problem
Mumford and Deligne introduced stacks, by following up on an idea that we should keep track
of both isomorphism classes as well as the isomorphisms themselves, even though we are only
interested in parametrizing isomorphisms classes, in order to understand "gluing". Therefore,
the moduli space should not be a set endowed with some geometric structure, but rather a
grupoid (of objects and their isomorphisms) i.e. a category in which every morphism is invertible,
endowed with some geometric structure. This is roughly the idea of stacks.
13
-
Chapter 2
The Quot scheme
The Quot scheme is a fine moduli space which generalises the Grassmannian in the sense that
it parametrises quotients of a fixed sheaf on a fixed projective scheme. Its existence was shown
by Grothendieck in [4, 5]. The Quot scheme, and its special case the Hilbert scheme, are of
fundamental importance in the theory of moduli spaces. We will use it in the sections to follow
to construct moduli spaces of semistable sheaves. In this section, we will define the moduli
problem that the Quot scheme represents and present its construction following [17].
2.1 Preliminaries
In this section, we will recall the notions from scheme theory that will be used often in what
follows. These notions are elementary and can be found in any textbook on the subject such
as Harshorne [6] and Liu [11]. We will use some elementary properties of these notions without
proof, and in those cases provide references to these textbooks. The choice for providing the def-
initions here was made for the purpose of completeness, and with a view towards an elementary
treatment of the subject matter.
Definition 6. Proper and locally closed morphisms Let f ∶ X → Y be a morphism of schemes.
f is said to be of finite type if f is quasi-compact (i.e. the inverse image of any affine open is
quasi-compact), and if for every affine open subset V ⊂ Y , and every affine open subset U of
f−1(V ), the canonical homomorphism OY (V )→ OX(U) makes OX(U) into a finitely generated
OY (V )-algebra. In fact, for a morphism to be of finite type, it is enough to be able cover Y
with finitely many affine open subsets such that their inverse images in X can again be covered
by a finite number of affine open subsets such that the same finiteness property of the canonical
morphism of rings is satisfied. We say that a scheme is of finite type if its structure morphism
14
-
is of finite type.
Now, define the diagonal morphism of f as (IdX , IdX) ∶ X → X ×Y X. Denote it by ∆X/Y .
We say that X is separated over Y if ∆X/Y is a closed immersion of schemes. We say that X is
separated if it is separated over Z.
Morphism f ∶X → Y is universally closed if it is closed and stays closed under base change.
A proper morphism of topological spaces is a morphism such that an inverse image of a
compact subset is compact. In the setting of schemes we generalize this to the following: we say
that a morphism f ∶X → Y is proper if it is of finite type, separated, and universally closed. We
say that a Y scheme is proper if the structural morphism is proper.
A morphism of schemes f ∶X → Y is called a locally closed immersion, or simply an immer-
sion, if it can be factored as j ○ i where i is a closed immersion and j is an open immersion.
Now we move on to some definitions related to sheaves.
Definition 7. An OX -module L is called invertible if for every x ∈X there exists an open neigh.
x ∈ U and an iso of OU -modules OX ∣U ≅ L∣U . If X is locally Noetherian this is the same as
saying L is coherent and Lx is of rank 1 over OX,x. A sheaf F is called free of rank n if it is
isomorphic to OnX . It is called locally free of finite rank if there exists a covering of X = ∪Xi and
integers ni such that F ∣Xi is free of rank ni for each i. We say it is locally free of rank n if ni = n
for all i. Locally free sheaves correspond to vector bundles, if one considers them as sheaves of
sections of the given vector bundle.
An OX -module F is called quasi-coherent if it has a local presentation, that is, if for any
x ∈X, there exists an open neighbourhood U of x and an exact sequence of OX -modules
O(J)X ∣U → O(I)X ∣U → F ∣U → 0
for some sets J and I, possibly infinite.
We say that F is finitely generated if for every x ∈ X, there exists an open neighbourhood
U ⊂ X, an integer n ≥ 1, and a surjective homomorphism OnX ∣U → F ∣U . We say that F is
coherent if it is finitely generated, and if for every open subset U ⊂X, and every homomorphism
OnX ∣U → F ∣U the kernel of the homomorphism is finitely generated.
On an affine scheme U = SpecA there is an equivalence of categories from A-modules to
quasi-coherent sheaves, taking a module M to its associated sheaf M̃ , where M̃ is defined as the
sheaf such that for any principal open subset D(f) ⊂X we have M̃(D(f)) =Mf (the localization
of M at f ∈ A). The inverse takes a quasi coherent sheaf F on U to the A-module F(U) of global
sections. Taking into account this equivalence of categories, a sheaf on an arbitrary scheme X is
quasi-coherent if any point x ∈X has an affine open neighbourhood such that the restriction of
15
-
F to that neighbourhood is a sheaf associated to a module (see [11] Chapter 5. Theorem 1.7.).
Then, a quasi-coherent sheaf F on a locally Noetherian scheme X is coherent if and only if the
above modules can be taken to be finitely generated (see [11] Chapter 5. Proposition 1.11).
Quasi-coherent sheaves, and therefore coherent sheaves, on any scheme form an abelian
category.
We move on to flatness of a morphism. It is an algebraic notion used to guarantee some
continuity of objects, or certain important invariants of objects, attached to the fibers, and it is
of great use in moduli theory.
Definition 8. Flat sheaves A module M over a ring R is flat if for every monomorphism of
R-modules A→ B the induced map M ⊗RA→M ⊗RB is again a monomorphism. A morphism
of schemes f ∶X → Y is flat if for every point x ∈X the local ring OX,x, regarded as an OY,f(x)-
module via the map f#, is flat. An OX -module F is flat over Y if the stalk Fx is flat when
regarded as a OY,f(x)-module via the composite map OY,f(x) → OX,x → Fx where the first map
is f#x and the second is the one making F an OX -module.
To see what we mean by continuity consider what we call a family of closed subschemes of
a given scheme X over a base scheme T . This is a closed subscheme Z ⊂ T ×X, together with a
restriction to Z of the projection map T ×X → T . The fibers of Z over t ∈ T are then naturally
closed subschemes of the fibers Xt of T ×X over T . Now, let T = SpecR be a non-singular,
one-dimensional affine scheme (think of Speck[t]), 0 ∈ T be a closed point and T ∗ = T −{0}. Let
Z∗ ⊂ AnT ∗ = AnZ×T ∗ be any closed subscheme (a family of closed subschemes of the affine n-space
AnZ over T∗), and let π ∶ Z → T ∗ be the projection. Consider now the fibers Z∗t = π−1(t). We ask
what the limit of the fibers is, when t → 0? The only reasonable answer is to take the closure
of Z∗ in AnT and take the limit of the schemes Zt when t → 0 to be the fiber Z0 of Z∗ over the
point 0 ∈ T . This construction can, however, yield some unexpected results, such as embedded
points in the limit fiber, even in simple cases. For examples of these pathological situations, as
well as a great geometrically intuitive introduction to the notion of flatness see [2] II §3.4. There
one finds the proof of the following proposition which illustrates the importance of flatness in a
very special, yet geometrically intuitive, case:
Proposition 1. Let T and Z be as above. Then the following conditions are equivalent:
(i) π is flat over 0
(ii) The fiber Z0 = π−1(0) is the limit of the fibers Zt = π−1(t) as t→ 0
(iii) No irreducible component or embedded component of Z is supported on Z0
16
-
One can generalize this statement to higher dimensional base schemes T which are reduced
and over a field, and when Z is of finite type over T (see [2] Lemma II-30). In fact, in this case,
the morphism π ∶ Z → T is flat at p ∈ Z if, under any embedding of a neighbourhood of p in
affine space AnT , the fiber Z0 = π−1(0) over π(p) = 0 ∈ T is (an open subset of) the limit of the
fibers Zt as t ∈ T approaches 0 along any one-parameter family.
For us, the more important property that flat families, or fibers of a flat morphism, have
under suitable assumptions is the invariance of the Hilbert polynomial of coherent OX -modules
(see Definition 12). In fact,
Proposition 2. Let T be a locally Noetherian and connected scheme, with the generic point ν
and a closed point t. Let f ∶ Z → T be a projective morphism (see Remark 6), and F a coherent
sheaf on Z that is flat over T . Then we have the equality
χk(t)(Ft) = χk(ν)(Fν)
For the proof of the case when T is the spectrum of a discrete valuation ring, as well as
the reference to the more general case, see [11] Chapter 5. Proposition 3.28. If moreover T is
reduced, we have the other implication i.e. if the Hilbert polynomial on the fiber Ft is indepen-
dent of t ∈ T then F is flat over T (see [6] III Theorem 9.9). If we consider the structure sheaf
OX in the previous statement, and the definition of the Hilbert polynomial, we can conclude
that for a flat family Z ⊂ PrT of closed subschemes of a projective space over a locally Noetherian
connected base T the dimension and the degree of the fiber Zt are independent of t.
Next, we define sheaves generated by their global sections, ample and very ample sheaves.
Definition 9. We say that an OX -module F is generated by its global sections at x ∈ X if the
canonical evaluation morphism F(X)OX,x → Fx is surjective. We say that F is generated by
its global sections, or globally generated, if it is true at every point x ∈ X. Equivalently, F is
generated by its global sections if and only if there exists a set I and a surjective homomorphism
of OX -modules O(I)X → F .
Let f ∶ X → SpecA be a scheme over a ring A. Let i ∶ X → PnA be an immersion. Let
OX(n) = i∗(OPnA(n)). This is an invertible sheaf on X and it depends on i. The sheaf OX(1) =
i∗(OPnA(1)) is called a very ample sheaf. An invertible sheaf L on a quasi-compact scheme X is
ample if for any finitely generated quasi-coherent sheaf F on X there exists an integer n0 ≥ 1
such that for every n ≥ n0 we have that F ⊗ L⊗n is generated by its global sections. Theorem
1.27 in [11] Chapter 5. says that very ample implies ample. Theorem 1.34 in the same chapter
says that a sufficiently high tensor power of ample is very ample for f if f ∶ X → SpecA is of
17
-
finite type and X is Noetherian or f is separated. We say that a line bundle L is f -ample, for
a projective morphism f ∶X → S if its restriction to any fiber Xs is ample.
As we will mainly be dealing with projective schemes over an algebraically closed field, we
will use the notions of (Weil) divisors, linear systems, as well as the correspondence between
divisors and invertible sheaves (equivalently line bundles) in this case. For an introduction to
the language of divisors see [6] II §6 or [11] Chapter 7. §7.1 and §7.2.
2.2 The Quot functor
Definition 10. Let S be a noetherian scheme. Let X be a finite type S-scheme and E a coherent
sheaf on X. For any locally noetherian S-scheme T let ET ∶= π∗X(E) be the pullback of E to
T ×SX =∶XT via the projection πX ∶ T ×SX →X. A family of quotients of E parametrized by T
is a pair (F , q) where F is a coherent sheaf on X ×S T such that its schematic support is proper
over T , F is flat over T , and q is a surjective OX×ST -linear morphism of sheaves q ∶ ET → F . We
say that two families (F , q) and (F ′, q′) are equivalent if and only if ker q = ker q′, or equivalently
if there is an isomorphism F → F ′ compatible with q and q′. Let (Sch/S) be the category of
all locally noetherian S-schemes. Then we define:
QuotE/X/S ∶ (Sch/S)o → Set
to be the functor sending an S-scheme T to the set of equivalence classes of families parametrised
by T . Since properness and flatness are preserved by base change ([11] Chapter 3. Proposition
3.16 (c) and Chapter 4. Proposition 3.3 (b)), for a morphism g ∶ T ′ → T of S-schemes we can
define QuotE/X/S(g) to be the map sending ET → F to ET ′ → g∗XF , where gX ∶ T ′×SX → T ×SX.
Definition 11. We define a special case of the Quot functor - the Hilbert functor. If we set
E = OX in the previous definition then the functor QuotOX/X/S associates to T the set of
all closed subschemes Y ⊂ T ×S X that are proper and flat over T . We denote this functor by
HilbX/S ∶= QuotOX/X/S . The special case of the Quot scheme that represents the Hilbert functor
is called the Hilbert scheme. We also denote HilbPnZ ∶=HilbPnZ /SpecZ.
Definition 12. For a coherent sheaf E over a projective scheme X equipped with a fixed ample
invertible sheaf L, the Hilbert polynomial of E with respect to L is a polynomial Φ(m) ∈ Q[t]
such that for m sufficiently large
Φ(m) = χ(E(m)) = ΣdimEi=0 (−1)i dimH i(X,E ⊗L⊗m)
where dimE is the dimension of the support of E . We refer to the leading coefficient of the
Hilbert polynomial as the multiplicity of E .
18
-
Serre’s vanishing theorem (see [6] III thm. 5.2) states that since L is ample, and E is coherent,
we have that for m sufficiently large all higher cohomology groups of E ⊗L⊗m vanish. Thus for
m sufficiently large we have Φ(m) = dimH0(X,E ⊗ L⊗m). For a proof that such a polynomial
exists see [8] Lemma 1.2.1. However, in the special case when X is a smooth projective curve,
and E is a locally free sheaf, or equivalently a vector bundle, we can write down the Hilbert
polynomial explicitly by using the Riemann-Roch theorem. First we recall the notion of degree
of locally free and coherent sheaves.
Definition 13. The Grothendieck group of X, denoted K0(X), is the free group generated by
classes [E], for E a coherent sheaf on X, modulo the relations [E]− [F]+ [G] = 0 for short exact
sequences 0→ E → F → G → 0.
There exists a homomorphism
(det, rk) ∶K0(X)→ Pic⊕Z (2.1)
which sends a locally free sheaf E to (det(E) ∶= ∧rkEE , rk(E)). We can extend this map to coher-
ent sheaves when X is smooth projective by taking a finite locally free resolution of the given
sheaf E (for the existence of the locally free resolution see [8] Proposition 2.1.10). Then by the
relations defining K0(X) we have [E] = Σi(−1)i[Ei], where Ei are the locally free sheaves in the
resolution of E . Therefore, we get a well defined line bundle det(E) ∶= det([E]). We recall that
the degree of a line bundle is defined to be the degree of the corresponding Weil divisor (for X
smooth the Picard group of isomorphism classes of line bundles and the group of Weil divisors
modulo linear equivalence are isomorphic - for proof see [6] II §6). Now we use the previous
definition of the determinant line bundle to define the degree of any coherent sheaf on a smooth
projective variety deg(E) ∶= deg(det(E)).
More generally we can define the rank of a coherent sheaf on a scheme X by rk(E) ∶= αd(E)αd(OX) ,
where αi are the coefficients of the Hilbert polynomial. Then we define deg(E) ∶= αd−1(E)−rk(E)⋅
αd−1(OX). The equivalence of this definition to the previous one in the projective non-singular
case follows from the Hirzebruch-Riemann-Roch theorem. Note that αdimX(OX) is exactly the
degree of X with respect to OX(1).
Now we return to a simple example of a Hilbert polynomial.
Example 5. Let X be a smooth projective curve of genus g. Fix a line bundle L = OX(1). For
a vector bundle E over X of rank n and degree d, the twist E(m) ∶= E ⊗ L⊗m has rank n and
19
-
degree d +mdeg(X)n. The Riemann-Roch theorem then gives
χ(E(m)) = h0(X,E(m)) − h1(X,E(m)) =
= deg(E(m)) + rank(E(m))(1 − g)
= d +mdeg(X)n + n(1 − g)
(2.2)
So E has Hilbert polynomial Φ(m) =mdeg(X)n+d+n(1−g) of degree 1 with leading coefficient
given by the rank.
Definition 14. Let X be a finite type scheme over a scheme S, and L a line bundle on X. We
have a decomposition:
QuotE/X/S(T ) = ∐Φ∈Q[t]
QuotΦ,LE/X/S(T )
where for any polynomial Φ ∈ Q[t] the functor QuotΦ,LE/X/S associates to an S-scheme T the
equivalence classes of families (F , q), such that at each t ∈ T the Hilbert polynomial of the
restriction FT ∣(XT )t , calculated with respect to the pull-back of L, is Φ.
Remark 3. We will take X to be projective over S and L a relatively very ample line bundle.
Then the Quot functor will be representable by schemes, but not in general.
Remark 4. From the previous definitions it is clear that we can think of the Quot scheme as
parametrising the coherent subsheaves of F up to equality rather then quotients of F up to the
equivalence given in the definition of the Quot functor. However, the quotient perspective will
be more useful in the rest of this text.
2.3 The Grassmannian
Example 6. A special case of the Quot functor is the Grassmannian functor defined as follows.
Quotr,OZ⊕nOZ/Z/Z=∶ Grass(n, r), a functor associating to T /Z the set of equivalence classes of
quotients ⊕nOT → F where F is a locally free sheaf on T of rank d, n ≥ r ≥ 1.
An elementary construction of the Grassmannian scheme is given in [17] 5.1.6 along with a
tautological quotient u ∶ ⊗nOGrass(n,r) → U , the proof of properness of π ∶ Grass(n, r) → SpecZ,
and a closed embedding Grass(n, r) ↪ P(π∗ detU) = PmZ , where m = (nr) − 1. It is constructed
by gluing together affine patches U I ∶= Spec(Z[XI]), where Z[XI] is the polynomial ring in the
variables xIp,q which we get as follows: let M be an r×n matrix and I ⊂ 1, ..., n with #I = r. MIis the Ith minor of M meaning a r × r minor given by columns indexed by I. Consider now the
r × n matrix XI which has XII given by the identity matrix 1r×r and the remaining entries are
the free variables xIp,q. Then Z[XI] is the polynomial ring in these variables. Let J be another
20
-
size r subset of {1, ..., n}. Let P IJ be the polynomial in the variables xI obtained by taking the
determinant of XIJ and let UIJ be the open subset of U
I where P IJ is non-zero. We define the
gluing map: ΘI,J ∶ U IJ → UJI via the ring homomorphism θI,J ∶ Z[XJ ,1/P JI ] → Z[XI ,1/P IJ ]
where the images of xJp,q are given by the entries of the matrix θI,J(XJ) = (XIJ)−1XI . In
particular θI,J(P JI ) = 1/P IJ , so the map extends to Z[XJ ,1/P JI ]. One checks that for any three
subsets I, J,K the cocycle condition θI,K = θI,JθJ,K is satisfied. Therefore these affine patches
glue to form a finite type scheme Grass(n, r) over SpecZ ([11] Chapter 2. Lemma 3.33.).
The universal quotient vector bundle and quotient map are constructed using the same affine
patches as in the construction of the Grass(n, r). The main idea is that on each patch, elements
of the Grassmannian are naturally represented by a unique matrix. Let U ∣UI = ⊕rOUI and
define the map qUI ∶ ⊕nO∣UI → ⊕rOUI via the matrix XI . Define the transition morphisms as
ρI,J ∶ ⊕rOUIJ = U ∣UIJ → ⊕rOUJI = U ∣UJI via the matrix (X
IJ)−1 ∈ GLr(U IJ). One checks that they
satisfy the cocycle condition for glueing sheaves ([11] Chapter 2. Exercise 2.8.).
We can show that the Grassmannian scheme Grass represents the functor Grass(n, r) de-
scribed at the beginning of the example as follows. We need to show the bijectionGrass(n, r)(Y ) ≅
HomZ(Y,Grass(n, r)) for any Y /Z. Take a morphism f ∶ Y → Grass(n, r). Then consider
f∗(u) ∶ OnY → f∗U . Since f∗ is right-exact this is an epimorphism so f∗(u) ∈ Grass(n, r)(Y ).
Therefore we get a map in one direction. Take now a surjection q ∶ OnY → F with F locally free of
rank r. This is given by choosing n ordered global sections of F which generate F at each point.
Let Y I be the open subset of Y where the global sections in a set I of degree I generate F i.e.
over Y I we have F ≅ OrY I
. Then over Y I we have that the map q is given by an r×n matrix, call
it M I with values in OY (Y I). Now define a map Y I → U I by sending xIp,q to the entries of the
matrix M I . This map will glue together to give a map Y → Grass(n, r) because the transition
matrix on Y I∩Y J is precisely (M IJ)−1 - switching from the basis of the I global sections to the ba-
sis of J global sections.This gives a map in the opposite direction inverse to the one given before.
Remark 5. We can generalise the previous construction as follows. Instead of considering the
locally free sheaf ⊕nOZ over SpecZ we can consider any locally free sheaf, or equivalently a vector
bundle, V over a scheme S. We get a functor Quotr,OSV /S/S =∶ Grass(V, r). It is representable by
a scheme Grass(V, d) and called the relative Grassmannian of V over S. If π ∶ Grass(V, r) →
S is the projection, and π∗V → U the universal quotient then the determinant bundle ∧rU
is very ample over S and it gives a closed embedding Grass(V, r) ↪ P(π∗ ∧r U) ⊂ P(∧rV ).
The representability of this more general functor can be obtained by observing that the group
scheme GLn,Z acts naturally on Grass(n, r) from the previous example, since it acts on the
21
-
free module ⊕nZ, and thus in turn on the locally free sheaf ⊕nOZ. The projective scheme
representing Grass(V, r) is then obtained by modding out Grass(n, r) by this action. Actions of
group schemes and the role of Geometric Invariant Theory in constructing moduli spaces will be
discussed in detail later. For now we mention one more generalisation of the Grassmannian. We
may replace the vector bundle V with a coherent sheaf E on S and obtain a functor Quotr,OSE/S/S =∶
Grass(E , r) which is still representable by a projective scheme over S. This is obtained from
the previous case by locally expressing a coherent sheaf as a quotient of a vector bundle.
2.4 Representability theorem of Altman and Kleiman
Definition 15. Let A be a ring and M an A-module. Let Tn(M) be the tensor product of
M with itself n times. Then T (M) ∶= ⊕d≥0T d(M) is a non-commutative A-algebra called the
tensor algebra of M . We define the symmetric algebra S(M) ∶= ⊕d≥0Sd(M) to be the quotient
of T (M) by the two-sided ideal generated by all expression x⊗ y − y ⊗ x for all x, y ∈M . As an
example note that for M a free A-module of rank rS(M) ≅ A[x1, ..., xr]. We define the exterior
algebra ∧(M) = ⊕d≥0 ∧d (M) as the quotient of T (M) by all expressions x⊗ x for all x ∈M .
Now, for a scheme (X,OX), and F a sheaf of OX -modules we define the tensor algebra,
symmetric algebra, and exterior algebra of F by taking the sheaves associated to the presheaf
which to each open U ∈ X assigns the corresponding tensor operation applied to F(U) as an
OX(U)-module. The results are OX -algebras and their components in each degree are OX -
modules.
Definition 16. Let V be a locally free sheaf on S. Then P(V ) ∶= Proj(Sym(V )) is called the
associated projective space bundle, where Sym(V ) is the symmetric algebra of V . S(V ) is a
graded OS-algebra, so Proj(Sym(V )) is defined as the unique S-scheme f ∶ Proj(Sym(V )) → S
such that for any affine open U ⊂ S we have f−1(U) ≅ Proj(Sym(V )(U)). Proj(Sym(V )(U))
is endowed with a sheaf O(1) = (Sym(V )(U)(1))∼. These sheaves glue to an invertible sheaf
OP(V )(1) ( see [11] Chapter 8. Remark 1.9.).
We will prove the following representability theorem by Altman and Kleiman:
Theorem 1. Let X be a closed subscheme of P(V ), L = OP(V )(1)∣X , W a locally free sheaf
on S, E a coherent quotient sheaf of π∗(W )(n), where π ∶ X → S and n ∈ Z. Let Φ ∈ Q[t].
Then the functor QuotΦ,LE/X/S is representable by a closed subscheme of Grass(V′, n′), where
V ′ =W ⊗OS SymrV , n′ = Φ(r).
Remark 6. In Grothendieck’s version of the theorem a weaker version of projectivity is used.
Recall that Grothendieck defines a morphism π ∶ X → S to be projective if there exists a
22
-
coherent sheaf E on S together with a closed embedding of X into P(E) = Proj SymOS E . We get
Altman and Kleiman’s notion of strong projectivity appearing in Theorem 1 by requiring that
E be a locally free sheaf on S. The generalization is simple and given in [17] 5.5.8. We should
mention here the strongest notion of projectivity in use: a morphism π ∶ X → S is projective in
the strongest sense if X admits a closed embedding into PnS , and π factors through this closed
embedding followed by the canonical morphism PnS → S.
2.5 Construction of the Quot scheme
Definition 17. Let F,G ∶ (Sch/S)o → Set be two functors, and let F be a subfunctor of G
i.e. there exists a natural transformation ν ∶ F → G which is injective when evaluated on any
object of (Sch/S). We say that F is a closed subfunctor (resp. locally closed) if for every T /S,
and any natural transformation γ ∶ hT → G from the functor of points of a scheme T , there
exists a closed (resp. locally closed) subscheme T ′/S such that the functor F ×G HomS(−, T ) is
represented by T ′ i.e. F ×G HomS(−, T ) ≅ hT ′ , where the fiber product is given by
(F ×G HomS(−, T ))(Y ) = {(H ∈ F (Y ), f ∶ Y → T ∈ hT (Y )) ∶ γY (f) = νY (H) ∈ G(Y )}
Remark 7. The previous definition just says that if G is represented by some T there exists
a closed (resp. locally closed) subscheme T ′ of T that represents F , because in that case
Hom(−, T ′) ≅ F ×GHomS(−, T ) ≅ F ×GG ≅ F . Moreover, in the case when F and G are moduli
functors, and G is representable by T , there exists a canonical choice for the map hT → G - the
map corresponding to the tautological quotient of G, which is just the map corresponding to
the identity on T .
We present the proof of Theorem 1 in several steps:
Step 1. Reduce to the case of QuotΦ,Lπ∗(W )/P(V )/S via the following lemma:
Lemma 2. (a) Tensoring by L−n gives an isomorphism QuotΦ,LE/X/S → QuotΨ,LE(−n)/X/S where
Ψ(m) = Φ(m − n)
(b) Let ψ ∶ E → G be en epimorphism of coherent sheaves on X. Then the corresponding natural
transformation QuotΦ,LG/X/S → QuotΦ,LE/X/S is a closed embedding.
(a) gets rid of the twist in the theorem of Altman and Kleiman. Furthermore, since we
assumed that E is a quotient sheaf of π∗(W ) and that X is a closed subscheme of P(V ) we
get an epimorphism of sheaves π∗(W ) → E . Hence, by (b) if we can represent the functor
23
-
QuotΦ,Lπ∗(W )/P(V )/S by a scheme T , then our original functor will be represented by a closed
subscheme of T .
Proof. (of (b)) We need the vanishing scheme V (ϕ) of a morphism of coherent sheaves ϕ ∶ F → F ′
on X/S, which is a scheme with the universal property that any morphism f ∶ Z → X factors
through V (ϕ) if and only if f∗ϕ ∶= f∗F → f∗F ′ = 0. We use the following result:
Proposition 3. Let S be a Noetherian scheme and π ∶X → S a projective morphism. Let F and
F ′ be coherent sheaves on X with F ′ flat over S. Then, the contravariant functor hom(F ,F ′),
which assigns to each T /S the set Hom(FXT ,F ′XT ) is represented by some Z = Spec(SymQ)
where Q is a coherent sheaf on S. In addition, the closed subscheme Z0 defined by the vanishing
of the ideal sheaf generated by Q is the closed subscheme where the universal homomorphism
vanishes (corresponding to the identity on Z).
For the proof see [17] Theorem 5.8. A corollary is that the vanishing scheme of a morphism
ϕ ∶ F → F ′ on X is given by f−1(Z0), where f is the image of ϕ ∈ hom(F ,F ′)(X) under the
bijection hom(F ,F ′)(X) ≅ hom(X,Z).
Back to the proof of (b): We want to prove that if a scheme E/S represents the functor
QuotΦ,LE/X/S , then there exists a closed subscheme G ⊂ E that represents QuotΦ,LG/X/S . By the
previous remark about closed subfunctors of moduli functors, this is equivalent to showing that
for a given T /S, and a family (F , q) ∈ QuotΦ,LE/X/S(T ), the functor
QuotΦ,LG/X/S ×QuotΦ,LE/X/S HomS(−, T ) is represented by a closed subscheme T′ ⊂ T . Given any
coherent sheaf (F , q) ∈ QuotΦ,LE/X/S(T ), by the Yoneda lemma (Nat(hA,G) ≅ G(A)), we get a
natural transformation hom(−, T )→ F ∈ QuotΦ,LE/X/S , therefore a commutative diagram (of sets):
QuotΦ,LG/X/S ×QuotΦ,LE/X/S HomS(−, T ) HomS(−, T )
QuotΦ,LG/X/S QuotΦ,LE/X/S
Applying all of the above to a scheme Y /S, we take a look at what the elements of
QuotΦ,LG/X/S(Y ) ×QuotΦ,LE/X/S HomS(Y,T ) are. Let ((F′, q′), f) be such an element, where F ′ is a
sheaf on XY . Then by the proof of Yoneda lemma (we send idT ↦ F , for any morphism of
schemes p we have Quot(p) is the pullback by f) we get f ↦ (f∗F , f∗q) by the map on the right
(where f is used both for f ∶ Y → T and the induced map f ∶XY →XT , since F is defined on XT ).
The map on the bottom, induced by our map of coherent sheaves ψ is composition (on the right)
by ψY (i.e. since q′ ∶ GXY → F ′, and ψ is epi, then ψY ∶ EXY → GXY is epi, so q′ ○ψY ∶ EXY → F ′
24
-
makes it a quotient of EXY ). Therefore we should have (F ′, q′ ○ ψY ) = (f∗F , f∗q). Now this is
true if kerψ ⊂ ker q′. So we get the set:
{((F ′, q′), f) ∶ f ∈ HomS(Y,T ),F ′ = f∗F , q′ = f∗q,kerψ ⊂ ker q′}(∗∗)
We need a closed subscheme T ′ ⊂ T such that a morphism Y → T factors through T ′ if and only
if f∗q∣kerψ = 0 thereby establishing a bijection between morphisms Y → T ′ and the elements of
the set (**). But then T ′ = V (f∗q∣kerψ), the vanishing scheme of f∗q∣kerψ, which exists by the
previous proposition.
Step 2. Let us now construct an injective natural transformation from the Quot mod-
uli functor to the Grassmannian moduli functor. To do this we will use the following result
that is based on Mumford-Castelnuovo regularity, which will be discussed and used again later
(Definition 45 and Lemma 11), and a flat base change result. For proof we refer to [17] 5.5.4.
Proposition 4. There exists an integer m depending only on the rank of V , the rank of W and
Φ s.t. for all r ≥ m, for all schemes T /S and for all T -flat coherent quotients F of EXT with
kernel G, the following facts hold:
1. πT∗(F(r)), πT∗(G(r)), πT∗(EXT (r)) are locally free are locally free sheaves of ranks deter-
mined by the rank of V , rank of W , r and Φ, and in particular the rank of πT∗(F(r)) is Φ(r).
Additionally, all higher direct image sheaves RiπT∗(F(r)),RiπT∗(G(r)),RiπT∗(EXT (r)) vanish,
2. In particular, the following is a commutative diagram of locally free sheaves on XT in which
the rows are exact and the vertical maps are surjective:
0 π∗TπT∗(G(r)) π∗TπT∗(EXT (r)) π∗TπT∗(F(r)) 0
0 G(r) EXT (r) F(r) 0
Fix r ≥m and consider a scheme T /S and a sheaf (F , q) ∈ QuotΦ,LE/X/S(T ). Twisting it by Lr
and applying πT∗ we get an epimorphism:
πT∗(q(r)) ∶ πT∗(EXT (r))→ πT∗(F(r))
i.e. applying the functor πT∗ to the exact sequence of sheaves 0 → G(r) → EXT (r) → F(r) → 0,
we get that the map above is surjective since the higher direct images vanish by the previous
proposition. Direct image functor is also left-exact (the right adjoint to the inverse image func-
tor), so it commutes with ker. Therefore if we have another element (F ′, q′) ∈ QuotΦ,LE/X/S(T ) s.t.
ker q = ker q′ then kerπT∗q = πT∗ ker q = πT∗ ker q′ = kerπT∗q′. So π∗ is a natural transformation:
QuotΦ,LE/X/S(T )→ QuotΦ(r),OSπ∗(π∗(W )⊗Lr)/S/S(T )
25
-
[q(r) ∶ π∗X(π∗(W )) = EXT (r)→ F(r)]↦ [πT∗(q(r)) ∶ πT∗(π∗X(π∗(W )⊗Lr))→ πT∗(F(r))]
On the right hand side we have a Quot functor that assigns to T /S quotients of the pullback
to T of π∗(π∗(W )⊗Lr) which is a sheaf on S. The fact that πT∗(q(r)) ∶ πT∗(π∗X(π∗(W )⊗Lr))→
πT∗(F(r)) is such a quotient follows from the fact that in the following diagram:
XT T
X S
πT
πX p
π
we have that πT∗(π∗X(π∗(W )⊗Lr)) ≅ p∗(π∗(π∗(W )⊗Lr)) (special case when X is PnS can
be found in [17] Lemma 5.4).
The fact that the sheaf πT∗(q(r)) is locally free of rank Φ(r) comes from the previous
proposition fact 1. Now we remark that the Quot functor on the right above is actually the
Grassmannian functor, and after applying the projection formula ([6] II Exercise 5.1. d))we get
QuotΦ(r),OSπ∗(π∗(W )⊗Lr)/S/S = Grass(π∗(π
∗(W )⊗Lr),Φ(r)) = Grass(W ⊗ Symr V,Φ(r))
We show now that this natural transformation is injective i.e. that we can recover q from
πT∗(q(r)). q(r) is the cokernel of the map G(r)→ EXT (r), by definition of G. Since the vertical
maps in the commutative diagram in Proposition 2. are surjective we can precompose with
the first one of them and we will not change the kernel i.e. q(r) is the cokernel of the map
π∗TπT∗(G(r))→ G(r)→ EXT (r). By the commutativity of the diagram this is the same map as
π∗TπT∗(G(r))→ π∗TπT∗(EXT (r))→ EXT (r) (2.3)
Now the second map is canonical and the first map is just the pullback of the inclusion of
the kernel of πT∗(q(r)).
Step 3. To prove thatQuotΦ,LE/X/S is a locally closed subfunctor ofGrass(W⊗Symr V ),Φ(r)),
i.e. that it is represented by a locally closed subscheme of the Grassmannian scheme, we need
to use flattening stratification.
Theorem 2. Let S be a Noetherian scheme and let F be a coherent sheaf on X/S, where X
is a closed subscheme of P(V ) for some locally free sheaf V on S. Then, the set I of Hilbert
polynomials of restrictons of F to the fibers of the map from X to S is finite. Moreover, for each
Φ ∈ I, there exists a locally closed subscheme SΦ of S such that the following three properties
hold:
26
-
1. Point-set: The underlying set ∣SΦ∣ consists of all points s ∈ S above which the Hilbert polyno-
mial of F is Φ.
2. Universal Property: Let S′ be the scheme theoretic disjoint union (coproduct) of the SΦ over
all Φ ∈ I and let f ∶ S′ → S be the natural surjection. Then, the sheaf f∗(F) on P(f∗(V ))
is flat over S′. Moreover f ∶ S′ → S has the universal property that a moprhism g ∶ T → S
factors through f if and only if g∗F is flat over T (we use g to denote the base changed morphism
g ∶XT = T ×S X → T ). The subscheme SΦ is thus uniquely determined by Φ.
3. Closure of Strata: Let the set I be given the total order by putting Φ < Φ′ if for sufficiently
large n ∈ Z Φ(n) < Φ′(n). Then, the closure of SΦ consists of the union of all SΦ′ for Φ′ ≥ Φ.
Proof. See [17] Theorem 5.13.
Again, as in a previous case of proving that a functor is a closed subfunctor (now locally),
given any T /S and (F , q) ∈ Grass(W ⊗ Symr V,Φ(r))(T ) we need to show that there exists a
locally closed subscheme T ′ → T representing
QuotΦ,LE/X/S ×Grass(W⊗Symr V ),Φ(r)) Hom(−, T )
Let us define T ′ as follows. Define G to be the kernel of
q ∶ πT∗(EXT (r)) = (π∗(E(r)))T → F
(again using π∗X ○ πT∗ = p∗ ○ π∗ from the fiber product square from Step 2.). Let h be the
composite map:
π∗T (G)→ π∗TπT∗(EXT (r))→ EXT (r)
Let J be the cokernel of h and define T ′ to be the subscheme obtained in the flattening strati-
fication for the sheaf J corresponding to the polynomial Φ. Let us show that T ′ is the scheme
we are looking for:
For any Y /S we have that by the universal property of the stratification that a morphism
f ∶ Y → T factors through T ′ if and only if f∗J is flat with Hilbert polynomial Φ (again f is
used for the base changed morphism f ∶ XY → XT ). Rephrasing the last property we remark
the following: f∗J = f∗(cokerh) = coker(f∗h), since both f∗ and coker are colimits. This gives
us, writing out h:
f∗J = coker(f∗π∗T (ker q)→ f∗π∗TπT∗(EXT (r))→ f∗EXT (r))
Because πT∗ and πY ∗ don’t have higher direct images on the relevant sheaves by the Mumford-
Castelnuovo regularity result stated before we have that f∗π∗T = π∗Y f∗ and f∗πT∗ = πY ∗f∗. This
gives us:
f∗J = coker(π∗Y f∗(ker q)→ π∗Y πY ∗f∗(EXT (r))→ f∗EXT (r))
27
-
On the other hand f lives in the image of the above functor if and only if f∗q comes from some
(F ′, q′) ∈ QuotΦ,LE/X/S(Y ), as can be seen from a similar diagram to a previous one:
QuotΦ,LG/X/S(Y ) ×Grass(W⊗Symr V ),Φ(r))(Y ) HomS(Y,T ) HomS(Y,T )
QuotΦ,LG/X/S(Y ) Grass(W ⊗ Symr V ),Φ(r))(Y )
By looking at equation (1) which gives us a way to recover an element of the Quot from an
element of the Grass, we have that there exists such a q′ if and only if
coker(π∗Y ker(f∗q)→ π∗Y f∗π∗T (EXT (r))→ f∗(EXT ))
is flat (then the q′ we need is equal to it). We can use the identities from before to get:
coker(π∗Y ker(f∗q)→ π∗Y π∗Y f∗(EXT (r))→ f∗(EXT ))
Since f∗ is right exact we can exchange ker(f∗q) and f∗ ker(q) without changing the cokernel.
Thus we get that the last sheaf is equal to f∗J .
So we can conclude that a morphism f ∶ Y → T factors through T ′ if and only if f∗J is flat and
has a Hilbert polynomial Φ which is true if and only if
((f∗F , f∗q), f) ∈ QuotΦ,LG/X/S(Y ) ×Grass(W⊗Symr V ),Φ(r))(Y ) HomS(Y,T )
thus establishing the needed bijection. From Theorem 2 we get the required locally closed sub-
scheme of Grass.
Step 4. To finish the proof of Theorem 1 we need to show that the QuotΦ,LE/X/S functor, as
defined in 10, is actually represented by a closed subscheme of the Grassmannian. To do this we
will check the valuative criterion for properness (see [11] Chapter 3. Theorem 3.25.). Namely,
if R is a DVR together with a given morphism SpecR → S and field of fractions K, we need to
show that the restriction map
QuotΦ,LE/X/S(SpecR)→ QuotΦ,LE/X/S(SpecK)
is bijective. So, take an element (F , q) in QuotΦ,LE/X/S(SpecK)
q ∶ EXK → F
where XK ∶= X ×S SpecK. Let j ∶ XK → XR be the natural map given by the inclusion of the
generic point SpecK in SpecR. Now let F̄ be the image of the composition of EXR → j∗EXK
28
-
with j∗q. Let q̄ ∶ EXR → F̄ be the epimorphism obtained by looking only at the image. If we
show that F̄ is flat over SpecR we will have that
(F̄ , q̄) ∈ QuotΦ,LE/X/S(SpecR)
and therefore we will have proven the surjectivity of the restriction map because applying j∗
makes the first map of the composition the identity map, and the second map q. But on a DVR
being torsion free is the same as being flat. Now, F is torsion free over SpecK since it is flat
over it, and hence j∗(F) is a torsion free sheaf over SpecR, thus so is F̄ .
Injectivity of the restriction map follows immediately from the valuative criterion for sep-
aratedness ([11] Chapter 3. Remark 3.12.) because we know that the scheme QuotΦ,LE/X/S is
separated as locally closed subschemes of separated schemes are separated (see [11] Chapter 3.
Proposition 3.9, and [17] 5.1.6 for separatedness of the Grassmannian).
Therefore, we have shown Theorem 1.
29
-
Chapter 3
Geometric Invariant Theory
In this section we will present another crucial tool in the theory of moduli spaces - Mumford’s
Geometric Invariant Theory.
3.1 Actions of algebraic groups
Definition 18. An algebraic group over a field k is a scheme G over k of finite type with
morphisms of k-schemes e ∶ Speck → G, m ∶ G × G → G and i ∶ G → G defining the identity
element, multiplication and inversion, and satisfying the usual axioms for groups. A more general
notion is that of a group scheme, which is the generalisation of the above definition to working
over an arbitrary ring R. We say that G is an affine algebraic group if the underlying scheme is
affine. A homomorphism of algebraic groups is a morphism of k-schemes which commutes with
the multiplication morphisms.
Remark 8. The functor of points hG ∶= Homk(−,G) of an algebraic group factors through the
category of groups i.e. for any k-scheme X Hom(X,G) is endowed with a group structure and
every map hG(f) ∶ hG(X)→ hG(Y ) is a morphism of group with this structure. One can show,
using the Yoneda lemma, that there is an equivalence of categories between the category of
algebraic groups and the category of functors F ∶ Sch/k→Grp such that the composition with
the forgetful functor Grp→ Set is representable.
Considering OG(G), the k-algebra of regular functions on G, the morphisms e,m, i above
correspond to k-algebra homomorphisms m∗ ∶ OG(G)→ OG(G)×OG(G), i∗ ∶ OG(G)→ OG(G)
and OG(G) → k, called comultiplication, coinverse and counit respectively. These define a
structure of a Hopf algebra on OG(G). There is a bijection between Hopf algebras and affine
algebraic groups ([12] II Theorem 5.1)
By a theorem of Cartier every affine algebraic group over a field k of characteristic 0 is
30
-
smooth ([12] VI Theorem 9.3).
Example 7. Let us see some examples of algebraic groups that will be used later.
1. The additive group Ga = Speck[t], whose underlying scheme is the affine line A1 and the
group structure is given by addition
m∗(t) = t⊗ 1 + 1⊗ t and i∗(t) = −t
For a k-algebra R we have Ga(R) = (R,+) which justifies the name.
2. The multiplicative group Gm = Speck[t, t−1] whose underlying scheme is A1 − {0} and
whose group action is given by multiplication
m∗(t) = t⊗ t and i∗(t) = t−1
For a k-algebra R we have Gm(R) = (R×, ⋅), hence the name.
3. The general linear group GLn over k is a subvariety of An2
cut out by the condition that the
determinant is non-zero. It is an affine variety whose coordinate ring is k[x11, x12, ..., xnn, t]/(tdet(xij)−
1). The morphisms are given by
m∗(xij) =n
∑k=1
xik ⊗ xkj and i∗(xij) = (xij)−1ij
where the (xij)−1ij is the (i, j)-th entry in the inverse matrix.
4. If V is a finite-dimensional vector space over k, there is an affine algebraic group GL(V )
non-canonically isomorphic to GLdimV , with the isomorphism given by choosing a basis.
For a k-algebra R GL(V )(R) = AutR(V ⊗k R)
Remark 9. A linear algebraic group is a closed subgroup of GLn. Thus, any linear algebraic
group is affine. The converse is also true, any affine algebraic group is linear (see [1] I Proposition
1.10.).
We want to consider actions of algebraic groups, and in particular we want to know when
they will provide us with "nice" quotients.
Definition 19. An action of an algebraic group G on a k-scheme X is a morphism σ ∶X×G→X
which satisfies the usual associativity rules with respect to the multiplication, identity and
inverse morphisms of the algebraic group. Let X,Y be two k-schemes with actions of G. A
morphism of k-schemes f ∶ X → Y is called G-equivariant if it commutes with the actions i.e.
f(g ⋅ x) = g ⋅ f(x). If the action on Y is trivial and the morphism is G-equivariant we call it
G-invariant i.e. f(g ⋅ x) = f(x).
31
-
As we mentioned before, we are interested in "nice" quotients, so let us define what we will
consider a "nice" quotients. It may happen that the orbit space X/G = G ⋅ x∣x ∈X admits a
scheme structure, though this does not happen often. Let us look at some weaker notions of
quotients.
3.2 Nice quotients
Definition 20. A cateogrical quotient for the action of G on X is a scheme Y and a G-invariant
morphism ϕ ∶ X → Y of schemes with the following universal property: every other G-invariant
morphism f ∶ X → Z factors uniquely through ϕ i.e. there exists a unique morphism h ∶ Y → Z
such that f = ϕ ○ h. Furthermore, if the preimage of each k-point in Y is a single orbit we say
that Y is an orbit space.
Equivalently, a categorical quotient is a scheme Y that represents the functor
X/G ∶ (Sch/k)o → Set, T ↦X(T )/G(T )
The morphism ϕ above is obtained as the image of [idX] in X(X)/G(X).
Remark 10. The categorical quotient has nice functorial properties: if ϕ ∶X → Y is G-invariant
and if there is a cover Y = ∪iUi such that ϕ−1(Ui)→ Ui is a categorical quotient for each i then
ϕ is a categorical quotient.
Example 8. Let Gm, as defined earlier, act on An by t ⋅ (a1, ..., an) = (ta1, ..., tan). In this case
there are two types of orbits: punctured lines through the origin, and the origin itself. The origin
is the only closed orbit and each orbit has the origin in its closure. Therefore, any G-invariant
morphism (i.e. constant on orbits) An → Y must be constant on An. Clearly, the categorical
quotient in this case is the structure map ϕ ∶ An → Speck since any other map f ∶ An → Z is
constant i.e. maps everything to a point z ∈ Z so we may choose the morphism h ∶ Speck → Z
to be the one sanding the one point to z ∈ Z to get f = h ○ ϕ.
This example illustrates how non-closed orbits can create problems, in that the categorical
quotient we constructed is far from an orbit space. If we were to remove the origin, and then
take the quotient of An − {0} by Gm with the same action as before, we would get Pn−1, which
is what we intuitively expect from this action. So we need a new notion that is closer to our
intuitive idea of a quotient.
The action of an algebraic group G on a k-scheme X induces an action on the k-algebra
O(X) of regular functions on X by
g ⋅ f(x) = f(g−1 ⋅ x)
32
-
and we denote the subalgebra of invariant functions by
O(X)G ∶= {f ∈ O(X) ∶ g ⋅ f = f for all g ∈ G}
If U ⊂ X is an open set that is invariant under the action of G i.e. for all u ∈ U and g ∈ G we
have u ⋅g ∈ U , then G also acts on OX(U) and we denote by OX(U)G the subalgebra of invariant
functions.
Definition 21. A morphism ϕ ∶X → Y is a good quotient for the action of G on X if
1. ϕ is G-invariant
2. ϕ is surjective, and U ⊂ Y is open if and only if ϕ−1(U) ⊂X is open
3. If U ⊂ Y is an open subset the morphism OY (U)→ OX(ϕ−1(U))G is an isomorphism
4. If W is any G-invariant closed subset of X then ϕ(W ) is closed in Y . If W1 and W2 are
disjoint G-invariant closed subsets then ϕ(W1) and ϕ(W2) are also disjoint
5. ϕ is affine i.e. the preimage of every affine open subset is affine.
If, moreover, the geometric fibers of ϕ are orbits of geometric points of X then ϕ is a geometric
quotient. Finally, ϕ is a universal good (resp. geometric) quotient if for any morphism Y ′ → Y
of k-schemes we have that the base change Y ′ ×Y X → Y ′ is a good (resp. geometric) quotient.
A good quotient is a categorical quotient (see [13] Proposition 0.1).
Example 9. Consider an action of Gm on A2k, where k is an algebraically closed field, given by
t ⋅ (x, y) = (tx, t−1y). The orbits of this action are
• conics {(x, y) ∶ xy = λ} for λ ∈ A1 − {0}
• the punctured x-axis
• the punctured y-axis
• the origin
The origin and the conics are closed orbits, while the punctured axes contain the origin in their
closure. Therefore the categorical quotient will identify all orbits except for the conics which are
parametrised by A1 − {0}. We may thus expect the morphism ϕ ∶ A2 → A1 given by (x, y)↦ xy
to be the categorical quotient. In fact, it is also a good quotient. Let us quickly check that fact.
ϕ is obviously surjective and G-invariant. Now let U ⊂ A1 be an open subset and consider
ϕ∗ ∶ OA1(U)→ OA2(ϕ−1(U))
33
-
For U = A1 we have ϕ∗ ∶ k[t]→ k[x, y] given by t↦ xy. Let us show that this is an isomorphism
on the ring of invariants. The action on O(A2) = k[x, y] is given by
t ⋅ (Σi,jaijxiyj) = (Σi,jaijtj−ixiyj)
as discussed above. Therefore the invariant subalgebra is
k[x, y]k∗ = {Σi,jaijxiyj ∶ aij = 0 for i /= j} = k[xy]
as we wanted. Now take a proper open subset U ⊂ A1. It is of the form U = A1 − {a1, ..., an}
and OA1(U) = k[t]f where f(t) = (t − a1)⋯(t − an). Then ϕ−1(U) is the non-vanishing locus of
F (x, y) ∶= f(xy) ∈ k[x, y] so we have OA2(ϕ−1(U)) = k[x, y]F . In particular,
OA2(ϕ−1(U))GM = (k[x, y]F )GM = (k[x, y]GM )F = k[xy]F ≅ k[t]f = OA1(U)
Furthermore, closed G-invariant subsets of A2 are either a finite union of orbit closures or the
entire space A2. Therefore it is enough to prove condition 4. for being a good quotient for
W1 = G ⋅ p1 and W2 = G ⋅ p2 disjoint. As we have the list of all orbits we know that for W1 and
W2 to be disjoint we must have one of the following two cases
• both p1 and p2 distinct and not lying on the axes of A2 so that their orbits are disjoint
conics (x, y) ∶ xy = αi, therefore ϕ(W1) = α1 /= α2 = ϕ(W2)
• one point lies on an axis, say p1 so ϕ(W1) = 0, and the other does not so ϕ(W2) /= 0
The condition that the morphism is affine is satisfied since any morphism of affine varieties is
affine.
Additionally, this morphism is not a geometric quotient since ϕ−1(0) is the union of three
orbits (all except the conics).
Lemma 3. Let G be an algebraic group acting on a scheme X and let ϕ ∶ X → Y be a good
quotient. Then:
1. G ⋅ x1 ∩G ⋅ x2 /= ∅ if and only if ϕ(x1) = ϕ(x2)
2. For each y ∈ Y the preimage ϕ−1(y) contains a unique closed orbit. In particular, if the
action is closed (i.e. all orbits are closed), then ϕ is a geometric quotient.
Proof. For the first part it is clear that ϕ(x1) = ϕ(x2) if G ⋅ x1 ∩G ⋅ x2 since we have said that ϕ
is G-invariant and thus constant on orbit closures. The other direction follows from property 4.
of the good quotient. For the second part, suppose that converse is true i.e. there are two closed
orbits W1,W2 in a preimage ϕ−1(y). But then the fact that we have two closed G-invariant
subsets that have the same image contradicts the property 4.
34
-
Moreover, it is easy to see that good and geometric quotients restrict for any open U ⊂ Y to
ϕ∣ ∶ ϕ−1(U)→ ϕ(U) to good and geometric quotients respectively of the action of G on ϕ−1(U).
3.3 Hilbert’s 14th problem and Nagata’s theorem
Remark 11. The story of constructing the nice quotients we’ve been discussing begins with the
14th item on the list of problems that David Hilbert proposed in 1900. Namely, any G-invariant
morphism ϕ ∶ X → Y , which is the first property that we ask of the categorical, and therefore
any "nice" quotient, induces a homomorphism ϕ∗ ∶ O(Y ) → O(X) whose image is contained in
the subalgebra of G-invariant regular functions O(X)G.
Definition 22. First off, a rational representation of an algebraic group G is a morphism
h ∶ G→ GL(V ), for a finite dimensional k-vector space V , of algebraic groups i.e. a morphism of
groups that is at the same time a morphism of varieties. This automatically gives as an action
of G on V , which we call a linear action. Moreover, we say that an action of G on a k-algebra
A is rational if the map a ↦ g ⋅ a is an automorphism of A for any g ∈ G, and every element
of R is contained in a finite-dimensional G-invariant linear subspace of A on which G acts by a
rational representation.
Now, a formulation of Hilbert’s 14th problem follows: For a rational action of an algebraic
group on a finitely generated k-algebra A, is the algebra of G-invariants AG finitely generated?
In general, this question has a negative answer. Nagata provided a counterexample (see
[14]), as well as a theorem proving that for G geometrically reductive Hilbert’s problem has a
positive answer (for proof see [15] Theorem 3.4.).
We give the definition of geometrically reductive groups, even though we will not prove
Nagata’s theorem, and will thus only care about the fact that the groups GLn,SLn and PGLn
are in fact geometrically reductive, so that we can apply the theory that follows to taking
quotients by their actions.
Definition 23. A linear algebraic group G is called geometrically reductive if, for any rational
representation h ∶ G→ GLn, and every point v /= 0 ∈ kn invariant by the induced linear action of
G on kn, there exists an invariant homogeneous polynomial f in n variables of degree ≥ 1 such
that f(v) /= 0.
In fact, there exist notions of a reductive group - a group whose radical is isomorphic to a
torus (i.e. direct product of multiplicative groups), and that of a linearly reductive group - a
group whose every rational representation is completely reducible i.e. expressible as a direct
35
-
sum or representations possessing no proper invariant subspaces. Mumford’s theory originally
used his proof that Hilbert’s 14th problem had a positive answer for linearly reductive groups
in all characteristics, and it was proven that linearly reductive is equivalent to reductive in
characteristic 0, but Nagata provided an example in positive characteristic of a reductive but
not linearly reductive group. This motivated the search for an appropriate weaker notion, and
the proof by Haboush in 1974, that geometrically reductive is equivalent to reductive in all
characteristics (see [13] Theorem A.1.0.).
3.4 Constructing the projective GIT quotient
Let X be a projective scheme with an action of an affine algebraic group G. A linear G-
equivariant projective embedding of X is a rational representation h ∶ G → GL(n + 1) and a
G-equivariant projective embedding X ↪ Pn. G then acts on Pn via the homomorphism h, and
on X by restriction. The action of G on Pn lifts to an action on the affine cone An+1 over Pn,
and since the projective embedding of X ⊂ Pn is G-equivariant there is an induced action of G
on the affine cone X̃ ⊂ An+1 over X ⊂ Pn. We have:
O(An+1) = k[x0, ..., xn] =⊕r≥0
k[x0, ..., xn]r =⊕r≥0
H0(Pn,OX(r))
and if X ⊂ Pn is the closed subscheme associated to a homogeneous ideal I(X) ⊂ k[x0, ..., xn]
then the affine cone X̃ = Spec(R(X)) where R(X) = k[x0, ..., xn]/I(X).
The homogeneous algebras O(An+1) and R(X) are graded by homogeneous degree and, since
the G-action on An+1 is linear it preserves the graded pieces, so that the invariant subalgebra:
O(An+1)G =⊕r≥0
k[x0, ..., xn]Gr
is a graded algebra and so is R(X)G = ⊕r≥0R(X)Gr . By Nagata’s theorem R(X)Gr is finitely
generated when G is reductive (as we assume in these notes). The inclusion of finitely generated
graded k-algebras R(X)G ↪ R(X) determines a rational morphism of projective schemes:
X ⇢ ProjR(X)G
whose indeterminacy locus is the closed subscheme of X defined by the homogeneous ideal
R(X)G+ ∶=⊕r>0R(X)Gr . This motivates the definition of semistable points as follows:
Definition 24. For an action of a reductive group G on a projective scheme X as in the
previous discussion, we define the nullcone N of X to be the closed subscheme of X defined by
the homogeneous ideal R(X)G+ . We define the set of semistable points on X to be Xss ∶=X −N .
We can thus define x ∈ X to be a semistable point if there exists a G-invariant homogeneous
polynomial f ∈ R(X)Gr for r > 0 such that f(x) /= 0.
36
-
By the previous definition we have that Xss is the domain of definition of the rational map
X ⇢ ProjR(X)G, and we call the morphism Xss → X//G ∶= ProjR(X)G the GIT quotient of
this action of G.
The following theorem will show that Xss → X//G is a good quotient, but from we know
that the presence of non-closed orbits in the semistable locus will prevent the good quotient
from being a geometric quotient. This motivates the definition of stability.
Definition 25. A point x ∈ X is stable if dimGx = 0 (the stabilizer is finite) and there is a
G-invatiant homogeneous polynomial f ∈ R(X)G+ such that x ∈ Xf = {x ∈ X ∣f(x) /= 0} and the
action of G on Xf is closed.
First, we’d like to prove that the set of stable points is open in X. We need the following:
Lemma 4. The dimension of the stabilizer subgroup viewed as a function X → N is upper-
semicontinuous, that is, for every n the set {x ∈X ∶ dimGx ≥ n} is closed in X.
Proof. Consider the graph of the action Γ = (pX , σ) ∶ G ×X →X ×X and the fiber product
P X
G ×X X ×X
ϕ
∆
Γ
where ∆ is the diagonal morphism. Then the k-points of P consist of pairs (g, x) such that
g ∈ Gx. The function on P which sends p = (g, x) ∈ P to the dimension of Pϕ(p) ∶= ϕ−1(ϕ(p)) is
upper-semicontinuous (see Grothendieck’s EGA [3] IV 13.1.3) i.e. for all n we have that {p ∈ P ∶
dimϕ(p) ≥ n} is closed in P . By restricting to the closed subscheme X ≅ {(e, x)∣x ∈ X} ⊂ P we
can conclude that the set {x ∈X ∣dimGx ≥ n}is closed in X for all n.
Lemma 5. The stable and semistable sets Xs and Xss are open in X.
Proof. The set of semistable points is open by definition, since it is the complement of the
nullcone, which is closed. To prove that the set of stable points is open consider Xc ∶= ∩Xfwhere the union is taken over f ∈ R(X)G+ such that the action of G on Xf is closed. Xc is open
in X, so we need to prove that Xs is open in Xc. By the previous lemma we have that the
function x ↦ dimGx is upper-semicontinuous, so the set of points in X with zero-dimensional
stabilizer is open in X, so Xs is open since it is the intersection of two open sets in X (Xc and
the set of points with finite stabilizer).
Now we come to the main theorem of this section, which will prove an important tool to
have in moduli theory.
37
-
Theorem 3. For a linear action of a reductive group G on a closed subscheme X ⊂ Pn, let
ϕ ∶Xss → Y ∶=X//G the GIT quotient. Then:
(i) ϕ ∶ Xss → Y ∶= X//G is a good quotient for the G-action on the open subset Xss of
semistable points in X. Moreover, Y is a projective scheme.
(ii) There is an open subscheme Y s ⊂ Y such that ϕ−1(Y s) = Xs and that the GIT quotient
restricts to a geometric quotient ϕ ∶Xs → Y s.
(iii) A k-point x ∈ X(k) is stable if and only if x is semistable, its orbit G ⋅ x is closed in Xss,
and its stabilizer Gx has dimension zero.
For proof see [15] Theorem 3.14.
Example 10. Let us look at an example. Consider the linear action of G = Gm on X = Pn given
by
t ⋅ [x0 ∶ ... ∶ xn] = [t−1x0 ∶ tx1 ∶ ... ∶ txn]
We have that R(X) = k[x0, ..., xn] graded by degree. We claim that the functions x0x1, ..., x0xngenerate the ring of invariants. Let f ∈ R(X), then
f = ∑m=(m0,...,mn)
a(m)xm00 ⋯xmnn
and for f ∈ G
t ⋅ f = ∑m=(m0,...,mn)
a(m)tm0−m1−⋯−mnxm00 ⋯xmnn
So we see that f is G-invariant if and only if a(m) = 0 for all m such that m /= ∑ni=1mi. If
m = ∑ni=1mi we can write
xm00 xm11 ⋯x
mnn = (x0x1)m1⋯(x0xn)mn
so if f is G-invariant we have f ∈ k[x0x1, ..., x0xn]. Hence
R(X)G = k[x0x1, ..., x0xn] = k[y0, ...yn−1]
Taking Proj we get the GIT quotient X//G = Pn−1. We can write down the rational morphism
ϕ in this case:
ϕ ∶ Pn =X ⇢X//G = Pn−1
Clearly the nullcone is given by
N = {[x0 ∶ ... ∶ xn] ∈ Pn ∶ x0 = 0 or (x1, ..., xn) = 0}
38
-
Therefore we have our semistable points
Xss = ∪ni=1Xx0xi = {[x0 ∶ ... ∶ xn] ∈ Pn ∶ x0 /= 0