local properties of distributions of stochastic functionals

203
Translations of MATHEMATICAL MONOGRAPHS Volume 173 Local Properties of Distributions of Stochastic Functionals Yu. A. Davydov M. A. Lifshits N. V. Smorodina f i ))= American Mathematical Society

Upload: others

Post on 11-Sep-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Local Properties of Distributions of Stochastic Functionals

Translations of

MATHEMATICAL MONOGRAPHS

Volume 173

Local Properties of Distributions of Stochastic Functionals Yu. A. Davydov M. A. Lifshits N. V. Smorodina

f i ))= American Mathematical Society

Page 2: Local Properties of Distributions of Stochastic Functionals
Page 3: Local Properties of Distributions of Stochastic Functionals

Selected Titles in This Series

173 Yu. A. Davydov, M. A. Lifshits, and N . V. Smorodina, Local properties of distributions of stochastic functionals, 1998

172 Ya. G. Berkovich and E. M. Zhmud', Characters of finite groups. Part 1, 1998

171 E. M. Landis, Second order equations of elliptic and parabolic type, 1998

170 Viktor Prasolov and Yuri Solovyev, Elliptic functions and elliptic integrals, 1997

169 S. K. Godunov, Ordinary differential equations with constant coefficient, 1997

168 Junjiro Noguchi , Introduction to complex analysis, 1998

167 Masaya Yamaguti , Masayoshi Hata , and Jun Kigami, Mathematics of fractals, 1997

166 Kenji Ueno , An introduction to algebraic geometry, 1997

165 V . V. Ishkhanov, B. B. Lur'e, and D . K. Faddeev, The embedding problem in

Galois theory, 1997

164 E. I. Gordon, Nonstandard methods in commutative harmonic analysis, 1997

163 A. Ya. Dorogovtsev, D . S. Silvestrov, A. V. Skorokhod, and M. I. Yadrenko,

Probability theory: Collection of problems, 1997

162 M. V. Boldin, G. I. Simonova, and Yu. N . Tyurin, Sign-based methods in linear

statistical models, 1997

161 Michael Blank, Discreteness and continuity in problems of chaotic dynamics, 1997

160 V . G. Osmolovskff, Linear and nonlinear perturbations of the operator div, 1997

159 S. Ya. Khavinson, Best approximation by linear superpositions (approximate

nomography), 1997

158 Hideki Omori, Infinite-dimensional Lie groups, 1997

157 V . B . Kolmanovskff and L. E . ShaYkhet, Control of systems with aftereffect, 1996

156 V . N . Shevchenko, Qualitative topics in integer linear programming, 1997

155 Yu. Safarov and D . Vassiliev, The asymptotic distribution of eigenvalues of partial

differential operators, 1997

154 V. V. Prasolov and A. B. Sossinsky, Knots, links, braids and 3-manifolds. An

introduction to the new invariants in low-dimensional topology, 1997

153 S. Kh. Aranson, G. R. Bel i tsky, and E. V. Zhuzhoma, Introduction to the

qualitative theory of dynamical systems on surfaces, 1996

152 R. S. Ismagilov, Representations of infinite-dimensional groups, 1996

151 S. Yu. Slavyanov, Asymptotic solutions of the one-dimensional Schrodinger equation,

1996

150 B . Ya. Levin, Lectures on entire functions, 1996

149 Takashi Sakai, Riemannian geometry, 1996

148 Vladimir I. Piterbarg, Asymptotic methods in the theory of Gaussian processes and

fields, 1996

147 S. G. Gindikin and L. R. Volevich, Mixed problem for partial differential equations

with quasihomogeneous principal part, 1996

146 L. Ya. Adrianova, Introduction to linear systems of differential equations, 1995

145 A. N. Andrianov and V. G. Zhuravlev, Modular forms and Hecke operators, 1995

144 O. V. Troshkin, Nontraditional methods in mathematical hydrodynamics, 1995

143 V. A. Malyshev and R. A. Minlos, Linear infinite-particle operators, 1995

142 N . V. Krylov, Introduction to the theory of diffusion processes, 1995

141 A . A. Davydov, Qualitative theory of control systems, 1994

140 Aizik I. Volpert, Vitaly A. Volpert, and Vladimir A. Volpert, Traveling wave

solutions of parabolic systems, 1994

139 I. V. Skrypnik, Methods for analysis of nonlinear elliptic boundary value problems, 1994

138 Yu. P . Razmyslov, Identities of algebras and their representations, 1994

137 F. I. Karpelevich and A. Ya. Kreinin, Heavy traffic limits for multiphase queues, 1994

136 Masayoshi Miyanishi, Algebraic geometry, 1994

135 Masaru Takeuchi, Modern spherical functions, 1994 (Continued in the back of this publication)

Page 4: Local Properties of Distributions of Stochastic Functionals
Page 5: Local Properties of Distributions of Stochastic Functionals

Translations of

MATHEMATICAL MONOGRAPHS

Volume 173

Local Properties of Distributions of Stochastic Functionals

Yu. A. Davydov M. A. Lifshits N. V. Smorodina

d//l^Ss= YJ& American Mathematical Society Providence, Rhode Island

Page 6: Local Properties of Distributions of Stochastic Functionals

EDITORIAL COMMITTEE

AMS Subcommittee Robert D. MacPherson

Grigorii A. Margulis James D. Stasheff (Chair)

ASL Subcommittee Steffen Lempp (Chair) IMS Subcommittee Mark I. Preidlin (Chair)

K). A. HaBbmoB, M. A. Jlwfywvnx, X. B. CMopczjima

JIOKAJILHLIE CBOHCTBA PACnPEJIEJIEHMM CTOXACTMHECKMX ^YHKUMOHAJIOB

HayKa • <£H3MaTjraT MocKBa, 1995

Trans la ted from the Russian by V. E. Nazaikinskii and M. A. Shishkova

1991 Mathematics Subject Classification. Primary 60-02, 60F17, 60F99, 60G15, 60G30, 60G50, 60H99.

ABSTRACT. This book deals with an important problem of probability theory, namely, the structure of distributions of functionals on trajectories of random processes. By using new methods (the stratification method, the superstructure method, and the method of differential operators), the authors examine whether distributions of functionals of Gaussian processes and Poisson measures have densities with prescribed properties. The absolute continuity of such densities is studied. These methods are used to prove local limit theorems for distributions of a wide class of stochastic functionals.

The book is intended for research mathematicians and graduate students working in probability theory and its applications.

Library of Congress Cataloging-in-Publication Data

Davydov, IU. A. (IUrii Aleksandrovich), 1944-[Lokal'nye svoistva raspredelenii stokhasticheskikh funktsionalov. English] Local properties of distributions of stochastic functionals / Yu. A. Davydov, M. A. Lifshits,

N. V. Smorodina ; [translated from the Russian by V. E. Nazaikinskii and M. S. Shishkova]. p. cm. — (Translations of mathematical monographs, ISSN 0065-9282 ; v. 173)

Includes bibliographical references (p. - ) and index. ISBN 0-8218-0584-3 (acid-free paper) 1. Limit theorems (Probability theory) 2. Distribution (Probability theory) 3. Functionals.

4. Stochastic processes. I. Lifshits, M. A. (Mikhail Antol'evich), 1956- . II. Smorodina, N. V. (Natal'ia Vasil'evna), 1959- . III. Title. IV. Series. QA273.67.D39 1998 519.2—dc21 97-44426

CIP

© 1998 by the American Mathematical Society. All rights reserved. The American Mathematical Society retains all rights

except those granted to the United States Government. Printed in the United States of America.

@ The paper used in this book is acid-free and falls within the guidelines established to ensure permanence and durability.

Information on copying and reprinting can be found in the back of this volume. Visit the AMS home page at URL: h t t p : //www. ams. org /

10 9 8 7 6 5 4 3 2 1 03 02 01 00 99 98

Page 7: Local Properties of Distributions of Stochastic Functionals

Contents

Preface vii

Introduction ix

Notation xi

Chapter 1. Preliminaries 1 §1. Random processes and their distributions 1 §2. Convergence of probability measures 4 §3. Measurable partitions and systems of conditional measures 9

Chapter 2. Methods for Studying Distributions of Functionals 13 §4. Stratifications of measures 13 §5. Superstructure 23 §6. Differential operators 29

Chapter 3. Gaussian Functionals 35 §7. Gaussian measures on linear spaces 35 §8. Smooth functionals 47 §9. Distributions of smooth functionals 52

§10. Convexity and the isoperimetric property of the Gaussian measure 57 §11. Convex functionals and their distributions 72 §12. Distribution of the norm 88

Chapter 4. Poisson Functionals 105 §13. The configuration space 105 §14. Differential calculus on the configuration space 111 §15. The Gauss-Ostrogradskii formula 116 §16. Smooth functionals 127 §17. Distribution of the norm of a stable vector 138

Chapter 5. Local Limit Theorems 145 §18. Strong convergence theorems 145 §19. Strong convergence of distributions of Gaussian functionals 150 §20. The local invariance principle 156 §21. The local invariance principle in the case of attraction to a stable

law 161 §22. The infinite-dimensional local limit theorem 167

Bibliographical Notes 171

Bibliography 175

Index 183

Page 8: Local Properties of Distributions of Stochastic Functionals
Page 9: Local Properties of Distributions of Stochastic Functionals

Preface

Studying distributions of stochastic functionals (that is, functional defined on trajectories of random processes) is a problem of great interest and importance in probability theory, in particular in such topics as limit theorems, mathemati­cal statistics, and approximation procedures. The characteristic function method, traditional in probability theory, proved to be ineffective in this problem, and so until the late 1970s there had been only separate results pertaining to a few specific functionals. Noticeable progress in recent years is due to newly developed meth­ods, such as the stratification method, the superstructure method, and the method of differential operators. For distributions of functionals over Gaussian processes and Poisson measures, these methods have provided comprehensive results concern­ing the absolute continuity and existence of densities with given properties (e.g., boundedness or prescribed differentiability). The new methods are also effective in proving local limit theorems for distributions of a wide class of functionals.

The goal of the present book is to provide an introduction to these problems and methods. The topics covered are mainly related to the authors' original research and have not been presented in monographs earlier. However, we are not striving for maximum generality and completeness, but rather focus on the main ideas and their applications to the most fascinating problems, so as to make the book available to the widest possible readership.

We point out that definitive solutions to many of the problems discussed here have yet to be found, and we hope that this book will inspire further studies in this direction.

Normally, each section is concluded with exercises and some additional infor­mation. This information is by no means complete, and is chosen according to the authors' personal tastes and in line with the general scope of the book. As usual, the exercises are intended to aid active assimilation of the main text; hints are given where necessary. The sections are numbered consecutively throughout the book; we adopt the system in which, say, Theorem 5 in §2 is referred to as Theorem 2.5.

A majority of the results presented in the book have been discussed at the sem­inar on random processes at St. Petersburg State University and the St. Petersburg Branch of the Steklov Mathematical Institute of the Russian Academy of Sciences. We are grateful to all participants of the seminar for their attention to the work, patience, and useful remarks. We are also keenly grateful to I. A. Ibragimov for his constant support of and interest in our work.

The Authors

Page 10: Local Properties of Distributions of Stochastic Functionals
Page 11: Local Properties of Distributions of Stochastic Functionals

Introduction

We study distributions of functionals defined on trajectories (sample paths) of random processes. Consider a process {£(t),t G T} whose trajectories with probability 1 belong to a function space X and whose distribution is given by a measure P on X. A functional / on the trajectories is a measurable mapping of X into R, and so our problem is to study the measure P / _ 1 , that is, the image of P under / .

Suppose that £ is the Brownian motion process on the interval [0,1], X = C[0,1], and P is the Wiener measure (this is one of the most interesting cases). For a long time, performing explicit calculations was the only available method to study the distribution P / - 1 . For example, it has been successfully applied to the functionals

f{x) = suprr(t), f{x) = sup|z(*)|,

and I

f(x)= f\x(t)\pdt 0

for p = 2. However, no explicit expression is available for the distribution of the last functional for p ^ 2, and even in the early 1970s it was not known whether the measure P / _ 1 has a density.

Apparently, the first deep results concerning the structure of distributions for a wide class of functionals was obtained by Tsirel'son [113]. His method, both geometric and analytical in nature, relied heavily on the Gaussian property and was new in probability theory. Later it has become clear that these results are closely related to the convexity of Gaussian measures, discovered by Borell and Ehrhard [129, 135].

Almost at the same time, in the fundamental paper [146] P. Malliavin consid­ered hypoelliptic problems. In our terms, he studied the question of whether the distribution P / _ 1 has a smooth density for the case in which f(x) = x(l) and P is the distribution of a diffusion process that satisfies a stochastic differential equation of a certain type.

In 1978 Davydov proposed the stratification method, which is based on different ideas and allows one to study the absolute continuity problem for distributions of a large variety of functionals, including nonsmooth functionals. This method was substantially improved by Lifshits [56], who applied it to the problem of the existence of a bounded density for P / _ 1 .

Recently, several other approaches to this problem have been proposed, in particular, the method of differential operators due to Smorodina [104], which is based on the abstract formalism of differential calculus and is a convenient, powerful

ix

Page 12: Local Properties of Distributions of Stochastic Functionals

x INTRODUCTION

tool to attack the problem on the existence of a density with given differentiability for the distribution P / _ 1 .

The stratification method, and especially its modification known as the "super­structure method" (Davydov [38]), is very useful in proving local limit theorems for P / - 1 when P n weakly converges to the limit measure P .

Our book provides a systematic exposition of the stratification method, the superstructure method, and the method of differential operators, as well as of their applications to the above-mentioned problems.

The book consists of five chapters. The first chapter is introductory. It contains preliminary information from

measure theory and the theory of random processes. In the second chapter the three methods are introduced and their applications

are illustrated by simple examples. The third chapter deals with Gaussian functionals. Some conditions for the

existence, boundedness, and smoothness of the distribution density are obtained by means of the stratification method and the method of differential operators. The properties of distributions of convex functionals are derived from the convexity of Gaussian measures. As an important example, the structure of the Gaussian norm distribution is studied in detail.

The fourth chapter deals with Poisson functionals. A special differential calcu­lus is constructed on the Poisson space and the general results obtained are used to study the distributions of smooth functionals and the norm of stable vectors.

The fifth chapter treats strong convergence of distributions. A number of local limit theorems for a wide class of Gaussian functionals and a local invariance prin­ciple for sums of independent random variables are proved by the superstructure method. The infinite-dimensional local limit theorem is also considered.

Chapters 3-5 are based on the methods presented in Chapter 2 and do not depend on each other.

Page 13: Local Properties of Distributions of Stochastic Functionals

Notation

Sets

.A0, A C , A~, dAy and C(A) are the interior, the complement, the closure, the boundary, and the linear span of a set A.

A€ is the e-neighborhood of A. A\B and Ax B are the difference and the direct product of sets A and B. AAB = (A U B) \ (A fl B) is the symmetric difference of A and B. L x is the orthogonal complement of a subspace L. N = {1,2, . . . } is the set of positive integers. N = N U {00} is the extended set of positive integers. Z is the set of integers. Z+ = { 0 , 1 , . . . } is the set of nonnegative integers. 0 is the empty set.

Spaces

c is the space of bounded sequences with the norm ||x|| = supfcGN \xk\. Co is the space of sequences tending to zero (with the norm inherited from c). C(T) is the space of continuous functions on a metric compactum T with the

norm ||a;|| = sup t G T \x{t)\. C[a, b] = C([a, b]) is the space of continuous functions on the interval [a, b], D[0,1] is the space of functions that have no second-kind discontinuities on the

interval [0,1]; this space is equipped with the Skorokhod topology. C°° is the class of infinitely differentiable functions on R1 . Cg° is the class of compactly supported functions from C°°. C[) C C°° is the class of functions bounded together with all derivatives. Ci(D) is the class of / times D-differentiable stochastic functionals. Io, I, I(G, II) are the classes of functions integrable with respect to a Poisson

random measure. H(X) is the tangent space at a point X e £(G), where X(G) is the space of

configurations on G (§13). L°(X, 25), or L°(X), is the space of measurable functions on a measurable space

(X,©). L*(X, P , 05, B) is the space of functions with finite norm ||/ | | = ( / x |/|« dP) 1 / q

defined on (X, 55) and with range in a normed space B. L«(X,P,B) , L«(X,P), and L«(X) are abridged notation for L ^ P , ® , ! * . 1 ) .

lq = L9(N) is the space of sequences with finite norm ||x|| = (YHZLi \xk\q) 9-£?(H,B) is the space of j-linear B-valued forms denned on H. £J '(H) = / ^ (HjR 1 ) is the space of j-linear real-valued forms defined on H.

xi

Page 14: Local Properties of Distributions of Stochastic Functionals

xii NOTATION

/ ^ ( H , B) is the space of j-linear B-valued Hilbert-Schmidt forms defined on H (B and H are Hilbert spaces).

£^(H) = £^(H, R1) is the space of j-linear real-valued Hilbert-Schmidt forms defined on H.

M and Ma are the classes of smooth functionals that satisfy the local invariance principle (§18).

Rfc is the fc-dimensional Euclidean space. R+ and R+ are the positive semiaxis and the positive octant. R is the extended real axis R 1 (containing the points "plus infinity" and "minus

infinity"). R T is the space of all functions on a set T. R°° = R N is the space of all sequences. V(X) is the class of lower semicontinuous convex functionals (§8).

W j is the space of smooth distributions on R 1 (§6).

W (D I So , . . . , Sj) or W (So, . . . , S/) is the space of I times D-differentiable functionals (§8).

W9(X, P , H) is the space of smooth H-valued functionals on a space X that is equipped with a Gaussian measure P (§8).

W<?(X,P) = W 9 ( X , P , R 1 ) . Z(X) is the space of finite signed measures on X with the topology of conver­

gence in variation. X ^ , X ^ , andX' p 0 are the spaces of linear measurable, affine measurable, and

centered measurable functionals for a Gaussian measure P . X(G) is the space of configurations on G (§13).

Measurability

95 x or 35 is the Borel cr-algebra of a topological space X. 35fc = 35Rfc is the Borel cr-algebra of Rfc. £0 is the cylinder algebra (§7). £ is the cylinder a-algebra (§7). a(s) is the minimal cr-algebra that contains a class $ of sets.

Measures

6X is the unit measure concentrated at the point x. A and Afc are the Lebesgue measures on R 1 and Hk. 5(X) is the class of Radon Gaussian measures on a space X. £o(X) C £(X) is the subclass of centered measures. fj,A is the restriction of a measure fj, to a set A. fji*(A) is the outer measure of the set A. \i* (A) is the inner measure of the set A. |/i|(A) is the variation of a signed measure \i on a set A. \\fj,\\ is the total variation of a signed measure \x. fj, < v means the absolute continuity of a measure \x with respect to a mea­

sure v. \x^v means the equivalence (mutual absolute continuity) of measures \i and v. I |/LXr — z->-11 is the distance in variation between measures \x and v. /j,n => fj, denotes the weak convergence of measures.

Page 15: Local Properties of Distributions of Stochastic Functionals

NOTATION xiii

/ in ™ \i denotes the strong convergence of measures (convergence in variation). Af(ay a2) is the Gaussian (normal) distribution in R 1 with mean a and variance

a2. $(r) is the distribution function for A/"(0,1). P is a probability measure. E p / is the integral (mathematical expectation) of a functional / with respect

to a measure P . Dp/ = E p / 2 — E^/ is the variance of a functional / with respect to a measure P . supp(P) is the topological support of a measure P . ATp is the linear support of a measure P .

Miscellany

det M is the determinant of a matrix M. (•, •) is the bilinear form defined on a pair of spaces in duality. (•, •) is the inner product on a Hilbert space. | • | is the norm on a Hilbert space. || • | |B is the norm on a normed space B. X(l) is the action functional (§7). crp(Z) is the admissibility of a vector I with respect to a Gaussian measure P

(§7). domD is the domain of an operator D (§6). dimL is the dimension of L. ker J is the kernel of a linear operator / . tr J is the trace of / . I* is the adjoint of I. 1 is the function identically equal to 1. 1A is the indicator function of the set A. Ell is the ellipsoid of concentration of a Gaussian measure (§10). ux is the modulus of continuity of a function x.

Page 16: Local Properties of Distributions of Stochastic Functionals
Page 17: Local Properties of Distributions of Stochastic Functionals

CHAPTER 1

Preliminaries

§1. Random processes and their distributions

A family of random variables {&, t e T} defined on a probability space (fi, #, P) will be called a random process with parameter set T. The process value at the point t will be denoted by £t or £(£); to indicate the dependence on u> e fi, we sometimes write £(£,u;) or &(u;). We assume that the trajectories of the process belong to some topological space X C R T with probability 1. We also assume that the mapping S: ft —> X given by the rule u> —> £( • ,u>) is ^-measurable and Q5x-measurable, where Q5x is the minimal cr-algebra containing the open subsets of X. The measure P 2 _ 1 defined on Q3x, that is, the image of the measure P under the mapping 2, is called the distribution P^ of the process £.

Any measurable mapping / : X —> R 1 defines the random variable w —> /(£ (w))> which is called a functional of £. Obviously, the distribution of this random variable is just the measure P ^ / - 1 and hence is completely determined by the distribution of the process itself. Therefore, it is reasonable to ignore the specific nature of the measure P^ and study the distribution P / _ 1 assuming that P is an arbitrary probability measure on X. In this context, / is called a stochastic functional. Usually, X is one of the following spaces:

• the space C(T) of continuous functions on a compact set T; this space is equipped with the uniform norm;

• the space D(T) of right continuous functions that have limits at each point of the interval T; this space is equipped with the Skorokhod topology (for the definition and properties of this space see [4, Ch. 3]);

• the space LP(T,T, r) of equivalence classes of functions whose pth power (p > 1) is integrable with respect to the measure r; this space is equipped with the norm||/i | |p = ( / T | ^ d r ) 1 ^ .

The properties of the distribution P^, which characterizes the process as a whole, are the key to studying distributions of stochastic functional. Since P^ is a measure on the topological space X, we recall some properties of such measures.

Measures on a topological space. Suppose that X is a Hausdorff space, 3 C Q5x is an algebra of Borel sets, and P : 3 —• R+ is a monotone set function. Then P is called a Radon function if for all B G 3 one has

(1.1) P{B} = sup{P{Z}, Z C B, Z e 3, Z is compact}.

The function P is said to be regular if it satisfies condition (1.1) with compact sets replaced by closed sets. We say that P is dense if (1.1) holds for B = X. Obviously, if P is a Radon function, then it is regular and dense. If P is finitely additive, then the converse is also true: the regularity and density of P imply (1.1).

l

Page 18: Local Properties of Distributions of Stochastic Functionals

2 1. PRELIMINARIES

If P is countably additive, then it will be called a measure. By definition, we consider measures with finite values. Occasionally, we use infinite measures (with values in R + ) and signed measures (charges) with values in R1 . If P is countably additive and satisfies one of the above properties, we speak of a Radon measure, regular measure, or dense measure, respectively.

For a given P , one can construct the following two functions on the cr-algebra of all subsets of X:

P*{£} = inf{P{Z}, Z G 3 , 5 C Z } ,

P*{£} = sup{P{Z}, Ze3, BDZ}.

If P is a measure, then P* and P* are referred to as the outer and the inner measure, respectively.

A function h that maps a subset of X to a measurable space (Y,il) is said to be T?-measurable (where P is a measure denned on a cr-algebra 3) if there exist a 3-measurable function g: X —> (Y,il) and a set Z G 3 of full measure (that is, P { £ } = P{X}) such that h = g on Z.

A set B G X is said to be ^-measurable if its indicator function h(x) = 1# (X) is P-measurable.

Let us study conditions under which a function P defined on an algebra 3 extends to a Radon measure defined on 93x- We have the following theorem.

THEOREM 1.1. 7/3 contains a base of the topology, P is finitely additive and regular, and P* is dense, then there exists a unique Radon measure that is defined on 2$x and is a continuation o / P .

REMARK. If P is a Radon measure on 3, then P is dense and hence so is P*; thus, we have the statement of the theorem.

The proof of this and some other properties of measures on topological spaces can be found in [16, Ch. 1].

The set of points a ;GX such that any neighborhood of x has a positive measure is called the topological support supp(P) of the measure P . The topological support is closed. If X has a countable base or P is a Radon measure, then the topological support is the smallest closed set of full measure: P{supp(P)} = P{X}.

Measures on a linear space. Let X and X' be two linear spaces in duality. Thus, there is a bilinear form (x,x'), defined on X x X', such that

• for each nonzero x G X there exists an x' G X7 with (x, x') ^ 0; • for each nonzero x' G X' there exists an x G X with (x,x') ^ 0. We say that the form ( • , • ) brings X and X' into duality. The following

example is typical: X is a locally convex Hausdorff space and X7 is the space of continuous linear functionals on X.

In the space X we obtain the cylinder algebra €o generated by linear functionals of the form ( • ,#') and the cylinder a-algebra <£ = a(£o) that is the minimal cr-algebra containing Co-

We can make X a locally convex Hausdorff space by introducing the set of seminorms

(1.2) qx,(x) = \(x,x'}\.

The corresponding topology is called the weak topology since it is the weakest topology in which all functionals of the form (•, x') are continuous. When speaking

Page 19: Local Properties of Distributions of Stochastic Functionals

§1. RANDOM PROCESSES AND THEIR DISTRIBUTIONS 3

about the topological properties of X, we usually mean the weak topology. By Q5x we denote the Borel cr-algebra of the weak topology. We have £ C 2$x and, in the most interesting cases, <£ = <Bx; however, sometimes the Borel <r-algebra is wider than the cylinder cr-algebra.

In the dual space X' one can also introduce the weak topology by using the set of seminorms {|(x, - ) |} .

Suppose that a measure P is defined on a measurable space (X, (£). A vector a G X is called the barycenter (or the Pettis mean value) of the measure P if

(1.3) (a,*7) = [ (x,x?)P(dx) Jx

for all x ' ^ X ' . If a = 0, then the measure P is said to be centered. A linear operator K: X' —> X is called the correlation (covariation) operator

of the measure P if

(1.4) (Kx\ yf) = J ( > , x') - (a, x')) «*, y') - (a, y')) P(cfc) Jx

for any x\y' G X'. Note that in the class of Radon measures there is virtually no difference between

measures defined on (£ and on Q5x, since, by Theorem 1.1, each Radon measure defined on <£ can be uniquely extended to 93x-

The complex-valued function defined on X' by the relation

(1.5) V P ( Z ' ) = / exp{i(x,x')}P(dx) Jx

is called the characteristic functional of the measure P .

THEOREM 1.2. Let P and Q be two Radon measures on 03 x with equal char­acteristic Junctionals ipp = V>Q- Then P = Q.

PROOF. Since the characteristic functionals coincide, we obviously have the same joint distributions with respect to the measures P and Q for any finite set of linear functionals of the form (•,#'). This precisely means that P and Q coincide on Co and hence on €. It remains to use the fact that a Radon measure can be uniquely extended from € to QSx- D

The linear support Np of a measure P is defined as the minimal closed affine subspace of X for which P{ATP} = 1. Obviously, Np D supp(P). If P is a Radon measure and the topological support of P is an affine subspace, then Np = supp(P).

Gaussian processes and measures . The distribution A/*(a, a2) with density

(1.6) p(r) = (2TT)-1 /2CJ-1 exp{-(r - a)2/2a2}

is called a nondegenerate Gaussian (or normal) distribution on R1 . The distribution 6a concentrated at a point a is called the degenerate Gaussian

distribution A/^a, 0). The parameters a G R 1 and a2 > 0 coincide with the mean value and the variance of A/'(a,cr2), respectively. The characteristic distribution function of Af(a, a2) has the form

(1.7) (p(u) = exp{iau — o~2u2/2}.

Page 20: Local Properties of Distributions of Stochastic Functionals

4 1. PRELIMINARIES

The distribution A/"(0,1) is called the standard Gaussian distribution; in what follows its distribution function will be denoted by 3>(r).

A random variable with Gaussian distribution is called Gaussian. The notions of a Gaussian random variable and a Gaussian distribution function can naturally be generalized to multidimensional and infinite-dimensional linear spaces. The required generalization can be obtained by using the following construction. Let (X, X7) be a pair of spaces in duality. A measure P defined on the measurable space (X, C) is said to be Gaussian if the distribution with respect to P of each linear functional (• , x') is Gaussian.

A random process {£*,£ G T} is said to be Gaussian if the joint distribution of any finite set {£tl, • • • » 6 n } *s a Gaussian measure on R n (in particular, the distribution £t belongs to the family Af(a, cr2)). The notions of a Gaussian process and a Gaussian measure are related as follows: under the natural choice of duality, the distribution P^ of a Gaussian process £ is a Gaussian measure in the space X of trajectories. In Chapter 3 we consider numerous examples of such measures and their most important properties.

§2. Convergence of probability measures

Weak convergence. The theory of weak convergence is considered in detail in several textbooks (e.g., see [4, 24]). Here we present some basic definitions and results that are not commonly known but will be useful for our further goals.

A sequence of (probability) measures {P n } defined on the cr-algebra 2$x of Borel subsets of a metric space X is said to be weakly convergent to a measure P if for any continuous bounded function ft on X we have

/ hdPn-> / hdP. Jx Jx

We denote weak convergence by the symbol =>. The following theorem contains equivalent definitions of weak convergence [4, §§2, 5].

THEOREM 2.1. The weak convergence P n => P is equivalent to each of the following properties.

1) I ImP n {F} < P { F } for any closet set F\

2) l imPn{G} < P{G} for any open set G\

3) l imPn{A} = V{A} for any set A with T?{dA} = 0; 4) for any P-almost everywhere continuous functional f: X —> R 1 the distri­

butions P n / _ 1 weakly converge to P / _ 1 .

We have an even more general statement. Let hn and h be measurable mappings of X to a separable metric space Y.

Let Xo be the set of x G X for which the relation hn(xn) —> h(x) is violated for some sequence (xn) that converges to x. Then (see [4, Theorem 5.5]) the relations P n => P and P{X 0 } = 0 imply P n / i ; ! => P / r 1 .

This statement readily yields the following proposition.

PROPOSITION 2.1. Let Y = R1 . Assume that hn and h are bounded by the same constant. 7 / P n = ^ P and P{Xo} = 0, then

(2.1) / hndPn-> [ hdP. Jx Jx

Page 21: Local Properties of Distributions of Stochastic Functionals

§2. C O N V E R G E N C E O F PROBABILITY MEASURES 5

PROOF. Suppose that c > 0 is a constant that majorizes hn and h. Let Vn = Pn/i" 1 and V = PA - 1 . By our assumptions, Vn => V. The function ip(t) = —cl(_oo)C](£) +tl(_CjC)(t) +cl[C)OC)(t) is continuous and bounded on R1 . Therefore,

/ <pdVn^ I <pdV.

Since JR 1 ipdVn = JRi tVn {dt) = / x hn dPn and / R 1 ipdV= J x / idP, we arrive at (2.1). •

The following theorem is very useful.

THEOREM 2.2. Suppose that X zs separable. Let {/#} fee a family of real func­tions that are uniformly bounded on X and equicontinuous at each point of a set Y of P-full measure. 7 / P n => P , then

[ UdPn-> [ UdP Jx Jx

uniformly with respect to fi.

PROOF. Let c > 0 be a constant that majorizes the functions /#. For any e > 0 and x e y , we can find an open ball Vx centered at x such that

(2.2) supuju(Vx)<e.

By reducing the radius of this ball, we can ensure that P{dVx} = 0. Obviously, the family [Vx)x^y is a cover of Y. Since X is separable, we can take a countable subcover (T4), Vk = VXfc. Let us choose N so that P(V) > 1 — e/c, where V = Ux T4, and consider the functions

N

Mx) = ^r/fo(xk)iUk(x), k=\

whereC/fc = 1 4 \ U , t i ^ . Then we have

TV

I UdPn- I UdPn\< Tuh(Vk)Pn{Uk} + Pn{Vc}.c<e + cPn{Vc}. \Jx Jx I k=1

We also have a similar inequality with P n replaced by P . On the other hand, since P{dUk} = 0, we have

r ~ N r ~ lim / fodPn = yZMxk) lim Pn{Uk}= / UdP.

n^°°Jx i n-*°° Jx

The set Vc is closed, and therefore,

l i m P n ( l / c ) <P{VC) <e/c. n

Thus,

/ UdPn- f , Jx Jx

,dP lim sup n ^

Since £ is arbitrarily small, we obtain the desired statement. •

< 2 ( e + c l i m P n { y c } ) <4e.

Page 22: Local Properties of Distributions of Stochastic Functionals

6 1. PRELIMINARIES

This theorem can be used to study the convergence of integrals of functions assuming values in Banach spaces.

THEOREM 2.3. Let h be a measurable mapping of a separable space X to a Banach space (B, || • ||). We assume that h is bounded with respect to the norm and continuous P-almost everywhere. IfPn => P, then

[ hdPn- [ hdP I Jx Jx

0.

P R O O F . Let us consider the family {hy*}) y* E Ball*, of functions hy*(x) = (h(x)y y*), where Ball* is the closed unit ball in B*. It follows from the assumptions of the theorem that this family is uniformly bounded and P-almost everywhere equicontinuous. By Theorem 2.1, we have

sup / (h(x)yy*)Pn(dx)- / (h(x),y*)P{dx)\ reBalll \Jx JX \

= sup ( / hdPn- J /idP,y*\ = / hdPn- [ hdP 2/*GBallJ | \ JX JX ' I II JX JX

Just as in the finite-dimensional case, the weak convergence of measures in a Banach space implies the convergence of their characteristic functionals.

THEOREM 2.4. LetPn andP be measures in a separable Banach space (X, ||-||), and let ^n and ip be their characteristic functionals. IfPn => P , then V>n(#*) —> ip(x*) uniformly with respect to x* e Ball* for any ball Ball* C X*.

PROOF. We see that the family of functions {exp(z( • ,#*)),#* G Ball*} is uniformly bounded and equicontinuous. Therefore, the desired statement follows from Theorem 2.2. •

Note that, in contrast with the case of R n , the converse statement does not hold in the infinite-dimensional case.

Convergence in variation. Let jubea finite charge (signed measure) on ©x-The measure \ji\ defined by the relation

|/x|(A) = sup J2\lM{An)\,

where the least upper bound is taken over all countable partitions of X, is called the variation of \x.

The value ||^|| = M(X) is called the total variation of \x. The space of all finite charges [x on <8x equipped with the norm ||/x|| is a Banach

space and is denoted by Z(X). The convergence with respect to the norm of this space is called convergence in variation or strong convergence and is denoted by the symbol ™.

If the charge \i is absolutely continuous with respect to the measure v and if h = d/i/dv, then \\/j,\\ = fx\h\du = ||/I||LI(X,I/)- Therefore, in the absolutely continuous case (that is, for \xn <^ v and j[i « i/) the convergence \xn ^> \x is equivalent to the convergence of the densities hn = d/j,n/dv to h = d/x/du in the norm of L1 (X, v). (This property is generalized in Problem 2.4.) The total variation cannot be made larger by applying a mapping.

Page 23: Local Properties of Distributions of Stochastic Functionals

§2. CONVERGENCE OF PROBABILITY MEASURES 7

THEOREM 2.5. Let f be a measurable mapping of (X,2l) to (Y, 2)), and let u = fif-\ Tten ||i/||<||Ai||.

PROOF. If sets An form a partition of Y, then the sets f~l(An) form a partition of X. Therefore,

5>CA»)I = I M / - 1 ^ ) ! = > (rVn) ) i < IIMII, n n n

and the desired inequality follows. •

THEOREM 2.6. Le t (X ,a , ^ ) = (Xi,2ti,/Ai)x(X2,a2 ,M2). Tften ||JK|| = ||/zi|| • llwll-

PROOF. We set v\ — \\i\\ and v2 = |^2|. Obviously, \i\ <C i/i and \x2 <^ i/2. Suppose that h\ = d\i\jdv\ and /i2 = d^ldvi- One can readily verify that /ii x ^ C i i x i/2 and

So2)(^) = f"WMl)l XGXl' y€X2-Therefore,

llMi x /x2|| = / M z ) M y ) K (dx)u2 {dy) 7 X i x X 2

i /Xi «/X2 •

Let X be a linear topological space, and let \i\ and \x2 be two measures on Q5x-It follows from the preceding two theorems that ||^i x ^2 | | < UMIII ' llM2||- Indeed, the convolution \i\ */J,2 is equal to ^ / _ 1 , where \i = \i\ x \i2 and f{x\,x<i) = x\ + x2> Therefore

I I M 1 * M 2 | | = I I M / - 1 I I < I H I = IIMIII-IIM2| | . Let us recall the definition of the mixture of measures. Let (Y,2),i/) be a

measure space, and let {ny}yeY be a family of charges on (X, 21) such that for each A e 21 the function t/ —> My(^) is measurable. The charge

ji(A) = y My(^)^ (dy)

is called the mixture of charges {fiy} with respect to the measure v. From this definition and from the definition of total variation we readily obtain

(2.3) Ml < jhUKdl/).

The strong convergence of measures implies the weak convergence. Although the converse is obviously false, we have the following statement.

THEOREM 2.7. Let P n => P and Q n => Q. Then

(2.4) | | P - Q | | < I 5 i | | P n - Q n | | .

Page 24: Local Properties of Distributions of Stochastic Functionals

8 1. PRELIMINARIES

P R O O F . Suppose F is a closed set and 8 > 0. We can choose a small e > 0 so that P{d(Fe)} = Q{d(Fe)} = 0 and Q{F£} < Q{F} + 8. Then

P{F} - Q{F} < P{F£} - Q{F£} + 8

= lim (Pn{F£} - Q„{Fe}) + 5 < Ihn" | |P n - Qn|| + 8. n n

Since 5 is arbitrarily small, we have

P { F } - Q { F } < f f i n ~ | | P n - Q n | | . 71

To pass from closed to arbitrary sets, it suffices to use the regularity formula

P{A} = sup {P{F}, F C A, F is closed}.

This formula holds for all A e Q5x provided that X is a metric space. •

PROBLEM 2.1. Let P n , n e N, be a sequence of Gaussian measures in a separable Banach space X, and let an be the barycenter of the measure P n . If Pn => Poo? then an —> a^ in the weak topology generated by the duality (X, X*).

PROBLEM 2.2. Suppose P i and P 2 are finite measures, Q = (Pi + P2V2, zi = dP i /dQ, and z2 = dP2/dQ. Then

| | P i - P 2 | | = E Q | z i - z 2 | = 2 E Q | l - z 2 | ,

where EQ denotes the integral with respect to the measure Q.

PROBLEM 2.3. Suppose that P i and P 2 are probability measures, Q is a prob­ability measure with respect to which P i and P 2 are absolutely continuous (for example, Q = (Px + P 2 ) /2) , zx = dP i /dQ, and z2 = dP2/dQ. The Hellinger distance p (P i ,P 2 ) between P i and P 2 is defined by

P2(PuP2) = \J(V^-V^)2dQ.

Verify that this distance is well defined, i.e., that p (P i ,P 2 ) is independent of the choice of Q, and prove the inequalities

2p 2 (P i ,P 2 ) < ||Pi - P 2 | | < 2v /2p(Pi,P2) .

PROBLEM 2.4. Let ()Un,n e N) be a sequence of finite measures, fin ^> fM^. Prove that for an arbitrary measure v the density of the continuous component of fin with respect to v converges (in the metric of LX(X, v)) to the density of the absolutely continuous component of ^oo with respect to v.

PROBLEM 2.5. Let (£n) be a sequence of random elements of a separable space X. Suppose that the sequence (P n ) of their distributions weakly converges, P n => P . Let {/#} be a family of functions defined on X, equicontinuous P-almost everywhere, and satisfying

sup I/0(a)I < H(x) X

for some function H and all d. If the family of random variables (H(£n)) is uni­formly integrable, then

dP uniformly with respect to d.

Page 25: Local Properties of Distributions of Stochastic Functionals

§3. MEASURABLE PARTITIONS 9

This statement is a generalization of Theorem 2.2.

§3. Measurable par t i t ions and systems of conditional measures

Suppose that (X, 05) is a measurable space, P a probability measure on 05, and T a partition of X into disjoint subsets 7.

We intend to define measures P 7 on the elements of V so that P can be rep­resented as a mixture of the P 7 . This representation enables us to study the properties of P by analyzing measures of relatively simple structure.

By X / r we denote the quotient space generated by T, that is, the set whose elements are the elements of I\ Let n: X —> X / r be the canonical projection; it takes each x G X to the element 7 G T that contains x.

We equip the set X / r with the cr-algebra 05x,r of sets A C X / r such that 7T~1(A) G 05. Then on the measurable space ( X / r , 05x,r) we can consider the measure Pr{^4} = P{7r~1(^)}> which will be called the quotient measure.

Finally, let us consider the cr-subalgebra 05r C 05 that consists of the sets 7r_1(^4), A G 05x,r- The elements of 05r are 05-measurable subsets of X that respect the partition T, that is, entirely contain each element of the partition with which they have a nonempty intersection.

A system of probability measures {P 7 } , 7 G T, defined on 05 is called a system of conditional measures for P with respect to T if for any B G 05 the function 7 —• Py{B} is 05x,r-nieasurable and

(3.1) P{BrHr- 1 ( i l )} = y P 7 { B } P r ( d 7 )

for any A G 05x,r-For A = X / r formula (3.1) becomes the well-known total probability formula

(3.2) P{B} = f P 7 { £ } P r ( d 7 ) . Jx/r

It is just this formula that allows us to say that P can be represented as a mixture of the measures P 7 ; moreover, the quotient measure is used as the weight.

Let us show how the term "a family of conditional measures" is related to the notion of conditional probability, which is more frequently used in probability theory. The conditional probability of an event B with respect to a cr-algebra V is defined as the D-measurable function such that

P{BD} = [ P{B I V}(x)P (dx), D G V. JD

By comparing this definition with (3.1), we see that the function Pn^(B) is a version of the conditional probability P{B | 05 r}.

The natural question of whether for all X, P , and V there exists a family of conditional measures is not simple. The following example shows that sometimes the answer is negative.

Suppose that X = R z , P is a distribution on X corresponding to a sequence of independent identically distributed random variables, and T is the partition of X into classes of translation-equivalent sequences, that is, x ~ y if and only if for some integer n and for all integer k we have Xk = yn+k- If the distribution of each random variable is not concentrated at a single point, then in the above situation

Page 26: Local Properties of Distributions of Stochastic Functionals

10 1. PRELIMINARIES

there does not exist a family of conditional measures. Indeed, a typical equivalence class 7 e X / r is an infinite sequence of distinct elements of X, and the conditional measure P 7 on this class must be finite and translation-equivalent. Obviously, these two properties contradict each other.

Nevertheless, under some restrictions on X and T it is possible to prove the existence of a conditional measure.

Suppose that X is a complete separable metric space and 03 = 03x is the Borel cr-algebra. Let T be a partition of X into the preimages of points for some measurable mapping of (X, 03) to a complete separable metric space. This partition will be called measurable.

THEOREM 3.1. If T is a measurable partition of the space X and P is a probability measure on (X, 03), then there exists a family of conditional measures {P7}7 € X / rS for Pr-almost all 7 the conditional measure is concentrated on the corresponding element of the partition, P7{7r - 1(7)} = 1.

The proof of Theorem 3.1 can be found in [78, Ch. VI, §46]. Now let us consider the properties of a family of conditional measures. The

most important property is uniqueness.

PROPOSITION 3.1. 7 /{P 7} and {P 7 } are two families of conditional measures for P with respect to a measurable partition T, then the measures P 7 and P 7 coin­cide for Pr-almost all 7.

PROOF. For B e 03 and A e 03 r , formula (3.1) implies

/ P 7 { £ } P r ( d 7 ) = / P 7 { £ } P r ( d 7 ) . J A J A

Since A is arbitrary, for a given B we have P 7 { # } = P 7 { # } for Pp-almost all 7. Hence, for each countable system of sets, the measures P 7 and P 7 coincide for Pr-almost all 7. Recall that we consider measures on the Borel cr-algebra of a separable metric space.

In this cr-algebra there is a countable class of sets such that the values of a measure on the set of this class uniquely determine the measure. Therefore, for 7 from a set of Pp-full measure we have P 7 = P 7 . •

PROPOSITION 3.2. If g is an integrable function on the space (X, 03, P ) , then

(3.3) / g(x)P (dx) = f [ g{x)¥1 (dx) P r (dx). Jx. . /x/r Jx

P R O O F . For g = 1#, (3.3) can be reduced to (3.2). One can readily pass to step functions, nonnegative functions, and arbitrary integrable functions. •

Now let us show how one can study the images of measures under mappings by using conditional measures.

Let / : (X, 03) —» (Y,il) be a measurable mapping.

PROPOSITION 3.3. For Aeiiwe have

(3.4) P / " 1 ( -A)= / P ^ - H ^ P r ^ ) . Jx/v

Page 27: Local Properties of Distributions of Stochastic Functionals

§3. MEASURABLE PARTITIONS 11

PROOF. Formula (3.4) is the special case of (3.2) for B = f~l{A). •

The measures P 7 / _ 1 ( - ) defined on (Y,il) are called the conditional distribu­tions of the functional / with respect to the measure P and the partition T.

We also need an analog of formula (3.4) for the density of the distribution of a functional. Let / z b e a measure on (Y,it). We assume that there exists a jointly measurable function p: (7,y) —> p^y) such that for Pp-almost all 7 G X / r the function p7(-) is a version of the density of the measure P 7 / _ 1 with respect to the measure \i on the cr-algebra it:

dP f-1

(3.5) ^ ( y ) = f_g_(y).

PROPOSITION 3.4. The function

(3.6) P(y)= [ P7(y)Pr(d7) Jx/r

is a version of the density of the measure P / _ 1 with respect to the measure \x on the a-algebra it.

PROOF. For A e il we use (3.4), substitute (3.5), then change the order of integration and use the definition of p:

p/- 1 (A)= [ p^ri(A)pr(d1) 7x/r

= / / P^(y)^(dy)Pr{dj) = / p{y)n{dy). Jx/r J A J A

This shows that p is the density of P / _ 1 with respect to \x. •

Relation (3.6) allows us to obtain some information about the density of the distribution of functionals / from the information about the density of conditional distributions p 7 . In the subsequent chapters, formula (3.6) serves as one of the main tools.

Now let us use the terminology of conditional distributions to show that the functional / is independent of the cr-algebra 05 r .

THEOREM 3.2. Let f: (X, 05) —> (Y,ii) be a measurable functional with values in a separable metric space Y. If T is a measurable partition o/X, then for f to be independent o / S r it is necessary and sufficient that P 7 / _ 1 = P / _ 1 for Pp-almost all 7 G X / r .

PROOF. First, we verify the sufficiency. Let

A* e s x , r ' ^2 = n~\A2) e 05 r.

We apply formula (3.1) to B\ and A2. Then we use the assumptions of the theorem and, finally, factor out the constant coefficient from the integral:

P{B1B2} = P{B1n7r-1(A2)}= f P1{f-1(A1)}Pr(d1) JA2

= [ Pif-'iA^Pridj) = P{f-1(A1)}Pr{A2} = P{B1}P{B2}. JA2

Page 28: Local Properties of Distributions of Stochastic Functionals

12 1. PRELIMINARIES

The relation P{B1B2} = P{Bi}P{B2} shows that the a-algebra <Br and the cr-algebra generated by the functional / are independent.

The necessity of the assumptions of the theorem follows from the same chain of relations:

/ P 7 { / - 1 ( A 1 ) } P r ( d 7 ) = / P { / - 1 ( A 1 ) } P r ( d 7 ) . JA2 J Ai

By literally repeating the proof of Proposition 3.1, we prove that the measures P 7 / _ 1 and P / _ 1 are equal for Pp-almost all 7. •

Now we assume that two measures P and Q, Q c P , are given on 2$. Obviously, then Qi <C Pr- Let us find the relation between P 7 and Q 7 .

THEOREM 3.3. For Vr-almost all 7 we have Q 7 <C P 7 and

(3.7) ^ l ( z ) = W(x) d P 7

W dP W dP £(*(*))

for P 7 -almost all x.

PROOF. It suffices to verify that the measures

*(*)-fw[^w i - l

P 7 (dx)

satisfy the following relation of the form (3.1):

Q{B n Tr-'iA)} = J Q7{£}Qr (*y)

for B G 93 and A e 93x,r. First, we express Q in terms of P :

Q{Bn<ir-l(A)}= [ g(x)P(dx), J x

where g(x) = l{Bnn-i(A)}(x)^§(x). Applying (3.3), we obtain

/ g(x)P (dx) = [ f g(x)P1 (dx)Pr (d7) Jx Jx/r Jx

lB(x)-^(x)P1(dx)Pr(d1) -II = JAQy{B}^h)Pr(d1) = J^{B}Qr(d7). D

Page 29: Local Properties of Distributions of Stochastic Functionals

CHAPTER 2

Methods for Studying Distributions of Functionals

§4. Stratifications of measures

General outline of the method. A stratification of a probability measure is a construction intended for studying distributions of functionals of a random process with the use of the total probability formula. The main idea is to con­sider measurable partitions of the probability space such that the elements of these partitions have a simple structure. In what follows, these elements are always finite-dimensional sets.

In the traditional approach, the total probability formula is commonly used in the form

P { 4 } = / p { A | e = r}P€(dr),

where f is a random variable with distribution P^ defined on a probability space X. However, the conditional measure P{ • | £ = r} is often no easier to study than the original measure P . No wonder that this formula proves useless in many cases, and one has to use partitions of quite different type, for which the elements, rather than (as in the traditional approach) the quotient space, are finite-dimensional.

Let us outline the subsequent considerations. The distribution of a random process is treated as a Borel measure P on a topological space X whose points are the process trajectories. Let / : (X, <8x) —> (Rm ,2$m) be a measurable functional ranging in the Euclidean space R m . We shall study its distribution, that is, the measure P / - 1 on Q5m, in three stages.

First, we choose a partition T of X for which it is easy to study the conditional measures P 7 . To this end, the partition elements are chosen to be sets of simple geometric structure, like lines, rays, segments, smooth curves, or multidimensional planes. These elements will be called strata. (In most cases, the elements are one-dimensional, so we speak of "fibers" rather than "strata.") The strata are chosen so that / will be a smooth function with nondegenerate derivative along each stratum.

Second, we apply the mapping / to the conditional measures P 7 . This is a comprehensively studied finite-dimensional problem for which the results are well known; we present them in the sequel. At the second stage of our investigation, we obtain some information about the conditional distributions P 7 / _ 1 , the images of P 7 under / .

At the third stage, we estimate the quotient measure P r and use formula (3.4) to obtain some information about P / _ 1 from the information obtained at the second stage. By using (3.5) instead of (3.4), we can study the properties of the density of the distribution P / _ 1 .

Our immediate goal is to construct a wide class of partitions suitable for our scheme and study the related conditional measures.

13

Page 30: Local Properties of Distributions of Stochastic Functionals

14 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

Admissible semigroups. Let (X, 2$x,P) be a topological space equipped with a Borel measure. A measurable mapping G: X —» X is said to be admissible i f P G " 1 < P .

A family of mappings {Gc}, c G R + , will be called an admissible semigroup if 1) Go is the identity mapping; 2) GCl o GC2 = GCl+C2; 3) for any c G R™, the mapping Gc is admissible and one-to-one; 4) for each x € X, the mapping c i—> Gc(x) is one-to-one. We say that two elements x\, x2 G X are equivalent if

GCl(xi) = GC2{x2)

for some Ci, c2 G R + .

PROBLEM 4.1. Prove that this is an equivalence relation on X.

Let T be the partition given by this equivalence relation, and let n: X —> X / r be the canonical projection. The equivalence classes 7r_1(7), 7 G X / r , will be called the orbits of the semigroup G. This is natural, since the orbits are invariant under each mapping Gc( •) .

The following two statements allow us to introduce a useful real parametrization of the orbits.

PROPOSITION 4.1. Let x\ ~ x2. Then there exists a unique c(x\,x2) G R m

such that GCl (xi) = GC2(x2) implies c\ = c2 + c(xi, £2).

PROOF. Let GCl(#i) = GC2(x2) and GC3(xi) = GC4(x2). Then c i - c 2 = C3-C4. Indeed, by the definition of an admissible semigroup, we have

^c2+c3(^2) = GC3 o GC2(x2) = GC3 o GCl(xi) = GC3+Cl(xi)

= GCl o GC 3(xi) = GCl o GC4(x2) = GC4+Cl(x2).

Moreover, property 4) of an admissible semigroup implies

C2 + C3 = C4 + c i ; C2 — c\ = C4 — C3.

Thus, c(#i,:r2) = ci — c2 depends only on x\ and x2. •

PROPOSITION 4.2. Let 7r_1(7) 6e an orta* o/{G c}, and to z G 7r -1(7). 7%en Jx\ y \-^> c(xyy) is a one-to-one mapping of TT~1{^) onto a X-measurable set Cx C R m . This mapping intertwines the action of{Gc} on 7r-1(7) with the action of the semigroup of translations on Cx:

(4.1) JxGc(y) = Jx(y)+c.

PROOF. Let us verify that the mapping is one-to-one. Suppose that

c(x,y') =c(x,y")\

GCl(x) = Gc/(y'); ci = c' + c(x,y')\

GC2(x) = Gc»{y"y, c2 = c" + c(x,y').

Then

Gc>+C"{y') = GC" o Gc>(y') = GC" o G c / (x) = Gc>l+Ci+C(x^)(x)

= G c /+ C 2(x) = Gc> o GC2{x) = GC'+C"(y")'

Page 31: Local Properties of Distributions of Stochastic Functionals

§4. STRATIFICATIONS OF MEASURES 15

Since the mapping Gc>+C"(-) is one-to-one, it follows that y' = y'\ and thus Jx(-) is one-to-one.

Now let us prove (4.1). Suppose that GCl(x) = GC2(y) and Jx{y) = c{x,y) = C\ — C2. T h e n

Gc+Cl (x) = Gc o GC1 (x) = Gc o GC2 (y) = GC2 (Gc(y))>

and hence, Jx(Gc(y)) = c — c\ — C2 = Jx(y) + c. It follows from (4.1) that Cx is measurable, which completes the proof. •

PROBLEM 4.2. Prove that choosing a different initial point x in the same orbit results only in a linear change of Jx and in a translation of the set Cx:

(4.2) JXl(>) = JX2(') + c(xux2),

(4.3) CXl = CX2 +c (x i ,x 2 ) .

In addition to the algebraic and set-theoretic properties l)-4), we impose the following topological condition of continuity:

5) Jx is a homeomorphism of n"1^) and Cx if vr_1(7) is equipped with the topology inherited from X.

Let Am be the Lebesgue measure on R m . Then we can define a measure A7 on Q5x by setting

(4.4) \1(B) = Xm(jx(Bmr-1(7))).

By condition 5), this is well defined. Obviously, A7 is concentrated on 7r_1(7). Note that A7 is independent of the reference point x G 7r-1(7) used in (4.4) by

virtue of (4.2).

PROPOSITION 4.3. The measure A7 is Gc-invariant. If B e Q5x and c G R + , thenG-l{B) G Q5X and

(4.5) A7(B) = A7(GC"1(B)).

PROOF. Since the orbit 7r_1(7) is Gc-invariant, we have

(4.6) Gc-1(B)n7r-1(7) = Gj 1 ( ^n7r - 1 (7 ) ) -

Relations (4.1) and (4.6) imply

JX(GC-X(£) 0 7 ^ ( 7 ) ) = J s ( G j 1 ( B n 7 r - 1 ( 7 ) ) ) = J x (BfHr- 1 ( 7 ) ) - c

Therefore, since the Lebesgue measure is invariant, we obtain

A7(G"1(^)) = xrn{Jx{G;1(B)n7r-1(i))) = xm(Jx(Bn7r-1(1))-c) = Xm(jx{Bn7r-1^))) = X1(B). •

Calculation of conditional measures. Now let us additionally assume that X is metrizable, complete, and separable and that the partition into Gc-orbits is measurable. Then the system of conditional measures {P 7 } is well defined (see Theorem 3.1). We set P c = P G " 1 . Since the semigroup G is admissible, we have P C < P .

Let pc = dPc/dP be the corresponding density.

Page 32: Local Properties of Distributions of Stochastic Functionals

16 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

THEOREM 4.1. Let the partition Y of a complete separable metric space X into orbits of an admissible semigroup {Gc}, c G R + , be measurable. Then for P r -almost all 7, the conditional measure P 7 is absolutely continuous with respect to the invariant measure A7, and the density is given by

(4-7) ^(Gu(x)) = K^\pa(Gu(x))]-1

for P7-almost all x. The normalizing factor K1 is constant on 7.

REMARK. In this theorem X can be replaced by any open set F c X , since by changing the metric, we can make this set a complete separable space. In this case, we must assume that the mapping Gc takes V to V. Prior to proving the theorem, let us present a criterion for absolute continuity of a measure on R m in terms of properties of its translates.

Let \i be a finite measure on (Rm,Q5m). By *iu, u G R m , we denote the translate of \i given by \iu(A) = p,(A — u).

LEMMA 4.1. Let U C R m be a set of positive Lebesgue measure, Xm(U) > 0, and let \iu <C \i for almost allu eU. Let us choose a representative of the function p(u,v) = dp,u/dfj,(v) jointly measurable in (u,v).

Then \x <C Am, and for {i-almost all v G R m the density M(-) = dfj,/d\m(') satisfies the relation

(4.8) M(v + u) = M(v) dfiu,

simultaneously for Xm-almost allu eU.

P R O O F OF LEMMA 4.1. Suppose that v is a nonzero absolutely continuous measure concentrated on — U. Then the convolution p, * v satisfies

(4.9) fM * u(A) = [ p{A- u)v (du) = [ iiu(A)v (du) = [ ^~u(A)u (-du). J-u J-u Ju

We can rewrite the relation \iu <C \x as \x <C pTu. Therefore, it follows from (4.9) that p, <C \x * v. On the other hand, \x * v <C Am, since v <C Am.

Thus, fi < jLt* v < Am. Let M(-) be the density of \i with respect to Am. Then the function (tx, v) i-> M(v)/M(v + u) is a measurable representation of the density dp,(v)/dp,~u.

Indeed, after the linear change w = v + u we obtain

f M{v) _ t t / , N f M{w-u) ,_ . f M(w-u)nr, , x m / , , / w/ NM (dv) = / —»\r/ x P>(dw) = / —,\r/ x M(w)Xm (dw) JAM(v + uf v ; JA+U M(w) ^ ; y A + u M(w) K J v ;

= / M(ti/ - u)Am (dw) = / M(v)\m {dv) = p(A). JA+U JA

Another measurable representation of the density dp,(v)/dpTu can be con­structed from the density p(u, v) mentioned in the conditions of Lemma 4.1. Name­ly, by a linear change of variable we prove that the function p(u, v + u) may serve as such a density:

/ p(u, v + u)pTu (dv) = / p(u, w)p, (dw) = /j,u(A + u) = p,(A). J A JA+U

Page 33: Local Properties of Distributions of Stochastic Functionals

§4. STRATIFICATIONS OF MEASURES 17

It is easy to verify that any two measurable representations of dfj,(v) / dfj,~u coincide onC/x R m almost everywhere with respect to the measure Am x \i.

By equating the two representatives, we obtain

(4.10) p(u,v + u)= M{V)

M(v + u)'

Hence, for ^-almost all vy (4.10) holds for Am-almost all ueU. •

P R O O F OF THEOREM 4.1. First of all note that the quotient measures (P c)r and the families of conditional measures {(P c)7} corresponding to the measure P c

and the partition T satisfy the relations

- l (4.11) (P c)r = P r ; (P c ) 7 = P-yG;

The first relation readily follows from the fact that the orbits are Gc-invariant: if A € Q3x,r, then

(Pc)r(A) = P ' O r " 1 ^ ) ) = P ( G - 1 T T - 1 ( A ) ) = P ( T T - 1 ( A ) ) = J>r{A).

The second relation in (4.11) can be obtained by straightforward verification of (3.1): for any B e 05 and A G 2$x,r, we have

/ P 7 G c - 1 (S) (P c ) r (d 7 ) = / P 7 (G c" 1 (S))Pr(d7) = P(G;1(B)n*-1(A)) J A J A

= P(G-1(Bnn-l{A))) = Pc(Bn7r-1(A)).

It follows that the measures P-yG"1 satisfy the definition of conditional mea­sures for the measure P c with respect to the partition T.

Furthermore, by applying Theorem 3.3 to the measures P c and P and by taking into account the fact that the quotient measures coincide, from (3.7) we obtain the following expression for the density of the conditional measure:

Let us use the parametrization from Proposition 4.2. By this representation, the space (X, 05, A7) is equivalent to some subset of (Rm , 2$m, Am), and moreover, the isomorphism Jx takes the semigroup Gc to a semigroup of translations. One can apply Lemma 4.1 to the measure fi = P1J~l, since the condition \xc <C \x is equivalent to V^G~X <C P 7 and, by (4.11), the latter relation is equivalent to (P c ) 7 <C P 7 , which was established in Theorem 3.3.

By Lemma 4.1, we have P 7 J~l < A m , which is equivalent to P 7 <^\mJx = \1. Thus, we have proved the absolute continuity of the conditional measure. Let

us calculate the corresponding density. Let us apply formula (4.8) to the measure \x = P1J~l. We choose some v that satisfies (4.8). Without loss of generality, we can assume that v = 0; otherwise, we could redefine the reference point x by taking x' = Gv(x) as a new reference point on the orbit.

Taking into account the relation between the semigroup of translations and the semigroup Gc, we rewrite (4.8) in the form

d\m K J d\m K J

- 1 T - l dPyGu Jx

Clir<yJx • ( « )

Page 34: Local Properties of Distributions of Stochastic Functionals

18 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

By setting K7 = dP7J~1(0)/dAm , we obtain

dP^J, - l l°x

d\« (u) = K1

dlr^Gr., J. - 1 T - l 7 ^ U ux

uJr^Jx f-(«) Now, returning from R m to the space (X, Q3x,A7), we can substitute u =

JxGu(x) into the result and rewrite it as follows:

dP

dA7

(Gu(x)) = Kn dP^G, - l

7v-'u d P 7

(<?«(*)) - 1

Finally, by substituting (4.12) into the right-hand side, we obtain the desired formula (4.7). D

REMARK. The statement and the above proof of Theorem 4.1 remain valid if we abandon the topological properties (metrizability, completeness, and separability of X) and only assume that there exists a unique system of conditional measures for the partition of X into orbits.

Calculation of conditional distributions. Following the general scheme of the stratification method, we first calculate conditional measures and then study their images, that is, conditional distributions. Since the conditional measures obtained in Theorem 4.1 are concentrated on finite-dimensional sets, we need special tools for studying transformations of measures in R m . In what follows we present related analytical results. Let Am be the Lebesgue measure in R m , A = A1.

THEOREM 4.2. Let A be a bounded open set in R1, f: A —> R 1 a X-measurable function, and B a X-measurable subset of A. Suppose that f is differentiate X-almost everywhere on B.

Under these conditions, Xsf~l <^ A if and only if

(4.13) X{c e B | f'(c) = 0} = 0.

/ / (4.13) holds, then the density p = dXsf~l/dX has the form

(4.i4) P(u)= 53 i/'wr1-c G / _ 1 { ^ } n 5

{If f'{c) does not exist, we set | / ' (c) | = oo.)

We shall prove the m-dimensional generalization of this theorem.

THEOREM 4.3. Let A be an open set in R m , / : A -» R m a Xm-measurable function, and B a Xm-measurable subset of A. Suppose that f is Frechet differen-tiable X-almost everywhere on B.

Under these conditions, A^ f~l <C Am if and only if

(4.15) Am{c G B | det Df(c) = 0} = 0,

where Df is the matrix of first-order partial derivatives of f. If (4.15) holds, then the density p = dX^f'1 /dX has the form

(4.16) p ( u ) = ^ \det DficT1. cG/ - 1 {^}nB

Page 35: Local Properties of Distributions of Stochastic Functionals

§4. STRATIFICATIONS OF MEASURES 19

PROOF. Let B C Bbe the subset of points of density of B where the derivative Df is denned. Then Am{£ \ B} = 0. Furthermore [111, Ch. Ill, §3.1], there exist Am-measurable disjoint sets Bn such that B = \JnBn and the restriction of / to each Bn satisfies the Lipschitz condition. We extend / from Bn to a Lipschitz function defined on R m . Then [111, Ch. Ill, §3.2] for any Am-measurable nonnegative function g we have

/ g(c)\detDf(c)\Xm(dc)= f ( £ g(cj)\m(du).

By taking g = l0n, where 6n = {c e Bn: detDf(c) = 0}, we obtain Am(/(0n)) = 0. Hence, condition (4.15) is necessary for the measure A ^ / - 1 to be absolute continuous. Next, we can use the above relation for the function

9(c) = l{f-i(A)nBn} I det Df(c)\~\

where A is an arbitrary Am-measurable set. We obtain

Xm{f-1(A)DBn}= f ( ] T | d e t D / ( c ) | - 1 ) A - ( ^ ) . JA cef~l{u}

By summing over n, we arrive at (4.16):

\1Ef-1(A) = \™{f-1(A)nB} = Y/\™{f-1(A)nBn}= [ P(u)\m(du). n

The above theorems can be generalized to the case in which / is applied to some measure other than Am.

PROPOSITION 4.4. Let A, B, and f satisfy the assumptions of Theorem 4.3, and let \x be a finite measure such that \i <C Am. / / condition (4.15) holds, then M B / - 1 <C Am and the density p = d/j,Bf~1/d\Tn has the form

(4.17) p(u)= £ \detDf(c)\-l-^(c). cef~l{u}nB

In particular, for ra = 1, formula (4.13) implies \xsf~l <C A and

(4.i8) P(«)= Yl i / ' w r 1 ^ ) . cef~1{u}nB

The following theorems give sufficient conditions under which the image of a measure continuously depends on the mapping.

THEOREM 4.4. Let fn, n E N, be convex functions defined on a bounded in­terval A c R 1 , and let n be a Borel measure such that fj, <C AA- Suppose that

foe (c) ¥" 0 and /oo (c) = lim fn (c) n

for X-almost all ce A. Then

(4-19) M / n 1 - ^ - 1 -

The space C^^(A) of continuously differentiable functions on a bounded inter­val A C R 1 will be equipped with the norm | | / | | i = sup A ( | / | + | / ' | ) -

Page 36: Local Properties of Distributions of Stochastic Functionals

20 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

THEOREM 4.5. Let V be a family of measures on the a-algebra 2$A such that the densities of all \x GV are equicontinuous. Suppose that f G C*(A) and f'^0 almost everywhere. Then

lim sup sup { H M / - 1 - W_ 1 H 111/ - 9\\i <6}=0.

6-0 ^V g

PROBLEM 4.3. Construct an example showing that closeness of / and g in the C1 metric cannot be replaced by closeness in C°.

Absolute continuity of the distribution of a functional. By way of il­lustration, we show how the stratification method allows one to prove the absolute continuity of the distribution of a functional.

PROPOSITION 4.5. Let {Gc}, c G R1 , be an admissible semigroup defined on the space (X, 93x,P), and let f: (X, 93x) —• R 1 be a measurable functional. Sup­pose that for P-almost all x G X the derivative of f along the orbit is defined and

(4.20) DGf(x) = Kmc-'lfiGcix)) - f(x)} ? 0. c—>0

ThenPf-1 ^X1.

PROOF. Let T be the partition of X into orbits of Gc. Let us consider an orbit 7 G X / I \ By Theorem 4.1, the corresponding conditional measure P 7 is absolutely continuous with respect to the invariant measure A7. By (4.20), we can apply Proposition 4.4 to the measure n = V1 and to the restriction of / to 7. Thus, P 7 / _ 1 < A and, by Proposition 3.3, P / " 1 < A, as desired. •

For a given functional / , it is not always easy to satisfy (4.20) by choosing only one semigroup Gc. More often, the desired semigroup can be constructed locally, that is, in a neighborhood of x for P-almost all x G X. Furthermore, it is convenient to replace condition (4.20), which simultaneously deals with / and G, by somewhat stronger conditions imposed on / and G separately. Let us present the corresponding definitions and results.

Let X be a linear space. Let B c X be a Borel subset with respect to the original topology r x and let H C X be a subset equipped with a topology rH. We say that a functional / : X —• R 1 is (B, H)-smooth if for each x £ B there exists a direction I G H such that

(4.21) Dtf(x) = lim c~l[f(x + cl) - f(x)] ^ 0 c—>0

and the function (y, h) h-> Dhf{y) is defined and ( r x x rH)-continuous in some (T X x rH)-neighborhood of (x,Z). By M(B,H) we denote the class of (B,H)-smooth functionals.

Let a Borel measure P and a seminorm || • || be given on X. We say that the measure P has a ( 5 , H)-sufficient class of admissible mappings if for any x e B and / G H there exist an ordered system of rx-neighborhoods Va of x, a system of P-admissible semigroups Ga defined on Va, and a system of vector fields la: VQ —> H such that:

1) for P-almost all y G V a , one has

(4.22) ]imc-1\\G^{y)-y-da(y)\\=0; c—»U

Page 37: Local Properties of Distributions of Stochastic Functionals

§4. STRATIFICATIONS OF MEASURES 21

2) the mapping y *-> la(y) is continuous on Va f)B with respect to r x and rH ;

3) TH-]imala(x) = I

THEOREM 4.6. Let ( X , r x ) be a topological space, P a Borel measure on X, and || • || a seminorm on X. Le£ 5 and # be subsets o /X equipped with topologies r x and T^, respectively, and let / : (X, 2$x) —> R 1 6e a measurable functional. Suppose that:

1) tte measure P /ias a (B, H)-sufficient class of admissible mappings; 2) £/&e functional f satisfies a local Lipschitz condition on B with respect to || • ||; 3)feM(B,H); 4) ei£/ier £/&e topology r x Aas a countable base, or IP is a Radon measure. Then VBf~l < A.

PROOF. Under the assumptions of the theorem, one can readily construct a cover of B by rx-open sets Wa such that Proposition 4.5 can be applied to the measure P a = P\vanB- By taking a countable subcover {Wn} C {W^}, we obtain

oo

P B / " 1 < 5 ^ 2 - n P n / " 1 < A . D n = l

REMARK. If we use linear semigroups G"(x) = x + d a , then condition 2) can be omitted.

The first assumption in the statement of the theorem deals only with the mea­sure P , and the other two only with the functional / . This simplifies their verifica­tion in specific cases.

Bounds for the density of a sum of independent variables. We shall illustrate the proposed methods by a well-known problem from the theory of sum­mation of random variables.

Let &, i > 1, be independent identically distributed random variables. Set Sn = X)r=i&- If the distribution of Sn has a density, we denote this density by pn(')- Gnedenko proved the following uniform local limit theorem [26; 51 , Ch. IV].

THEOREM 4.7. Suppose that E& = 0 and D& = a2 > 0. Then

1 - r 2 / 2 0 (4.23) sup \pn(ray/n)ay/n -=

if and only if pN is bounded for some N.

In contrast with a majority of limit theorems, Gnedenko's condition is imposed on the distribution of the sum of several terms rather than on the distribution of a single term. Therefore, it is natural to ask whether it is possible to find a condition on the distribution of an individual summand guaranteeing (4.23). A partial answer is given by the following theorem.

THEOREM 4.8. / /

(4.24) Epj(0 = fp\+e{r) dr < oo

for some e > 0, then pn is bounded for n > (1 + £)£ - 1 .

Page 38: Local Properties of Distributions of Stochastic Functionals

22 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

PROOF. We use the stratification technique. Let P be the measure generated by the random vector (£1 , . . . , £n) on the space X = Rn. By x we denote elements of X, and by x\,..., xn the components of x. Then

2 = 1

We construct a partition of X as follows. Let ao = 0 and ak = 2 / e_1 , k G N. We set X/c = {x G X: minp^Zi) G {ak,ak+i}}.

For x G Xfc we define

/ (a) = max{i: p^Xi) G (afc,afc+i]}.

In Xfc we introduce the partition generated by the following equivalence rela­tion:

( x,y GXfc,

x~y:= I I(x) = I(y),

{ Xj = yj for j ^ I(x).

Since X = U & L o ^ ' w e °btain a partition of the entire space X into one-dimensional sets, each being parallel to one of the coordinate axes. We denote this partition by C/, its elements by u, and the quotient and conditional measures by P ^ and {Pu}> respectively.

One can readily calculate the density of the conditional measure. Suppose that x eu G yik/U and I(x) = i. Then

(4.25) pu(x) = -j^(x) = cicP^xi),

where the normalizing factor is

i - l

p,{y)\(dy)\ '{y\ak<Pi(y)<ak+i}

(4.26) ck

We must study the density pn of the image of P under the mapping on i x H-> X^=ix* e ^ • The t°*al probability formula (3.6) implies

P«(0 = / P c ( * » ) ^ ^ ( r )

Since each class u lies on a straight line parallel to some coordinate axis and the mapping Sm along each of these lines acts as a translation, it follows that the measure PuS'm1 is a translate of Pu. Therefore, by (4.15), we obtain

(4.27) /

dP 9 _ 1

r CLA

= / snppu(x)Pu (du) < y]P{Xk}ckak+1 J xeu ,_n

Page 39: Local Properties of Distributions of Stochastic Functionals

§5. SUPERSTRUCTURE 23

We also note that

P{X/J = P{x | ak < minpJxi) < afc+i}

= P{x | ak < minp1 (xi)} - P{x | afc+i < m i n ^ (xi)} i i

= ([ P1(y)Hdy)) -( f Pl(y)X(dy))n

\J{v\Pl(v)>ak} / W{l/|Pi(l»)X»*+i} /

= Pn{Pl(0 > ak} - Pn{Pl(0 > ak+1}.

Taking into account the elementary estimate

a71 - bn < n\a - b\an~\

from (4.17) we obtain oo

sup pn(r) < y2nc^1Pn-1{p1(0 > ak}ckak+l.

By the Chebyshev inequality, we have

/ Pl(y)\(dy)<a^ fp\+e(y)dy. J {y\Pi(y)>a>k} J

Since ak = 2fe, we finally obtain

sup pn(r) <n p\+£(y)dy(l + Y"ak+1al{n~l)£) r € R i Mi v t^i J

= 2n / p11+£(y)dy(l + 2 ^ 2 f c ( 1 - ^ - 1 ^ )

^ R 1 fc=o

= n ( l + 2 ( l - 2 ^ 1 - ( n - 1 ^ ) - 1 ) / p} + £ (y)dy<oo. •

Note that we divided the space X into countably many parts and constructed different partitions for different parts. This scheme is typical for virtually all appli­cations of the stratification method to density estimates.

§5. Superstructure

If one deals with proving absolute continuity or verifying strong convergence, the stratification method often requires much more information about conditional distributions than is actually needed to solve the problem. The superstructure method, presented in this section, is easier and more effective in these problems. The use of this method is related to an extension of the space of sample functions.

The idea of the method. Let (X, 21, P) be a probability space, and let / be a functional. We study the absolute continuity of the distribution P / _ 1 . The idea of the superstructure method is to choose an auxiliary family of measures Q£ and functionals F£ with the following two properties.1 First, the measures Q£F~1 converge in variation to P / _ 1 as e —> 0. Second, the measures Q£Ff1 are

1 In general, for different e, these measures and functionals may be defined on different spaces.

Page 40: Local Properties of Distributions of Stochastic Functionals

24 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

absolutely continuous, and the absolute continuity can be verified by the simplest version of the stratification method.

First, we consider a somewhat simplified situation. Assume that we are given a family {Gcyc G [0, a]} of transformations of the space X such that the action of these transformations on P is small for small parameter values:

(5.1) PG-1™? as c - » 0 .

We use these transformations to construct the families {Q£} and {F£}. To this end, on the direct product (Ye, 2)e) = ([0,e], 2$[o,e])x (X, 21), e G [0, a], we introduce the measure Q£ = ^A x P and the functional F£: Y€ —• R, Fe(c, x) = f(Gcx).

Note that since {x \ (c,x) G F~l(A)} = G ^ / " 1 ^ ) ) , we have

Q£F£-\A) = (±A x P ) ^ " 1 ^ ) } = \f^{Gc\r\A))}dc

for ,4 G S31. Hence, by (5.1),

iip ri_QeF-iii = |i rP /-id c_i TPG-1/-1^! I I € Jo € Jo II

< - / \\P-PG-1\\dc->0 as e - > 0 . £ Jo

Thus, for small e the measures Q^i7^-1 are close in variation to the measure P / _ 1 . At the same time, by considering the partition of the space into fibers "parallel" to the segment [0, e] and using the Fubini theorem, we see that

CfFe-l = \f \<p?P(dx)t e Jx

where ipx(c) = / (G cx) , c G [0,e]. This relation means that the measure Q£F€~l

is a mixture of measures A^"1, that is, of images of the Lebesgue measure under the restrictions of / to the "orbits" c -* Gcx, c G [0,e]. We can study the absolute continuity of the measures Ay?"1 by the methods developed in §4. Thus, if, say, for P-almost all x the function c •-• ipx(c) has a nonzero derivative almost everywhere on [0,e], then by Theorem 4.2 the measures Ay?"1 are absolutely continuous for P-almost all x. It follows that the measures Q£F~l are absolutely continuous, and eventually, P / - 1 is absolutely continuous.

Theorems on absolute continuity. By slightly changing the above argu­ment, we obtain the following result.

THEOREM 5.1. Suppose that a family of mappings {Gc, cG [0,a]} of the space X satisfies property (5.1), and moreover, for P-almost all x there exists a number ex > 0 such that the derivative <px(c) exists and is nonzero for almost all c G [O,^]; here, as above, ipx{c) = f{Gcx).

Then P / " 1 < A.

PROOF. We set Xn = {x \ ex > 1/n}. The representation Xn = {x \ g(x) = 1/n}, where

nl/n

g{x)= / iR \{ 0 }(<^( c))d c> Jo

implies that the sets Xn are measurable. Furthermore, the sets Xn form an in­creasing sequence and P{X%} —> 0 as n —> oo. By P n we denote the restriction

Page 41: Local Properties of Distributions of Stochastic Functionals

§5. SUPERSTRUCTURE 25

of P to the a-algebra 21 n Xn. Consider the measure Q n = nX x P n on the direct product of [0,1/n] by Xn. As before, let F(c, x) = f{Gcx). Then, in a similar way, we obtain

WQnf-'-Pf-'^ln f nVnG-lf-1dc-n[ " P / " 1 * ! II Jo Jo II

<n f n\\PnG:l-P\\dc<n [ " l I P G ^ - P H d c + P ^ } . Jo Jo

Thus, QnF'1 ™ P / " 1 as n -> oo. Note that

Q n F _ 1 = n / A y ^ P t d x ) .

By the construction of Xn and by the assumptions of this theorem and The­orem 4.2, the measures A^"1 are absolutely continuous for x G Xn. Hence, QnF'1 < A. It follows that P / " 1 < A. D

Theoretically, the structure of / may vary strongly on different parts of the space X. Therefore, it may be very difRcult to find a suitable unified family of mappings (Gc). To make the above argument effective, we must "localize" it. The price we have to pay for this is imposing some additional conditions on the space X and the mappings Gc.

THEOREM 5.2. Suppose that (X,d) is a separable metric space, 21 = 95x, and P is a probability measure on 21. Suppose that for P-almost all x ^ X , there exists an open neighborhood V and a family {Gc, c G [0, ax]} of mappings of the space X such that

1)PG-1V-^P a s c ^ O ; 2) Gcx —» x with respect to P as c —> 0; 3) for P-almost all y G V, there exists an ey > 0 such that ^ ( c ) exists and is

nonzero for almost all c G [0,6^]; here, as before, ipy(c) = f(Gcy). Then P / _ 1 < A.

REMARK. If there is a family of mappings (Gc) such that condition 1) is satis­fied and condition 3) holds for P-almost all y G X, then Theorem 5.2 remains valid even without the separability assumption (see also Problem 5.2).

PROOF. By Xo we denote the set of x described in the statement of the theo­rem. By assumption, P{X 0} = 1. The corresponding open neighborhoods form a cover of X0 . The separability of the space allows us to choose a countable subcover (Vk). Since

prl<^vkr\ k

it suffices to prove the absolute continuity of each measure Pykf~1. To this end,

we need the following statement.

LEMMA 5.1. Let P be a probability measure on a Borel a-algebra of a metric space X. Suppose that a family of measurable mappings {Gc: X —> X} possesses properties 1) and 2) from Theorem 5.2. Then for any measurable set A one has

(5.2) PAG^^PA as c - • 0.

Page 42: Local Properties of Distributions of Stochastic Functionals

26 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

PROOF. First, we assume that P{dA} = 0. The inequality

sup | P { J 4 H G-^B)} - P{A n B}\ < sup \P{A nB}- PiG-^A n B)}\ BeVl BG21

+ sup \P{AnG;\B)} - P{G;\A) nG^(B)}\ Beoi

implies (5.3) \\PAG;1 - PA\\ < WPG;1 - P | | + PiAAG-1^)}.

For each e —> 0, we have

P{AAG^(A)}= [ \lA(x)-lA(Gcx)\P(dx)

< P{x I d(xy Gcx) >e} + P{A£ n (Ac)e},

since the condition d(x, Gca;) < e implies 1A(X) = 1A(GCX) for x £ A£ D (^l0)^. Thus, we obtain

i E i P ^ A G " 1 ^ ) } <P{A£n(Ac)£},

and P{^ie H (Ac)e} -» P { ^ ~ n (Ac)"} = P{9A} = 0 as £ -+ 0. Therefore, limP{>4AG~1(A)} = 0. Inequality (5.3) and condition 1) of the theorem readily

prove (5.2). The general case follows from the fact that for an arbitrary Borel set A and for

all e > 0, there exists a set B e 21 such that P{dB} = 0 and P{AAB} < e (see Problem 5.1).

Indeed,

HP^G-1 - P * | | < \\PAG;1 - PBG;1]] + \\PBG;1 -PB\\ + \\PB - PA\\

<2e + \\PBG;1-PB\\.

Now, using the special case, we obtain (5.2) from the last inequality. •

Now let us prove the theorem. By Lemma 5.1, the measure Pyk satisfies the assumptions of Theorem 5.1. Theorem 5.2 is thereby proved. •

To use the stratification method, we need some families of mappings (Gc) that possess the group structure and whose action on the measure P is quasi-invariant, that is, PG~l ~ P . In the superstructure method, the assumption about the group structure is not necessary, and the quasi-invariance property is replaced by the assumption that the measures P G " 1 and P are close in variation for small c. Moreover, in the stratification method one must take care that the partitions into orbits be measurable. Here this is not necessary.

Application to semistable processes. The following application illustrates fairly well the possibilities of the superstructure method.

Recall that a process £ = (&,£ G R+) is said to be semistable of order a > 0 if for all a > 0 the processes (£at) and (aa£t) have the same finite-dimensional distributions.

We assume that the trajectories of the process £ lie in the space C[0, oo) with probability 1. (In this connection, see Problem 5.3.)

Page 43: Local Properties of Distributions of Stochastic Functionals

§5. SUPERSTRUCTURE 27

By P we denote the distribution of the process £ in C[0, oo) and consider the functional

f(x) =supx(t), x € C[0, oo). [0,1]

Since the trajectories are semistable, we readily see that £o = 0 with probability 1. Therefore, the distribution P / _ 1 is concentrated on the half-line [0, oo).

THEOREM 5.3. ppose that the distribution of the random variable £i has a density. Then the distribution P / _ 1 has the following structure: at the zero point it may have an atom, and on (0, oo) it is absolutely continuous.

Note that the mapping S: C(R+) —> C(R 1) defined by the relation

(5.4) Sx(t) = e-tax(et), teR\

takes a semistable process £ of order a to the strictly stationary process rj = S£ and that this correspondence is one-to-one (see Problem 5.5). Let p = P / _ 1 { 0 } . Since the process £ is semistable, we readily obtain

P{sup f (*) = 0} = P{sup£(TY) = 0} = P{sup£(t) = 0}. [0,T] [0,1] [0,1]

for all T > 0. Therefore,

p = P{sup£(£) = 0} = P{sup£(£) < 0} = P{sup77(s) < 0}. [0,1] R+ R1

So, the fact that the distribution P / _ 1 has an atom of value p is equivalent to the fact that with probability p the trajectories of the stationary process rj lie under the x-axis. For 0 < p < 1, this is possible only if the process rj is not ergodic. Thus, we have the following assertion.

COROLLARY 1. / / the semistable process £ is generated by an ergodic stationary process for which the distribution of rj(0) is absolutely continuous and not concen­trated on (—oo,0], then the distribution o / P / - 1 is absolutely continuous.

Let us point out another useful result.

COROLLARY 2. Suppose that the distribution of the variable £(1) is absolutely continuous. Then the distribution of the random variable

ff(0=sup|«0l [0,1]

is also absolutely continuous.

The proof readily follows from the fact that |£|, as well as £, is semistable. Here we do not need the ergodicity of the corresponding stationary process, since the inequality P{sup [01j \£(t)\ = 0} > 0 would imply P{f (1) = 0} > 0, but this contradicts the absolute continuity of the distribution of £(1).

P R O O F OF THEOREM 5.3. Let iri(x) = x(l) and x e C[0,oo). We set

Xi = {x\ f(x) = m(x)}t X2 = X f n {x | f(x) > 0}, X3 = {x I f(x) = 0}.

Obviously, the atom at the origin of the distribution P / _ 1 equals P{Xa}, and the restriction of P / _ 1 to the half-line (0, oo) coincides with the measure

Page 44: Local Properties of Distributions of Stochastic Functionals

28 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

Px uX f *, which is equal to Px f l + P x / l. By assumption, P7r1

l <C A. Therefore,

px1/"1 = px1^r1«P 7 rr1«A-Thus, it remains to show that P xJ~l <C A. To this end, we use Theorem 5.2.

In the space X = C[0, oo), we consider the mappings

(5.5) GcX{t)^^±^>, teR+, c€[0,l].

Since £ is semistable, we readily find that the measure P is invariant under these mappings,

PG;1 = P. Hence, assumption 1) in Theorem 5.2 is trivially satisfied.

It follows from the continuity of the trajectories that Gcx —> x as c —> 0 for each x G X; thus, assumption 2) is also satisfied. Obviously, the set X2 is measurable (and even open, since / and TTI are continuous). Therefore, we can use Lemma 5.1, whence it follows that P x G~l ™ P x as c —> 0.

Now we note that for any x G X2 and for all sufficiently small c we have

<pX(c) = f(Gcx)= SUP f ^ a ) = ' 1 + C ^ ' t€[0,l] l 1 + C)

and hence, ip'x{c) = —af(x)(l + c ) ~ a _ 1 . Since / (#) > 0 for x G -X"2> it follows that <pf

x(c) ^ 0 for all sufficiently small c. Thus, by applying Theorem 5.1 to the measure P x , we can prove that Px f~l is absolutely continuous. •

We have used the orbits {Gcx,c G [0,1]} of the mappings (5.5), which are "pieces" of the orbits of the mappings Gs:

(5.6) Gsx(t) = e-sax(tes), s G R1 , * G R + .

Although the family {Gs} is a group, the partition T into orbits of this group is not measurable (see Problem 5.5). Thus, one can hardly use T for studying the distributions P / _ 1 within the framework of the stratification method.

The advantages of the superstructure method are obvious as long as we study strong convergence. These problems are treated in Chapter 5.

PROBLEM 5.1. Let X be a metric space, P a finite measure on 2$x, and £ the algebra of P-continuous sets. Prove that cr(£) = <8x-

PROBLEM 5.2. Show that the space X in Theorem 5.2 may not be separable if the measure P is assumed to be dense.

PROBLEM 5.3. Verify that the statement of Theorem 5.3 remains valid if we assume that almost all trajectories of the process f lie in D[0, oo), that is, with probability 1 they are right continuous and have left limits at each point.

PROBLEM 5.4. Let (&), t e R1 , be a measurable process stationary in the narrow sense. Suppose that the random variable

/ h((t)dt, J a

Page 45: Local Properties of Distributions of Stochastic Functionals

§6. DIFFERENTIAL OPERATORS 29

where ft is a measurable function and — o o < a < 6 < o o , is well defined. We also assume that h(£a) i1 M&>) with probability 1. Prove that the distribution of r is absolutely continuous.

{Hint: in the space X = L°(R1) consider the family of translation mappings

Gcx(t) = x(t + c), ce [0,1], teR1.

Use Theorem 5.1 and the fact that P G " 1 = P , where P is the distribution of £ inX.)

PROBLEM 5.5. Prove the following properties of the mapping S: C(i?+) —> CiR1) defined in (5.4).

a) If £ is a continuous semistable process, then rj = 5(£) is a continuous strictly stationary process.

b) The mapping S takes the group of mappings (Gc,c G R1) defined by (5.6) into the group of translations

Vcx(t) = SGcS~lx(t) = x(t + c), c G R 1 .

c) The partition of the space C(i?+) into orbits of the group (Vc), as well as the partition of the space C(i?+) into orbits of the group (Gc), is measurable. (Hint: assuming the contrary, prove that if we identify the orbits with R1 , then the conditional measures on the orbits must be invariant under translations; this is impossible for probability measures.)

§6. Differential operators

Let / : ft —> R1 be a random variable on a probability space (0 ,$ , P) . In this section, we present a rather general approach to obtaining sufficient conditions for the density of the distribution P / _ 1 to be smooth. In the simplest case fi = R 1

and dP(x) = p(x) dx, the sufficient conditions consist in smoothness of p and / and the nonvanishing of / ' .

In what follows, we show that the same holds for a quite general situation. Namely, for the existence of a smooth density it suffices to require that the random variable / and the measure P be differentiate in some generalized sense and that the "generalized" derivative of / does not vanish too often.

Main definitions. Let (0,$) be a measurable space, and let R n be the space of all possible mappings fi —> R1 .

A partial mapping D: R Q —> RQ with domain dom D will be called a differen­tial operator (of the first order) if

1) the set domD is closed with respect to smooth superpositions, that is, for any n > 1, any C°°-function ip: R n —> R, and any # i , . . . ,#n G domD, we have <p(9u-->9n) edomD]

2) the conventional differentiation rule for the superposition is valid: n

D<p(9u • • • ,9n) = ^dupigu • • >,9n)Dgi. 2 = 1

The usual differentiation operator and the operator of differentiation along a vector field are differential operators in the sense of this definition.

We shall consider examples in which domD C L°(fi,3r) and a probability mea­sure P is fixed on a measurable space (0 ,$) . In this situation, we additionally

Page 46: Local Properties of Distributions of Stochastic Functionals

30 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

assume that the mapping D does not distinguish functions coinciding P-almost everywhere.

We define spaces of l-times D-differentiable functionals Ci(D) by the formulas C0(L>) = R", Ci(D) = dom£>, and Q(L>) = D^id-^D)). Obviously, Q(L>) is the domain of the operator Dl = DoDo--oD.

Let L be a linear functional with domain domL c R f i. A linear functional DL with domain dom(DL) = {g G domD | Dg G domL} defined by (DL)(g) = —L(Dg) will be called the D-derivative of L.

Obviously, dom (DlL) = {g G dom (Dl) \ Dlg G dom L} and {DlL)(g) = (-l)lL(Dlg), J = 1,2,.. . .

We mostly consider the class of linear functionals defined by signed measures,

Ln(9) = / 9d/J,, domLfj, = Li (M). JQ

In what follows, we do not distinguish between a measure and the linear functional corresponding to this measure, and write fj, instead of LM.

Let ft be a nonempty set. Each surjective mapping £: ft —• ft defines a o-algebra # = ^~1(Sr) on the set ft.

By using £, we transfer various objects (differential operators, linear functionals (in particular, measures), etc.) to the measurable space (ft, #). If D is a linear differential operator with dom D C R f i, then the differential operator {Do£)(go£) = Dgo£ is defined on the set dom £>o£ = {go£\ g e dom D}\ if L is a linear functional with domL C R n , then the linear functional (Lo( ) (^o( ) = L{g) is defined on the set domL o £ = {g o £ | # G domL}, and so forth. The surjectivity of £ guarantees that these notions are well defined.

Now let us introduce spaces of smooth measures. In accordance with the theory

of distributions, by W x (I > 1) we denote the class of distributions on R 1 whose Ith derivative is a finite charge. If a probability measure belongs to the class W x , then it is absolutely continuous with respect to Lebesgue measure and its distribution function has I—1 absolutely continuous bounded derivatives; the Zth-order derivative is defined almost everywhere and is a function of bounded variation. Accordingly, for I > 2 the density of such a measure has I — 2 absolutely continuous derivatives, and its (I — l)st-order derivative is a function of bounded variation (for details, see [28]).

For example, if a distribution belongs to the class W x , then it is absolutely continuous and its density is a function of bounded variation (hence, the density is bounded).

Now let us define some analogs of the classes W x in the most general situation. In this case, the role of distributions is played by D-differentiable linear functionals.

Let S C R n be a linear set. We say that a linear functional L belongs to the class W (S) if domL D 5 and L is defined on S by the formula L(g) = fQgdfj,, where \x is a measure. The measure \x will be called the representing measure of L. Note that this measure is not uniquely determined in general. More generally, let 5o , . . . , Si be linear subsets of Co(L>),..., Q ( D ) , respectively, such that DSj C Sj-i (j = 1 , . . . ,Z). We say that a functional L belongs to the class

W*(S0 > . . . ,SO = W(L> | S 0 , . . . ,S j ) ifD*L G W°(Sj), 3 = 0 , . . . , / . Then the

Page 47: Local Properties of Distributions of Stochastic Functionals

§6. DIFFERENTIAL OPERATORS 31

relations

(6.1) Dgdfj,j-1 = - gd/j,jy g G Sj9

where fij is the representing measure of D JL, are naturally called formulas of integration by parts. We point out that in this case the formulas of integration by parts readily follow from the definition of the classes W .

The fact that the measure P (treated as a linear functional) belongs to one of

the classes W shows how differentiable this measure is.

Let us prove a transformation property of W under a mapping of spaces.

PROPOSITION 6.1. If L e Wl{D | 5 0 , . . . , Si), then L o £ G Wl(D o ^ | 5 0 o

PROOF. Let g e Sj. Then

[(D o f )i(L o fl](0 o 0 = ( - i ) i (L o 0((2? o ty(g o 0 ) = ( - l ) ' ( L o 0 ( ^ o 0

= (-l)jL(D^g) = &L(g) = I gdN = f (g o 0 d(N o fl. JO, JQ

Thus, ^ £ is the representing measure of (D o f )J'(L o £). D

Sufficient condition for smoothness. Let / : Ct —> R1 be a random vari­able on a probability space (fyfoP). We consider the measure V = P / _ 1 (the distribution function / ) as a distribution on R1 . We present sufficient conditions

under which it belongs to the classes W x . We need some auxiliary notation. For Z = 1,2,... and i = 1 , . . . , / , we set

l(l,i) = {k = (ku...,kl)ezl+\^2jkj = l-i}.

We also set |k| = k\ H h fc/. If D is a differential operator and g G Q+i(.D), then by Si,i(D, g), i = 0 , . . . , / , we denote the set of functions of the form

where k G / (M) , </?i G Cg0, and </?2 6 C£° vanishes in a neighborhood of zero.

THEOREM 6.1. Let f: O —> R 1 6e a random variable. Assume that for some differential operator D and some I > 1, the following conditions hold:

1) feCl+1(D) and P { D / ^ 0 } = 1;

2) P G W'(£> | So, • • •, Si) for some Si D SM(L>, / ) , i = 0 , . . . , Z; 3) /or eac/i i = 0 , . . . , / and eacft k G / ( / , i), £/&e integral

(6.2) | |£>/|- (-lkl |D 2 / | f e l • • • \Dl+1f\k' d\fn\

is finite (here \Xi is the representing measure of the functional DlP on Si).

ThenPf'1 eW[.

The proof is based on the following result.

Page 48: Local Properties of Distributions of Stochastic Functionals

32 2. METHODS FOR STUDYING DISTRIBUTIONS OF FUNCTIONALS

LEMMA 6.1. Suppose that there exists a constant c > 0 such that

(6.3) | E ^ ) ( / ) | < c s u p M .

for any (p £ C§°.

ThenV = Pf~1eWl1.

PROOF OF LEMMA 6.1. Let us transform the expression under the modulus sign on the left-hand side in (6.3):

*Pil)tf) = f <P{1) dP = (-l)l(<p,VM),

where (y>, T>W) is the value of the Zth derivative of V (in the sense of distributions) on the element (p. Now, from (6.3) and from the Riesz theorem about the repre­sentation of a linear functional by an integral, it follows that the functional V^ is a finite charge. •

P R O O F OF THEOREM 6.1. Let us verify the assumptions of Lemma 6.1. We choose an auxiliary sequence of C°°-functions ij)n: R 1 —> [0,1] such that

a) tfn(r) = 0 for r e [ - £ , £ ] ; b ) W r ) = l f o r r ^ [ - i , I ] i

c) supn>r \rjipn (r)\ < oo for each j = 1,2, —

PROBLEM 6.1. Prove that such sequences exist.

Let us consider the integral f (pW(f)ipn(Df)dP. By using the representa­tion (pW(f) = D{yti-Vf)(Df)-1, j = 1 , . . . , / , the differentiation rules, and the definition of the derivative of a linear functional, we successively transfer the dif­ferentiation from (p to -071 and P . As a result, we obtain

P l

/ ^ W n ( i ) / ) d P = ( - l ) ^ O T ^ ( / ) 2 Ak(D2f)k>

J i=0 kel(l,i) Ikl

... (D'+V)*' 53^-k(Dfy-1-^(Df)}. i=o

Here A^ and Aj^ are absolute constants whose explicit form is unessential. By using the representing measures of the functionals .D*P, we obtain

r i M

(6.4) J t=ok€/(/,t) j=o

x j\u)^\Df){Df)J-l-M{D2f)kl • • • {Dl+lf)k> W .

Now let us pass to the limit as n —> oo on the right-hand side in (6.4). First, let us show that the terms with j ^ 0 tend to zero. Indeed, the func­tions ipn (Df)(DfY are bounded uniformly with respect to n, and the measure of the set of points at which ipn ^ 0 tends to zero as n —> oo. This passage to the

Page 49: Local Properties of Distributions of Stochastic Functionals

§6. DIFFERENTIAL OPERATORS 33

limit is valid, since the integrals (6.2) are finite. Similarly, we can pass to the limit in the terms with j = 0, thus obtaining

/>>(/)dP = (-l)'£ £ AkA0M (6.5) i=o ke/((,»)

x J' <p(f)(Df)-l-M(D2f)k>... (D'+1/)*' dW.

Obviously, the desired bound (6.3) follows form the identity (6.5). •

REMARK. Let us apply Theorem 6.1 in the simplest case 1 = 1. We need / to be differentiate twice (condition 1)) and P to be differentiate once (condi­tion 2)). Furthermore, we need the finiteness of the integrals f D2f(Df)~2 dP and J(Df)~1dfjbiy where /J,I is the representing measure of the functional DP. Under these conditions, V has a density of bounded variation. Our further goal is to find differential operators that can be used in Theorem 6.1 for some (interesting for applications) classes of stochastic functionals. Chapters 3 and 4 partially deal with this problem. In Chapter 3, we construct such operators for Gaussian functionals, and in Chapter 4, for functionals of sample functions of a Poisson random measure.

Page 50: Local Properties of Distributions of Stochastic Functionals
Page 51: Local Properties of Distributions of Stochastic Functionals

CHAPTER 3

Gaussian Functionals

§7. Gaussian measures on linear spaces

Basic characteristics. The definitions of a Gaussian measure and its char­acteristics, such as the barycenter, the correlation operator, and the characteristic functional, were given in §1.

Let (X, X') be a pair of linear spaces in duality. We denote the class of all Radon measures defined on (X, (£) by £(X) and the subclass of centered Radon Gaussian measures by GoPQ.

PROPOSITION 7.1. Each measure P G £(X) has a barycenter and a self-adjoint correlation operator continuous in the weak topology.

PROOF. Let X be the completion of X in the weak topology, i: X —> X the natural embedding, and P = P i - 1 the extension of P to X. We choose a compact set A C X such that P{X \ A} < 1/3 and consider the convex closed hull A of the compact set i(A) in X. Since X is complete, it follows that A is compact in X [87, Ch. III]. Now for any functional x' G X' satisfying the condition

(7.1) sup|(£,x7) | < e , xeA

we have

(7.2) P{x G X: \(x,x')\ >e}< P { X \ A} < P { X \ z ( A ) } < 1/3.

Let N{OL,G2) be the distribution of the functional (•,#') with respect to the measure P . Then, (7.2) implies the estimate

( ,3 ) # ( £ i 2 ) . . ( ^ ) , | .

In particular, $((e — a)/a) > 1/2, and hence, a < e\ similarly, we obtain a > —e. Thus, (7.1) implies |a| < e. This argument shows that the linear functional

(7.4) A:x' ^ a = EF(-,x /)

defined on X ; is continuous in the Mackey topology corresponding to the pair (X, X ;) (that is, in the topology of uniform convergence of functionals on convex compact subsets of X). The Mackey topology is the strongest topology on X ; for which the space of continuous linear functionals coincides with X [87, Ch.III]. In particular, the functional A is generated by some element a G X:

(7.5) (a, a/) = A(x') = Ep(-,z') = Ep(.,z').

Thus, a is the barycenter of P . Let us show that a = i(a) for some a G X, which implies that a is the barycenter of P .

35

Page 52: Local Properties of Distributions of Stochastic Functionals

36 3. GAUSSIAN FUNCTIONALS

The linear space i(X.) is P-measurable with P{z(X)} = 1. Since a Gaussian measure is symmetric with respect to its barycenter, we also have P{2a—i(X)} = 1. Therefore, the intersection of the spaces i(X) and 2a — i(X) is not empty: for some zi,£2 € X, we have i{x\) = 2a — 2(^2). Thus, a = (x\ + #2)/2, which proves the existence of a barycenter for P .

Now let us choose a y ' G X ' and consider the functional /C: X' —> R 1 given by

(7.6) K{x') = EP(. - a,x')(. - a,y') = EF(- - a,x'){- - a,y').

If x' satisfies (7.1), then it follows from (7.3) and the expression for the Gaussian 1 /9

density that a = (Ep(- — a, x')2) < 2e. By using the Holder inequality, we obtain

(7.7) \K{x')\2 < a2Ep(- - a,y')2 < 4£2EF(- - a,y')2.

Hence, /C, as well as A) is continuous on X' in the Mackey topology, and for some element Ky' of the space X we have

{Ky\x') = £(*')' = EF(. - a,x')(- -a,y').

The operator K: X' —» X thus obtained is the correlation operator of the measure P . Obviously, K is linear and self-adjoint. To construct the correlation operator of the measure P , it suffices to verify the inclusion i^(X') C i(X.) and then set K = i~lK. Thus, let us take some x' G X' and verify that / = Kx' G i(X). By comparing the characteristic functionals, one can easily prove that the translate of the measure P by a vector I coincides with the measure

Pi(dx) =expUx-a,x')- -(l,x')\l?(dx).

Thus, P/ < P , and hence, Pi{i(X)+l} = P{i(X)} = 1. Therefore, the intersection of the spaces i(X)+Z and i(X) is not empty. It follows that / G z(X) and the operator K is well defined by Kx' = i~lKx' = i~1l.

Obviously, K is self-adjoint and continuous. •

PROPOSITION 7.2. The characteristic functional of the Gaussian measure P with barycenter a and correlation operator K has the form

(7.8) ¥>POB7) = Ep-exp{z(•,£')} = exp{i(a,z ') - -(Kx',x')\.

PROOF. By the definitions of a barycenter and a correlation operator, the linear functional (-,a/) has the distribution N((a, a/), (Kx\x')). Therefore, (7.8) follows from the one-dimensional formula (1.7). •

Spaces of measurable functionals. The definition of a Gaussian measure implies that all functionals of the form (•,£') are square integrable. Therefore, X' C L 2 (X,C,P) . The closure of X ' in L 2 (X ,£ ,P ) is called the space X > of measurable linear functionals. The elements of Xp are called measurable linear functionals. A functional z G Xp treated as a random variable on (X, (£, P) will be denoted by (-,2).

By adding constants to measurable linear functionals, we obtain the space Xp1 = £{Xp, l ( - ) } of measurable affine functionals. The spaces Xp and Xp1

Page 53: Local Properties of Distributions of Stochastic Functionals

§7. GAUSSIAN MEASURES ON LINEAR SPACES 37

inherit the Hilbert structure from L2(X, (£, P ) . By (•, •) we denote the correspond­ing inner product and by | • | the L2-norm (it may well happen that \xf\ = 0 for some nonzero x' G X').

Let us define a linear operator 7*: X' —> Xp1 by setting

(7.9) 7V(- ) = ( - ) x ' ) -Ep(- 1 x / ) .

The closure of the linear manifold 7*(X') in Xp1 is called the space of centered linear Junctionals and is denoted by Xp 0.

PROPOSITION 7.3. Let P e G(X). Then the following assertions hold. 1) The operator 7* is continuous on X ' in the Mackey topology of the pair

(X, X'), where X is the completion ofX. 2) The adjoint operator I: Xp1 —> X exists and is defined by the formula

(7.10) (Iz9y') = (ziryt)

for ally' £X' andzeXg. The operator I is continuous in the weak topologies of X and Xp1. The kernel

of I coincides with the one-dimensional space of constants in Xp1. 3) The correlation operator K of the measure P can be represented in the form

K = II*. In particular, for x\y' G X' one has

(7.11) (Kx',y') = (H*x',y') = (/V,/*<,')•

4) The space Xp is separable.

PROBLEM 7.1. Prove statements l)-3). (Hint: to prove 1), use the estimates from the proof of Proposition 7.1.)

P R O O F OF STATEMENT 4). Suppose that A c X is a compact set and P{A} > 0. For M > 0, we set

X'(M) = {x' e X': sup \(x,x')\ < M}.

Since each continuous functional x' is bounded on A, we have (JA/>O X'(M) = X'. To prove that I*(X'(M)) is separable, we construct a sequence of finite-dimensional spaces Ln C Xp that approximate 7*(X /(M)).

We set Lo = {0} and proceed by induction. At the nth step, we choose an x'n e X'(M) so that

2 inf \I*x'n - z\ > an = sup inf | / V - z\ zeLn-i x'GX'(M) z€Ln-i

and set Ln = C{I*x'n,Ln-i}• Let us show that a = l i m n - ^ an = 0. Suppose that a > 0. Then

inf \I*x'n-z\>oj2>c/2 zELn-i

for all n 1. Hence,

P { A } < lim P { max | ( - , < > | < M } < lim (AT(0,a2/4){[-M,M]})m = 0, m->oo l < n < m m—oo

which contradicts the choice of A. Hence, a —> 0, which means that the spaces Ln approximate the set I*(X'(M)). It follows that 7*(X/(M)) is separable in Xp . Therefore, the spaces 7*(X/), Xp 0, and Xp are also separable. •

Page 54: Local Properties of Distributions of Stochastic Functionals

38 3. GAUSSIAN FUNCTIONALS

PROBLEM 7.2. Let P , Q G </(X) and Q < P . Prove that X'Q c X'P, X'g c X'^, and

(7.12) sup |* |p / | z | Q < oo.

Admissible t ransla t ions and directions. Each vector I G X generates a one-parameter transformation group on X, which is called a translation group:

(7.13) Glc(x) = x + cl.

A vector I G X is called an admissible translation for a measure P if the measure Pl = P ( G i ) " 1 is absolutely continuous with respect to P . A vector I defines an admissible direction for P if the entire group Gl is admissible, that is, for each c G R 1 the translation cl is admissible.

By Hp we denote the set of all vectors that define admissible directions for P . This set is a vector space (prove this fact!) and is invariant under measure translations: Hp = HP* for all I G X. The space Hp is often called the core of the measure P .

For a Gaussian measure, the space Hp is quite large. Therefore, in specific applications, we always have a large supply of admissible directions.

The following theorem relates admissible translations, directions, and centered linear functionals.

THEOREM 7.1. Let P G £(X). Then the operator I defined in (7.10) is a linear isomorphism between the spaces Xp 0 and H p .

If z G X p 0 and I = Iz G H p , then

(7.14) ^ ( s ) =«*>{<*,*>-±|*|2}.

The set of admissible translations of the measure P coincides with Hp , that is, each admissible translation of a Gaussian measure defines an admissible direction.

PROOF. First, note that the operator J is one-to-one on the subspace Xp 0 c Xp1, since the kernel of J is the space of constants, which is orthogonal to Xp 0 .

Suppose that I is an admissible translation. Let us show that there exists a z G Xp 0 such that Iz = 1. For each x' G X', by \ixi and vxi we denote the distributions of the functional (-,#') with respect to P and P/, respectively. These distributions are Gaussian; namely, \xx> = A/*((a,x'), (Kx\ #')) and vx> = A/*((a-|-i,x;), (Kx', x')).

Since P* <C P , it follows that for any e > 0 there exists a 8 > 0 such that P{A} < 6 implies P*{i4} < e for any A. In particular, if B G R 1 is an arbitrary measurable set, then /JLX'{B} < 6 implies vX'{B} < e. The latter inequality means that the parameters of \ix> and vx> satisfy

sup \lKl,x,)\(Kx',x,)-ll<1<™-

By (7.11), it follows that

sup K ^ x O M / V r 1 <oo . x'GX'

Page 55: Local Properties of Distributions of Stochastic Functionals

§7. GAUSSIAN MEASURES ON LINEAR SPACES 39

Hence, the linear functional (/, •) admits a continuous extension from I*(X') to Xp 0 . Therefore, for some z G Xp 0 and all x' G X7, we have

(i1x,) = {z,ra/) = (iz,xr)t

whence it follows that I = Iz. Now let I = Iz, z e Xp 0. To show that I is an admissible translation, it suffices

to verify that P ' coincides with the measure

Q(dx)=exp{(x,z)-±\z\2}p(dx).

Let us prove that the characteristic functionals of these measures coincide. By Proposition 7.2, we have

Ep* exp{i(-9x')} = EPexp{z(- + /,#')}

= exp{ - hlCx\rf) +i(a + l,x')\.

Suppose that TZ is the correlation matrix of the joint distribution of the func­tionals (•,#') and (-,z) with respect to P . Then

E g e x p ^ . , ^ ) } = E p e x p j ^ - , ^ ) + (-,*') - ^ | 2 }

= exp{i(a,z ') - i | z | 2 } (de t f t ) - 1 / 2

x / / exp j i r i H-r2 - - ( 7 ^ _ 1 r , r ) | d r i d r 2

= exp{z(a,x /) - -\z\2 - -[Ku + 2K12(i) +K22(-i)2]}-

Into this equation, we substitute the correlations

Ku = (Kx\x'), Tl22 = \z\\

Ki2a = EP(- ,z)(--a ,a; /) = ( z , / V ) = (Iz,x') = (l,x').

After some calculations we obtain

EQexp{i(-,x')} = expli(a,x') - -(Kx\x') +i(l,x')\,

which coincides with (7.15). By Theorem 1.2, the coincidence of measures follows from the coincidence of the corresponding characteristic functionals. Therefore, Q = P*.

Thus, we have proved that the set of admissible translations coincides with I*(X'Pfi). D

The case of a centered measure P must be considered separately. In this case, Xp = Xp 0, and by Theorem 7.1, there is an isomorphism between the space Hp of admissible translations and the space Xp of measurable linear functionals.

Let us introduce two convenient notions. Let z G Xp 0 and I = Iz e Hp . Then the number

(7.16) ap(0 = \z\ - 1

Page 56: Local Properties of Distributions of Stochastic Functionals

40 3. GAUSSIAN FUNCTIONALS

is called the admissibility of the translation Z, and the number

(7.17) 1 (0 = |*|2

is referred to as the action functional If I G X \ Hp , then we adopt the natural convention that o~p(l) = 0 and 1(1) = oo.

PROBLEM 7.3. Let P , Q G <?(X), P < Q, and Q < P . Prove that H P = H Q

and 0 < ^ {ap(l)/aQ(l)} < rop {ap(l)/aQ(l)} < oo.

(Use the result of Problem 7.2.)

PROPOSITION 7.4. Le£ P G £o(X); let supp(P) and N P be, respectively, the topological and the linear supports of P with respect to the weak topology. Then

1) supp(P) = N P = p | Kerx'; x'GKer J*

2) z/ Hp is the closure of Hp in X ; £/&en Np = Hp ;

3) P { N P } = 1.

PROOF. We write U = f)x'eKeri* K e r z ' - F i r s t> l e t u s s n o w t n a t H p = U- If a; G Hp , then x = Iz, z G Xp, and

(x,x/) = (^,x /) = (^/*x /) = 0

for x' e Ker/*. Therefore, x G Kerx' and Hp C U. Since E/ is closed, we have Hp C U. Conversely, if x £ H p , then, by the Hahn-Banach theorem, there exists a vector x' G X' such that (x, x1) = 1 and (•, x') = 0 on Hp . Therefore, x' G Ker /*, and hence, x $. U. It follows that Hp D U.

Thus, Hp = C/. If X is finite-dimensional, then the verification of statement 1) is straightforward. In the general case, only the inclusion supp(P) C U is obvious. Let x G U. Let us choose a basic neighborhood

V = {y G X I max \(y - x,z-) | < e] 3 x

and verify that P{V} > 0. We define a mapping / : X —> R m by setting f(y) = {(y, a;^)}. By applying statement 1) to the finite measure P / _ 1 and the point f{x))

we see that f(x) G supp(P / _ 1 ) . Therefore,

P{V} = P / - 1 { / ( ^ ) } = Pf-'iv e R m | max ^ - f(x)i\ < e} > 0.

Hence, x C supp(P), and consequently, U C supp(P). Thus, we have U = supp(P) = H p . Since P is a Radon measure and, by definition, U is a linear set, it follows that (see §1) P{supp(P)} = 1, the linear support ATP exists, and Np = supp(P). This implies the desired equations in l)-3). •

REMARK. Since, by Proposition 7.3, 4) and Theorem 7.1, the space Hp is separable, it follows from Proposition 7.4 that the support of any measure P G £o(X) is a linear separable subset of X.

This statement remains valid if the Radon measure P is defined on a wider (j-algebra 05 generated by some local convex topology compatible with the duality between X and X'. However, if P is not a Radon measure, then the support need not be linear or separable.

Page 57: Local Properties of Distributions of Stochastic Functionals

§7. GAUSSIAN MEASURES ON LINEAR SPACES 41

Par t i t ions into parallel lines. Let us study partitions of a space with Gauss­ian measure and calculate the corresponding conditional distributions. We use the notation introduced in §3.

Let I G Hp and let T be the partition of X into lines parallel to Z, that is, orbits of the group (7.13). On each line 7 = {y G X \ y = x + cl,c e R1} G X / r , we define the Lebesgue measure A7 by setting

^j{y I y = x + cl,ci < c < c2} = c2 - c\.

By Theorem 7.1, there exists a linear measurable functional z G Xp 0 such that Iz = I. On 7 we introduce the one-dimensional Gaussian measure

(7.18) P7 {dy) = (2n)-^\z\ exp { - '^M-)xi (dy).

Note that the functional z is defined and linear everywhere on some linear space L G £ with P{L} = 1. Since I is an admissible translation, we have I G L. Therefore, the measure P 7 is well defined on the set 7r(L) = {7 | 7 c L}, and P r M L ) } = 1.

THEOREM 7.2. The system {P 7 } is the system of conditional measures for P with respect to T.

PROOF. According to (3.1), for C G £ and A G £x,r we must verify the relation

(7.19) P{C H n-\A)} = J P7{C}Pr (dry).

We shall only verify (7.19) for some special sets. Let / = {/J, 1 ^ i ^ n, and 9 = te'}» 1 ^ 3 ^ m> be two systems of centered functional, f^g^ G Xp 0 with (fuz) = {9j,z) = 0. We also choose three Borel sets Z G 93(R1), F G ©(R"), and G G <8(Rm), and set

C = {x G X I (x,z) G Z, {(xji)} G F } , A = TT({X G X | {(x,9j)} G G}).

Let us verify (7.19). In view of orthogonality, the functional z is independent of / and g. Therefore,

P f C r i T r - 1 ^ ) } = P { x 6 X | (x,z) G Z, {(xji)} G F, {<*,#)} G G}

= P{x € X I (x,z) G Z j P f t f o / i ) } G F, {{x,9j}} G G}

= 7V(0,|z|2){Z}P{{(x,/ i)} G F, {(x,9j)} G G}.

On the other hand, the definition of P 7 yields

P 7 {C} = l{<x, / i>}6FA/-(0,|z|2){Z}.

Thus, the right-hand side of (7.19) is equal to

pr{.4n {71 {(x,m eF,xe 7}}7V(o,|z|2){z} = p{x G r I {(xji)} G F, {{x,9j)} e G}M(o, \z\2){z}.

We have verified (7.19) for sets A and C of special structure. The general verification of (7.19) is left to the reader as a simple exercise in measure theory. •

Page 58: Local Properties of Distributions of Stochastic Functionals

42 3. GAUSSIAN FUNCTIONALS

Examples of Gaussian measures.

PROBLEM 7.4. Let X = X' = R 1 and P = Af(a,a2) G G(X). Construct the spaces Xp, Xp1, and X p 0 . Define the operators I* and I. Verify (7.14) by straightforward computation. Do the same for X = X7 = R n , n > 1.

We present some examples of Gaussian measures on infinite-dimensional spaces.

EXAMPLE 7.1. Let T be an arbitrary set, X = R T the space of all functions on T, and X' the linear space spanned by the coordinate functionals nt{x), i e T , given by nt{x) = x(£), x G X.

We treat X as the space of trajectories of a Gaussian process {xt(uj),t G T} and a Gaussian measure P on X as the distribution of this process. To specify P , it suffices to specify the mean value (barycenter), that is, a function a(-) G X and a symmetric function K: T x T —> R 1 satisfying the positive semidefiniteness condition

n y ^ K{U,tj)viVj > 0

for all n > 0, v G R n , and tu • • •, U e T. Then it follows from the classical Kolmogorov theorem on consistent systems

of finite-dimensional distributions ([18, Ch. 5, §5.1], [55, Ch. Ill, §4]) that there exists a unique measure P on (X, £) such that the coordinate functional 7T* has the distribution J\f(a(t),K(t,t)) with respect to P and

EP(TTS - a{s))(nt - a{t)) = K{S)t).

If T is not countable, then the Borel cr-algebra 05 is wider than the cylinder a-algebra £, and not each Gaussian measure is a Radon measure.

PROBLEM 7.5. Construct the operator I for Example 7.1. Verify that II*7rt = K(t,*). By using this fact, prove that the function K(t,*) is an admissible transla­tion for the measure P .

EXAMPLE 7.2. Let us consider the most important special case of the previous example. Suppose that X = R°° is the space of sequences and X' is the space of finite sequences. The duality between X and X ; is defined in the standard way,

oo

(x,a/) = ^xtx't. t=i

Let a(t) = 0 and

Then the measure P is the distribution of a sequence of independent A/"(0,1)-distributed random variables.

By applying Theorem 7.1, we obtain Hp = l2; that is, square summable se­quences are admissible translations for P .

EXAMPLE 7.3. Let X be a separable Hilbert space with inner product (•,•). We set X7 = X and use (•, •) to define the duality between X and X'.

In this case £ = 05 (each open set of X can be represented as a countable union of open balls, an open ball as a countable union of closed balls, and a closed ball as a countable intersection of half-spaces belonging to <£). To construct a Gaussian

Page 59: Local Properties of Distributions of Stochastic Functionals

§7. GAUSSIAN MEASURES ON LINEAR SPACES 43

measure P on (X, 2$), we choose several objects, namely, a vector a G X, a basis {ej}<j?=1 in X, a sequence {^•}^11 of independent random variables with distribution A/"(0,1), and a sequence {GJ}^=1 of nonnegative numbers such that YLT=i aj < °°-

We define P to be the distribution of the X-valued random vector

oo

3 = 1

This method permits one to obtain an arbitrary Gaussian measure on X. PROBLEM 7.6. Prove that X p 0 is a Hilbert space and the functionals fj(>) =

(7~1(- — a, ej) form a basis in this space. Derive the following formulas for the operators /*, 7, and K:

3 3 3 3

K(^2X3ej) =^2^X3e3' 3 3

Use the relation Hp = / (Xp 0) to obtain the following representation for the space of admissible translations of the measure P :

H P = {l G X | ] T |(/, e3)\V"2 < oo} C X. 3

The following theorem proves useful when one tries to find the space Hp in various situations.

THEOREM 7.3. Suppose that the correlation operator of the measure P admits a factorization K = J J*, where J is a linear operator that acts from a Hilbert space L into X and J* is the adjoint of J . Then Hp = J(L).

REMARK. For L = Xp 0 and J = 7, where I is introduced in (7.10), we obtain just the first statement of Theorem 7.1.

PROOF. Let Lo be the closure of J*(X') in L, and let LQ- be the orthogonal complement of Lo. The mapping x: I*{xf) —> J*(x') is an isometry between 7*(X') and J*(X'); indeed,

(JV, J V ) L = (JJ*x',y') = (Kx',y') = <J/V,y') = ( /V,/V).

The isometry x extends to an isometry between Xp 0 and L0. Let us show that I = JH. Indeed,

JKZ = JxPx' = JJ*x' = II*x' = Iz for z = I*x' G Xp 0. Hence, by Theorem 7.1 we have

H P = I(XJ>,0) = JxiX^o) = ^(Lo).

On the other hand, for I G LQ" and x G X7 we have (Jl^x'} = (/, J * X ' ) L = 0. Therefore, Jl = 0 and J(L) = J(L0) = H P . •

We consider some examples to illustrate how this theorem can be used.

Page 60: Local Properties of Distributions of Stochastic Functionals

44 3. GAUSSIAN FUNCTIONALS

EXAMPLE 7.4. A space with a reproducing kernel. Let P be a Gaussian mea­sure on X = R T with correlation function K(s,t) (see Example 7.1). Let L be the Hilbert space with reproducing kernel if, that is, the space of functions on T obtained by completing the linear span of the functions Ks = K(s, •) with respect to the inner product (Ks,Kt) = K(sit).

We define an operator J*: X' —» L by setting J*(ns) = Ks. Then J is an embedding of L in X, and K = J J*. By Theorem 7.3, the space L is just the space of admissible translations for P (cf. Problem 7.5).

EXAMPLE 7.5. Let X be the space of bounded measurable functions defined on a metric compactum T. We define the dual space X ' as the space of finite signed measures Z(T). The duality is given by the formula

</,"> = j /(<)"(*), / e x , uex'.

Let P € £o(X). The correlation operator K: X ' —> X is an integral operator of the form

(7.20) Kv(s)= f K(a,t)v(dt)>

where the kernel K(s,t) = fx f(s)f(t)'P(df) is a positive semidefinite function on T2 . Let L be the completion of the linear span of the functions

/„(•) = J K(-,t)v{dt), vex',

with respect to the inner product

( / « , , / M ) L = / K(s,t)v{ds)ii(dt). JT2

Then K can be treated as a linear operator from X' to L, and the embedding J : L —> X is the adjoint of K. Therefore, for J* = K we have K = J J*, and by Theorem 7.3, the admissible translations of the measure P are just the elements of L.

EXAMPLE 7.6. The Wiener field. Let P be the centered Gaussian measure on X = C(T), T — [0, l ] d , corresponding to the correlation function

d

(7.21) K(8, t) = Y[ min(5i, U), 5, t G [0, l]d.

To describe admissible translations, we construct a factorization (7.20) of the operator K different from that considered in the preceding example. We set L = L2(T) and define operators J : L —> X and J*: X ; —> L by setting

r d

Jh(s) = / h(t)Xd (ctt), [0, s] = JJ[0,8i], se T; (7.22) [0,sl *=i

J*v(t)= [ v(ds). JT\[Qtt]

Page 61: Local Properties of Distributions of Stochastic Functionals

§7. GAUSSIAN MEASURES ON LINEAR SPACES 45

PROBLEM 7.7. Verify that the operators J and J* are adjoints of each other and K = JJ*.

Thus, formula (7.22) provides the general form of admissible translations, and moreover, ap(Jh) = (JT \h\2)~x is the admissibility.

The one-parameter case (d = 1) is of particular importance. Here the measure P defined on C[0,1] is the distribution of the Wiener process (Brownian motion), that is, the Gaussian random process w(£), t G [0,1], defined by the conditions Ew(t) = 0 and Ew(s)w(t) = min(s, t). This measure is called the Wiener measure. By applying formulas (7.22) and (7.14) to P , we obtain the following classical result.

THEOREM 7.4. Let P be the Wiener measure on C[0,1], For a function I G C[0,1] to be an admissible translation of P , it is necessary and sufficient that 1(0) = 0, I is absolutely continuous, andV G L2[0,1]. If I is an admissible translation, then l(l) = ar2(l) = f*l,2(s)ds and

(7.23) ^ > ) = exp | j f ' I'(s) dw(s) - \ j * lf2(s) d s } .

EXAMPLE 7.7. Semistable Gaussian processes (fractional Brownian motion). Let us consider the centered Gaussian process wa(t) on R+ with correlation function

K(s,t) = l[sa + ta-\s-t\a}.

We can choose the parameter a in the interval (0,2]. Let P a be the Gaussian measure on X = C[0,1] corresponding to the restriction of wa(t) to the interval [0,1]. The correlation operator has the integral form (7.20). Straightforward calcu­lations show that we have the factorization K = J J*, where J: L = L2(RX) —> X is the integral operator with kernel

fat) = C/3{(s - *)-%,.](*) + K« - *)-' - (-tr^-ocoiW}. / ? = ^ , cfi = {J°J(l-r)-f>-{-r)-'i)2dr + j\l-r)-Wdr} ^

Note that the operator J*: X7 —» L2(RX) is also an integral operator with kernel j*(syt) =j(t,s).

Thus, the admissible translations for P a have the following general form

(7.24) l(s) = op [ h(t)[(s - t)-? - (-t)-*3] dt + cp [ h(t)(s - t)-*dt, J-oo JO

where h G L2(R J) is called the fractional derivative of order 1 — /? of the function /. For a = 1 and (3 = 0, we arrive at the distribution of the Wiener process, and

(7.24) is reduced to the statement of Theorem 7.4. Let us give (without proof) a very useful sufficient condition for the function I

to be representable in the form (7.24). This is a differentiability condition in terms of the Fourier transform.

Page 62: Local Properties of Distributions of Stochastic Functionals

46 3. GAUSSIAN FUNCTIONALS

PROPOSITION 7.5. Suppose that Z(0) = 0 and the Fourier transform I of the function I belongs to L2(R1 , \u\l+a \ (du)). Then I can be represented in the form (7.24) and

[ \h\2dt< f \T{u)\2\u\1+adu.

EXAMPLE 7.8. Stationary Gaussian processes. Let us consider a Gaussian process (&), t ^ R 1 , with zero mean, correlation function K(s,t) = K(t — s), and spectral measure £ ,

K{s,t)= J e^-s^B(du).

Let P be the measure on L2[0,1] corresponding to the restriction of £ to [0,1] (in this example, we deal with L2-spaces consisting of complex-valued functions). The correlation operator K of the measure P , which is the integral operator with kernel K(*,*), can be factored as follows: K = J J*, where (verify this!)

J : L 2 ( R 1 , 5 ) ^ L 2 [ 0 , 1 ] ,

Jh(s) = [ e~isuh{u) B(du), s G [0,1],

rg(u)= f eisug(s)ds, ueR1. Jo

Therefore, I is an admissible translation for P if and only if

(7.25) l(s)= f e-isth(u)B(du)

for some h G L2 (R1, £ ) . Admissible translations are of the simplest form if the process £ has a spectral

density, B(du) = b(u) du, and if

(7.26) ci(l + \u\)~2n < b(u) < c2(l + M ) " 2 n ,

where n is an integer. In this case, equation (7.25) has a solution if and only if l,l^l\... , ^ n _ 1 ) are absolutely continuous and l^ £ L2[0,1], that is, I belongs to the Sobolev space W J . The measure of admissibility is equivalent to the reciprocal of the Sobolev norm:

/ n \ - l / 2

For the proof of these statements, the reader is referred to [88, Ch. II, §3]. Note that the description of admissible translations remains the same if condition (7.26) is satisfied only at infinity rather than on the entire line. This follows from Problem 7.3 and the statement that the measures corresponding to stationary processes whose spectral densities coincide outside some neighborhood of zero are equivalent [52].

EXAMPLE 7.9. Gaussian white noise. Let T be a set, X an algebra of subsets of T, X the linear space of simple finitely additive signed measures defined on X, and X ; the space of simple X-measurable functions of the form

n

x'(t) = J^ca^W; ci e R\ Ai e X.

Page 63: Local Properties of Distributions of Stochastic Functionals

§8. SMOOTH FUNCTIONALS 47

The duality between X and X' is defined by the relation n -

(x,x') = y2cix{Ai} = / x'{t)x{dt).

Let m be a countably additive measure on (T, X). The centered Gaussian measure P on X with correlation operator

(Kx\y') = J x\t)y\t)m{db)

is called the Gaussian white noise with variance m. The signed measure of a given set A is J\f(0) ra(A))-distributed with respect to

P . The signed measures of disjoint sets are independent. Therefore, the white noise can be considered as the distribution of a random finitely additive signed measure with independent values.

PROBLEM 7.8. Prove that the core H P c X has the form

H P = {r{t)m(dt) G X | r(-) G L2(T,X,m)}.

§8. Smooth functionals

Spaces of smooth functionals and differential operators. Suppose that (X, X') is a pair of spaces in duality, P G £o(X) a centered Gaussian measure, Hp C X the core of P , and / : X —> R 1 a stochastic functional. We define the notion of smoothness of / with respect to P , examine the smoothness of P itself, and derive a useful formula of integration by parts. Then we use these results to study the smoothness of distributions of functionals.

Recall that Hp is a separable Hilbert space with norm \l\2 = J (I) defined in (7.17). Let H C Hp be a finite-dimensional subspace. A functional / is said to be m times continuously differentiate along H (we write / G Cg(X)) if for any x G X the function h i-» f(x + A), h G H, is m times continuously Prechet differentiable on H with respect to the Hilbert structure inherited from Hp . By f£ we denote the ith derivative at a point x G X. We treat this derivative as a polylinear form, / J J G £2(H, R 1 ) , and use the explicit notation

f$(x)[hu...,hi]i XGX, /i.-GH,

whenever necessary. For the derivative, we use the Hilbert-Schmidt norm

l^ ) W| 2 = E l / H ) K , . . . , e Q i ] | 2)

a

where {e^} is an orthonormal basis in H. Let us choose a multi-index q = (g(0),. . . ,tf(ra)), q(j) > 1. A functional / G

L^(°)(X, P) is said to be Hp-differentiable of order q if for any increasing system of finite-dimensional subspaces H n /* H and the corresponding orthogonal projections Qn: Hp —> H n , there exists a sequence of functionals fn G C g n convergent to / in L^°)(X,P) such that the sequence of mappings

X^tiJ\x)[Qn(-),...,Qn(-)}

Page 64: Local Properties of Distributions of Stochastic Functionals

48 3. GAUSSIAN FUNCTIONALS

converges in L^')(X, P , ££(H P , R1)) for each j = 1,2,... , ra. The limits of these sequences are called the derivatives of f along Hp and

denoted by f^\ These derivatives are Hilbert-Schmidt polylinear forms, f^\x) G / ^ ( H p j R 1 ) , and are independent of the choice of the sequences H n and fn (the proof is left to the reader). In particular, the first derivative fW can be treated as an element of the space H p = / ^ ( H p j R 1 ) .

By W 9 (X,P) we denote the space of q times Hp-different iable functionals. In some problems, only the local integrability of the derivatives is required. A functional / is said to belong to the class W ^ X , P) if for some sequences of open sets 4 / X and functionals fn G W9(X, P) , we have fn = f on An for all n. One can verify that in this case the sequences of derivatives /A converge P-almost everywhere. Just as above, we denote the corresponding limits by f^\

In quite a similar way, one can define the spaces C H ( X , 5 ) , W9(X, P , £ ) , and WioC(X, P , B ) of differentiate functionals with values in an arbitrary Hilbert space B.

PROBLEM 8.1. Suppose that X is equipped with a norm that makes X a Banach space and a functional f is m times Prechet differentiate with respect to this norm. Prove that if | | /^ (0II G L«0>(X,P), 0 < j < ra, then / G W*(X,P).

Let us consider a measurable mapping V: X —> H p , which will be called a vector field (there is a vector V(x) assigned to each point i G X ) .

We define a differential operator D on the space VVfc * (X, P) by setting

(8.1) Dg(x) = (gM(x),V(x)), x e X.

Let us show how the operator D acts in the scale {W9}. By d x m we denote the multi-index (d , . . . , d) of length m.

PROPOSITION 8.1. Suppose that d > 1, q = (go>---><Zm) is a multi-index, and Si = (d_ 1 + ^ \ ) _ 1 > 1. Consider the multi-indices s = (so, • • • , s m _i ) and dxm = (d, . . . ,d) of length m. Suppose that V G W d x m ( X , P , H P ) . Then the following assertions hold:

1)D: W 9 ( X , P ) - » W S ( X , P ) ;

2) ifqi>d/(d — i), 1 < i < m, then the iterated differentiation operator

Dm: W 9 ( X , P ) ^ L 1 ( X , P )

is well defined] 3) similar statements are valid for the spaces W^oc.

PROOF. Let us estimate Dg for g G W9(X, P) in terms of moments. Using the Cauchy and Holder inequalities, we obtain

(d—so)/d / p \ so/d

[ \Dg(x)\s°I>(dx)< [ \gW(x)\9°\V(x)\8°P(dx) Jx Jx

y»(x)\«J>(dx)) ^\V(x)\dP(dx)j

Similar estimates can be obtained for the derivatives of the functional Dg; to this end, instead of the inequality \Dg\ < \g^\ \V\ we use the inequality

\Dg^(x)\<2l(im^+i\g^(x)\){v^Xi\V^(Lx)\).

Page 65: Local Properties of Distributions of Stochastic Functionals

§8. SMOOTH FUNCTIONALS 49

As a result, we obtain (Dg)^ G L S i (X,P) , 0 < i < m — 1, and hence, Dg G W S (X,P) .

The second statement is proved by applying the first statement m times. •

Integration by parts and smoothness of the Gaussian measure. Our immediate goal is to define the derivative of P with respect to D and derive the corresponding formula of integration by parts. First, we obtain a finite-dimensional result.

LEMMA 8.1. Suppose that V: R n —> R n is a C 1 vector field, P is a standard Gaussian measure on R n , and q>l. If |V|, \V^\ G L 9 ( R n , P ) , then

(8.2) / ({x,V(x))-trV^(x))P(dx) = 0.

PROOF. Let n = 1. Integrating by parts, we obtain

/ {x,V(x))P{dx) = -^= [ xV{x)e~x2/2dx

= -£= [°° V'{x)e-X^2dx= f tvV^(x)P(dx).

By applying the one-dimensional result to the restrictions of the scalar projec­tions of V on lines parallel to the coordinate axes, we obtain the multidimensional result. •

LEMMA 8.2. Let H C Hp be the n-dimensional subspace spanned by the or-thonormal system {lj = Kxp 1 < j < n,Xj G X '} , where K is the correla­tion operator of the measure P . Let V: X —> H be a vector field such that V G C J I ( X , H ) D > V ^ ( X , P , H ) , q>\. Then

(8.3) / ((x,K-lV(x)) -txV{l\x))¥(dx) = 0. Jx

PROOF. First we note that the integrand is well defined for all x G X; if v(x) = ^2j vj(x)lj> t n e n

(x,K~lV{x)) = Y^vjtofaK-Hj) = 5 > , - ( * ) < * , 4 ) . 3 3

Moreover, for x G H we have

{x,K-1V(x)} = (x,V(x))H,

so that (8.3) is a generalization of (8.2). By applying (8.2) to each n-dimensional plane of the form y + H and then integrating with respect to y, we get (8.3). •

LEMMA 8.3. If the assumptions of Lemma 8.2 are satisfied, then

(8.4) / (g^{x),V(x))P{dx) = I g(x) \(x,K-lV{x)) - t r F ^ ( x ) ] P(dx) J x «/x

for each functional g G C^(X) n W«''9 '(X, P ) , q' = q/(q - 1).

Page 66: Local Properties of Distributions of Stochastic Functionals

50 3. GAUSSIAN FUNCTIONALS

PROOF. First, assume that g and gW are uniformly bounded on X. Then the vector field V\{x) = g(x)V(x) satisfies the assumptions of Lemma 8.2. We apply (8.3) to V\ and use the identity (verify it!)

tvV?\x) = g(x)tiV^(x) + (ff(1)(s),V(aO).

By substituting this expression into (8.3), we obtain (8.4). To prove the statement in the general case, one approximates the functional g by bounded functionals and then passes to the limit in (8.4). •

We can treat (8.4) as a formula of integration by parts. We set

(8.5) Sv{x) = t r l / ( 1 )(x) - (x,K-lV{x))

and define the derivative of P with respect to D by the formula

(8.6) DP{dx) = Sv(x) P(dx).

Then we can rewrite (8.4) in the canonical form

(8.7) / (Dg)(x) P(dx) = - J g(x)(DP) (dx).

So far, this formula has been proved only for a finite-dimensional field V. In the general case it also holds, but the expression (8.5) for the density of the derivative of P cannot be interpreted literally, since each of the two terms on the right-hand side in (8.5) taken separately need not be well defined. A generalization to the infinite-dimensional case is given by the following theorem.

THEOREM 8.1. Letd be an even number, q = (d,d), qf = (d/(d—l),d/(d—l)), and V e W9(X, P , H P ) . Then there exists a function 6V G Ld(X, P) such that (8.7) holds for each functional g £ Wq (X, P) and the measure (8.6).

P R O O F . Let gn and Vn be approximating sequences for g and V, respectively. By (8.4), we have

(8-8) / (g^(x))Vn(x))P(dx) = - [ gn(x)6n(x)P(dx), Jx. Jx

where 6n(x) = trVn (x) - (x,K~ lVn{x)). Obviously, the left-hand side of (8.8) converges to the left-hand side of (8.4). Let us show that 6n is a Cauchy sequence in Ld(X, P) . To this end, we need the following estimate [1].

LEMMA 8.4. Suppose that d is even, V: R n —> R n is a continuously differen-tiable field, and P is the standard Gaussian measure on R n . Then there exists a constant c(d), independent of n and V, such that

(8.9) / ((x,V(x))-trVM{x))dP{dx)<c(d) [ (\V(x)\d + \V^(x)\d)P{dx).

Since 6n(x) — 6k{x) = trU(x) - (x, U(x)) and U = Vn — 14, it follows that 6n

is a Cauchy sequence in L d (X ,P) . We write 8y = limn_>oo<5n. Using the Holder inequality, we can easily justify the passage to the limit on the right-hand side in (8.8). Thus, (8.8) yields (8.7) as n - • oo. •

Page 67: Local Properties of Distributions of Stochastic Functionals

§8. SMOOTH FUNCTIONALS 51

Let us generalize Theorem 8.1 to the case of iterated differentiation. Suppose that m > 1 and d > m is even. For a vector field V, we define functions a o , . . . , am : X —> R 1 by setting

(8.10) a0 = 1, Oi+i = (af \ V) + ai«v ,

where <5K is defined in (8.5). The reason for introducing these functions is that

(8.11) Di P(dx) = ai(x) P(dx),

which is a generalization of (8.6). Let us justify (8.11). Needless to say, the sequence ^ is not well defined by (8.10) for all fields V. But it is well defined at least for m times continuously differentiate fields. To state the theorem, we need spaces of measures differentiable with respect to D on some definite function classes Sj. These spaces, W(D | So, • • • j Sm), were defined in §6. For example, the statement of Theorem 8.1 means that

P e W\D I L ^ X , ? ) , W ^ x 2 ( X , P ) ) .

THEOREM 8.2. Let m > 1, and let d> m be even. Consider the multi-indices q = dx (m + 1) and qi = (d/(d - i)) x (i + 1). If V e W 9 ( X , P , H P ) , then P eWfi | L 1 (X ,P ) ,W 9 1 (X ,P ) , . . . ,W < ? - (X ,P ) ) . For each i = l , . . . , r a , we can choose a representing measure for the functional DlP in the form ai(x)P(dx), where ai eLd^(X,P).

PROOF. For i = 1, everything has been proved in Theorem 8.1. For i > 1, the proof is similar. Instead of (8.8), we pass to the limit in the chain of relations

/ Din9n(x)P(dx) = ..- = ( - 1 ) ' / Dirj9n(x)ajtn(x)P(dx)

= . . . = (-1)* / gn(x)aiin(x) P(dx). 7x

Here Dn is the differential operator generated by a finite-dimensional field Vn that approximates V, a i > n , . . . , amjn is the sequence of the form (8.10) corresponding to the field Vn, and gn(x) is a smooth functional that approximates g G W9 i(X, P) .

To justify passing to the limit, we again use Lemma 8.4. In the limit, we obtain the following formula of integration by parts:

(8.12) / Dig(x)P{dx) = (-1)* [ g(x)ai(x)P{dx). Jx Jx.

This relation just means that ai(x) P{dx) is a representing measure for DlP. Sometimes it is useful to express the density of the highest derivative of P , that

is, the function a , i > 1, in terms of the density a\ = 6y of the first derivative. It follows from (8.10) that a can be represented as a linear combination of the expressions

i

(8.13) {6v)h{D6v)*...{&-%)*, ft >0, Y,rti=L

3=1

For example, a2 = 6^ + D6y and a3 = 6^ + S6V{D6V) + D28v. •

Page 68: Local Properties of Distributions of Stochastic Functionals

52 3. GAUSSIAN FUNCTIONALS

§9. Distributions of smooth functionals

Absolute continuity of distributions. Theorem 4.6 provides a general cri­terion for absolute continuity of the distribution of a stochastic functional in terms of admissible semigroups. Since a Gaussian measure has a large supply of admissi­ble linear groups (7.13), we can specify the general theory developed in §4 for the Gaussian case.

Suppose that (X, X7) is a pair of spaces in duality, P G £(X), Hp C X is the core of the Gaussian measure P , and TH is the discrete topology in Hp . Let B G Q5X. By (4.21), a functional / : X -> R 1 belongs to the class M{B, H P ) if for each x G B there exist a direction I G Hp and a neighborhood V C X such that the function y —> Dif(y) is continuous on V fl B and Dif(x) ^ 0.

PROPOSITION 9.1. Suppose that B G 2$x> / : X —> R 1 is a measurable func­tional, and f G «M(i?,Hp). Then P B / _ 1 <£. A. In particular, ifP{B} = 1, then the distribution of f is absolutely continuous.

PROOF. Let us verify the assumptions of Theorem 4.6. Obviously, the admissi­ble semigroups {G\ I G H p } form a (S , Hp)-sufRcient class of admissible mappings. Thus, the first assumption of the theorem is satisfied. By virtue of the remark fol­lowing Theorem 4.6, the second assumption is satisfied and we need not verify it. The third assumption of Theorem 4.6 is included in Proposition 9.1. Finally, the fourth assumption is also satisfied, since P is a Radon measure. •

By way of example, let us consider the distribution of an integral functional. Suppose that (T,/x) is a standard Borel space with measure, X = L2(T, ^ ) , and / : X —> R 1 is an integral functional of the form

(9.1) f(x) = J q(x(t)) n(dt).

Suppose that the kernel q satisfies the local Lipschitz condition and the de­rivative q' exists, is continuous, and does not vanish on some set S c R 1 with A{RX \ S} = 0.

Let P G £/(X) satisfy the nondegeneracy condition

(9.2) | l V | 2 = D P ( . , j / ' ) > 0

for any nonzero y' G X' = X. We also assume that the measure P is concentrated on bounded functions, that is, P{L°°(T,^)} = 1.

PROPOSITION 9.2. Under the above assumptions, P / _ 1 <C A.

PROOF. We set B0 = {x G X | n{t \ x(t) e S} = /J,(T)} C X. For each M > 0, we introduce the set B = B(M) = [x G X | H Hoo < M}f)B0. We intend to apply Proposition 9.1 to B, / , and P . To this end, we must verify that / G M(ByTij>). Indeed, for x G BQ and / G Hp C L°°(T,/x), we have

(9.3) Dtf(x) = JV (*(*))/(*) M o ­

using this expression and Lebesgue's dominated convergence theorem, we see that

Dif(>) is continuous. On the other hand, for x G So we have q'(x(t)) ^ 0 for

ju-almost all t eT. Therefore, the expression (9.3) is nonzero for some / G X' = X.

By Proposition 7.4, condition (9.2) implies that Hp is dense in X; therefore, one

Page 69: Local Properties of Distributions of Stochastic Functionals

§9. DISTRIBUTIONS OF SMOOTH FUNCTIONALS 53

can choose an I G Hp so that Dif(x) ^ 0. Hence, / G ,M(i?,Hp), and by Proposition 9.1, we have P B ( M ) / - 1 ^ A.

It remains to note that, by assumption,

J im P{B(M)} = P{L°°(T,») D B0} = P{B0}.

But it follows from (9.2) that P{B0} = 1. Therefore, letting M —> oo, we obtain P / " 1 < A. •

Similar assertions can be proved for integral functionals on the space of con­tinuous functions, for the integral functional with the kernel q(x(t),t), and for an integral functional whose kernel is a smooth function only on a small part of the real axis (in this case, it is necessary to require that P-almost all sample functions pass through this part). For the corresponding results see [34, 40, 58].

PROBLEM 9.1. Let X = C[0,1], and let P be a Wiener measure on X. We define an integral functional / by setting

f(x)= [ q(x(t))dt. Jo

Suppose that the derivative q'(*) exists, is continuous, and does not vanish on some open set whose closure contains zero. Prove that P / _ 1 <C A.

Est imates for the dis t r ibut ion density. In Proposition 9.1, the assumption that the derivative of the functional is nondegenerate, that is, Dif(x) ^ 0 for an appropriate /, is most important for the absolute continuity of the distribution. We develop this approach and obtain some estimates for the distribution density in terms of the probability of the sets on which the differential D.f is close to zero. In particular, these bounds prove useful in finding conditions for the density to be bounded.

Suppose that P G 5(X), B G £ X , {#n}> {En} C <£x a r e t w 0 covers of the set B, {ln} C Hp is a sequence of admissible translations, a(ln) is the admissibility (7.16) of Zn, and / : X —> R 1 is a measurable functional. We set

8n = inf [sup{c G R 1 | x + cln e En} - inf{c G R 1 | x + cln G En}],

which means that 6n is the minimum length of the interval cut by the set En on a straight line parallel to ln.

Suppose that 1) Bn c En\ 2) En cuts a convex set on each line parallel to ln\ 3) the functional / is monotone on Bn along ln and its derivative satisfies the

estimate

(9.4) inf \Dlnf(x)\ > dn. xeBn

THEOREM 9.1. Under the above assumptions, we have

dPef-1 (9.5) ess sup BJ (v) < 2 ^ / i + l ^ ^ P ^ } veR1

x m a x ^ - V G n r ' a + iainPitfn}!1/2)}.

Page 70: Local Properties of Distributions of Stochastic Functionals

PROOF. Let Yn be the partition of X into lines parallel to ln. By P n and {P™} we denote the corresponding quotient measures and families of conditional measures. Since B c \Jn Bn, it follows that

n n J*-/ln

By using formula (4.18) and the estimate (9.4), we see that the nth term in (9.6) does not exceed

r d P n

(9.7) d~l SUP -TT^Pn(d7) . 7 x / r n Bnn7T-1(7) «A 7

Here A™ is the invariant measure (4.4) on the straight line 7r-1(7):

\"{y 12/ = x + c/n, cG [ci,c2]} = c2 - c i .

To estimate the density of the conditional measure, we use the following ele­mentary result.

LEMMA 9.1. Suppose that N = A/*(a, a2) is a Gaussian measure m R1 , p = dAf/dX, U is an interval in R1 , and X(U) > 8. Set p = infc€c/ \c — a\. Then

(9.8) M{U) < exp{-p2/2<72},

(9.9) supp(c) < 2y/emax{6~\a-1,pa~2}J\f{U}. ceu

We use this lemma in the following situation "embedded" in the space X: M = P™, a7 is the barycenter of the measure A/*, and

U = 7r-1(7)f iEn , a = a(ln), 6 = 6n,

p = p1 = inf {|c|,c | a1 + cln G En}.

By using (9.9), we obtain the upper bound

2x/i<C1 / m^{8-\a{ln)-\p1a{lny2}PVi{En}Pn{d^

for (9.7). Let us split the interval into two parts according to the conditions

P^ln)-1 < \2\nP{En}\1'2 + 1, p^ln)-1 > \2lnP{En}\^2 + 1.

In the first domain of integration, we use the total probability formula for P{En} and obtain the majorant

2 ^ - 1 m a x { ( 5 - 1 , a G n ) - 1 ( | 2 1 n P { E n } | 1 / 2 + l )}P{E n } .

In the second domain of integration, we use the estimate (9.8) for P™{En} in the form

P^{En} < exp{- / 02/2 (r(/n)2}.

Page 71: Local Properties of Distributions of Stochastic Functionals

§9. DISTRIBUTIONS OF SMOOTH FUNCTIONALS 55

We substitute this estimate into (9.7) and, since the function v —> v exp{—v2/2} is monotone for v > 1, obtain the following majorant for (9.7):

2 ^ C 1 m a x { ( 5 - \ a ( U - ^ | 2 1 n P { E n } | 1 / 2 + i)}e-(|2inP{JE;n}|+i)/2

= 2d"1max{(!)-1 ,a( /n)-1( |21nP{En}|1 /2 + l )}P{E n } .

The sum of these two estimates yields (9.5). •

To use Theorem 9.1, one must choose the cover Bn and the translations ln so that the estimates of dn in (9.4) and of a(ln) are not too small. Then one must extend Bn so that condition 2) preceding Theorem 9.1 is satisfied. This extension is taken as En.

Numerous examples of calculating the admissibility of measures, given in §7, allow one to apply this scheme to a wide class of Gaussian measures.

PROBLEM 9.2. Let X, P , / , and q be the same as in Problem 9.1. Suppose that the kernel q is monotone on some intervals (—a, 0) and (0,a) and satisfies the estimate \q'(r)\ > b(r) for almost all r G (—a,a). What conditions on b guarantee the boundedness of the density of the distribution P / _ 1 ?

For the history of this problem, see the bibliographical notes in the end of the book. Sakhanenko [93] proved that the function b(r) = \r\~n exp{—7r2r~2/8} guarantees the boundedness of the density, whereas this is not the case for b(r) = | r | -9exp{-7r2r-2 /8}o(l) .

Smoothness of dis tr ibut ions. Suppose that / : X —> R 1 is a stochastic functional defined on a space X with a Gaussian measure P G GoPQ. We study the smoothness of the distribution of / with respect to P . More precisely, we obtain the conditions under which the distribution P / _ 1 belongs to the class of measures Wx introduced in §6. To verify whether a distribution is smooth, we need to apply the general Theorem 6.1 in the Gaussian situation. In §8, we introduced the tools, namely, the spaces W 9 (X,P) of smooth functionals and W 9 ( X , P , H p ) of smooth vector fields.

Thus, suppose that V: X —> Hp is a vector field, D the differential operator (8.1) corresponding to V, and 6y(x) the density of the measure .DP, defined in Theorem 8.1.

We use the notation / ( / , i) introduced in Theorem 6.1 for the set of nonnegative integer multi-indices k = (&i, . . . , fc/) such that Y^j=\J^o ~ I — i- We write |A;| = fci+ ••• + *«.

THEOREM 9.2. Suppose that m is a positive integer, d > m + 1 is even, and the following three conditions are satisfied:

l)Ve W d x ( m + 1 ) ( X , P , H P ) and f G W S (X ,P) , where s = (s0, • • • ,*m+i) and Si = d/(d-i)\

2)

(9.10) P{x G X | Df(x) = 0} = 0;

• x |£>"H-i/ | f cm

x\Di-Hv\(3i){x)P{dx)

(9.11)

3) the integral

[ [\Df\-m~W x |£>2/|fcl x

x\6v\*x~

Page 72: Local Properties of Distributions of Stochastic Functionals

56 3. GAUSSIAN FUNCTIONALS

is finite for alii = 0 , . . . , m and any multi-indices k G /(ra, i) and ft G /(z, 0). U t e n P / ^ e W r .

P R O O F . Let us more precisely describe the sets of functions on which we con­sider the operator D. We introduce the multi-indices qi = (d/(d — i)) x (i + 1), 1 < i < ra, and we set S0 = LX(X,P) and Si = W * ( X , P ) , 1 < i < ra. Then, by Theorem 8.2, the measure P is smooth, that is,

P e W m ( £ > | S 0 , S i , . . . , S m ) .

This smoothness is one of the conditions under which Theorem 6.1 can be used. Next, we must verify that the integrals (6.2) are finite. This is a direct consequence of the fact that the integrals (9.11) are jfinite and that the measure fj,1 = DlP = ai(x)P(dx) can be represented as a sum of the products (8.13). Obviously, the other assumptions of Theorem 6.1 are also satisfied. Therefore, P / ^ e W i . •

The first assumption of Theorem 9.2 comprises the requirements imposed on the smoothness of / and V. The second assumption means that the derivative of / is nondegenerate along V. The third assumption combines quantitative requirements imposed on smoothness (the terms | ^ / | , 2 < j < m + 1), nondegeneracy (the factor \Df\ raised to a negative power), and compatibility of V with P (the factors including 6y).

Note that, by Proposition 8.1 and Theorem 8.1, the first assumption of the theorem provides that all the expressions in (9.11) are well defined. Therefore, the third assumption is purely a moment condition.

To illustrate the above arguments, we consider the simplest special case m = 1.

COROLLARY. Suppose V e W d > d (X,P,H P ) , / e WM/(d-i) ,d/(d-2)(x,p) , (9.10) holds, and the integrals

f [\Df\-2\D2f\](x)P(dx), f [\Dfn6v\](x)P(dx)

are finite.

Then P / _ 1 G W1 # Thus, the distribution density of the functional f is of bounded variation] in particular, the density is bounded.

It follows from the corollary that, to apply Theorem 9.2, we must require the functional / to be at least two times differentiate along V.

To use Theorem 9.2 for the proof that the distribution of / is smooth, we must be able to construct an appropriate vector field V. If all moments of the (m + l)st derivative of / are defined, then V can be readily constructed. This is shown in the following theorem.

THEOREM 9.3. Suppose that m > 1, / G f|d>i W d x ( m + 2 ) ( X , P ) , and

(9.12) E | /(D|-2m-6 = / \fW(x)\-*™-*p(dx) < oo

for some 6 > 0. ThenPf^eW™.

Page 73: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISO PERIMETRIC PROPERTY 57

PROOF. Let us verify that the assumptions of Theorem 9.2 are satisfied for the functional / and the vector field V(x) = f^(x). We see that the derivatives of / and V have the required smoothness and integrability. Furthermore, in view of the identities

Df(x) = (/«(*), V(*)) = (/(1)(*),/(1)(*)) = l/(1)(*)|2,

(9.10) follows from (9.12). Let us verify that the integrals (9.11) are finite. Under the assumptions of our

theorem, it is convenient to couple all factors of the form |.DJ'/I> 2 < j < m + 1, with the compensating factors | . D / | ~ \ since the functionals \D^f\ \Df\~1 have all moments. For example,

\D2f\ \Df\~1 = 2\fW(x)[fll\x),fllHx)]\\Df\-1 < 2 | / ( 2 )(x) | .

The factors |.DJ<$y|> 0 < j < m — 1, also have all moments. It remains to consider the first factor. After the compensating factors have been canceled out, the first factor acquires the form

\Df\ * =\Df\-m = i/(1)r2m-

Therefore, the Holder inequality and condition (9.12) ensure that (9.11) is finite. •

§10. Convexity and the isoperimetric property of the Gaussian measure

The main goal of this and the next section is to obtain some information about the distribution of a convex functional defined on a space with Gaussian measure. This problem is simplified by the fact that the Gaussian measure is, in a sense, convex. This property is closely related to the famous isoperimetric theorem stating that of all sets of the same measure, a half-space has the least "surface area." Following Ehrhard, who proposed a unified approach to all these problems, we first consider the properties of the standard Gaussian measure on R m , then extend the results to arbitrary Gaussian measures, and finally arrive at a qualitative description of the distribution of a convex functional.

Symmetrization of the Gaussian measure on Rm . In this subsection, P m

stands for the standard Gaussian measure on R m , that is, the Gaussian measure with respect to which the functionals of the form (*,e), where e G R m is a unit vector, have the distribution A/"(0,1). By P& we also denote the projection of the measure P m on a fc-dimensional linear subspace L C R m (k < m). By II(e,r) = {x G R m | (x, e) > r } , we denote half-spaces.

Suppose that 1 < k < m, L is an (n — fc)-dimensional subspace of R m and e G R m is a unit vector orthogonal to L. Let us define the Gaussian k-symmetrization with respect to L in the direction e. This is a mapping that assigns a set A' to each open or closed set A c R m . The set A' is defined as follows. Let x G L. Then the following properties hold:

1) if Pk{A C\{x + L-1)} = 0, then A! n (x + L-1) = 0 ;

2) if Pk{A H (x + L x )} = 1, then A' n (x + L-1) = x + L-1;

Page 74: Local Properties of Distributions of Stochastic Functionals

58 3. GAUSSIAN FUNCTIONALS

3) if 0 < Pk{A n(x + L-1)} < 1, then either

A! D (x + I / ) = n(e, a) n (a; + L x )

(if A is open), or

A' n (x + L-1-) = i r (e, a) n (x + i / ) (if A is closed).

Here the number a is determined from the equation

Pk{An(x + L±)} = Pk{U(eya)n(x + L±)}.

We write A' = S[A] = 5(L,e)[A] for the symmetrization. The symmetrization of a set A can be described as follows. We take a plane

x + L1- parallel to the fc-dimensional space L-1, calculate the Gaussian measure of the trace of A on x + L-1, and replace this trace by the fc-dimensional half-space of the same measure with boundary hyperplane orthogonal to e. The union of all the half-spaces thus constructed gives S[A].

EXAMPLE 10.1. If m = k = 1, L = {0}, and e = - 1 , then for each set A, its symmetrization is the ray 5(L, e)[A] = ( — oo, $ _ 1 ( P I { J 4 } ) ) of the same measure, where $(r) = Af(0, l ){(-oo,r]} .

If e = 1, then the symmetrization yields the ray

S(L,e){A} = ($-l(l-P1{A}))+oo)

directed rightward. For k = m > 1 and L = {0}, we obtain almost the same result: the sym­

metrization of an open set A is the half-space

S({0},e)[A) = n ( e , * - 1 ( l - *>m{A})),

also directed rightward. The reader will have no difficulty in verifying the simplest properties of sym­

metrization.

PROPOSITION 10.1. Let S = 5(L, e) be a Gaussian symmetrization. Then the following properties hold.

(i) IfAcB, then S[A] c S[B] (monotonicity). (ii) / / Ai is an increasing sequence of open sets and A = [jAi, then S[A] =

\JiS[Ai] (lower continuity). (in) 5(L,e)[Ac] = 5(L, —e)[A]c (compatibility with the operation of taking the

complement). (iv) S[A] is invariant along the space (L + C(e))L, and S[A] + ce C S[A] for

c > 0 (the invariance along L-1-). (v) If h e L, then S[A + h] = S[A] + h. In particular, ifMcLisa linear

subspace and A is invariant along M, then S[A] is also invariant along M (the invariance along L).

(vi) If B is invariant under L-1, that is, B = B + L1-, then

pm{BnA} = pm{BnS[A]}.

In particular, Pm{A} = Pm{S[A]} (measure preservation).

The most remarkable property of symmetrization is that it reduces the surface of a set.

Page 75: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 59

Let Vr be the closed ball of radius r centered at the origin, and let Ar = A + Vr

be the r-neighborhood of a set A.

THEOREM 10.1. Let S = 5(L,e) be a k-symmetrization in R m . Then

(10.1) S[Ar] D S[A]r

for any closed set A and any r > 0.

The inclusion (10.1) is called the reduction of the surface and S is said to reduce the surface. This terminology becomes clear if we pass to the measure P m

in (10.1) and use the property of measure preservation under symmetrization. Then we obtain

(10.2) Pm{Ar} = Pm{S[Ar)} > Pm{S[A]r}, Pm{A} = Pm{S[A}}.

Hence, Pm{Ar\A}>Pm{S[A}r\S[A]},

that is, the Gaussian measure of the layer of width r around A is greater than that of the layer of the same width around S[A].

Let us outline the basic steps in the proof of Theorem 10.1. First, we prove (10.1) for the simplest symmetrizations (k = 1). Then we represent a fc-symmetrization via superpositions of 1-symmetrizations.

The inclusion (10.1) readily extends from the components of the superposition to its result.

In particular, we cover the case of m-symmetrization, which transforms each set into a half-space.

By applying inequality (10.2) to m-symmetrization, we obtain the famous isoperimetric property of a half-space with respect to the Gaussian measure: the measure of the r-neighborhood of a half-space is less than or equal to the measure of the r-neighborhood of any measurable set of the same measure as the half-space.

We present the properties of Gaussian symmetrizations according to the scheme shown in Figure 1, on the next page.

P R O O F OF THEOREM 10.1. We start with the case in which m = 1, L = {0}, e = 1, and A is a closed interval in R1 . Let q = P i {A} be the measure of A. Closed intervals of measure q form the one-parameter family At = [£,$(£)]> t G [—oo,$_1(l — q)]y where the function t —• s(t) is determined by the condition Px{At} = q. Let us minimize the measure qr(t) = P i{(A t )

r } with respect to t. Inequality (10.1) implies that qr attains its minimum values at the extreme values of £, namely, t\ = —oo and t<i = $ _ 1 (1 ~~ tf)> f° r which At becomes a ray. Note that for any t we have

S(L,-l)[At] = Atl, S(L,l)[At] = At2.

By differentiating qr with respect to t, we obtain

(10.3) «/r(*)=p(*)

where p is the standard density (1.6) with a = 0 and a = 1. A simple analysis of (10.3) shows that qr(t) is a unimodal function with maxi­

mum at the point to = — 5(to), which corresponds to the symmetric position of the

\p(s(t) + r) _ p(t - r)

p(s(t)) p(t)

Page 76: Local Properties of Distributions of Stochastic Functionals

60 GAUSSIAN FUNCTIONALS

Surface reduction for an interval

in R1

Convexity of the function

Surface reduction for a system of intervals

1 in Rl 1

Convexity preservation

by 1-symmetrization in R m

1 Surface reduction

for an open set in R1 J V

1 \ | Surface

reduction by 1-symmetrization

in R m

The 2-symmetrization in R m is the limit of superpositions

of 1-symmetrizations

Convexity preservation

by 2-symmetrization in R m

Surface reduction

by 1-symmetrization in R m

The 2-symmetrization in R m is the limit of superpositions

of 1-symmetrizations

Convexity preservation

by 2-symmetrization in R m

1 /

Surface reduction

y 2-symmetrization in R m

/

4r

1 Theorem 10.1 Surface reduction

by fc-symmetrization in R m

1 For fc>3, 1 the fc-symmetrization

can be represented as a superposition

of 2-symmetrizations

Convexity preservation

by fc-symmetrization in R m

1 Theorem 10.1 Surface reduction

by fc-symmetrization in R m

1 For fc>3, 1 the fc-symmetrization

can be represented as a superposition

of 2-symmetrizations

Convexity preservation

by fc-symmetrization in R m

I Ehrhard inequality

in R m

1 Ehrhard inequality in a Gaussian space

Convexity of the normed distribution

function of the convex functional $ _ 1 F

FIGURE 1

Page 77: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 61

segment Au and with minima at t\ and £2- Thus, we have proved formula (10.1) for a closed interval.

Now let {A{, 1 < i < n + 1 } be a family of disjoint segments numbered from the left to the right. By induction over n, we shall prove (10.1) for the set A = UI^ / A% and for both symmetrizations S+ = S'({0}, 1) and 5_ = S'({0}, —1) in R1 . We have already proved the base clause (n = 0).

Without loss of generality, we can assume that (Aj)r n (Ak)r = 0 for j ^ k\ otherwise, we can prove the inclusion (10.1) for shorter intervals, for which the left-hand side of (10.1) remains unchanged and the right-hand side is smaller.

We set J = |jr=2 Ai and replace the extreme intervals A\ and An+\ by the rays 5_[Ai] and 5+[^4n+i] of the same measure so that

n+l S-[A} = S-[{jAi]=S-[S4Al}UJUS+[An+1}],

S-[Ar] = S-[S-[{Ai)r] U / U S + [ ( A n + 1 ) r } ] .

Then we substitute the already known inequalities

s^Atf) D s-iAtY, s+[(An+1y} D s+[An+1]r

into (10.4), thus obtaining

(10.5) S-[Ar] D S- [5_[i4i]r U / U S+IA^Y].

Let I = S- [Ai]r U Jr U 5 + [A m + i ] r . The complement of / is a union of n closed intervals; therefore, by the induction assumption, the inclusion (10.1) is satisfied for Ic, and we obtain

S. [5-[i4i] U J U 5 + M = S. [ ( ( / c r ) C ]

= 5+[(/c)r]Cc(5+[^DC = (5_[/])-r. Thus,

S- [S. M U J U 5 + [Am + 1]] r C S- [I]. By comparing this inclusion with (10.4) and (10.5), we obtain

S-[Ar] D S-[I] D S- [5_[i4i] U J U 5 + [A m + 1 ] ] r = 5-[i4] r.

Thus, we have proved (10.1) for unions of finitely many intervals in R 1 (obvi­ously, it makes no difference whether the intervals are open or closed). Since any open set in R 1 is a union of a countably many intervals, we can use the fact that the symmetrization is continuous (Proposition 10.1 (ii)) to pass in (10.1) from finite systems of open intervals to any open set in R1 .

To consider an arbitrary closed set A, we use the following standard argument. Let V® be the open ball of radius e. Then

A=f](A + V% Ar=f)(A + V°ey,

(10.6) £>° S[Ar) = f] S[(A + 2?°)1 Df]S[A + V°e]

r = S[A}r. e>0 e>0

Now let us prove (10.1) for 1-symmetrization in R m . Let e G R m be a unit vector, L = £ { e } x a hyperplane, and S = 5(L,e) the corresponding symmetriza­tion.

Page 78: Local Properties of Distributions of Stochastic Functionals

62 3. GAUSSIAN FUNCTIONALS

For x G L by Rx = {y : y = x + pe, p G R 1 } we denote the one-dimensional affine subspace in which the symmetrization is performed. We verify (10.1) in these subspaces, that is, for each x we prove the inclusion

(10.7) S[Ar]nRxDS[A]rnRx.

We rewrite the left-hand side as S[Ar n Rx] and the right-hand side as

S[A]r n Rx = | J (S[A] nRy)rnRx.

Each term in the union can be represented as the one-dimensional neighborhood

(io.8) (S[A] n Ry)r nRx = s[A n Ry] + (vr n Rx-y).

By applying the one-dimensional inclusion (10.1) to the set A fl Ry + x — y C Rx, we obtain

S[(A + i?y)r n Rx] D s[A n Ry] + (pr n i?x_y).

Comparing this expression with (10.8), we get

S[(A n Ry)r n Rx] D (S[A\ n Ry)

r n i?x.

By taking the union over all y G L, we obtain

5 [^ r n^] DS[A]rnRx,

which proves (10.7). Thus, we have shown that 1-symmetrization reduces surfaces in R m .

PROBLEM 10.1. Let k < m. Assume that fc-symmetrization reduces surfaces in Hk. Prove that then /c-symmetrization also reduces surfaces in R m . (For k = 1, this statement has just been proved; the argument can be transferred literally to the case k > 1.)

Now we must study 2-symmetrizations. We represent them as a limit of super­positions of 1-symmetrizations. We start with the space R2 .

Let {e^}, j = 0 ,1 ,2 , . . . , be the sequence of unit vectors in R2 determined by the following relations:

eo = (0,l); ei = (l,0); (e,-, e^-i + e0) = 0; ( e j , e 0 ) < 0 .

Then en —> — eo and for j > 1, the angle between ej and — eo is half the angle between ej_i and eo.

Let Sj = S(C{ej}-L^ej) be the 1-symmetrization in R2 corresponding to ej. We set Qj = Sj o Sj-i o • • • o So-

LEMMA 10.1. For any j > 0, t, tf e R+, a closed set A c R2 , and x e Qj[A], we have

(10.9) x + teo+tej €Qj[A].

Page 79: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND T H E I S O P E R I M E T R I C P R O P E R T Y 63

PROOF. We proceed by induction on j . For j = 0, the statement of the lemma follows from the invariance of symmetrization (see Proposition 10.1 (iv)). Let us pass from j to j + 1. We write hj = ej + eo. For r > 0, we construct the segment

A?> = {y '• V = Are0 + (1 - A)rej, 0 < A < l } .

Consider the line Ra = {y \ y = ahj + pe^+i, p G R 1 } , which is one of the subspaces in which the symmetrization Sj+i is performed. The line Ra contains the set Qj[A] fl Ra obtained as a result of the previous symmetrizations. From all points of this set, we draw cones with generators eo, e^, thus obtaining the new set

B = {y : y = x + te0 + i!e$\ x G Qj[A] n Ra; t,t' G R+} C Qj[A].

Let us find the traces of B on the lines Rp with 0 > a. The vectors eo and ej form the same angles with ej+i. Therefore, B fl Rp is a translated neighborhood of the original set Qj[A] fl Ra:

Bf]Rp = (Qj[A] n Ra) + Ajtr + (/? - a)/ij.

By applying the inclusion (10.1) to the one-dimensional set B fl Rp inside i?/?, we obtain

Sj+i[B D #0] D 5 j + i [Qj-[i4] D i2a + (/? - aj/ij] + Aj>r

= {u : u = x + te0 + ^e^; x G Sj+i [Qj[A] D i2a]; t, i! G R+} fl ify.

Now we take the union over /3 > a, thus obtaining the inclusion

Sj+![B] D {y 11/ = x + te0 + ^e,-; x G 5[Oj[i4] n Ra]\ t,t'e R + } .

In particular, for x G 5j+i [Qj[^4] H iiQ] = Qj+i[i4] fl i2a we have

y = x + te0 + ^ej G 5 i + i [5 ] C 5 i + i o Q5[A] = Qj+1 [A], D

Lemma 10.1 has the following geometric interpretation: for each x G QjfA], the set Qj[A] contains the cone with vertex x and generators eo and ej. Since the angle between eo and ej tends to 7r, we see that for large j the set Qj[A] looks like a half-plane whose boundary is orthogonal to e\. This is just the half-plane obtained by applying the 2-symmetrization Q = 5({0},ei) to A. The relation

Q[A] = lim Qj[A] j - > o o

suggests that it is possible to pass from 1-symmetrization to 2-symmetrization. The following lemma contains the precise statement.

LEMMA 10.2. Let Qj, Q be the above-defined transformations of subsets o/R2 . Then for any e G (0,1/2) there exists a large number p(e) such that

(10.10) lim Qj [A] nVr = Q[A] n Vr j^oo

for r > p(e) uniformly with respect to a closed set A with e < P2{A} < 1 — e.

The convergence of sets is understood in the Hausdorff metric

d(A, B) = sup cGR 2

inf \c — b\ — inf \c — a\ beB' ' aeA' '

Page 80: Local Properties of Distributions of Stochastic Functionals

64 3. GAUSSIAN FUNCTIONALS

PROBLEM 10.2. Derive this lemma from the previous lemma.

By using the inclusion (10.1), which has already been proved for Sj, we obtain

QMV = Sj [Qj-i[A]Y C Sj [Qj-i[AY]] C - c S , o V i » - o S0[Ar] = Qj[Ar].

By passing to the limit with respect to j and using (10.10), we arrive at the inclusion

Q[A]r C Q[Ar).

This proves (10.1) for 2-symmetrizations in R2 .

PROBLEM 10.3. Prove (10.1) for an arbitrary 2-symmetrization in R m . Use the result of Problem 10.1.

The following two lemmas allow us to pass from low- to arbitrary-order sym-metrizations.

LEMMA 10.3. Let M i , M2, and M 3 be pairwise orthogonal subspaces o / R m . Set Li = Mi + M 2 and L2 = M 2 + M3. Then the Gaussian symmetrizations Si = 5(Li,e) and S2 = S(L2 ,e) satisfy

SioS2 = S2oS1 = S(M2 ,e) .

PROOF. Le tH = (Mi+M 2 +M 3 +£{e}) - L . The invariance of symmetrizations (Proposition 10.1 (iv)) implies that

Si [A] = Si [A] + (Li + £{e}) x = Si [A] + M 3 + H,

S 2[A]=S 2[^] + M i + H

for any closed set A. Since Mi C Li, it follows from Proposition 10.1 (v) that

SioS2[A] = SioS2[A] + M1.

Consequently,

Si o S2[A] = Si o S2[A] + M 3 + H = (Si o S2[A] + Mi) + M 3 + H

= Si o S2[A] + (Mi + M 3 + H) = Si o S2[A] + (M2 + £{e})x .

Thus, the set SioS2[^4] is invariant with respect to (M 2 +£{e}) ± . By definition, the same is true of S(M 2 , e)[A]. Moreover, both sets are invariant under translations along e (see Proposition 10.1 (iv)). Therefore, the intersection of either set with the affine subspace N x = x + M f is a half-space. Let us compare the measures of these half-spaces. By Proposition 10.1 (vi), we have

Vk{S1 oS2[A] n N x } = Pfc{5i[i4] n N , } = P f e { A n N x }

= Pk{S(M2,e)[A)nNx}, A; = d imN x .

Since the measures coincide, it follows that the half-spaces also coincide,

Si o S2[A] n N x = 5(M 2 , e)[A) n N x .

By taking the union over x € M2, we obtain

S1oS2[A} = S{M2,e)[A}. D

Page 81: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 65

LEMMA 10.4. Let Q = 5(L, e) be a k-symmetrization in R m for m > 3 and k>2. Then there exist 2-symmetrizations Qi, Q2,..., Qk-i such that

(10.11) Q = QioQ2o.'oQk_1.

PROOF. Let he (L + ^{e})1- be a unit vector. We set Li = (C{h} + ^{e})1-and L2 = L + C{h}. Then the assumptions of Lemma 10.3 are satisfied for Mi = (L + C{h} + C{e})^, M 2 = L, and M 3 = £{h}. Hence,

g = 5 (L i , e )o5(L 2 , e ) .

Note that Q\ = 5(Li,e) is a 2-symmetrization, whereas 5(L2,e) is a (A: — ^-sym­metrization.

In a similar way, we can represent S(Jj2,e) as a superposition, etc. At each stage, we extract a new factor Qj so that the order of the remaining symmetrization decreases until it becomes equal to 2. •

Now we can prove Theorem 10.1. We have already proved this theorem for all symmetrizations in R 1 and R2 , as well as for fc-symmetrizations in R m , k = 1,2 (see Problem 10.3). Let us consider the symmetrization Q of order k > 3 and represent it in the form (10.11). Then we have

Q[Ar] = Qi o Q2 o • • • o Qk_, [Ar] D Q i o . - o Qfc_2 [Qfc_! [A]r]

D • • o Oi [(Q2 o • • • o Qk-MW] 3 Qi [Q2 o . •. o Q,_1[,4]]r = (Q[A})r.

The proof of Theorem 10.1 is complete. •

Convexity and the isoperimetric property of the Gaussian measure on Rm .

THEOREM 10.2 (isoperimetric inequality). Let P m be the standard Gaussian measure on R m , and let A c R m be a Borel subset. Then

(10.12) 4T 1 (Pm{Ar}) > $ " 1 (Pm{A}) + r

for any r > 0.

PROOF. First, suppose that A is closed. We take any m-symmetrization S and apply Theorem 10.1 in the form (10.2). Then the sets S[A] and 5[^4]r are half-spaces, and since the symmetrization preserves the measure, we have

Pm{S[A]} = Pm{A}.

Hence, Pm{S[A]r} = * ( * - ' ( P B { i l } ) + r ) .

We substitute this relation into (10.2) and obtain (10.12), since $ is monotone. To prove (10.12) for an arbitrary Borel subset A) we use the regularity of the

measure P m and approximate A by closed sets Ai C ^4, Pm{Ai} —> P m {A}. By applying (10.12) to Ai and taking the limit as i —> 00, we prove the theorem. •

THEOREM 10.3 (convexity is preserved under symmetrization). Let A be an open or closed subset of R m , and let S be a Gaussian symmetrization in R m . / / A is convex, then so is S[A].

Page 82: Local Properties of Distributions of Stochastic Functionals

66 3. GAUSSIAN FUNCTIONALS

PROOF. First, we consider the 1-symmetrization S = 5(L, e), where e is a unit vector in R m and L = C{e}±. For x G L, we set Rx = x + C{e}. The set A D Rx

is a ray or a segment, and the set S[A] C\ Rx is a ray. We must verify that

(10.13) S[A] n R1XlHl_l)X2 D 7(S[A] n RX1) + (1 - 7)(5[A] n RX2)

for xi,a;2 G L and 7 G [0,1]. This inclusion is equivalent to the convexity of S[A\. Let us introduce some notation for the related segments and rays. Let

A n RXj = Xj + [aj, bj\e, j = 1,2,

S[A] fl RXj = Xj + [CJ, oo]e, i = 1,2.

By the definition of symmetrization, we have $(bj) — $(dj) = 1 — 3>(CJ).

In this notation, (10.13) is reduced to the following inequality for the function

no 14) *"* 7 & 1 + ^ " 7 ^ " ^ 7 a i + ^ ~ 7^a2^ ~ ~ 7 l C l ~ ^ ~ 7^C2

= 7 $ _ 1 ($(&i) - *(ai)) + (1 - 7 ) ^ _ 1 ( * ( M - *(aa)).

In turn, (10.14) is an immediate consequence of the following simple lemma, which we present without proof.

LEMMA 10.5. The function g{a,b) = $-1(<I>(6) — $(a)) i>s concave in the do­main {a < b}.

Thus, we have proved Theorem 10.3 for 1-symmetrizations. For 2-symmetriza-tions, we prove the theorem by passing to the limit, as described in Lemma 10.2. Finally, for higher-order symmetrizations, the convexity preservation follows from Lemma 10.4. •

The following theorem gives precise meaning to the statement that the Gaussian measure is concave.

THEOREM 10.4 (the Ehrhard inequality). Suppose that P m is the standard Gaussian measure on R m , A, B C R m are nonempty convex subsets, and 7 6 [0,1]. Then

(10.15) r ^ P ^ K l - ^ } ) > 7^-1(Pm{^4}) + (1 - 7 ) ^ _ 1 ( P - W ) .

PROOF. In R m + 1 , consider the subsets A! = A x {1} and B' = B x {0}. The set

C={ye R m + 1 I y = pa + (1 - /?)&; aeA\beB\(3e [0,1]} is concave, and then

C n (R m x {7}) = (<yA + (1 - i)B) x {7}.

First, assume that A and B are open. Consider the ra-symmetrization S(C{e}) /i), where e = (0 ,0 , . . . , 0,1) G R m + 1

and h is a unit vector orthogonal to e. By Theorem 10.3, the convexity of the set S[C] is equivalent to the concavity of the function

/ ( 7 ) = S " 1 (Pm{C n (R™ x {7})}) = 4T 1 (PmbA + (1 - 7)B})

on the interval [0,1]. Hence,

/ ( 7 ) > 7 / ( l ) + ( l - 7 ) / ( 0 ) ,

Page 83: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 67

which coincides with the desired inequality (10.15). Now let A and B be arbitrary convex sets. Then we apply our theorem to

their interiors A0 and B° and use the fact that P m {A 0 } = Pm{A} and P m { £ 0 } = Pm{B}. An arbitrary convex set need not be a Borel set, but it necessarily belongs to the complement of 05 with respect to the measure P m , so that all measures involved in (10.15) are well defined. •

Let us discuss how one can extend the statements of Theorems 10.2 and 10.4 to the case of an arbitrary Gaussian measure on R m .

PROBLEM 10.4. Let P be a Gaussian measure on R m with mean value a and correlation operator K: R m —• R m . Then P is the image of the standard Gaussian measure P m under the afBne transformation

G(x) = a + Kx'2x.

To generalize the isoperimetric inequality (10.12), we give a new interpretation of the notion of a neighborhood of a set. Whereas for the standard Gaussian measure we have Ar = A+Vr, for an arbitrary measure P we introduce the ellipsoid of concentration Ellr = Kxl2Vr and set Ar = A + Ellr. The new definition of a neighborhood is related to the transformation G by

(G{A))r = G{A) + Ellr = G(A) + Kx'2Vr = a + Kl'2{A + Vr) = G(Ar).

With regard to the result of Problem 10.4, we can reformulate the isoperimetric inequality as follows.

THEOREM 10.2'. Suppose that P is a Gaussian measure on Rm, A e 93 (R m ) , and r > 0. Then

(10.16) $ _ 1 (P{A + Ellr}) > $ - 1 (P{^}) + r.

Since G preserves convexity and commutes with the summation of sets, we obtain the following modification of the Ehrhard inequality.

THEOREM 10.4'. Suppose that P is a Gaussian measure on R m , 4 , 5 c R m

are convex subsets, and 7 € [0,1]. Then

(10.17) $~l (P{ 7A + (1 - -y)B}) > 7 $ - 1 (P{^4» + (1 - 7 ) $ - 1 (P{#}) •

Convexity and the isoperimetric property of a Gaussian measure on an infinite-dimensional space. To extend the above results to Gaussian measures on an arbitrary linear space X, we shall use the following construction.

Let {fi}iZi be a countable set of functional from X7. We define a projection F m : X -> R m by setting Fm(x) = { ( x , / i ) , . . . , <z , / m )} .

If A C X is a compact set, then Fm(A) is compact in R m , and hence, the set Am = F^lFm{A) C X is a cylinder, Am G £0.

We set Aoo = f1m=i Am-

PROPOSITION 10.2. Let E = a{fi} be the minimal a-algebra with respect to which all functionals fi are measurable. Then A^ is the smallest element of E that contains A. Moreover, A^ = A + /C, where K = D S i ^ e r /*•

Page 84: Local Properties of Distributions of Stochastic Functionals

68 3. GAUSSIAN FUNCTIONALS

PROOF. First, note that the sets Am form a decreasing sequence and each of them contains A. Therefore, A^ D A. Each element of E is invariant under the addition of /C. Thus, A^ = A^ + /C D A + /C. Let us prove the opposite inclusion.

Let x G AQQ. By the definition of Am and Aoo, there exist elements xm G A such that Fm(xm) = Fm(x). In other words, for i < m we have (xm,fi) = (x,fi). Consider the limit points of the sequence xm. Since A is compact, it follows that there exists at least one limit point, I. Then for any i and e > 0, there exists an m > i such that

whence it follows that

\(x-lJi)\ = \(xm-lJi)\<e.

Therefore,

(z> fi) = {Uh), x-lelC, x = l + {x-l)eA + !C.

We see that A^ = A + K. Finally, if C e E and A c C, then

C = C + /CDA + /C = Aoo. •

In this proposition, we did not consider the measure. It turns out that by an appropriate choice of the system of functionals, one can ensure that the measures of A and A^ coincide.

PROPOSITION 10.3. Suppose that A c X is a compact set and P is a measure defined on the cylinder a-algebra (£. Then P*{A} = P{Aoo} for some countable set {fi} C X' . IfP can be extended to a Radon measure on 93x, then P{A} = P{Aoo}.

P R O O F . By the definition of the outer measure P*, we have

P*{A} = inf {P{£} , E e Co, A c E}.

Let us choose Ej so that A C Ej and

P*{A} < P{Ej} + 1/j.

We set oo

E=f]Ej. i= i

Then A c E and P*{A} = P*{E} = P{E}. Since E G £, we have E G E = a{ft} for some countable set of functionals {fi}. By Proposition 10.2, we have A^ C E. Therefore,

P * { A } < P * { A o o } = P { A o o } < P { £ } = P * { A } .

It follows that P*{A} = P{Aoo}. If P admits a Radon extension, then P = P* on compact sets. Thus, P{A} =

P*{A} = P{A00}. D

Now we are in a position to state and prove infinite-dimensional analogs of Theorems 10.2 and 10.4. In what follows, P G £o(X) is a centered Radon Gaussian measure defined on € = £(X).

Page 85: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 69

To generalize the isoperimetric inequality (10.16), we must generalize the ellip­soid of concentration in Theorem 10.2' to the infinite-dimensional case. We present two definitions and prove that they are equivalent.

DEFINITION 1. The set

Ell = p| {xex\ \{x,x')\ < i } c x {x /GX / |EP(-,rc /)2<l}

is called the ellipsoid of concentration of the measure P .

DEFINITION 2. The set Ell/ = / ( P i ) , where the operator / : X'p -» X is defined in Proposition 7.3 and V\ is the unit ball in the Hilbert space Xp, is called the ellipsoid of concentration of the measure P .

PROPOSITION 10.4. The ellipsoid Ell/ is compact and Ell/ = Ell.

PROOF. Since P is a Radon measure, it follows that Xp is separable. Hence, Vi is compact in the weak topology of Xp . The operator / is weakly continuous; hence Ell/ = I(V\) is compact in X.

Let us show that Ell/ = Ell. Let x = Iz, z eVi. Then

\(x,a/)\ = \{Iz,x')\ = \(z,I*x')\ < |*| | / V | < Ej/2(. ,z ')2

for any x' G X, and therefore, x G Ell. Thus, we have shown that Ell/ C Ell. Conversely, let x G X \ Ell/. We show that x G X \ Ell. We separate the compact set Ell/ from the point x by a continuous functional x' G X' so that (x,xf) > 1. However, for z G Vx we have Iz G Ell/ and (Iz,x') < 1. Set z = J V / | / V | G Vx. Then

| / V | = (I*x\I*x')/\I*x'\ = ( s , l V ) = (Iz,xf) < 1.

Hence, x G {y \ \(y,x')\ > 1} C X \ Ell. The proof is complete. •

For A G 05, we set Ar = A + r Ell.

THEOREM 10.5 (the infinite-dimensional isoperimetric inequality). Let P G £(X), A G S3(X), and r > 0. Then

(10.18) S - H P * ^ } ) > S - ^ P ^ } ) + r.

PROOF. We shall successively verify (10.18) for the cases in which A is a cylin­der, a compact set, and an arbitrary Borel set.

The proof is based on the fact that a finite-dimensional projection takes the ellipsoid of concentration of the measure P to the ellipsoid of concentration of the projection of P . Let T = {/i,..., / n , . . . , fm} C X' be a set of functional such that / i , . . . , fn are orthonormal and / n + i , . . . , / m are degenerate with respect to P ; furthermore, let F: X —> R m be the projection generated by T. Next, let P = P F " 1 be the projection of P , and let Ell C R m be the ellipsoid of concentration o fP :

n

MI = {c e Rm | Y,cf < i; cn+i = • • • = cm = o}-

Page 86: Local Properties of Distributions of Stochastic Functionals

70 3. GAUSSIAN FUNCTIONALS

LEMMA 10.6. Let A G <8(Rm), and let A = F " 1 ^ ) G Co(X) be a cylinder. Then

(10.19) F(rEll) = rElI,

(10.20) F(A + r Ell) = A + r Ell,

(10.21) i4 + r Ell = F _ 1 ( i + r l ) .

PROOF. Suppose that x G rEll; then (x,fn+j) = (7z,/n+j) = (z,I*fn+j) = 0, since the / n +j are degenerate for j > 1. Furthermore,

2 = 1 2 = 1 2 = 1

<N-|E(*,/i>/i|<r(El(x,/i>|2)1/

2 = 1 2 = 1

It follows that 53SLi \(x>fi)\2 < r2, and hence, F{x) G rEll and F(rEll) C rEll. Conversely, let c G rEll. We set x = J27=ic^^*fi ^ -^- ®ne c a n re^dily verify that x G rEll and F(x) = c. Thus, rEll C .F(rEll), which completes the proof of (10.19).

Since the projection F is linear, we obtain (10.20) as follows:

F( A + r Ell) = F( A) + F(r Ell) = A + r Ell.

It follows that ^ + r E l l c F - 1 ( A + rEiI).

Let us prove the opposite inclusion. Let x G F~l{A + rEll). Then for some I G A and d G rEll, we have F(x) = F(6) + F(d). Thus, F (z - d) = F(l) G A. Therefore, x — d £ A and x = ( x - d ) + d G A + r Ell. The proof of Lemma 10.6 is complete. •

Now let us derive inequality (10.18) for A being the cylinder from Lemma 10.6. By applying Theorem 10.2' to P and A, we obtain

^ ( P ^ + rEU}) > ^ - 1 ( P { i } ) + r .

Now by (10.21) we have the estimates

P{A + rEll} = P F " 1 ^ + rEll} = P{A + rMI} > $($-l(P{A}) + r)

= ^ ( r ^ p r ^ i j j + r) = $($-l{P{A}) + r).

Since $ is monotone, this inequality is equivalent to the statement of the theorem. Now let A be a compact set. We shall use the construction of Propositions 10.2

and 10.3, that is, construct a system of functional such that P{Aoo} = P{A} and P{(A + r Ell)oo} = P{A + r Ell}. (The latter relation can be ensured, since the set Ell is compact by Proposition 10.4, and hence, so is A + r Ell.)

By Proposition 10.2, we have

Aoo + r Ell = (A + K) + r Ell = (A + r Ell) +K = {A + r Ell)*, = f]{A + r Ell)m.

Page 87: Local Properties of Distributions of Stochastic Functionals

§10. CONVEXITY AND THE ISOPERIMETRIC PROPERTY 71

Obviously, (A + r Ell)m = Am + r Ell. An application of inequality (10.18) to the cylinder Am yields

^ ( P f t A + rEll)™}) = $ - 1 ( P { ^ l m + r E l l } ) > * - 1 ( P { i 4 m } ) + r .

By letting m —» oo, we obtain

^ ( P K A + rEUJoo}) > S-^PtAoo}) + r .

Now the theorem (inequality (10.18)) follows from Proposition 10.3. Finally, let A be an arbitrary Borel set. We choose compact sets An c A so

that P{^4n} —> P{^4}. By applying the isoperimetric inequality to An and then passing to the limit, we prove the theorem for A. •

THEOREM 10.6 (the infinite-dimensional Ehrhard inequality). Suppose that P G (?(X), i , 5 G 2J(X), 7 G [0,1], and A and B are convex sets. Then

(10.22) ^" 1 (P*{ 7 A + (1 -i)B}) > 7*"1(P{i4}) + (1 - 7 ) $ _ 1 ( P { £ } ) .

PROOF. Let C = jA + (1 - i)B. We choose compact sets A' c A and B' <zB so that P{.A'} > P{A} - £ and P{B'} > P{B} - e. Then C" = 7 A' + (1 - j)B' is also a compact set.

We use Proposition 10.3 and choose a system of functionals T = {fi} C X7

such that P{A'} = P{A,OQ}, P { £ ' } = P { B ^ } and P{C"} = P { C ^ } . Without

loss of generality, we can assume that the functionals fi are P-orthogonal and their variances are either 0 or 1 (this can be achieved by a linear change of the fi).

By applying Theorem 10.4 to the finite-dimensional compact sets F^A') and Fm(Bf) and the Gaussian measure P i ^ 1 , we obtain

*- 1 (P{Cj n}) = * - 1 (PF- 1 {F m (C" )} )

= S" 1 {PF-^jF^A) + (1 - 7 ) i W ) »

> 7 * - 1 ( P F - 1 { ^ ( i 4 ' ) } ) + (1 - ^ ( P F - ^ i ? ' ) } )

= ( P K J ) + (1 - ^ ( P ^ } ) . By letting m —> oo, we obtain

* _ 1 ( P { ^ } ) > T ^ - ^ P K o o } ) + (1 - ^ ^ ( P ^ o o } ) -

Taking into account the fact that A^ B'^, and C ^ have the same measure as A\ B\ and C", respectively, we obtain

S - ^ P f C ' } ) > 7 *- 1 (P{ i4 / }) + (1 - 7 )*" 1 (P{B / }) .

The substitution of the definitions of A', B\ and C yields

S" 1 (P*{<?» > $ _ 1 (P{<?» > 7 * " 1 (P{A} - e) + (1 - 7 ) $ - 1 (P{#} - e).

Letting e —* 0, we arrive at (10.22). •

The following corollary of Theorem 10.6 is often used in applications.

Page 88: Local Properties of Distributions of Stochastic Functionals

72 3. GAUSSIAN FUNCTIONALS

COROLLARY (Anderson's inequality). Let C G 5J(X) be a convex centrally sym­metric set, and let P G £o(X). Then

P{C}>P{C + l}

for any I G X.

PROOF. We set A = C + Z, B = C - I, and 7 = 0.5 and use Theorem 10.6. Then the set

1A + (l-1)B = 1C + (l-7)C + l{21-l) = (C + C)/2 = C

satisfies the estimate

S - ^ P I C } ) > (^(PiA}) + $ " 1 ( P { £ } ) ) / 2 .

Since C is symmetric and P is centered, we have

B = -A, P{B} = P{A}, fc-1 ( P { C » > S" 1 {P{A}). •

§11. Convex functionals and their distributions

A functional / : X —> (—00, +00] is said to be convex if

/ ( / to + (1 - 0)y) < pf(x) + (1 - /3)/(y)

for any x, y G X and /? G [0,1]. If the opposite inequality holds and the range of / is [-00, +00), then we obtain

the definition of a concave functional. The multiplication by —1 takes convex functionals to concave functionals and vice versa; therefore, in what follows we consider only convex functionals.

EXAMPLE 11.1. Let {ft}teT be a family of linear functionals defined on X, and let {et}teT be a family of real numbers. Then the functional

(11.1) f(x) = sap{ft{x) + et} teT

is convex.

Let us show that under very general topological assumptions on X and / , a convex functional / can be represented in the form (11.1). Recall that a functional / : X —> R is said to be lower semicontinuous if

f(x) < Urn f(y)

for all x G X. Equivalently, one can require all the sets {x G X | f(x) < r} to be closed. We

denote the class of convex lower semicontinuous functionals on X by V(X).

PROPOSITION 11.1. Let (X, X7) be a pair of spaces in duality, and let f G V(X). Then there exists a family {(*,/*), ft £ X'} of continuous linear functionals and a family of numbers {et} such that

(11.2) / (x) = sup((x , / t ) + e<). teT

Page 89: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 73

PROOF. The set F = {(x,r) \ f(x) < r} c X x R 1 is convex and closed, since / is convex and lower semicontinuous. Let T = X x R 1 \ F. By the Hahn-Banach theorem, for any point t = (xt, rt) G T, there exists a functional (/t°, e£) G X ' x R 1

that separates t from F. This means that e® < 0 and

(11.3) l<(x t , / ° )+ne? ,

but

for all x G X. We set

/t = - r f / e ? , et = (e?)"1. Then

( x , / t > + c t < / ( x ) for all x G X . Hence,

/ ( z ) >sup( (x , / t ) + e t). teT

On the other hand, let us choose an x G X and consider a sequence tn = (#, r n ) , where rn —> / (#) . Then (11.3) yields

(z , / t n ) + e*n >^n.

Therefore, sup((x,/,) + e*)> n E ( ( x , / t n ) + e t J > / ( x ) . D t€T n->oo

One can readily verify the converse statement: if a functional / admits the representation (11.2), then / G V(X).

Note that, generally speaking, the class V(X) depends on the topology with respect to which the semicontinuity of functionals is defined. However, (11.2) shows that for all topologies compatible with the duality (X,X') , the class V(X) is the same.

It is often convenient to have a representation (11.2) with a countable indexing set T. The following result shows that this is possible if we neglect a set of measure zero. If a representation (11.2) is given, then we write

f(T']x) = sup((xJt) + et) teT'

for V C T.

PROPOSITION 11.2. Suppose that P G GoPQ, f G V(X), and a representation (11.2) is given. Then there exists a countable subset 7\ c T such that f(x) = f{T\\x) for P-almost all x G X .

PROOF. First, let us consider the case when miter &t > — oo> and assume that P { / < oo} > 0. Then

(11.4) P{sup(x , / t ) < o o } > 0 . teT

It is well known that under condition (11.4) there exist a set Xo C X of full measure and a metric d on the set T such that the space (T, d) is separable and the function (#, / . ) : T —• R 1 is continuous on (T,d) for i G X o [114, 115, 155]. We replace d by the stronger measure di(£i,£2) = d(^i>^2) + \eti -~et2\- Then the space

Page 90: Local Properties of Distributions of Stochastic Functionals

74 3. GAUSSIAN FUNCTIONALS

(T, d\) remains separable, but the functions {x) /.) + e. are continuous with respect to d\.

Let T\ be a countable d\-dense subset of T. We choose x G Xo, t G T, and a sequence {tn} C Ti such that di(tn,t) —> 0. Then

/ ( T i ; x ) > lim «x, / t n> +c t n ) = (x,/ t> + e t. n—>oo

By taking the upper bound with respect to £, we obtain the desired inequality

/(T i ;x)>/(r )a ;) = /(x).

Now let us consider the general case, allowing inf*GT et = — oo. Let Tn = {t G T\et> -n} and fM(x) = f{Tn\x). In T n , we choose countable subsets Tf such that /<n)(-) = / (Tf ; - ) , P-almost surely. Let 7\ = U n

T f • Then

f(x) = sup/<*>(*), / ( T i ; x ) = sup / ( ?? ;* ) n n

for any x G X , which implies that f(x) = / (Xi;z) , P-almost surely. It remains to consider the degenerate case P { / < oo} = 0. Since P is a Radon

measure, it follows that for any M > 0 and e > 0 there exists a compact set A C {x | / (#) > M} such that P{A} > 1 — e. By choosing a finite subcover of the cover of A by the open sets {x \ (x,ft) + et > M}, we obtain a finite set T ' c T such that

P { max ((£, ft) + et) > M} > P{A} > 1 - e.

This proves our proposition. •

The dis t r ibut ion of a convex functional. Let P G £(X) and / G V(X). The following results characterize the distribution of / with respect to the measure P . Consider the distribution function

F(r) = P{x | f(x) < r } .

PROPOSITION 11.3. The function $>(r) = $ _ 1 (F ( r ) ) is concave.

PROOF. Let us choose n , r2 G R 1 and 7 G [0,1]. Set r3 = 77*1 + (1 - 7)r2 and Ai = {x G X I /(a;) < 7^}, a = 1,2,3. Then the sets Ai are closed and satisfy

A 3 D 7 i i + ( l - 7 ) 4

since / is convex. By Theorem 10.6, we obtain

S - ^ P M a } ) > 7 * _ 1 ( P M i } ) + (1 " 7)* _ 1 (P{^2}) ,

which is equivalent to the desired inequality

* ( 7 n + (1 - 7>2) > 7 * ( n ) + (1 - 7 ) ^ 2 ) . •

Page 91: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 75

THEOREM 11.1. The distribution function F has the following properties. (i) F is everywhere continuous except possibly at the separation point

(11.5) r0 = inf{r | F(r) > 0} = inf{r | P { / <r}> 0}.

(ii) F is absolutely continuous on the interval (r0 ,+oo). (iii) F is differentiate everywhere on (ro, +oo) except on an at most countable

exceptional set A C (ro, +oo). (iv) The derivative F' is positive and continuous everywhere on (ro,+oo) \ A;

it has one-sided limits and jumps downwards at each point r G A. (v) F' is the distribution density of the functional f, that is,

P{x | f{x) € A} = f F\r)\(dr) J A

for any Ae<B(B}). (vi) For any r > ro, the variation of F' on [r, oo) is bounded. In particular, F'

is bounded on [r, oo).

PROOF. The assertion of the theorem follows from the representation F = $ o \I>, where # is the concave function defined in Proposition 11.3. The derivation of (i)-(vi) from elementary properties of convex functions on R 1 is left to the reader. •

The following examples show that the unpleasant "singularities" allowed by Theorem 11.1 can indeed occur. Specifically, F can have a jump at ro; F1 can have jumps on a countable set A; F' can be unbounded in any neighborhood of ro and hence need not be a function of bounded variation on (ro, +oo).

EXAMPLE 11.2. Let rjn be independent A/*(0, l)-distributed random variables, an = (21nn + 2 In Inn ) - 1 / 2 , and £n = anr\n. Then

(11.6) P { s u p f n < o o } = l,

(11.7) P{supfn = l } > 0 .

Indeed, simple calculations show that the sum

OO OO • Ayf \ °°

S(M) = £ P { £ n > M} = 5 3 (1 - * ( - ) ~ J2(lnn)-^2(n\nn)-M2

n = l n = l ^ W n / / n = l

is finite for M > 1 and infinite for 0 < M < 1. Now it follows from the Borel-Cantelli lemma that the event {£n > 1} almost

surely occurs only finitely many times. Thus, (11.6) holds. It also follows from the Borel-Cantelli lemma that P{sup£n > 1} = 1.

Note that for sufficiently large n > N we have

lnP{£n < 1} > - P { £ n > l}/2.

Thus,

P{sup£n < 1} = exp{ f > P { £ n ^ !>} > exp { ^ l n P { ^ n < 1} - ^ H > 0, n = l ^ n = l ^

which proves (11.7).

Page 92: Local Properties of Distributions of Stochastic Functionals

76 3. GAUSSIAN FUNCTIONALS

Let P be the Gaussian measure on R°° corresponding to the distribution of £n, and let / : R°° —» R be the functional denned by the formula f(x) = sup n # n . Then P and / satisfy the assumptions of Theorem 11.1. The distribution P / _ 1

has the separation point ro = 1 and an atom at the point ro. The same reasoning allows us to study the following situation.

EXAMPLE 11.3. Let rjn and an be taken from the previous example. Set Cn = rjn - a'1 =rjn- (21nn + 2 In Inn)1 /2 . Then

P { 0 < SUpCn < 0 0 } = 1, P { s u p C n = 0 } > 0.

Thus, the distribution of the functional / = sup £n has an atom and, in contrast with Example 11.2, all variables have the same variances.

PROBLEM 11.1. Let X = R1 , P = ^ ( 0 , 1 ) , and r0 6 R1 , and let A c (ro,+oo) be a countable set. Construct a convex function / : X —> R 1 such that ro is a separation point of the corresponding distribution function F and F' jumps downwards at each point of A and is unbounded in any neighborhood of ro.

THEOREM 11.2. Suppose thatP £ Go(X), f e V(X), and inf t G T{eJ > -oo in the representation (11.2). Moreover, let Ep|(*,/t)|2 = \I*ft\2 = &2 for some a > 0 and for all t e T. Then the distribution F = P / _ 1 is absolutely continuous and has the density

(11.8) F\r) = q{r) exp{-r2/2<72},

where <?(•) is a nondecreasing function on R 1 continuous everywhere except possibly for the separation point ro defined in (11.5).

PROOF. First, let us we prove the theorem for the case in which the indexing set T in the representation (11.2) is finite. Let T = {1,2 , . . . , n} . Then we have the explicit expression

n

F;(r) = ^ P { max((., / , ) + es) < r | (•, / t ) + et = r}p(r - e t),

where p is the Gaussian density (1.6) with parameters (0, a2). Let us show that each term in this sum has the form (11.3). Let t be fixed. Let kst = Ep(',/s)(*,/t)<r~2

be the correlation coefficient. Then for each s we have the representation

fs = kst ft +nS)

where I*ft and I*ns are orthogonal and, by virtue of the Gaussian property, inde­pendent. Therefore,

P { max((., f8) + es) < r \ (-, /t> + et = r }

= P |max(A; 5 t ( - , / s )+n s + es) < r | (-,/t) =r-etj

= P J max(ns + es + kst(r - et)) \ < r.

Since kst < 1, it follows that the last probability is monotone with respect to r, whence we obtain the representation (11.8) for a finite T. In particular, (11.8)

Page 93: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 77

implies the inequality

(11.9) q{r) < ( X / ^ C J - 1 ) - 1 (l - ${r/a))~l < oo.

If Ai = [^i, ^2] and A2 = [53,&i] are intervals in R 1 and 62 < #3, then from (11.8) we obtain the estimate

(11.10) * < £ > , < ^ A 2 > ^ ( 0 , < T 2 ) { A 1 } - ^ ( 0 , C T 2 ) { A 2 } -

Now suppose that the set T is countable. We represent it as the union of an expanding sequence of finite sets Tn. Then / ( T n ; x) —> f(x) for each x G X. Hence, we have the weak convergence of distributions

F n = P / - 1 ( T n ; . ) = ^ P / - 1 = F .

We have already proved the desired estimates for the distributions Fn. By (11.9), they are uniformly absolutely continuous on any ray (—00,r]. Hence, F is also an absolutely continuous distribution. By passing to the limit in the estimate (11.10) for F n , we obtain the estimate of the same form for F. By letting the lengths of the intervals A; tend to zero, we see that the density F has the form (11.8).

Finally, let us consider the general case. By Proposition (11.2), it can be reduced to the case of a finite T, which has already been considered.

It follows from (11.8) that F' cannot jump downwards. Theorem 11.1 (iv) prohibits other types of jumps for F'. Thus, F' is continuous.

The proof of Theorem 11.2 is complete. •

Let X = R1 , P = Af(0,1), and f(x) = x - l . Then

F'{r) = (27T)"1/2 exp{-(z + l ) 2 /2} .

Obviously, the conclusion of the theorem (about the monotonicity of q) does not hold, since the condition inf{e*} > 0 is violated. Theorem 11.2 can be generalized to the case e_ = inf{et} > —00. Instead of (11.8), here we can guarantee the representation

F'(r) = q{r)exp{ - (r - e_)2/2<j2}.

Let X = R1 , P = A/*(0,1), and f(x) = sup(z, -x) = \x\. Then F'(r) = lr r > 0 | (2/7r)1/2exp{—r2/2}. This example shows that F' may have a jump at the separation point ro-

It follows from Example 11.3 that we cannot generalize Theorem 11.2 to the case miteT{^t] = —00.

Uniqueness of the extremal functional. Let / G V(X) and let a represen­tation (11.2) be given. For x G X we construct the set of extremal indices

Mx = {teT\f(x) = (xift)+et}.

The set Mx may be empty, but more often Mx is a singleton if we identify P-a.s. coinciding functionals. More precisely, we have the following statement.

THEOREM 11.3. Let P G <?o(X) and f e V(X). Then there exists an X1 c X with P{A"i} = 1 such that I*fs = I*ft, (x,fs) = (x,ft), and es = et for any x G Xi and s,t € Mx.

Page 94: Local Properties of Distributions of Stochastic Functionals

78 3. GAUSSIAN FUNCTIONALS

P R O O F . Suppose that s.teT and I*f3 ^ I*ft. Then (zj*fa) < (z,I*ft) for some z G Xp. In Xp, we choose small neighborhoods V 3 I*f8 and Vt 3 I*ft such that

sup(;v) < inf (*,-)• vs

yt

We need the following auxiliary lemma. •

LEMMA 11.1. There exists a set Xs,t c X of full measure such that for each x G Xsj one of the sets Mx n I*~l{Vs) or Mx n I*~l(Vt) is empty.

PROOF. Consider the functionals

9s{x) = f({u G T | I*fu e Vs}-x), gt(x) = f({u G T | 7*/u € ^ } ; z ) .

Note that if for some x both sets mentioned in the lemma are not empty, then f(x) = 9s(x) = 9t(x)> Therefore, we can set Xsj = {x \ gs{x) ^ 9t{x)}- Let us prove that P{X8j} = 1.

We set I = Iz and consider the partition of X into lines parallel to the vector I. It suffices to verify that for any line 7 = {x \ x = #o+cZ, c G R 1 } the set jn(X\X8j) consists of at most one point. Let us choose 7 and a point X\ G 7 fl (X \ Xs,t). Let us verify that x + cl G XS)t for c > 0. Indeed, by the definition of X\ we have 9s(xi) = 9t(xi)> Using the definitions of 2, /, #s, and #*, we obtain

g8(xi +cl) <gs(xi) + c sup ((/,/„)) < # s(zi)+csup(2,-) < &(xi) + csup(*,-) i*Uevs vs vt

< 9t{xi) + c inf ((/, /M)) < gt(xi + d ) . *• Ju€ Vt

Thus, ^ (^1 + cl) > g8[x\ + c/), and our lemma is thereby proved. •

Let us continue the proof of Theorem 11.3. We have an open cover of the set of pairs {(I*f8,I*ft) I s,t G T, I*f8 ^ I*ft} by sets of the form V8 x V*. Since Xp is separable, it follows that this cover has a countable subcover; we denote the corresponding indices by sn and tn. Then we apply Lemma 11.1 and set X\ = iVp n p | n XSnitn. Let us prove that X\ is the desired set. By Lemma 11.1 and Proposition 7.4 (iii), we have P{Xi) = 1. If x G Xu s,t G MXi and I*f8 ^ 7*/t> then 7*/s G Vr

Sn and 7*/t G Vtn for some n, and it turns out that s G MxDl*~l(V8n) and t e M x n f _1(Vin). Thus, x G A'3ni tn. Moreover, x e Xi. This contradiction shows that I*fs = I*ft, that is, fs — ft G Ker7*. Since x G Np , it follows from Proposition 7.4 (i) that (x, / s) = (x , / t ) . Finally, by the definition of the set Mx

we have e8 = f(x) — (x, / s) = f(x) — (xyft) = et. The proof of the theorem is complete. •

COROLLARY. Suppose that X c R T is some space of trajectories, X ' = C{irt}, t G T (see Example 7.1), P G <?o(X), and f(x) = supteTx(t). Furthermore, suppose that P{x(t) = x(s)} = 0 for s ^ t G T. Then almost surely the equality x(t*) = sup i G T s(t) is attained at no more than one point t* G T.

We can strengthen Theorem 11.3 by considering a wider set of extremal indices M° D Mx, namely,

M° = {teT\ inf f({u I \I*fu - Fft\ < e},x) = f(x)}.

Then almost surely 7*/s = I*ft and (x, f8) = (x,ft) for s,t e M£, but we need not have e8 = et.

Page 95: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 79

PROBLEM 11.2. State and prove an analog of Theorem 11.3 for P G £(X).

The separat ion point and absolute continuity of dis tr ibut ions. Let P G £(X) and / G V(X). Let us study the absolute continuity of the distribution F = P / _ 1 . Theorem 11.1 shows that F <C A if and only if F does not have an atom at the separation point TQ defined in (11.5). In particular, if ro = —oo, then F <C A. Therefore, the location of the separation point is of particular interest.

If we assume that / is a continuous functional, then we can give a descriptive geometric interpretation of the separation point.

PROBLEM 11.3. Let P be a Radon (not necessarily Gaussian) measure on a topological space X, and let / : X —> R 1 be a functional continuous on supp(P). Then r0 = infxGsUpp(p) f{x).

There is a natural question: how to verify the continuity of a convex functional? The following problem shows that continuity is equivalent to local boundedness.

PROBLEM 11.4. Suppose that P G <?(X), / : N P -> (-oo,+oo] is a convex functional, No C Np is the set of points of continuity of / , and N = No n {x \ f(x) < oo}. Prove that

1. if V C X is open and sup / < oo, then V D Np C N\ vnNp

2. the set N is convex and open in Np; 3. if y G N P \ N, then inf / = +oo;

* X 2y-N 4. if P{x | f(x) < oo} = 1 and N is not empty, then / < oo on Np and

N = NP.

In connection with these problems, let us again consider the functional / and the Gaussian measure P G £o(R°°) from Example 11.2. In this case, we have Np = R°° and infNP / = —oo, but ro = 1. Therefore, in the statement of Problem 11.3 we cannot omit the assumption that / is continuous. The same P and / can be considered in the Banach space of bounded sequences equipped with the norm ||x|| = sup n |x n | . Then the functional / is continuous, but P is not a Radon measure.

There is one more way to find the points at which the distribution P / _ 1 may have an atom. Let / G V(X) and let a representation (11.2) be given. We addi­tionally assume that for P-almost all x G X there exists &t GT such that

(11.11) f{x) = (xjt) + et.

A typical example of this situation is given by the functional f(x) = sup t G T x{t) defined on the space of continuous functions C(T), where T is a compact set.

Set

To = {t G T | | /* / t |2 = EP |(., ft)\

2 - E&<.f ft) = 0}, ( 1 L 1 2 ) e° = sup(e, + E P ( . , / i » .

ter0

The following theorem shows that if there exists an atom of the distribution P / _ 1 , then this atom is at the point e°.

THEOREM 11.4. Suppose that P G <7(X), / G V(X), and f satisfies con­dition (11.11). Then the distribution P / _ 1 is concentrated on [e°,+oo] and is absolutely continuous on (e°,+oo).

Page 96: Local Properties of Distributions of Stochastic Functionals

80 3. GAUSSIAN FUNCTIONALS

PROOF. We assume that P G £o(X). For each t G To, we have

P / - 1 { ( - o o , e , ) } = P{x | f{x) < et} < P{x \ (xjt)+et < et} = 0.

Therefore,

P / - 1 { ( - o o ) e ° ) } = P / - 1 { ( J ( - < x > , e t ) } = 0 . ter0

Let us prove the absolute continuity. Let {zn} be a basis in Xp . We set T(n,m) = {teT\ \(znyI*ft)\ > m " 1 } . Obviously, T \ T0 C \Jn^T{n,m). By the assumptions of the theorem, we have

{x G N P | f(x) G A} = {x G N P | f(T\T0;x) G A} C \J{x \ f(T(n,m)\x) G A}

for any Borel set A C (e°,+oo). Therefore, it suffices to verify the absolute con­tinuity of the distribution of the functional /(T(n,ra);-) . For brevity, we assume that T = T(n,m) for some n and m. Set I = Izn. Let us estimate the partial derivative

DJ(x) = lim c"1 (f(x + cl) - f{x))

for any x G X . Let us choose a t satisfying condition (11.11). Then

f(x + cl) > (x + cl, / t ) + et = / (*) + c(/, /t> = / ( a ) + c(zn, /* f t) .

Similarly, f(x — cl) > f(x) — c(zn)I*ft). Therefore, if Dif(x) is defined, then

(11.13) | A / ( x ) | > | ( ^ > / * / t ) | > m - 1 .

Now let us consider the partition of X into lines 7 parallel to I. The restriction of / to each 7 is a convex function. By inequality (11.13), the situation in which this function has a closed interval of points of minimum rather than one point is impossible. Taking into account the absolute continuity of the conditional measures P 7 (see Theorem 7.2), we see that Proposition 4.4 can be applied to these measures, and therefore, P 7 / _ 1 <C A. By (3.4), it follows that P / " 1 < A. •

COROLLARY. Suppose that f satisfies condition (11.11) and e° < r$. Then Pf-1 « A.

To prove this, it suffices to compare Theorem 11.1 (i), (ii) with Theorem 11.4. Let us again consider Example 11.2, in which — 00 = e° < ro = 1 and the

assertions of Theorem 11.4 and its corollary are not satisfied since condition (11.11) is violated.

PROBLEM 11.5. Let X = C[0,1] and f(x) = suptG[01j x(t). Construct a mea­sure P G 0o (X) for which e° = r0 = 0 and P / - 1 { 0 } > 0.'

Theorem 11.4 suggests that it is degenerate linear functionals ft, t G T, that generate atoms in the distribution Pf-1. It is natural to assume that if To is empty, then P / - 1 <C A. Although this is not the case (see Examples 11.2 and 11.3), we have the following weaker result.

PROPOSITION 11.4. Let P e Go(X) and f e V(X). Suppose that we have a = infter \I*ft\ > 0 and e_ = inf^^e* > —00 in the representation (11.2). Then P / - 1 <C A, and the density F' = dPf~l/d\ is bounded.

Page 97: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 81

PROOF. Let us choose an r £ R 1 and set

P = inf (et - r)/at > ( - | e_ | - r)/a > -oo.

Then the functional

g(x) = sup ((x, ft/at) + (et - r)/at - p) ter

satisfies the assumptions of Theorem 11.2. Moreover,

{x | f(x) e [r - e, r + e}} = {x | sup((x, ft) + et - r)/a e [-e/a, e/a]}

C {x | g(x) e [-p - e/a, -p + e/a]}.

Taking (11.8) and (11.9) into account, we obtain the estimate

P / _ 1 { [ r - e, r + e]} < Pg-^-p - e/a, -p + e/a}} < 2ea-1(V2^)-1(l-$(-p + e/a)ylexp{-(\p\-£/a)2/2}

for e < a\p\. Therefore, the distribution F = P / _ 1 is absolutely continuous, and

n r ) s [ M l _ * ( ^ ) r ^ _ < n ± i ^ f } for almost all r G R1 . This estimate is uniform on all rays (—oo,ri]. By Theo­rem 11.1 (vi), F' is bounded on the entire axis. •

Example 11.3 shows that one cannot omit the condition e_ > —oo in the statement of the theorem.

Probabil i t ies of large values of a convex functional. In many applica­tions, it is important to know the behavior of the tail of the distribution of the stochastic functional P { / > r} as r —> oo. For a wide class of convex function­a l that "grow linearly at infinity," the tails of the corresponding distributions are similar to the "tails" of a Gaussian measure in R1 .

THEOREM 11.5. Let P € £o(X) and f £ V(X). Let e_ = i n f ^ e * > -oo and a2 = sup t G T \I*ft\2 > 0 in the representation (11.2). Suppose that P { / < oo} = 1. Then

(11.14) P{z | f(x) >r} = exp{-(r + df /2a2 + o{r)}

as r —> oo for some d G (—oo, —e_].

Let us clarify the statement of the theorem. The condition e_ > —oo excludes functional of "superlinear growth," for example, f(x) = x2 for X = R1 . The condition a > 0 excludes the degenerate case / = const.

The asymptotics (11.14) means that, neglecting a constant translation by d, the tail of the distribution of / corresponds to the tail of the distribution A/*(0, a2) , that is, to the tail of the distribution of the linear functional whose variance is the maximum of the variances of all functionals on the right-hand side in (11.2).

LEMMA 11.2. Let ^(r) = $ _ 1 ( F ( r ) ) . Then there exists a finite limit d = limr_^oo(cr^(r) - r ) .

Page 98: Local Properties of Distributions of Stochastic Functionals

82 3. GAUSSIAN FUNCTIONALS

PROOF. By Proposition 11.3, the function cr^(r) — r is concave. Therefore, the limit d exists. It suffices to prove that d ^ ±oo. First, let us obtain a lower bound. Suppose that po £ R 1 is chosen so that F(po) > 0, and hence, \I>(po) > —oo. We choose an arbitrary r > 0 and apply Theorem 10.5 to the set A = {x \ f(x) < po}, thus obtaining

^(P^A + rEll}) > S - ^ P ^ } ) + r = tf(po) + r.

Let us prove that A + r Ell C {x \ f(x) < po + fa}. Indeed,

f(y) = sup[{(xjt) + et)+r(ljt)] < / (*) + supr|(J,/ t>|

< p0 +sup r\(lzjt)\ < po + rsup(zJ*ft)

<Po + r\I*ft\<po + ra

for y = x + rl, x G A, and I = Iz e Ell. Therefore,

*(po + r<r) = $ _ 1 ( P { x | f(x) < p0 + w } ) > $ _ 1 ( P * M + r Ell}) > #(p0) + r.

By setting p = po + ra , we rewrite this inequality in the form

* ( p ) > * ( p o ) + ( p - p o ) M

or

(11.15) a#(p) - p > a^(po) - po-

We have proved that the function cr^(-) — • increases. Therefore, d > —oo. Now let us obtain an upper bound for d. For any t 6 T, we set of = Ep/ t

2. Then

F{r) = P{x | / (x) <r}< P{x \ ft(x) + et < r} = $ ( ( r - et)/<7t).

Therefore, *( r ) < inf(r - et)/at < (r - e_)/cr.

Hence, <j\I>(r) — r < —e_,

and it follows that d < —e_ < +oo. The proof is complete. •

COROLLARY (Borell's inequality). Let mF be the median of the distribution f; that is, let F(mF) = 0.5. Then

(11.16) F(mF + r) > * ( r / a )

for any r > 0.

P R O O F . It suffices to set po = mF and p = raF + r in (11.15), and note that * ( m F ) = 0. D

PROBLEM 11.6. Suppose that P n , n E N, are Gaussian measures with bary-center an in a Banach space (X, ||« ||). Prove that if P n => Poo, then \\an—a^W —> 0.

n (Inequality (11.16) allows us to cut off the parts of the measures P n that are far away from zero.)

Page 99: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 83

PROOF OF THEOREM 11.5. Taking into account the definition of the function $ and the result of Lemma 11.2, we obtain

P{x | f(x) > r} = 1 - *(*( r ) ) = exp{-# 2 ( r ) /2 + o{V{r))}

= exp{-(r + d)2/2a2 + o{r)}. D

Theorem 11.5 covers some particularly important special situations, two of which will be considered shortly.

PROPOSITION 11.5. Let A be a convex closed absorbing subset of X. Let I: X —• [0, oo] be the action functional defined in (7.17). Then

(11.17) P{X \ rA} = eXp{-J( .4)r2( l + o(l))/2}, 1(A) = ^ 1 ( 1 )

as r —> oo.

PROOF. Consider the Minkowski functional

fix) = inffr I x/r G A}. r>0

Since A is convex, so is / . Since A is closed, / is lower semicontinuous. Let TTA C X' be the polar of the set ^4,

7TA = {fa e X ' I SUp(x,/a) < 1}.

xeA

Then the representation (11.2) of / takes the form f{x)= sup {xja).

faenA

By applying Theorem 11.5 to / and taking into account that {x | f(x) > r} = X \ r\A, we obtain

P{X \ rA} = exp{-(r + d)2 /2a2 + o(r)},

where a2 = sup /aG7rA \I*fa\2. It remains to express a in terms of the action functional. Consider the set A' = I~1(A)i which is a convex closed absorbing subset of

Xp , and the set K'A = {u G Xp | s\xpzeA,(u,z) < 1}, which is the polar of A'. Then, by the definition of the action functional and the polar, we have

1{A) = inf 1 (0 = inf \z\2 = ( sup \u\)~2.

On the other hand, by the definition of a and by the relation I*(ITA)~ = 7r^, we obtain

a2 = sup \I*fa\2 = sup \u\2.

fatKA U^^A

Therefore, a~2 = X(J4), and the proposition is proved. •

Note that if the set A is convex and closed but not absorbing (for example, a half-space) or closed and absorbing but not convex, then (11.17) need not hold.

Page 100: Local Properties of Distributions of Stochastic Functionals

84 3. GAUSSIAN FUNCTIONALS

PROPOSITION 11.6. Suppose that (X, || • ||) is a Banach space, X ' is the dual space o /X, P E GoPQ, and K is the correlation operator of the measure P . Then

P{x | ||a:|| >r} = exp{- r 2 /2a 2 ( l + o(l))}

as r —> oo, where

a2= sup EP(;f)* = \\K\\. {/ex'|||/||=i}

PROOF. The functional in question (the norm) admits the following represen­tation of the type of (11.2):

||x|| = sup (xjt). Il/tll=l hex'

Therefore, Theorem 11.5 readily implies the desired asymptotics. We only need to justify the relation between K and a:

pT|| = sup \\Kf\\ = sup <#/,<?) = sup E p ( . , / ) ( . ^ ) = sup E P | ( . , / ) | 2 . • ll/ll=i ||/||=i ||/||=i ll/ll=i

11*11=1 IIPII=I

Estimates of the distribution density. Let p be the density of the standard Gaussian distribution A/*(0,1). Elementary calculations show that for r > 0 one has

(11.18) p(r) = r[l - Qir^/Sir),

where

dv. S(r) = JQ e x p | - ^ - ^ j

The function S is monotone increasing from 0 to 1. In what follows, we derive a relation, similar to (11.18), between the density and

the distribution function of a convex functional. By combining this relation with various estimates for the distribution function (for example, with Theorem 11.5), one can obtain lower and upper bounds for the density.

Let P , / , or, and d satisfy the assumptions of Theorem 11.5, and let F and mF

be, respectively, the distribution function and the median of the distribution P / _ 1 . Just as in Lemma 11.2, we set \I>(r) = ^~1(F(r)).

THEOREM 11.6. For any r > mF one has the estimates

(1U9) * ( r ) £ - y i < F'(r) < <r + <W-FM aS(V(r)) ~ w ~ a 2 ( r - m i ? ) 5 ( * ( r ) ) '

In particular,

(11.20) lira F'(r)/r[l - F(r)] = a'2.

Page 101: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 85

PROOF. We treat the distribution P / _ 1 as the image of the measure W(0,1) under the convex mapping Q = \J/_1 = F _ 1 $ . Since Q is convex, it follows from Lemma 11.2 that

(11.21) (r - mF)/9(r) = (Q(*(r)) - Q(0))/*(r) < Q'(*(r))

< lim Q'(0) = a. (3—»oo

We substitute (11.21) and (11.18) into the equation

J ^ ( r ) = p ( * ( r ) ) / Q , ( * ( r ) )

and obtain tt(r)[l-*Qg(r))] , tt2(r)[l-$(tt(r))]

* S ( * (r)) " ^ ; " (r- roF)5(tf (r)) *

Recall that $(\I/(r)) = F(r) and that Lemma 11.2 implies the inequality \l>(r) < (r + d)/cr. After the substitution, we obtain (11.19). The limit relation (11.20) follows from (11.19) and Lemma 11.2. •

For some r, the function F may have only one-sided derivatives. They also satisfy the estimates (11.19).

We point out that one can estimate the density only for sufficiently large values of the argument (r > mF). It turns out that near the separation point r$ the density may be either very small or very large. The corresponding examples are given in the following section.

By combining the right-hand side of (11.19) with BorelPs inequality (11.16) and the representation (11.18), we obtain the following estimate.

COROLLARY. For r > mF, one has

(11.22) F'(r)< {T+fFr exp{ (r — mFY\J2/KG \

For example, if f is an affine functional, then d equality.

Appendix

1. Suppose that P G £o(X), / G V(X) is the functional defined by (11.2), and {gt = (., ft) + e t ; t G T} is the family of affine functional that defines / . The limit

6(g) = lim sup (gs - gs'){x), £^°s)s

feT(g,e)

where T(g,e) = {s G T \ Ep(gs—g)2 < £2}, is called the Ito-Nisio oscillation [140] of the family {gt} and the affine functional g.

If the set T(p, e) is empty for small £, then we set 8(g) = —oo. It is well known that 6(g) coincides with some constant [140]; thus, the oscil­

lation can be treated as a nonnegative number rather than a random variable. Suppose that P { / < oo} = 1, e_ = miter et > - co , and the family {gt} forms

a convex set in X'. Then we have the representation

(r - mF)2 1

2 < T 2 J*

—mF and (11.22) becomes an

r0 = sup [r + 6(rl)/2] rGR1

Page 102: Local Properties of Distributions of Stochastic Functionals

86 3. GAUSSIAN FUNCTIONALS

for the separation point of the distribution of / . In particular, if et = 0, then for r ^ O w e have 6(rl) = —oo, and hence,

r0 = S(0)/2.

Example 11.3 shows that we cannot omit the condition e_ > — oo, since in this case 6(rl) = —oo for all r, whereas r$ = 0.

The convexity of {gt} is essentially not a restriction, since in (11.2) we can always replace this family by its convex hull. The formulas for recalculating the oscillations in terms of the convex hull can be found in [5]; see also [54, 66].

In terms of oscillations, one can sharpen the estimate for the tail of the distri­bution of / given in Theorem 11.5. Namely, under the assumptions of this theorem, we have

d = sup{EP£ - % ) / 2 ; g G Xj>, DPg = a2}.

In particular, if et = 0, then

d = - - sup {%), \g\=a}.

Thus, the parameter d is determined by the oscillation on the functionals of maximum variance (see [48]; previously, an estimate for d in terms of oscillations was proved in [162]).

2. Suppose that P G £o(X), / G V(X), and the assumptions of Theorem 11.4 are satisfied. We additionally assume that e° < e~ = miter ^t (for example, et = 0). Consider the set

U = {ft/\I*ft\,t€T\To}cX'.

The distribution F = P / - 1 is absolutely continuous if and only if at least one of the following three conditions is satisfied:

(a) P{ sup (x, x') = oo} = 1; x'eu

(b) the closed convex hull of the set I*U C Xp contains zero;

(c) sup(e* - e-)/at = +oo. ter

The following examples completely clarify the role of each of these conditions. L e t X = C[0,l].

A. Let f(x) = supx(£) and let P be the distribution of a Wiener process. Only condition (a) is satisfied. The absolute continuity of F is due to the large amplitude of oscillations of the process near the singular point To = {0}.

B. Let f(x) = supx(t) and let P be the distribution of a continuously differen­tiable process & for which E£t = 0, £o = £i = 0, and £Q = f i ¥" 0- Only condition (b) is satisfied.

C. Let f(x) = sup(x(£) + y/t) and let P be the distribution of a continuously differentiable process £t for which E& = 0 and £o = 0 (see Lifshits [59], where some additional information about the absolute continuity of F without the assumption that e° < e~ is given).

3. Let P G </(X) and let f(x) = sup t e T (x , / t ) G V(X), P { / < oo} = 1. We introduce the following notation:

• K(sit) = Ep(-,/s)(•,/*) is the correlation function;

Page 103: Local Properties of Distributions of Stochastic Functionals

§11. CONVEX FUNCTIONALS AND THEIR DISTRIBUTIONS 87

• q(s) = sap{\K(8,t)/K(a,8)-l\y, teT

• 9t = ft- rw ' ,fs is the prediction error of ft for a given / s ; K{s,s)

• ps(-) is the distribution density of / s , that is, the Gaussian density with parameters (0,K(s,s)).

If for some s e T we have q(s) < 0.5, then the density F' of the distribution F = P / _ 1 satisfies the estimate

F'(r) < (l-qWr'Eripsdr-supgtXl + qis))-1)). T

The condition q(s) < 0.5 implies that

miK(t,t)>K(s,s)/4>0.

Therefore, this estimate is a sharper version of Proposition 11.4 (Weber [164]).

4. Suppose that P G £o(X) and B is a subset of the Hilbert space Xp of measurable functionals. Let us define some entropy characteristics of B. By N(B, e) we denote the cardinality of the minimal e-net for B. The function H(B,e) = In N(B,e) is called the metric entropy. The expression

* ( £ , « ) = / H1/2{Bye)de Jo /o

is called the Dudley integral. The function

* ( B , p, S) = sup V(B H {y | \y - z\ < p}, 5) 2€X' p

can be used as a local entropy characteristic. Consider a convex functional /(•) = sup£GT(-, ft). We set B = {/*/*> t G T } c

Xp and seek the entropy bounds for the tail of the distribution of / . If

( 7 2 = s u p | r / t | 2 = s u p E P / , 2 > 0 teT teT

and # ( £ , a) < oo, then for r > 0 we have

P { / > r } < l - $ ( ( r V - 2 X i ( r ) ) 1 / 2 ) ,

where X l ( r ) = mfp>0{4V2a-2*(B,p,p/2)r + H(B,p)}. If a < 1, then for r > 4-v/2*(S, 1) + 1 we have the estimate

2

P { / >r}< 4.8exp { - r- + X 2 ( r ) } ,

where X a(r) = infp>0{2/o2r2 + 4%/2#(£,p,p)r + H(B,p) + Inp} [46-48].

A similar estimate can be written out for the density of the distribution F = P / _ 1 . Specifically, if r > 2 + 19* (B, 1) and * ( B , r" 1 / 2 ) < y/2, then

F ' (r) < y l r 1 / 2 * 1 / 2 ^ , ^ 1 ^ ) ^ + l l n ^ r - 1 / 2 ) ! ] 1 / 2

2

x exp { - y + 6>/2¥(J3,T--1/2^ + 9* 2 (B, 1)},

where A is an absolute constant (Lifshits [64]).

Page 104: Local Properties of Distributions of Stochastic Functionals

88 3. GAUSSIAN FUNCTIONALS

§12. Distribution of the norm

Throughout this section, (X, || • ||) is a Banach space, S and V are the unit sphere and the unit closed ball in X, respectively, a is the barycenter of a Gaussian Radon measure P G G(X), f(x) = ||z||, and F = P / _ 1 is the distribution of the norm with respect to P . Since the norm is a convex functional, we can use the results obtained in §11 to study F. However, the properties of the distribution of the norm are very important in proving limit theorems [8, 9, 11, 84]; this is why we pay special attention to this case. In particular, estimates of the density F1 and the higher derivatives of F , if they exist, are of interest.

The absolute continuity of F.

PROPOSITION 12.1. Suppose that the measure P is not concentrated at a single point. Then each of the following three conditions is sufficient for the absolute continuity of F with respect to A G R1:

(i) OGNp; (ii) a G H p ;

(iii) the space X is strictly convex, that is, each point x G S is an extreme point ofV.

PROOF. If 0 G N p or X is strongly convex, then there exists at most one point x G N p such that ||#|| = infy€NP IMI- By Theorem 11.1 (ii) and Problem 11.3, we see that to verify that F <C A, it suffices to prove that P{x} = 0. By assumption, we have P{x} < 1. Since P{Np} = 1, it follows that there exists a point y G N p \ { X } . Let g G X7 be a functional that separates the points x and y. Since x,y G Np and (x,g) ^ (y,<7), we see that the distribution of g is nondegenerate, that is, P 0 - 1 =N{(a,g))a

2)) a > 0. In particular, P{x} < P{^"1(^(x))} = 0. If a G H p , then 0 G H p = Hp + a = Np; thus, the second condition implies

the first. •

This statement need not hold if P is Gaussian but is not a Radon measure (see Example 11.2 and the comment on Problem 11.4).

If 0 G Np and strong convexity is not assumed, then, generally speaking, Theorem 11.1 provides the strongest possible statement about the distribution of the norm. This is confirmed by the following result.

PROPOSITION 12.2. Suppose that X = R2 , P = Af(0,1) x A/*(1,0) G G(X), tp: R 1 —> (0, +oo) is a convex even function, and

lim tp'(t) G (0, +oo), lim (tp(t) - tV'{t)) G [0, +oo). t—>+oo t—»+oo

Then there exists a norm £/(•) on X such that PC/ - 1 = A/*(0,1)<^-1.

PROOF. We define the unit ball of the norm U by

Elementary calculations show that V\j is symmetric and convex. We set U(x) = inf{r > 0,2 G rVu}- Then U is a norm, and U(x) = (p(t) for x = (t) 1). •

Page 105: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 89

PROBLEM 12.1. Use Proposition 12.2 to construct a norm [/(•) on R 2 such that the distribution of U has an atom at the separation point and the density of U is unbounded at that point and has jumps at count ably many points. Find a complete description of the class of distributions {PC/ - 1 , U is a norm on R 2 } .

PROBLEM 12.2. Let X be a finite-dimensional normed space. Prove that a necessary and sufficient condition for F <C A is that

sup{||p||,tf e N P n [x + eV)} > inf{||»||,y € N P }

for x e Np and e > 0. Construct a measure and a norm on l2 which prove that this condition is only

necessary but not sufficient for absolute continuity in the infinite-dimensional case. (Hint: the unit ball of the new norm can be taken in the form {x e l2 | x = 72/ + (1 - i)zy 0 < 7 < 1, 2/1 = 0, ZT Vl < 1. I*i I = 1. £ ~ nzl < 1}; furthermore, one can set Np = {x e l2 | x\ = 1}.)

The boundedness of F'. By Theorem 11.1 (vi), Fl is uniformly bounded on R-1 \ [ 0) o + £)• However, to estimate the rate of convergence in the central limit theorem, we would like to have a uniform estimate of F' on R1 .

PROPOSITION 12.3. Suppose that (X, || • ||) is a finite-dimensional space and P{a} = 0. If 0 e Np , then the density F1 of the distribution F is uniformly bounded.

PROOF. Without loss of generality, we can assume that Np = X. Since 0 is the separation point of F , it suffices to verify that F' is bounded on the interval [0,1]. It follows from the condition Np = X that the density of the measure P with respect to the Lebesgue measure Am, m = dimX, is bounded. By passing to the radial system of coordinates, we obtain

where \x\ is the Euclidean norm on X and | 5 m _ i | is the surface area of the unit Euclidean sphere in X. This estimate shows that sup[01] F' < oo, and moreover, l im r_0 F'{r) = 0 for m > 1. ' •

The most typical cases covered by this result are centered (a = 0) and nondegen-erate (X = Np) measures. For the case of noncentered measures, see Problem 12.1.

In the infinite-dimensional case, the condition 0 € Np is no longer sufficient for F' to be bounded. Here we must impose additional conditions either on the measure (essentially, these conditions imply closeness to the finite-dimensional case) or on the norm (by requiring the norm to be smooth and sufficiently "uniformly convex"). However, these additional conditions mostly serve to exclude pathological cases rather than prevent one from considering any specific measures arising in applications.

An example in which F' is unbounded. We shall construct an example of a Gaussian measure for which the distribution density F' is unbounded near zero (other forms of unboundedness are prohibited by Theorem 11.1 (vi). First, we prove the following elementary lemma, which contains the key point of our construction.

Page 106: Local Properties of Distributions of Stochastic Functionals

90 3. GAUSSIAN FUNCTIONALS

LEMMA 12.1. Suppose that £&, k < n, are independent A/'(0,1)-distributed random variables, pn is the distribution density of the random variable Mn = sup fc<n |&|, 0 6 [0,1], andr = (21nn - lnlnn + 0 ) 1 / 2 . Then

Pn(r)>0.131n 1 / 2 n and P{M n > 2r} < 0.321n3/2n- n " 3

for n > 3.

PROOF. We set 3>+(r) = P{|£i| < r} = 2$(r) — 1. The following formulas are obvious:

P{Mn <r} = c^(r), Pn(r) = n^r)^1^),

&+(r) = 2$'(r) = (21nn/7r)1/2n-1exp{-6>/2} )

K'Hr) = [1 - (1 - M r ) ) ] " " 1 > exp{-2(l - * + ( r ) ) ( n - 1)}.

Moreover,

(1 - $+(r))(n - 1) < 2&(p)n/p < max (21nn/7r(21nn - lnlnn) 1 / 2) < 0.625

for p = (2 Inn - In Inn)1 /2 . Therefore,

Pn(r) > (21nn/7r)1/2exp{-0/2 - 1,25} > 0.13(lnn)1/2.

Furthermore,

P{M n > 2r} < P{M n > 2p} < nP{|&| > 2p} = n( l - *+(2p))

< n*'(2p)/p = n(n" 4 In2 n) / (2Inn - l n l n n ) 1 / 2 ^ ) 1 / 2

< ln 3 / 2 n- n _ 3 m a x (lnn/(27r(21nn - ln lnn))) 1 / 2

< 0 . 3 2 1 n 3 / 2 n - n - 3 . D

REMARK. The sharper estimates in [22] show that

P{(M n - an)/bn < r} -> exp( -exp( - r ) )

as n —> oo. Here

an = (21n(2n))1/2 - (lnln(2n) +ln(47r))/(2(21n(2n))1/2),

6n = (21n(2n))"1/2.

Therefore,

suppn(r) ~ 6~1sup{exp(—exp(—r))exp(—r)} = e~1(21n(2n))1/2

r>0 r>0

as n —> oo.

Now let us construct the above-mentioned example. Consider a family of inde­pendent random variables ^ m , l < m < o o , l < f e < n m < o o . Suppose that £/cim

is A/ O, a^J-distributed. We specify the lengths of the series nm and the variances a^ later. We write Mm = maxfc |ffejm| and M = sup fcm |& i m | = sup m M m . Let F be the distribution function and F' the distribution density of the random vari­able M (by Proposition 12.1, the density exists), and let Fm be the distribution density Mm.

Page 107: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION O F T H E NORM 91

PROPOSITION 12.4. There exist sequences am, ym —> 0, and n m —> oo such that P{limm M m = 0} = 1 and jF'(ym) -> oo.

PROOF. We proceed by induction. Set

<T\ = 1, n\ = 3, yi = a\ (2 In n\ — ln lnni) 1 / 2 .

Suppose that o^ nj, and yj are already defined for j <m. We set

Vm = 41"myi

and choose n m so large that

nm > max{n m _i + 1, exp (P{sup Mq < y m / 2 } - 1 ) } . q<m

Finally, we define 0m = ym(21nnm - l n lnn m )~ 1 / 2 .

The construction is complete. Now let us prove the desired properties. By Lem­ma 12.1, we have

P { M m > 2 y m } < 0 . 3 2 1 n 3 / 2

Tim y^m)

Therefore, ^2mP{Mm > 2ym} < oo. Since ym —> 0, we have Mm -» 0 almost surely.

Now let us estimate F'. Since the Mm are independent, we have

F'(ym) > P{sup Mq < ym/2}Fm(ym)P{sup Mq < ym/2}. q<m q>m

By construction, P{sup Mq < ym/2} > ( lnn m ) " 1 .

q<m

By Lemma 12.1, one has

KiiVm) > 0.13 l n 1 / 2 n m • a" 1 = 0.13 ln 1 / 2 n m (2 I n n m - In I n g 1 ' 2 ^ .

Using the second assertion of the lemma, we obtain

P{sup Mq > ym/2} < J2 V{Md > 22/J <*>m q>m

< 0.32 ^ ( l n 3 / 2 n q ) n q3 < 0 . 3 2 ^ n " 2 < 0.13.

q>m n>3

Therefore, ~P{supq>mMq < ym/2} > 0.87. By combining all these estimates, we obtain

F'{ym) > 0.13- 0.87[(21nnm - l n l n n m ) / l n n m ] 1 / 2 y - 1 > O.ly"1 ^ oo. D

Let us arrange all the variables {£fc,m} in a single sequence. Let P be the dis­tribution of this sequence in the space Co- Proposition 12.4 shows that P G £/o(co)> and if f(x) = max*; \xk\ is the norm on Co, then the density of the distribution P / _ 1 is unbounded.

Sufficient conditions for the distribution density of the maximum of a Gaussian sequence to be bounded provide a natural addition to the last example.

Let £&, 1 < k < oo, be a sequence of independent random variables with distribution A/*(0, cr^), a^ > 0. By N(e) we denote the number of indices k for

Page 108: Local Properties of Distributions of Stochastic Functionals

92 3. GAUSSIAN FUNCTIONALS

which ak > e. We introduce the metric entropy H(e) = ln7V(£) and the Dudley integral V{a) = f° Hl/2{e)de.

PROPOSITION 12.5. Suppose that for small a the Dudley integral is finite and

(12.1) lim a"99(a) = 0

for some 0 > 0. Then the distribution density of the random variable M = supfc | ^ | is uniformly bounded on R 1 .

Note that condition (12.1) is not too restrictive. Indeed, for this condition to hold, it suffices to have the estimate H(a) = o(cr2e~2), which, in turn, is close to a necessary condition for M to be finite.

PROOF. Suppose that F is the distribution function of M and ro is the sepa­ration point of F. Since the Dudley integral is finite, it readily follows that ro = 0. By Theorem 11.1 (vi), it suffices to examine the behavior of the density F'{r) as r ->0.

Let us choose small numbers r and A and estimate the probability of the event that M belongs to the interval V = [r — A,r]. To this end, we divide the positive integers into three parts:

Ti = {fe | ak > r 1 ^ } ; T2 = {k\ake [rA\r1/2]}, where Ax = 6~l (by the definition of the Dudley integral, we have A\ > 1); T3 = {k\ak<rA^}.

Consider the corresponding maxima Mi = supfcGT. |£fc|. Then

(12.2) P { M eV}< P{Mi eV} + P{M2 G V}P{M1 < r} + P{M 3 G V}.

Let us estimate each summand. We have

P{M1 € V} < £ P{ |&| 6 V; |fc| < r, t* k) feeTi

< N^1'2) max.Af(0 ( ak){V U -V} • (M(0, ak){[-r, r}}) /cETi

2 „ , wo, wo. / 2 i/oN^7,172)-1

N(r^2)-l

Therefore,

P{Mi eV}< iV(r1 /2)22"w( r l / 2)A < max n22~nA = 2A n>0

for r < 0.25. In a similar way, we obtain the inequality

(12.3) P{Mj <r}< r ^ 1 ' 2 ) / 2 .

We divide the most important region T2 once more into very narrow regions so that each of them contains variables with almost equal variations. Namely, we have T2 = i m , m , where

T2<m = {k \ak e [r1 /2(l + r ^ ~ * V 1 / 2 ( l + r y l 8 ) 1 - m ] } ,

m = 1,2,... ,m m a x ( r ) , and A2 = 2A\ + 1.

Page 109: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 93

One can readily obtain the following power-law estimate of the number of new regions:

m m a x ( r ) < 2 y l 1 | l n r | r - A 2 .

Therefore, in view of (12.3), we have

P { M 2 G F } P { M i < r } < 2 A 1 | l n r | r i V ( r l / 2 ) / 2 - A 2 m a x P ( M 2 j m = sup |&| G y ) . m l fcGT2>m >

To estimate the distribution density of M2,m, we need the converse of Lem­ma 12.1.

LEMMA 12.2. Let £&, 1 < k < n, be independent random Affinal)-distributed variables such that a < o^ < (1 + ln~ n)a. Then the distribution density PM of M = max/c |£fc| satisfies the estimate

SWPPM(V) < H0a~1ln1^2n. v>0

We omit the elementary proof and only note that for r < min(l; \I>(1)~2) this lemma can be applied to M2,m, since the number of the & in T2,m has the bound

n < N(rAl) < exp { # 2 ( l ) r " 2 A l } ,

and the difference between the maximum and minimum variations for the variables occurring in that region does not exceed

1 + rM < 1 + In"1 n.

By Lemma 12.2, we have

P{M2 G V}P{M1 <r}< 220A1\\nr\rN^/2^2-6A^2A.

Now let us examine the last summand in (12.2). Consider the functional / = M3. Let i*3 be is its distribution function. To estimate the density, we apply the estimate (11.22). In our case, the parameters of the functional satisfy the estimates d = 0, 771F3 < 8\ /2#(rA l) , and a<rAl. By condition (12.1), we have

mFz < 8\ /2*(rA l) < 8^(0.001)(rA l)* < O.lr

for small r. Substituting the bounds for d, m ^ , and a into (11.22) and additionally assuming that A < O.lr, we obtain

for any v G V. Now that all terms in (12.2) have been estimated, we obtain

Ihn" sup P{V}/A. • r —° 0<A<0.1r

Let us extend the above result to the case of a Gaussian process (we have proved the proposition for a sequence of independent random variables). Consider an arbitrary family of centered Gaussian variables {£t,£ G T}. We set ^(£1,^2) =

(Ete,-^)2)1/2-The function d is called the natural semimetric on T. Without loss of generality,

one can assume that (T, d) is a complete metric space, that is, to distinct t there

Page 110: Local Properties of Distributions of Stochastic Functionals

94 3. GAUSSIAN FUNCTIONALS

correspond distinct & and the family {&} is closed under passage to the limit. We introduce the following entropy characteristics:

N(e) is the minimum number of balls of radius e in the space (T, d) which cover T;

H(e) = \nN(e) is the metric entropy; 9(e) = JQ # 1 / 2 ( O - ) da is the Dudley integral. If 9(e) < oo, then (T, d) is a compact space. This is just the case that we

shall consider. Along with the entropy, we need one more characteristic, which is specially adapted for estimating the Gaussian measure of a small ball. Let S be a set in a Hilbert space. For e > 0 we define the e-dimension v(e) of S as the maximum length of a chain £ i , . . . , &/ of elements of E such that |£i| > e and the distance from & to £({£i , . . . ,&-i}) is at least e.

Suppose that P is the distribution of the field £, that is, the Gaussian measure on the Banach space of continuous functions X = C(T, d) with respect to which each coordinate functional 7rtl,t2

: X —> R2 has the same distribution as the vector ( & I J & 2 ) - Obviously, P G £o(X). In the Hilbert space L 2 (X,P) , we consider the set E of one-dimensional coordinate functionals 7rt: x(-) —» x(t). The ^-dimension u(e) of S has a simple probability meaning. Namely, u(e) is the maximum length of a chain of time instants t\,..., tu that satisfy the "renewal" condition

D(6J6 l,...,6<_1)>e2. The left-hand side (the conditional variation) is nonrandom for a Gaussian

field. It is the squared error of the linear prediction of £ti horn known values of the preceding elements of the chain.

The following theorem deals with the distribution of the random variables M = suPteT 161- Obviously, this distribution coincides (for a reasonable definition of M) with the distribution of the norm ||a;|| = sup t G T \x(t)\, x G X, with respect to the measure P . As usual, let F(r) = P{x \ \\x\\ < r} be the distribution density of the norm.

THEOREM 12.1. Suppose that either conditions (i) and (ii), or conditions (hi) and (iv) are satisfied:

(i) iim 9(a)a~e < oo for some 0 > 0; <j—>0

(ii) lim In u(a)/u(a^) = 0 for some C > 0; (7—>0

(in) lim N(a)ae < oo for some 8 > 0;

(iv) lim v(p) > 0. a—»0

Then the density F' is uniformly bounded on R 1 except possibly a set of measure zero.

COROLLARY. Let T C R n . Suppose that the field £t is not identically zero and satisfies the Holder condition

d(tut2)<c\tl-t2\a

for some c, a > 0, and all t\,t2 G T. Then the distribution density of the random variable M is bounded.

PROOF. The assumptions of the corollary imply that conditions (iii) and (iv) are satisfied. •

Page 111: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 95

P R O O F OF THE THEOREM. We only outline this proof. We need to verify the assumptions of Theorem 9.1 for the situation in question. To this end, we construct the corresponding sets and directions. We set an = 2~n and divide T into the "belts" Tn = {t G T \ an > E ^ > crn + i}. Then we cover each belt by finitely many "caps" Tn^-

We construct the caps as follows. Let £ i , . . . , £I/(<T„/3) be the sequence occurring in the definition of v = v(an/3). We choose an orthonormal system {en^,k < v} in the space L n c L 2 (X,P) generated by coordinate functionals 7r t l , . . . , 7rt|/. For k G [v + l,2i/], we set en^ = —£n,k-v> For k G [l,2i/], the cap Tn^ is defined by the relation

Tn,k = {tGTn\ ECtCn,fc > Vn/ZV^}. Let us construct the families of sets {E} and {B} and of admissible translations

{/} considered in Theorem 9.1. We define an admissible translation ln^ G X by setting ln,k{t) = E&ent/c.

We set Bn,k = {x e X. \ max|x(-)| = max|x(*)|}. The set Bnjk must be

extended so that the sections of the extended set by lines parallel to the vector ln^ will be sufficiently long segments. Let z b e a small number. We set

En,k = {yeX\y = x + csign(x)Zn,fc, x G £n,/c, 0 < c < < } ,

where sign(x) is the sign of the function x at the point of maximum modulus. We leave it to the reader to verify that the sum (9.5) is finite. •

The boundedness of F' in a uniformly convex space. The function

S(p) = 1 - sup{||x + 2/||/2; ||*|| = ||y|| = 1, ||x - y\\ > p}, 0 < p < 2,

is called the modulus of uniform convexity of the norm || • ||. If 8{p) > 0 for all p > 0, then the space (X, || • ||) is said to be 6-uniformly

convex. We need some geometric properties of a uniformly convex space which show that the norm varies sufficiently fast along any straight line in X.

LEMMA 12.3. Let xo and I be vectors in a 6-uniformly convex space X. Let (p(c) = \\xo + cl\\. Suppose that infcGRi (p(c) = <p(0). Then

(12.4) \c\< rtcWH-^l - * ) ,

(i2.5) v(C)\>m(i-mMi-m

for all c ell1. Here 6*{u) = inf{p | 6(p) > u} is the inverse function of 6. By ip' in (12.5)

one can understand any one-sided derivative of the convex function cp.

PROOF. For any c ^ 0, there exists a c' such that <p(c) = ^(c7). By the choice of the initial point and the uniform convexity, we have

d)i\y ¥ > ( 0 ) < ^ + | ) < ¥ > ( c ) 1-8

<p{c) <f(c) l-6[\c\

<p(c)jy Hence,

Page 112: Local Properties of Distributions of Stochastic Functionals

96 3. GAUSSIAN FUNCTIONALS

This implies (12.4). To obtain (12.5), it suffices to substitute (12.4) into the estimate

l¥/(c)|>(v>(c)-¥>(0))|c|, which holds since (p is convex. •

The following lemma characterizes the important class of "radial" Gaussian densities.

LEMMA 12.4. Letp(c) be a density of the form

(c-a)2' (12.6)

inR 1 . Then

(12.7)

,Kc) = A | c r - 1 e x p { - ^ 0 ! }

p(c) < min {'

2<X2

-7710I—m/2|c|m—1 o I r ( m / 2 ) ' (TTT1^

Note that the uniform estimate for p depends on neither m nor a.

P R O O F . The normalizing factor A has the obvious bound

A-[/w-,«^{-^}*]"1s[jf^--I.{-^}*] 1 ol—?n/2_—m

- 1

= [ f0°(2u)rn^-1amexi>(-u)du r(ra/2) '

This yields the first part of (12.7). Now let us obtain the uniform estimate. To be definite, we assume that a > 0.

If c < 0, then p(c) < p{\c\). Therefore, it suffices to estimate p(c) only for c > 0. By differentiating £>(•), we find the point of maximum

<* ( 2, ^ <*2Y

Let c > c*. Then

0 > ^ ( c ) = c- 1 ( m - 1 ) c(c — a) (m-l)(i-l)-^.

\ c c*/ a2

Taking into account the fact that cc* > c\ > a2(m — 1), we obtain

0 > - ( c ) > - 2 ( 7 2 ( c - c * ) . V

By integrating this estimate, we arrive at the inequality

\1'

Finally,

p{c) > exp I - ° J* U(c*).

1 > / p(c)dc> / exp< 2 f ^ ' P ( c * ) »

p ( c * } - [ / e x p { - ^ } H =^72- D

Page 113: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 97

THEOREM 12.2. Let X be a uniformly convex space with

(12.8) S(p) > A /

for some A > 0, /? > 1, and p G (0,2]. Suppose that P G £(X) and dimNp > /?. T/ien £/ie distribution density of the

norm with respect to P is bounded.

PROOF. Let m > /?, and let 2 1 , . . . , zm be an orthogonal system in Xp 0. We set Ik = /2fc G X and construct a partition T of the space X into m-dimensional planes 7 = {#7 + ]CfcLi cfc'fc> ck G R 1 } . We choose the initial points x1 so that

||x7|| = inf| |x||.

Let us choose some 7. By virtue of the choice of z and /, the conditional measure P 7 is an m-dimensional Gaussian measure with unit correlation. Within 7, we construct one more partition, 6 , into lines passing through x7 . Let 6 = {x1 + ci, c G R 1 } be one of these lines. We normalize the direction I G £{{h}) by the condition I = Iz, \z\ = 1. Then the conditional measure P# on 6 has the form

Pe(dc) = pe(c) dc = A9\c\m-1 exp j - {° ^ \ dc.

Let us consider the conditional distribution of the norm with respect to P#. Since the function fo(c) = ||x7 + d|| is monotone both for c > 0 and for c < 0, it follows that the distribution F$ = ~Pef~

l has the density

(12.9) *e(r) Pe{c)

M M * - ) I / J ( C ) I ' £

For r > ||x7||, this sum contains exactly two terms. We apply Lemma 12.3 to the function /#. The substitution (12.5) yields

\m\ > 1 - . - i 1 --1//3

We can apply (12.7) and (12.4) to the density p$:

^ ( c ^ m i n J A ^ c p - 1 ; ^ }

< min |Amrm-1 | | i | |1-m I A"11 1 - ll*i -|(m-l)//3

' 7T1/2/'

where Am = 2 1 - m / 2 /T(m/2) . By combining these estimates with (12.9), we find that

F'e{r) <2min\Amr ,m—11 lA~m / / 3( 1 -m/P-1

1/0-1 -iA-i/^1_My,p-ywi/a}.

Page 114: Local Properties of Distributions of Stochastic Functionals

98 3. GAUSSIAN FUNCTIONALS

Depending on the value of 1 — ||x7 | | /r, we use the first or the second expression for the minimum and obtain

/~\/2A \ (l-/3)/(l-m)

Fi(r) <4n'^2^-^j \\l\\~e^r^ = ^ ( m > / 3 ) | | Z | | ^ A - V - 1 .

Let us average this estimate first with respect to 9 and then with respect to 7. Using (3.6), we see that

(12.10) F'(r) < A(mt0){ inf H ^ r V 1 ^ " 1 ;

that is, the density of the distribution F = P / _ 1 is bounded on any finite interval. By Theorem 11.1 (vi), the density is uniformly bounded on R1 . •

The strong convexity of the norm alone without the power-law estimate (12.8) does not guarantee the boundedness of F' [153],

The spaces Lp, 1 < p < 00, can serve as an example of strongly convex spaces in which Theorem 12.1 does not work. In this case, condition (12.8) holds with (3 = max{p; 2}. For a Hilbert space, (3 = 2 and A = 0.25.

Now we shall find out how the position of the barycenter of P affects uniform estimates of the density of the norm. We already have the estimate (12.10), which is independent of the barycenter. However, this estimate is not uniform.

Suppose that the measure P is centered, Pa{*} = P{* — &}> and Fa = P a / " 1

is the distribution of the norm with respect to the translated measure.

PROPOSITION 12.6. IfX. satisfies the assumptions of Theorem 12.1 and P G So(X), then

(12.11) sup F'a{r) < A(P, X)(|M| + l ) " " 1

rGR1

for all a G X. Here A(P,X) is a constant to be defined in the proof

PROOF. By Proposition 11.3, the function * a (^ ) = ^ _ 1 (^a(^)) is concave. Therefore, the derivative

*'r>-y(tffi.(r)))-(»>"<<M^} is nonincreasing. Let r > ||a|| + 1. Then it follows from the inequality ^f'a{r) < ^ ( | | o | | + 1 ) that

(12.12) F'a{r) < F'a{\\a\\ + 1) exp { - *M + * I M ± ! ) j .

Let us consider two cases. 1) Fa(||a|| + 1) > 1/2. ThenO < *0( | |a | | + l) < ¥ 0 ( r ) andF^(r) < ^ ( | | o | | + l) .

2) Fa(||a|| + 1) < 1/2. Then we use the estimate

Fa(\\a\\ + 1) = P{| |x - a|| < ||a|| + 1} > P{||x|| < l } .

Now inequality (12.12) implies the estimate

« r,<* (W + 1)„,{l£*ibUiffi}.

Page 115: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION O F T H E NORM 99

By substituting r = \\a\\ + 1 into (12.10), we obtain

^(rJ^P.XXHall + l)'-1, where

*(P,X) = X M { ( J S 1 ) ) MI}-V.»P{=!!t!^w^iLoij. Q

The smoothness of F. We show that if the functional f(x) = \\x\\ is both uniformly convex and smooth, then its distribution function F is smooth. Specifi­cally, we prove that the distribution of / belongs to one of the classes W x . Recall that the inclusion P / _ 1 G W j means that the derivatives F^l\ . . . , F ^ exist, the functions F, F^\ . . . , F^M~^ are absolutely continuous, and F^M^ is a function of bounded variation.

We consider the derivatives of / as R1-valued polylinear forms. We denote the value of the zth-order derivative at a point x G X by f^(x)[xi,... >Xi\. The norm of this derivative is denoted by 11/^0*0 II a n d is defined by the formula

| | /W(s) | |= sup | / w ( s ) [ » i , . . . , * i ] | . Il*ill=-=IM=1

We say that the norm / has bounded ith-order derivatives if

(12.13) \f{iHx)[xu...ixi]\<c\\x\\1-if[\\xj\\ 3=1

for some c > 0 and for any x ^ 0 and Xj G X. This condition is equivalent to the boundedness of | | /^ (0 l l o n the u r n t sphere

in X. It is well known that the spaces Lp satisfy this condition if p is even or p > i.

THEOREM 12.3. Let the norm have bounded derivatives of orders 1,2,... , M + l and satisfy condition (12.8) of power-law uniform convexity. Suppose that P G £(X) and dimNp = oo. Then P / " 1 G W f .

PROOF. Theorem 9.3 provides sufficient conditions for the distribution of a M

functional to belong to W x . To use these conditions, we need to verify the inte­grability of higher-order derivatives and to obtain a lower bound for | / ^ | .

We start by estimating the higher-order derivatives. Under the assumptions of the theorem, we have an estimate for | | / ^ | | , but we need to verify the integrability of the Hilbert-Schmidt norm | / ^ | . Hence, we must establish the relation between the two norms. It is given in the following lemma, which is of interest in itself.

LEMMA 12.5. Let L: X* —> R 1 be an i-linear form, and let P G £o(X). Then

\L\2= [ |L [x 1 , . . . l x i ] | 2 P(dx i ) . . .P (dx < ) .

We omit the elementary proof, which includes treating the finite-dimensional case and passing to an infinite-dimensional X in the limit. We are interested in the following obvious consequence of this lemma:

iii2<(^w2P(dx)y iiiii2

Page 116: Local Properties of Distributions of Stochastic Functionals

100 3. GAUSSIAN FUNCTIONALS

Since, by Proposition 11.6, the norm is integrable with respect to the Gaussian measure, it suffices to verify the moment properties of the function | | /^( ') | |» 2 < i < M + 1. Prom the boundedness condition (12.13), we obtain the estimate

ll/WWHciixii1-*. The variable |/<*>(-)| has all moments, since

(12.14) / \\x\\-dP(dx)<oo Jx

for all d > 1. Let us prove this statement. First, we consider the case in which the measure P

is centered. The support of P is infinite-dimensional; therefore, in X' we can choose continuous functionals / o , . . . , fd that are orthonormal in L2(X, P) . By continuity, we have

By passing to the joint distribution of the functionals fj (that is, the standard Gaussian distribution in R d + 1 ) and then to the polar coordinates, we obtain

/ \\x\\-dP(dx)<c;d/2 f ('£\(x,fj)\

ayd/2p(dx) «/X t /X 7=o

= (27r)-(d+1)/2crd/2 f V V e x p j - j \ dr < oo.

We have verified inequality (12.14) for a centered measure. To verify this inequality in the general case, it suffices to estimate the expression

/ \\x + a\\-dP(dx)= f r-ddP{\\x + a\\ <r} (12.15) 7 x Jo

= d P{||x + a|| <r}r~d-ldr Jo

for P G GoPQ. By the corollary of Theorem 10.6, we have

P{||a? + a|| < r } < P { | | a ; | | < r}.

Returning to (12.15), we obtain

/ ||x + a | | - d P ( d x ) < / | |x | | -dP(dx) < oo, 7x Jx

thus proving inequality (12.14) for a measure with an arbitrary barycenter. It remains to obtain a lower bound for f^\ This bound implies the main

assumption of Theorem 9.3, that is, the inequality

(12.16) / |/<1>(x)r<2Af+e> P(dx) < oo ./x

for some e > 0. Let us take a positive integer m > (/? — 1)(2M + e) and a system of vectors

{lj, 1 < j < ra} C Hp C X orthonormal in the sense of the Hilbert structure on Hp defined by the formula \l\2 = 1(1). Let us consider the truncated gradient

Page 117: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 101

v(x) = E j l i f(1)(x)llj]l3' T h i s i s t h e projection in H P of the vector fW on the linear span of the system {lj}. Therefore, \V\ < | / ^ | , and for (12.16), it suffices to obtain a lower bound for |V|.

We shall use the notation introduced in the proof of Theorem 12.2. Suppose that x = x1 + cl e 0 G 7 / 9 . Then

771 771 n

WW = j2f(1)mA2 > I E / ( 1 ) ( « v>lM = I/(1)WWI2 = \m\2. j=i 3=1

There are two lower bounds for |/g(c)|. If f$(c) > 2fg(0), then the convexity of / and (12.4) imply

(12.17) W(c)i > /*(*)-/*(Q) >IM> > \c\ '- 2|c| _ 26*(1) ~ 2 '

But if f$(c) < 2f$(0), then by the definition of 6 we have

\\ct MO) < fe{c) 1-6 (IMII

\fe(c) < fe(c) - fe(0)6 ( Mil \

Let us estimate \f'B{c)\ with the help of inequality (12.8):

I / 'Ml > ^ ( c ) - / ^ ( ° ) > /g(0)g(||ci||/2/g(0)) I M n~ \c\ ~ |c| ~ Hf)'"'(?)

By integrating this estimate with respect to the radial conditional measure, we obtain

/_~ \m\-(™^Mc)dc < (M)"(2M+£)

(3-i - (2M+e) rcx)

+ A/e(0)i-^Blj j J Jc\V-M™^Pe(c)dc.

Let us estimate the last integral. We write Am — 21 m / 2 / r ( m / 2 ) and apply (12.7) to pe:

/

OO pi

( c | ( l - W 2 M + £ ) ^ ( c ) dc<Am |c |(l-/3)(2M+£)+m-l dc + L -oo J-l

We denote the right-hand side by i4(ra, M, £, /?). It is finite by the choice of m. We integrate over the set of radial rays 0 in the space 7 and obtain

j ]V(X)\-^^p,(dX) < (mf my(2M+£)

• / 11/11 \/3l-(2W+e)

This estimate is uniform with respect to 7 C X except for the factor / (# 7 ) . Let us obtain an integrable estimate for this factor.

If a7 is the barycenter of the measure P 7 and /?i = (ft - 1)(2M + e), then by the definition of x 7 we have / (x 7 ) = ||x7 | | < ||a7|| and

/(a;7)*<(l + |K||)1+*.

Page 118: Local Properties of Distributions of Stochastic Functionals

102 3. GAUSSIAN FUNCTIONALS

Since the power-law function with exponent 1 + /?i > 1 is convex, we can use the Jensen inequality to obtain

(l + IKH) 1 *^ f(l + \\x\\)1+^P^dx).

The integration with respect to the quotient measure gives

/ / ( s 7 ) f t P r ( d 7 ) < / (1 + | |* | | )1 + / ? 1 P7(dx). Jx/v Jx fx/v

By Proposition 11.6, the right-hand side is finite. By combining the above estimates, we see that

/ \V(x)\^2M^P{dx)<oo. Jx

This implies (12.16). Now the application of Theorem 9.3 is justified, and therefore,

we have P / " 1 e W f . •

Appendix

1. Suppose that (X, || • ||) is an infinite-dimensional separable Banach space. For each e > 0, one can construct a norm | • | on X and a measure P € Go(X) with the following properties:

a) ( l - e ) | | x | | < | x | < ( l + e ) W for all x G X; b) if F(r) = P{x | |x| < r } , then the density F' is unbounded. If X is a Hilbert space, then one can choose the norm | • | so that it is infinitely

differentiate and its derivatives are bounded in the sense of (12.13) [151, 152]. By comparing this statement with Theorem 12.2, we see that the norm | • |

cannot be uniformly convex of a power-law order. 2. There exist a sequence of independent Gaussian random variables {£&}, 1 <

k < oo, such that the distribution function F of the random variable M = supk | ^ | has a bounded derivative F\ whereas F" (the derivative of the distribution density) is unbounded [77].

For any function <p: (0, oo) —> [0, oo) such that limr_>o (fix) = 0, there exists a sequence {£&} such that [75, 84]

Fir + e)- F(r) sup — Y~T — = +oo. r,e>o W\e)

3. Suppose that X is a Hilbert space, P £ £/(X), a is the barycenter of P , a\ > a\ > . . . are the eigenvalues of the correlation operator, and F is the distribution function of the Hilbert norm with respect to P . Then

supF'(r) <c(aua2)-^(j2°l) + N l }

where c is an absolute constant. It is impossible to use only one eigenvalue in this estimate.

For the Hilbert space and the spaces lp, there are different, more complicated estimates of the dependence between F' and the correlation operator and the po­sition of the barycenter [80, 84].

Page 119: Local Properties of Distributions of Stochastic Functionals

§12. DISTRIBUTION OF THE NORM 103

4. Suppose that {£&}, 1 < fc < oo, is a sequence of independent random •A/"(0, <J|)-distributed variables and that the variances a\ decrease regularly:

01 = bu °k = (lnk)-1/2bki k>2,

where {bk} is a monotone sequence that tends to zero. Let &(•): R+ —> R+ be a monotone function with b(k) = bk. We denote the inverse of b by 6*.

If for some a G (0,1/2) and any (3 > 1 we have

llmlnb* (t)/(b* (fit))" = 0,

then the distribution density of the random variable M = max^ |£/c| is bounded [82-84].

Although this statement is close to Proposition 12.5, it neither follows from nor implies it. Indeed, the Paulauskas condition is based on the fact that the sequence <7fc decreases regularly, whereas (12.1) implies a restriction on the rate of decrease of ak.

5. Suppose that X is a Hilbert space and P G GoPQ is the measure constructed in Example 7.3. Then the random variable Q = \\x\\2 = ]T\ <T?£? is a quadratic form of independent A/"(0, l)-distributed variables. Let F(r) = P{x | Q(x) < r}. There exists extensive literature dealing with the properties of F\ for example, see [23, 50, 110, 117]. It follows from Theorem 12.3 that F has derivatives of all orders. If Gj > 0, then each derivative is uniformly bounded on R+.

Now we present an estimate for F' that can also be used for quadratic forms whose coefficients have alternating signs. Let

Q = J2aiSh F = P{x\Q(x)<r}. 3

We set

u{8) = sup sup i > ( r ) | V a * = 1, V \ad\ < oo, V V 4 = s)

for 6 > 0. Then lim6^Qu(6) = (20T)"1 . Moreover, if 1/3 < 6 < 1/2, then

u(6) < c0 - ci ln(l - 26), c0 = 0.77, cx = 0.43.

For 6 = 1/2, the unpleasant singularity on the right-hand side is primarily due to the hyperbolic form Q = (f jf - £2)/V% whose distribution density is unbounded [89].

6. Suppose that (X, || • ||) is a separable Banach space, P G GoPQ> and F(r) = P{# | ||z|| < r} . Then there exists a P-dependent constant c such that

1 - F{r) > c~l exp{-c(r + e)e}[l - F(r - e)]

for all e > 0 and r > e. If r > max{l; e}, then

F(r + e) - F(r - e) < c(r + l)e[l - F(r - e)].

If the norm has a bounded distribution density, then the last inequality holds for all e > 0 and r > e.

These statements can readily be derived from Proposition 11.3 and Theo­rem 11.6, respectively, (see [2, 40]; both inequalities occurred for the first time in [2]).

Page 120: Local Properties of Distributions of Stochastic Functionals
Page 121: Local Properties of Distributions of Stochastic Functionals

CHAPTER 4

Poisson Functionals

§13. The configuration space

Let G be a separable metric1 space. Let us define some notions related to the space of configurations in G. Informally speaking, a configuration is a locally finite subset of G with repetitions, that is, with multiple points. The main difference from the standard definition of configurations is that we allow repetitions.

Let 05 = 05 G be the Borel <r-algebra and 05o C 05 the ring of bounded Borel sets. A sequence #i , #2, • • • (possibly finite) of points of G is said to be locally finite if any bounded set contains only finitely many terms of this sequence.

Two locally finite sequences {x^ and {yi} are said to be equivalent if there is a bijection a between their indexing sets such that yi = xa^) for all i.

An equivalence class of locally finite sequences is called a configuration in G. We denote the space of all configurations in G by X = X(G). Elements of X

are usually denoted by X, V, or Z. A sequence {xi} from an equivalence class X is called a representing sequence of X. In a natural way, we define the membership relation x € X (a, point belongs to a configuration), the multiplicity of a point in a configuration, the intersection X fl V ( 7 C G), and the number \XC\ V\ of points in this intersection. In the last two cases, we count the points with their multiplicities.

Furthermore, let E be the minimal possible cr-algebra on X(G) with respect to which all mappings of the form X \-^> \X nV\, X e X, where V G 05o, are measurable.

If G is a bounded set, then we have the simplest configuration space in which all configurations are finite and X(G) is naturally identified with the set {0} U IJ^Li Gn/7rn. Here Gn/7rn is the quotient of G n with respect to the permutation group 7rn. Note that in this case the <j-algebra £ is the union of the cr-algebras 7fn05(Gn), where Wn: G n —> Gn/nn is the natural projection. In the general sit­uation, when the space G is unbounded, for any bounded Gi C G we have the natural bijection X <-> Y U Z between 36(G) and the set

oo

(13.1) X(G \ Gi) U ({0} U ( J G?/7rn) . 7 1 = 1

Let us define the canonical bijection as t that relates any two representing sequences s = {xi} and t = {y;}, syt e X. This bijection is uniquely determined by the following two conditions:

1) as t takes 5 to t\ that is, yi = xa j ^ for each i\

1The metric structure on G is inessential to our considerations. What really matters is the assumption that singletons are measurable and a class of "bounded" subsets of G is reasonably defined.

105

Page 122: Local Properties of Distributions of Stochastic Functionals

106 4. POISSON FUNCTIONALS

2) as t preserves the order at each point of the configuration; that is, if i < j and Xi = xj9 then o~st(i) < crst(j).

Now let us give the definition of a function / : X —> M, where X is a configu­ration and M is a set. First, we define the representation of / on each representing sequence of X, and then impose the consistency condition on these representations. A sequence i »-> / ( s , i) G M is called the representation of / on a sequence 5 = {#*}. Furthermore, f(s,i) is called the value of / at the element Xi of the representing sequence 5. The representations of / on representing sequences s and t is said to be consistent if f(tyi) = / ( s , o~s t(i)) for any i. We say that a function / is defined on the configuration X if a system of consistent representations of / is given on all representing sequences of X. The meaning of this definition is obvious: if a point x G G belongs (with multiplicity k) to the configuration X, then / assigns to x an ordered sequence of k elements of M. If X does not have multiple points, then we have the ordinary definition of a function. In the general case, we can assume that we take several copies of each multiple point and assign a value of the function to each of these copies. Obviously, each function / is uniquely determined by its representation on any representing sequence.

Poisson random measures. Let II be a nonnegative measure on 2$ such that II is finite on ©o- A family of random variables {v(A), A e*Bo} is called a Poisson random measure of intensity II if for any A G 2$o the random value v(A) has the Poisson distribution with parameter 11(^4), that is,

n\

for n = 0 , 1 , . . . , and for any finite family {Ai\ of disjoint sets Ai G Q5o, the variables v{Ai) are independent.

A Poisson random measure can be realized on the configuration space (£, £) . First, by setting

(13.2) P({X \\XnV\= k}) = c -n^ f f lp ! ,

we define a set function P on the sets of the form {X \ \XC\V\ = fc}, where V G 2$o and k > 0.

One can prove that this function can be extended to a probability measure P on £ (e.g., see [53]).

We define a family of random variables {v{V), V G 2$o} on the probability space (£, E, P) by setting

v(v,x) = \xnv\. One can readily show that the family {v{V), K G ©o} is a Poisson random measure. We also note that v is indeed a random measure: for a fixed X G £, v(')X) is the measure on (G,<8) equal to J2xex m(#)£c> where m(x) is the multiplicity of the point x. The measure P is also called a Poisson measure of intensity II.

Along with i/, we often consider the centered measure v = v — II, which satisfies EV(V) = 0 for any V G Q50.

Page 123: Local Properties of Distributions of Stochastic Functionals

§13. THE CONFIGURATION SPACE 107

As was already mentioned, for bounded G it is natural to identify the space X(G) with the set {0} U {U^Li G n /7r n }. Let us show that in this case we have

oo 1

(13.3) ^^'(iw + E W -7 1 = 1

To this end, it suffices to verify that (13.2) is satisfied for the measure P defined by (13.3). Indeed, let F c G . Then

0 0 1

p({x \\xnv\ = k}) = e~n^ ]Tcn(n(^))/c(n(G\v))n~k-^ n=k

k °°

E n=k

e_nr^(U(V))k

k\

for k > 0 (for k = 0, (13.2) is obvious).

Stochastic integrals. Stochastic integrals provide the most important exam­ples of functionals on the configuration space. We need the following definitions.

Let (B, || • ||) be a separable Banach space, <p: G —> B a measurable func­tion vanishing outside some bounded set (we denote the class of such functions by Io(G,B)), and v a Poisson random measure of intensity II defined on the proba­bility space (36, S ,P) . The random variable

(13.4) X h- Y, V(x) = f <p(x)v(dx,X) xex ^G

is called the stochastic integral of (p with respect to v and is denoted by f (pdv. If ip e L^G, ] ! ) , then, by definition, we set

(pdu= / ipdv— I ipdU.

Now let us calculate the mean value of the stochastic integral.

PROPOSITION 13.1. For any ip e I 0 (G,B) n L ^ G . I I ) , one has

E / </? dv = ip dU.

PROOF. Let Gi = {x E G | <p(x) ^ 0}. Since the values of a random measure on disjoint sets are independent, it follows from (13.3) that

^ , = e - n ^ ) V i / ytiP{xi)U{dx1)...U(dxn) ti n! J^ t[

= e-n<G l

Note that (13.4) gives a function defined on the entire configuration space X. Obviously, this is possible only for <p £ I Q ( G , B ) . In what follows, we extend the

Page 124: Local Properties of Distributions of Stochastic Functionals

108 4. POISSON FUNCTIONALS

class of integrable functions. The price we have to pay is that the integral will be defined on a subset of P-full measure in X rather than on the entire X.

A function </?: G —• B is said to belong to the class I(G,II) if for any e > 0 there exists a bounded set V G 2$o such that

(13.5) p j x e £ | II Iiplv,dA >e\ <e

for any bounded measurable subset V C G \ V. Likewise, (p is said to belong to I(G, II) if y? is locally integrable (that is, integrable on each bounded set) and for each e > 0 there exists V G Q3o such that

P J X G £ | J ftplv,<E/^ >e\ <e

for any V G 5$o with V C G \ V. Now let us choose an increasing sequence {Vn} such that Vn G Q50, Vn | G, and (13.5) is satisfied for V = Vn and e = 1/n. In this case, the sequence of random vectors / cplv dv is a Cauchy sequence in probability. The corresponding limit is called the stochastic integral and is denoted by f ipdv. In a similar way we define fipdv for <p G I(G,II) . One can readily show that these integrals are independent of the choice of the sequence Vn. We also note that in this case convergence in probability, as well as the weak convergence of the corresponding distributions, automatically follows from convergence almost everywhere (e.g., see [16, Ch. V, §2]).

PROPOSITION 13.2. LetB be a Hilbert space, and let ip be a function such that \\<p\\ G L 2 ( G , n ) . ThenipeI(G,U).

PROOF. Let V e 550. Then

\flv(pdpf=[( J2 < x)> E v(aOW" /v>dJ J J Vo-cYnu *c vm/ / \\JV II

= e -n(v)

-n(v)

xexnv xzxnv °° 1 / f n \ II f "2

E ^T / E (V(»i). ¥>(*i))n(dri) • • • n(dx„) - / ip dU ^in-\Jv» C~±x ) \\Jv

JT±(n[ MfdYlMV))-1

n ( n - 1) / (¥>(aj),v(v))n(daf)n(c^)) - / v ^ n Jv J \\Jv

+

= / IMI2<*n. JV

This identity together with Chebyshev's inequality readily implies (13.5). •

The notion of a stochastic integral proves useful for the description of infinitely divisible distributions. Specifically, suppose that B is a separable Banach space and u G B is an infinitely divisible random vector. (Just as in the one-dimensional case, the infinite divisibility means that for each n the random vector u can be represented in the form u = u\ -\ \-un, where the Ui are independent identically distributed random vectors; here and in what follows the equality of vectors is understood in the sense that their distributions coincide.)

Page 125: Local Properties of Distributions of Stochastic Functionals

§13. T H E CONFIGURATION SPACE 109

Infinitely divisible random vectors admit the Levy-Khinchin representation-, namely, any such vector u can be uniquely represented as the sum of three inde­pendent terms,

(13.6) u = u0 +ui +^2,

where UQ € B is a constant vector, u\ is a centered Gaussian vector, and

(13.7) u2= xdv+ I J{\\x\\<l\ U

xdv. {|MI<i} J{\\x\\>i}

Here v is a Poisson random measure2 of intensity II on B. (The measure II is called the Levy measure of the vector u). Note that II satisfies the conditions II({0}) = 0 and Jmin(| |x| |2 , l)dII(x) < oo.

Let us consider two important classes of infinitely divisible random vectors.

Stable vectors. A random vector u is said to be stable if the following asser­tion holds for independent random vectors u\ and u2 whose distributions coincide with the distribution of u. For any a, b > 0, there exists a number c and a non-random vector x such that the distributions of the vectors au\ + bu2 and cu-\- x coincide. If x = 0, then u is said to be strongly stable. The distribution of a stable random vector is also called stable.

A vector u is said to be stable with exponent a (a-stable) if the numbers a, 6, and c satisfy the relation aa + ba = ca.

PROPOSITION 13.3. Each stable random vector is a-stable for some a G [0,2]. The case a = 2 corresponds to Gaussian vectors] in this case, the strong stability means that the mean value is zero.

The proof can be found in [145]. Note that if a vector u is a-stable, then

u = ^17^(^1 + * • * + tin) + x = ^((ui+nl/a~lx) + *' * + K + n 1 / a " 1 x) )

(here the Ui are independent copies of u and x = x(n) G B), and hence, each stable vector is infinitely divisible.

We denote the unit sphere by 5 = {x G B | ||x|| = 1} and introduce the coordinates (r, 6) in B \ {0}, where r = r(x) = \\x\\ G (0, oo) and 6 = 6(x) = zll^H-1 G S.

THEOREM 13.1. Let u be an a-stable vector with a ^ 2. Then u\ = 0 in the representation (13.6), and the Levy measure II has the following form in the coordinates (r,8):

dr (13.8) n(dr,d0) = ^ 7 r ( d 0 ) ,

where n is some finite measure on S, called the spectral measure.

The proof of this theorem can also be found in [145].

2When this measure is defined, the metric on B must be chosen so that the sets bounded away from zero are bounded.

Page 126: Local Properties of Distributions of Stochastic Functionals

110 4. POISSON FUNCTIONALS

Processes wi th independent increments . A process £(£), t G R+, is called a process with independent increments if the random variables £(£o)>£(£i) _

£(£o)> • • • >£(£/c) — £(^/c-i) are independent for alHo < h < ''' < tk in R+-If the distribution of £(t + h)— £(h) depends only on h) then the process is said

to be homogeneous. In what follows, we assume that the process takes values in Rd , d>l.

The following theorem describes the structure of an arbitrary stochastically continuous process with independent increments.

THEOREM 13.2. If £(t) is a separable stochastically continuous process with independent increments, then there exist a Gaussian process £i(t) with trajecto­ries continuous with probability 1 and a Poisson random measure v of intensity II independent of £i on R+ x R d such that

(13.9) £{t) = &(*) + / / xv(dsdx) + / / xv(dsdx). ^0<s<t ^0<s<t

\x\<l \x\>l

The measure II is called the Levy measure of the process £. It satisfies the condition

JJ\Q min(|#|2, l)TL(dsdx) < oo

[0,t)xKd

for any t G R+. We also note that for homogeneous processes the measure U(ds dx) has the form dslli(dx).

PROBLEM 13.1. Let X\ C X be the set of configurations that have no multiple points, and let the measure II vanish on singletons in G. Prove that X\ G E and P { * i } = l.

PROBLEM 13.2. Let II be some finite measure in a Banach space B. Consider the probability measure expll = e~n(B) YH?=o^k/k]- (the power is understood in the sense of convolution). Prove that expll coincides with the distribution of the random variable / x dv, where v is a Poisson random measure of intensity II.

PROBLEM 13.3 (change of variables in the stochastic integral). Let g: G —> Gi be a mapping of G into some set Gi . A subset A c Gi is said to be bounded if g~1(A) is a bounded set. Let X\ be the set of configurations in Gi (with respect to the class of bounded sets). We set Q(X) = {g(x),x G X} and thus define a mapping Q: X —> X\. Furthermore, suppose that P is a Poisson measure of intensity II on E and v is the corresponding Poisson random measure. Prove that P i = ~PQ~l is a Poisson measure of intensity I I ^ - 1 on £(E) and

/ <p(g)dv= / ydvi

(where v\ is a Poisson random measure of intensity II# x).

PROBLEM 13.4. Choose a metric on B so that on the space £(B) correspond­ing to this metric one can define a Poisson measure P whose intensity has the form (13.8).

Page 127: Local Properties of Distributions of Stochastic Functionals

§14. DIFFERENTIAL CALCULUS ON THE CONFIGURATION SPACE 111

§14. Differential calculus on the configuration space

The local structure of the configuration space. We make some additional assumptions about the structure of the space G. Namely, suppose that S is a complete separable metric space and I C R m (m > 1) is an open set (also treated as a metric space with respect to the Euclidean metric). We set G = S x / and equip this space with some metric3 consistent with the direct product structure and the Borel cr-algebra 2$G-

On the configuration space 3£(G), we introduce the structure of a "plane Hilbert manifold along J"; namely, for each configuration X £ X(G) we consider the Hilbert space 'H.(X) of square integrable mappings X —> R m ,

(14.1) H(X) = {h: X - R m | \h\2H(x) = £ \h(x)\2

Rm < oo}.

The space H(X) has the meaning of the tangent space along I at a point X G X. Indeed, for h G H(X), we can define the configuration X + h e X that consists of the points x + h(x), x G l In this formula, the summation "along / " is understood as follows: we represent each point x G G in the form x = (0, r) , 0 G S, r G i", and, by definition, set x + a = (0, r + a) for a G R m such that r + a e I.

The family

Q = Q(V) = {Q(X): H(X) -> H{X) \ X G X}

of mappings Q(X) of the form h i-> lXnyh is called the orthogonal projection defined by a Borel set V. The orthogonal projections corresponding to the bounded sets V are said to be finite-dimensional (in this case, all the spaces H\z(X) = Qli(X) are finite-dimensional).

Furthermore, a mapping K: X \-> K(X) G H(X) is called a vector field if it is measurable in the following sense. For any bounded open set V C G, the mapping X i—» X + Q(V)K(X) is E-measurable (as a mapping X(G) —> X(S x R m ) ) . Note that if K\ and K<i are two measurable vector fields, then X i—> (Ki(X),K2(X))H( . is a measurable function. We also use the fields of

multilinear Junctionals L: X i-> L(X) G £ J (H(X)) , /?eWs of multilinear Hilbert-Schmidt Junctionals L: X i-> L(X) G £ / 2 JH(X) ) , and/JeZds of trace class operators L: X i—> L(X) G £(i)(H(X)). A j-linear functional L is said to be measurable if the function X i-> L(X)(i i r i (X), . . . , i£j(X)) is measurable for any jf-tuple i f i , . . . , Kj of measurable vector fields.

The topology of the configuration space. First, suppose that G is a bounded space. Then all configurations on G are finite and, as was already noted, X(G) is naturally identified with {0} U U^Li Gn/7rn, which defines the natural topology on X(G).

If G is unbounded, then X(G) bears no "canonical" topology. Let us consider some topologies which will be used in this case.

3Even the case in which S is a singleton is meaningful, but this is insufficient for studying the distributions of functionals of stable random vectors (§§16, 17).

Page 128: Local Properties of Distributions of Stochastic Functionals

112 4. POISSON FUNCTIONALS

1. The coarse topology. The coarse topology w on X is the weakest topology in which the mapping

X •"> S ^ is continuous for any continuous nonnegative function </? with bounded support.

It is well known [53] that (X,w) is a complete separable metrizable space. A sequence {Xn} converges to X in w if the sequence of measures Y2xexnnv ^ weakly converges to Y^xexnv ^ for any bounded V. If G is bounded, then the coarse topology coincides with the natural topology. For our goals, the coarse topology is too poor, that is, the supply of ^-continuous functions is too small.

2. The O-topology. This topology describes the properties of the space X "along J." A set il c X is said to be O-open (il G O) if for any configuration X G il and any bounded open subset V C G there exists an e > 0 such that X + h G il for any h G Hv(X) with |h| < e. In other words, a set i l is O-open if for any configuration in il we can slightly "perturb" (along / ) the points of this configuration that lie in a bounded open set V so that the perturbed configuration still remains in il.

Let us consider a useful property of the class of open sets.

PROPOSITION 14.1. Suppose that il G O, X G il, and V c G is a bounded open set. Then the sets

{hettv(X)\X + heil},

{h G Hv(X) I {X n V) + h C V, X + h G il}

are open in Hy (X).

We leave the proof to the reader. We shall mainly use the O-topology. For il C £(G) , we denote by il~ its

O-closure and by dil its boundary in the space X(S x R m ) D X(G). Obviously, il~ C £(S x J~). If i l is an open set and il~ C 3£(G), then il is called an inner open set.

Let us make some remarks about the O-topology. A sequence {Xn} converges to X in O if and only if there exist a finite-dimensional projection Q = Q(V) and a number no such that for any n > no the terms of this sequence have the form Xn = X + /in , where hn G tiy{X) and \hn\ —> 0.

The O-topology is so rich that it fails to have a countable basis (even locally). Moreover, this topology does not take into account any properties of the space S. For example, an O-open set need not be E-measurable. Nevertheless, the O-topology is well adapted for studying objects that do not have good properties "along S."

3. Poisson topologies. The topologies w and O are, in a sense, extreme topologies: w is the weakest and O is the strongest "reasonable" topology. To get rid of their disadvantages, we define "intermediate" topologies, which will be called Poisson topologies. Although they are defined not on the entire configuration space but on some subset of P-full measure, this is sufficient for our goals.

First, we give some auxiliary definitions. By setting

(14.2) v(X) = Xn(G\V),

Page 129: Local Properties of Distributions of Stochastic Functionals

§14. DIFFERENTIAL CALCULUS ON THE CONFIGURATION SPACE 113

we assign a mapping v: X(G) —» X(G \ V) to each measurable set V C G. Thus, each configuration X is represented in the form X = (X C\ V) U v(X).

Two configurations X,Y e X are said to be close to each other if there exists a bounded set V C G such that X and Y coincide outside V. A subset Xo of the configuration space is said to be saturated if Xo contains the empty set and if for each configuration l G l o > all configurations close to X belong to Xo- (In particular, all finite configurations belong to a saturated set.)

If Xo is saturated and V C G is bounded, then we set v(Xo) = {X e Xo \ X fl V = 0} C Xo. Then the mapping iy: (Y,Z) i-> Y U Z is a bijection of X(V) x v(Xo) onto Xo. Now if r is a topology on £o> then by v(r) we denote the induced topology on the subspace v(Xo). Furthermore, by r(V) we denote the topology induced by r on X(V) (note that X(V) is naturally identified with the set {XeXo\xn(G\V) = 0}).

Now we can give the definition of a Poisson space. Let Xo be a E-measurable saturated set. A topological space (Xo,r) (and its

topology r) is called a Poisson space with respect to a measure P if (1) Xo is a set of full measure, that is, P{3£o} = 1; (2) the measure P is dense on (3Co,r); (3) r is a Lindelof topology (that is, each open cover contains a countable sub-

cover) and is measurable (r C E); (4) for any open bounded set 7 c G , the topology r(V) is not stronger than

the natural topology p(V) on X{V)\ (5) r is not stronger than O (r C O ) ; (6) for any open bounded set V C G, the mapping iy is continuous with respect

to the topologies (r(V) x V(T),T).

There is a rather general method for constructing Poisson topologies. One considers an additive mapping of Xo into an appropriate linear topological space B and defines r as the preimage of the topology of B. Let us describe this construction in more detail.

Suppose that B is a separable Banach space (for our goals this is sufficient) and Xo is a measurable saturated set of full measure. A mapping A: Xo —> B is said to be additive if for any measurable bounded set V C G we have

(14.3) Apf) = A(XnV) + A(v(X)), X e X0.

Let us additionally assume that the mapping A is surjective, that is, A(3£o) = B. furthermore, let x be the strong topology in B, and let r = A~1(x). Obviously, r is a Lindelof topology and the measure P is dense on Xo- For r not to be stronger than 0) it suffices to require that A be 0-continuous.

Furthermore, it is obvious that the continuity of iy is equivalent to the conti­nuity of the composite mapping A o iy. Since (A o iv)(Y, Z) = A(Y) + A(Z), we see that the latter continuity follows from the continuity of addition in B and from the definition of r .

As an example of a measurable additive mapping, we take the stochastic integral A = \ gdv (or / gdv). This integral can be viewed as a function defined P-almost everywhere on X. Let us show that by varying A on a set of zero probability we can ensure that the domain Xo of this function is saturated and A is additive. Let g € I(G,II). Let Xo be the set of X G X for which Jly gdv is convergent (for some sequence {Vn} chosen in advance). Obviously, if X G Xo, then configurations close to X also belong to XQ. By passing to the limit, we obtain the additivity

Page 130: Local Properties of Distributions of Stochastic Functionals

114 4. POISSON FUNCTIONALS

relation (14.3). Of course, the reasoning remains valid for integrals of the form JgdV.

Differentiation in the configuration space. Suppose that il G O fl E and V C G is a bounded open set. A measurable function / : i l —> R 1 is said to be I times Hy-differentiable (f G C^ (il)) if for any X G il the mapping h H-> f{X+h) is Z times continuously differentiate in a neighborhood of zero in Hy(X) (obviously, this mapping is then differentiate in the same sense on the entire set {h G Hv(X) \ X + h G il, (X fl V) + h C V}). We denote the jth-order derivative at a point h by fy{X,h) and omit the argument h ii h = 0. By definition, the derivative is a multilinear functional on Hy(X) .

The notion of differentiability means that the function varies smoothly if the configuration points belonging to V are slightly varied along the real coordinate.

Let us consider the simplest situation, in which one deals with the configuration space on an open bounded set G C R m . Then X(G) can be represented in the form {0}U|J^Li Gn/nn, and one can readily show that a function / is Hc-differentiable if and only if the restriction of this function to each set Gn/nn is differentiate, that is, the function / o 7fn is differentiate in the usual sense on the set Gn C ( R m ) n

(here 7fn is the natural projection). The notion of the derivative in this case is also transparent. Namely,

(f{1)(X))(xi) = [di(foWn))(xu...)xn)

for the configuration X = { # i , . . . , xn}.

The classes C^( i l ) of differentiate vector fields K: X ^ K(X) G HV(X) are defined almost similarly. The difference is that the mapping h H-» f(X + h) is replaced by the mapping h H-> T(X,h)K(X + /i), where the transport operator T(X, h): H(X + h) -* H(X) is defined by the natural formula

[T(X,h)g](x) = g(x + h(x))> xeX, ge H(X + h).

Suppose that il G O fl E and / : i l —> R 1 is a measurable function. We say that / is I times differentiate ( / G C^(i l ) ) if / G C^ (il) for any bounded open subset V C G and there exist "globalizing" measurable vector fields fW : X i-> f^\X) G Cj(H(X)) (j = 1 , . . . , /, X G il) such that for any V and any X G il we have

/ {? ' 'TO w = /0)mW' w whereQ = Q(V), j = l , . . . ,Z.

We say that / is / times differentiable up to the boundary ( / G C^(i l~)) if there exist an open set ill D il~ in X(S x R m ) and a function g G C^( i l i ) such that / = g on il.

The differentiability of a vector field K: X H-> K(X) G H(X) is defined in a similar way. Moreover, we can readily generalize the definition of differentiability to functions ranging in some Banach space B. In this case, the derivatives are multilinear B-valued functionals.

Let us introduce the notation for the normalized derivative. We set

(14.4) n(X;f) = / ( 1 ) P O / | / ( 1 ) ( X ) | H ( x ) G H(X)

for/eC^CU).

Page 131: Local Properties of Distributions of Stochastic Functionals

§14. DIFFERENTIAL CALCULUS ON THE CONFIGURATION SPACE 115

Suppose that I > 1, V is a bounded open set, and Q = Q(V). We say that a vector field K G Cy (ii) belongs to the class C ^ ^(ii) if there exists a bounded open set V\ such that V\ C V and Q{V\)K = K.

Let us give an example of a differentiable function. Suppose that </? G Io(G, R1) and for any 0 G S the function (p(0, •) belongs to C^(7) . We set

/(X) =£>(*). xex

One can readily show that / G C^l\X) and

(f^(X))(x) = ^\x)

for all x G X (here we naturally identify a linear functional on H(X) with an element of H(X) and write

(14.5) {fu)(X))[hlt • • •,h^ = X) ¥>W)(*)[M*), • • •.M*)l

for all j = 1 , . . . , Z). Formulas (14.5) remain valid if (p ranges in a Banach space B; moreover, the

composite function differentiation formula

(14.6) (gof{X))W(x)=gW{f(X))V>W(x)

holds for any Frechet differentiable function g: B —> R1 . By passing to the limit, one can readily prove formulas (14.5) and (14.6) for the case in which the functional (p belongs to one of the classes I(G, II) or I(G, II).

Let us introduce one more useful notion of differentiability of functions. Suppose that q = {q(0)) #(1) , . . . , tf(0)> Q(j) > 1, j = 0 , . . . , / , is a multi-index

of length / + 1, V G G is an open set (not necessarily bounded), and Q = Q(V). For Y G v(X) (where v is defined by (14.2)), by P y (respectively, Ey) we de­note the conditional probability (respectively, the conditional mean value) for P corresponding to the condition v = Y.

We say that a function / : X —> R 1 belongs to the class W^(X, P) if for any increasing sequence of bounded open sets {V^}, (Jn Vn = V, there exists a sequence of functions {/n}, fn € C ^ , with the following properties:

1) / n -> / in L«<°>(£,Py) for Pv"^almost all Y\ 2) the sequence of mappings

X -> f^Hx)[Qn(-)t... ,Qn(-)] € r{2 )(H(X)), Q n = Qn{Vn),

converges in L9^'^(3t, P y , £ L ( H ) ) for each j = 1 , . . . , I and for Pv~-^almost all Y". The limits of these sequences will be called the derivatives of f along Hy and

denoted by f^\ They are measurable fields of Hilbert-Schmidt forms, f^(X) G CLJH(X)). One can show that they are independent of the choice of Vn and fn.

In particular, the first derivative fW can be treated as a measurable vector field. In a similar way, we can define the classes W£(£, P , H) and W#°(£, P , H) for

vector fields (in the latter case, the approximating sequences of vector fields are taken from Cy°).

Page 132: Local Properties of Distributions of Stochastic Functionals

116 4. POISSON FUNCTIONALS

Next, we say that a function / belongs to the class VVyloc(X,P) (the subscript "loc" means that the function and its derivatives are integrable only locally) if there exist an increasing sequence {An} of sets A n e 0 f l E and a sequence of functions fn € W « ( £ , P ) such that

l ) P { U n ^ n } = l; 2) the restrictions of the functions / and fn to An coincide for each n. In this case, the limits fW = limn fn exist (P-almost surely) for j = 1 , . . . , I

and are independent of the choice of the sequences {An} and {/n}-We conclude this section with a very important remark. The norm in the

tangent space H(X) was chosen rather arbitrarily; we have never used the explicit form of this norm. All our statements remain valid if we assume that for each X e X we have a Hilbert space H(X) (a subset of R x ) equipped with some norm I * IH(X)- For example, we can set

/ \ l / 2

(14.7) I*IH(*)= E " ( * ) W * ) I R ™ > ^xex '

where x : G —> (0, oo) is some measurable function. In what follows, we need norms of the form (14.7), but, unless otherwise spec­

ified, we consider the l2-norm (that is, x = 1). The same is true of the choice of the metric on / C R m : we have nowhere used the explicit form of the metric.

In all applications, some Poisson random measure is given beforehand and for this measure we must choose an appropriate metric in / and a norm on H(X) (see Problem 13.4). If the Poisson measure has a Lebesgue intensity (this case is just the case that will be considered in most detail), then it is most convenient to use the Euclidean metric in / and the l2-norm in H(X). If the intensity II is not Lebesgue, then we can either choose an appropriate norm in H(X) (so that the functions in question will be differentiate) and a metric on / , or, vice versa, choose a transformation R m —> J that takes the Lebesgue measure to the measure II (see Problem 13.3). We shall use both methods.

§15. The Gauss-Ostrogradskii formula

This section deals with the "geometry" of the configuration space X. In the first part, we define the notion of a smooth surface in X (with respect to the differential structure introduced in §14). Then for a Poisson measure P whose intensity has the special form II = 7r x A, where A is the Lebesgue measure on / , we introduce the notion of the surface measure corresponding to P and prove the "factorization theorem," which states that the integral with respect to the measure P can be expressed in terms of integrals with respect to the surface measures on the level surfaces of a smooth function. This theorem readily implies existence conditions for the distribution density of some class of stochastic functional. Moreover, the density can be expressed in terms of the integral with respect to the surface measure on the corresponding level surface. Finally, we prove a formula that can naturally be called a Poisson analog of the classical Gauss-Ostrogradskii formula.

Surfaces in the configuration space. A set T C X is called an elementary surface of codimension k (for k = 1, we have an elementary hypersurface) if there exist an open measurable set il containing r ( i l G ( 9 f l E , i l D r ) and a function / =

Page 133: Local Properties of Distributions of Stochastic Functionals

§15. THE GAUSS-OSTROGRADSKII FORMULA 117

(fii--)fk) £ C^^(it,R^) such that T = f 1(0) and for some finite-dimensional projection Q = Q(V) (where V is an open set) we have

miudet ({QniX-J&QniXifj))^) > 0,

where n(X; /,) = f?\X) l\ft\x)\ e H(X), i = l,...,k. More generally, a set T c X is called a surface of codimension k if there exists

an at most countable family {Tj} of elementary surfaces of codimension k such that (J • Fj = T and each finite intersection of Vj is an elementary surface of the same codimension.

Subsets of surfaces, as well as at most countable disjoint unions of subsets of surfaces of the same codimension, are naturally called surface sets.

LEMMA 15.1. Suppose that U G O n E, / G C ^ i l ) , and t G / ( i i ) . Then / _ 1 ( 0 H {X | / ( 1 ) P 0 ^ 0} is a hypersurface.

PROOF. Let {Sj} be an increasing sequence of bounded subsets of S, (J. Sj = S. Let {ipj} be a sequence of smooth functions R m —> R 1 such that V^([—J> j ] m ) = 1, ^ ( R m \ [-2j, 2j]m) = 0, and ^ < 1 for all j = 1 ,2, . . . . By setting

we define a sequence {gj} of functions ( : G —> R1 . Next, we set

D J(X) = ^ ^ ( x ) | / ( 1 ) ( X ) ( x ) | 2 , X G i i ,

WO = {xeii| ^(X) > \\fM{x)f >o}.

The desired statement follows from the representation oo

f-\t) n {x I fW(x) * 0} = |J(/^(t) n w ). n

Surface measures on the configuration space related to a Poisson random measure. On (G, 05), let us consider a Poisson random measure v of intensity II = TT X A, where n is some locally finite measure on S and A is the Lebesgue measure on J. Let P be the corresponding Poisson measure on (3£, E).

Now let us construct surface measures corresponding to P on elementary sur­faces in X. It will be shown that this construction can naturally be generalized to arbitrary surfaces (the measure on a surface can be obtained by the standard "sewing" of measures on elementary surfaces) and to measurable surface sets.

Thus, suppose that T = f~l(0) ( / G C(1)(H,Rfc)) is an elementary surface of codimension k and Q = Q(V) is the corresponding finite-dimensional orthogonal projection, whose existence is required by the definition of an elementary surface. Since an arbitrary orthogonal projection Q{V\) with V\ D V can be taken instead of Q(V), we assume that the open set V has the form V = B x A. We use formula (14.1) to define a mapping v: £(G) - • £ ( G \ V).

Let us choose a configuration X eT and set I = \X fl V\ (obviously, lm> k). In what follows, let ( 0 i , r i ) , . . . , (^/,r/) ($i G £ , n G A) be a given representing sequence of the configuration X fl V. For each function h G Hy(X) , by h i , . . . , hi

Page 134: Local Properties of Distributions of Stochastic Functionals

118 4. POISSON FUNCTIONALS

we denote the corresponding representation of the restriction fr|XnK- If we consider the m-dimensional vectors / i i , . . . , hi as coordinates in (Rm)*, then we can identify the spaces Hy(X) and (R m ) ' . This identification, the definition of an elementary surface, and the implicit function theorem readily imply that the level sets of the function ht-*f(X + h) defined on {h G HV(X) \ X + h G il, (X n V) + h C V} are C^-manifolds of codimension k in Hv(X) . By cfLX) ox et

w e denote the standard Lebesgue surface measure defined on these manifolds. The mapping h t-» (XC\V)+h takes this measure to some measure crl

v(X) er 0, o n ^e s P a c e X(V). Choose Z £ X(G \ V) and define a measure az on the space X(V) by setting

az(dY) = e-n^ £ ^ l { , y N } fn(cW1)..Md0i)lii(ZUY)alZtei 6l{dY).

l>k/m ' J

Now we can define the surface measure on T. Let T be an elementary hypersurface. The measure

av{dX) = l r ( X ) | Q n | - 1N x ) ( d ( X n F ) ) P t ; - 1 ( ^ ( X ) ) )

where Pv~x is the image of P under the mapping v (v(X) = X fl (G \ V)) and n(X) = n(X-J) = fW(X)/\fW(X)l is called the surface measure on T.

More generally, if T is an elementary surface of codimension fc, then the surface measure ar is defined by the formula

ar(dX) = 1JX) de t (n , (X) ,n i (X) )^ . = 1 i W

Ldet (Qni(X)iQnj(X)Yij=1

av{x){d(XnV))Vv-'{dv(X)).

Here rii(X) = rii{X\ fa). Now we shall try to explain these formulas informally. The measure a z , Z e

£(G/V) , is a surface measure on the section Tz = {Y G X(V) \ Y U Z G T} of the surface T. Recall that the standard measure on a surface in R* = R*1+*2 can be expressed in terms of surface measures on the sections of this surface by affine subspaces parallel to R'1. A simple geometric argument shows that it suffices to multiply the surface measure of the section by |Qn|_ 1 , where n is the normal to the surface and Q is the projection on R*1, and integrate with respect to the remaining I2 variables. In our case, we need to integrate over the measure Pv~1(dZ).

The reader can readily show that the surface measure is well defined (that is, independent of the choice of V and / ) .

Suppose that it is an open measurable set, / : i l —> R 1 is a function from the class C^^(il), and f^ does not vanish on il. By Lemma 15.1, the level sets /_ 1(£) = {X G il I f(X) = i] are hypersurfaces. By at we denote the surface measure on f~l(t).

THEOREM 15.1. The measure P admits the representation

(15.1) dP = \fM\-1dtdat-)

in other words, for any P-integrable function # : i l —> R 1 the following conditions are satisfied:

(1) the function X 1—> ^ r(X)|/^1^(X)|~1 is at-integrable for Lebesgue almost all i G R 1 ;

Page 135: Local Properties of Distributions of Stochastic Functionals

§15. THE GAUSS-OSTROGRADSKII FORMULA 119

(2) the function t i-> J^iX^f^iX)]'1^ (dX) is integrable, and

f dt [ ^If^l'1 da. = f VdP.

REMARK. The vector case / e C^(HyKk) is a natural generalization of this

theorem. The corresponding representation has the form

dP=[det(f?\tf%=1]-1/2dtdat.

P R O O F OF THEOREM 15.1. Let us choose an approximating sequence Qj = Qj(Vj)y Vj = Bj x A?> °f finite-dimensional orthogonal projections and define the functions D3•,: i i —• R 1 just as in the proof of Lemma 15.1. We also choose an auxiliary continuous increasing function p: R 1 —> [0,1] such that p(l/3) = 0 and p(2/3) = 1. We set Kj(X) = p(\f^(X)\-2Dj(X)) (X G il) and define a partition of unity {XJ}J?=1 by setting

Xi = X i , X2 = (1 " X i ) X 2 , X 3 = (1 - X i ) ( l - X 2 ) X 3 ,

Note that f~l(t) D {XJ ^ 0} are elementary hypersurfaces, and we can take Qj to be the finite-dimensional projection occurring in the definition of an elementary hypersurface. Obviously, it suffices to prove the theorem with \£ replaced by tyj = ^Hj. For brevity, we omit the subscript j everywhere in what follows. First, we have the obvious relation / u # dP = Y^Zi /U | * d P ' w h e r e ^ = {X € C/ | |X n V| = /} (if X fl V = 0 , then tf (X) = 0; therefore, the term with Z = 0 is lacking). By the properties of Poisson measures, we can factor the measure in each term on the right-hand side,

P(dX) = Pv-HMX)) • ffi)t-n(V) . i r W . . . > W d r 1 . . . d r i >

I. [\i[V))

As before, ( 0 i , n ) , . . . , (0/,n) is the representing sequence of the configuration

xnv.

Thus,

/ V(X)P{dX) = rxe-n{y) [ Pv-1 {dZ) Jilt *" JX(G\V)

x / TT(C»I) •. .n(d6i) f i lUi(X)y(X)dn . ..drt.

Here X = ZU { (0 i , r i ) , . . . , (0/,n)} and Z = v{X). By a well-known formula, the inner (finite-dimensional!) integral can be ex­

pressed in terms of the surface measure alz di Ql as follows:

f liXi(XMX)dr1...drl = J ^dt j y{X)\QfM{X)\-lalZt()l 6[(dru... ,dn)

(the inner integral is taken over the level set {(r*i,..., r/) G A* | X e il/ fl / 1(t)}).

Page 136: Local Properties of Distributions of Stochastic Functionals

120 4. POISSON FUNCTIONALS

By substituting the last integral into the preceding formula, replacing the mea­sure 3^0 0, by the corresponding measure 0^,01 ,...,0i m t n e configuration space 3£(V), and summing over Z, we readily obtain

[ VdP [ dt f Pv~l(dZ) Ju JR1 JX(G\V)

x / lri(t)(Z U Y)*(Z U Y)\QfW(Z U Y)\~1az (dY)

= [ dt f ViXXQfWiXT^QniXiftWtidX)

= f dt [ ^(x^HxT^idx). D

Surface measures in the configuration space which correspond to other measures. For some applications, the above notions are not sufBciently general. For example, in what follows we shall consider functional of a process with independent increments and obtain some conditions under which the distributions of such functionals are absolutely continuous. To this end, we need to define surface measures that correspond to Poisson random measures and whose intensities are different from that of the Lebesgue measure.

Here we consider a very useful generalization of the surface measure. In this case the surface measure is defined by "locally" the same measure. In particular, this class contains Poisson measures whose intensities are absolutely continuous with respect to the Lebesgue measure.

Just as before, we suppose that G = S x J, but the metric in / is not necessary Euclidean (see Problem 13.4), X = £(G) , and for each X G 3£(G) there is a Hilbert space H(X) (a subset of R x ) equipped with the norm | • |~ = | • IH(X) = (("» O^)1^2-We only require that the norm | • IH(X) *S m s o m e sense "consistent" with the cr-algebra E. Namely, if K\ and Ki are two measurable vector fields, then the function X i—• (Ki(X),K2{X))~ must also be measurable.

Furthermore, suppose that P is some probability measure on (3£, E), satisfying the following condition. For some cr-finite measure ir on S and each bounded measurable set V C G, there exists a measurable function qv(Y, Z), Y e v(X)y

Z e X(V)y such that for Pv_1-almost all Y we have the relations

(15.2) P £ « P v , ^(Z)=qy(YiZ)i

where P y is the conditional measure (for P) on X(V) corresponding to the condition v = Y and Py is a Poisson measure of intensity TT X A on X(V).

For each measure P of this type and for each surface T, we define a surface mea­sure ar related to P . As was already noted, it suffices to define ar for elementary surfaces T.

Thus, suppose that T is an elementary hypersurface and that Q = Q(V) and / are, respectively, finite-dimensional projection and the function occurring in the

Page 137: Local Properties of Distributions of Stochastic Functionals

§15. THE GAUSS-OSTROGRADSKII FORMULA 121

definition of an elementary surface. We set

fW(X) n(X)=n(X;f)

I / ( 1 WIH ( *) '

Next, for Y G v(£), by FY we denote the corresponding section of T, that is, the set {Z G X(V) \ Y U Z G T}, and by arv we denote the surface measure on T y corresponding to the measure Py. Moreover, by | • \v we denote the norm on Hy(X) defined by (14.1) and by (-r)v the inner product corresponding to this norm. Note that Py and | • \v exist only locally and need not have a continuation to all X and H(X) (that is, the corresponding limit as V | G need not exist).

Now we define the surface measure ar by

(15.3)

ov(dY,dZ) = Vv-\dY){\Qn\~)-l\QfX\Qf{1)\vl<lv(Y>Z)°Tv(dZ)

= Pv-l\Qn\y1qv(Y,Z)ary(dZ).

Straightforward calculations show that this measure is well defined, that is, the definition is independent of the choice of the set V. We omit these calculations.

Similarly, for a surface of codimension k we define the surface measure ar as follows:

Here » , (*) = n,(X;}) = f!'\x)/\f!'\x)\~.

Now let us show that the measure P satisfies the assumptions of Theorem 15.1, that is,

(15.4) / dt [ S d / ^ D " 1 ^ = / * d P

for any P-integrable function # . Just as in the proof of Theorem 15.1, it sufBces to prove (15.4) with \I> replaced by tyj = Skxj. We have (the subscript j is omitted)

[ VdP = fpv~l{dY) [y(YL\Z)ln(YUZ)qv{YUZ)Pv(dZ)

= fpy-^dY) f dt

x / ^(YUZ)ln(YUZ)\Qf^\y1qv(Y,Z)ary(dY)

= fpv-\dY) f dt

x f ^(YUZn^YUZm^Cr'lQnlv'QviY^Kri^

=L• ! i/^ r( l / , I ) ^ ," , d ^••

Page 138: Local Properties of Distributions of Stochastic Functionals

122 4. POISSON FUNCTIONALS

In a similar way, we can prove the following analog of (15.1):

(15.4') / ¥dP = / dtf *[det [ ( / P . / ^ n f j J " 1 7 2 ^ .

Absolute continuity of distributions of functionals.

PROPOSITION 15.1. Suppose that f = (fu . . . , fk) G CW(il, Rk),

dimC{fl1)(X),...jl1\x)} = k

for all X G il, and £/ie probability measure P satisfies the assumptions stated in the previous subsection. Then the image of the measure Pu = P(- flit) under the mapping f is absolutely continuous with respect to the Lebesgue measure on Rk.

PROOF. In (15.4'), we set # = lff<t\ (the inequality / < t is understood coordinatewise) and obtain the following representation for the distribution function F of the measure P ^ / - 1 :

F(t) = [ dsf (d*((f?\fj1%=1))~1/2d*a.

J(-oo,t] Jf-^s) V '

Proposition 15.1 is thereby proved. •

For example, let us consider a functional of a process with independent in­crements. Let £(£), t G [0,T], be a separable stochastically continuous Rd-valued process with independent increments, and let g: Dd[0,T] —> R^ be a functional defined on some subset of Dd[0,T] containing almost all trajectories of £. By V^ we denote the measure generated by £ on Dd[0,T].

The representation (13.9) of the process £ allows us to choose a convenient probability space of the form L x 3£([0, T] x R d ) , equipped with the measure P L X P , on which we define the process £ and the stochastic functional g(£(•))• Here (L, P L ) is some probability space for the Gaussian process £i (usually, L is a vector space) and (3£([0,T] x R d ) , P ) is a configuration space with Poisson measure of intensity

n. In the sequel, for the mapping g = (<7i,. •. ,#&) we assume that the func­

tional gi'. Dd[0,T] —> R 1 satisfies the following differentiability condition for all i = 1 , . . . , k. There exists an open set DQ, V^(DQ) = 1, such that the limit

l i m 9i{y + el) - gj(y) = dfr,. e-»o e dl

exists for all y G DQ and any I G D . In this case, the functional I1-> dgi(y)/dl (we denote it by Vgi(y)) is linear for all y G DQ, and the functional (yj) >-> (Vpi(y),/) is continuous on DQ x D d .

One can readily see that these assumptions are sufficient for the functions of the form X H* gi(i(X)) to belong to the class c|/

1)(3£(S x J)) , S = [0,T], I = R d \ { 0 } , for any fixed value of the Gaussian coordinate and for any open set F c [ 0 , T ] x R d

such that the projection of its closure on Hd is bounded away from zero. Now let us write out explicit expressions for the derivatives of the functions of

the form X h-» gi(£.(X)). For simplicity, we restrict our consideration to the case

Page 139: Local Properties of Distributions of Stochastic Functionals

§15. THE GAUSS-OSTROGRADSKII FORMULA 123

d = 1. For h G HV(X)) we have

±9i(t.(X + eh)) = (vft(£(X)), E &(*.»))

(15.5) s<

= E (VftK.ra.i^Oj^y). (s.y)GXnV

The expression on the right-hand side in (15.5) is finite, since \X n V\ < oo. Now let us construct the "tangent space" H(X) at each point X G £ by

introducing the norm | • |~ on this space as follows:

!*£<*)=( E ^(^^(min^2,!))-1) 1/2

In what follows, we show that for P-almost all X G X and any h G H(X), the sum on the right-hand side in (15.5) is finite for V = G = [0,T] x (R1 \ {0}) as well. Indeed,

£ (V9i(i(X)),l[sT](.))h(s,y) (s,y)€X

<( £ (%(e(i)), iHo)2^,i))1 / 2( £ ^^)1/2.

Note that sup | ( V ^ ( e W ) , l [ s T ) ( - ) ) | < o o

for any fixed X. On the other hand, J V j3/)Gx mm(2/2> 1) < °° f° r almost all X. Thus, we have shown that the functions X i—> gi(£.(X)) belong to the class CW(X{G)))t so that

(15.6) ( f t t tW)) ( 1 ) (* ,v) = (V^(e(X)) , l [ S ) T l ( . ) )min( |y | 2 , l ) .

Now we can use Proposition 15.1.

THEOREM 15.2. Suppose that the measure II is absolutely continuous with re­spect to the Lebesgue measure on [0,T] x Rd ,

(15.7) j * [ U(dsdx) = oo

for any distinct t\,t<i G [0,T], and the functional g: Dd[0,T] —* Hk satisfies the condition

(15.8) dim£{V^i(»)>...>V f fJb(y)} = fc

for V^ -almost all y. Then the distribution of the stochastic functional #(£.) is absolutely continuous

with respect to the Lebesgue measure on Hk.

Page 140: Local Properties of Distributions of Stochastic Functionals

124 4. POISSON FUNCTIONALS

PROOF. For simplicity, we consider only the case d = 1. First, let us show that the assumptions of the theorem imply

(15.9) d im£{( 5 l otfiX), ...,(9ko £) (1) (X)} = k

for P-almost all X. Indeed, it readily follows from (15.6) that (15.9) is equivalent to linear inde­

pendence in H(X) of the vectors Ui = {^(x), x = (s, y) G X} , % = 1 , . . . , fc, where

Ui(x) = (V^(e) , l [ S ) T ) (0 )n i in (M 2 , l ) .

Since (15.8) holds and the functions 5 H-» (V<7;(£.(.X)), lr T,) are continuous, it

follows that there exist points «? , . . . , sjj and a number e > 0 (obviously, depending on X) such that for any s i , . . . , s/., Sj G (s® — e, s^ + £), the vectors

(15.10) 6 i = ( (Vf tK .W) , l [ , l i 3 1 ( - ) ) , . - - , (Vf t ( eW) , l [ a i k , r , ( - ) ) ) €R f c ,

i = 1 , . . . , fc, are linearly independent. Moreover, by (15.7), P-almost all configura­tions X satisfy the condition X D [(s^ - e, s!- + e) x R1] ^ 0 for all j = 1 , . . . , fc.

This fact together with the linear independence of the vectors bi implies the linear independence of the vectors Ui and (15.9) (the details are left to the reader).

Now it follows from Proposition 15.1 that the conditional distribution of the stochastic functional #(£.) is absolutely continuous (provided that the trajectory of the Gaussian process is fixed). Hence, the unconditional distribution is also absolutely continuous. D

The Gauss-Ostrogradskii formula. In what follows, we return to the con­sideration of the "usual" situation. Thus, X is the configuration space on S x J, the metric in i" is Euclidean, the norm on H(X) is defined by (14.1), and P is a Poisson measure of intensity n x A.

Suppose that ( ^ o f ) is a Poisson space and it is an inner r-open set. The set it is called a set with smooth boundary if for each configuration X G diX there exist a r-neighborhood of i lx , a function / G C^^(ilx)) and a finite-dimensional projection Q such that

l ) i l n i l x = { y G i t x | / ( F ) < 0 } ;

2) inf | Q n ( y ; / ) | > 0 . YexXx

Since the topology r is Lindelof, one can readily see that <9il is a hypersurface. We also note that for each set with smooth boundary, the field of the outward normal n is well defined by the formulas n(Y) = n(V; / ) , Y G itx-

If it is a set with smooth boundary and K G C^^ i l " ) is a vector field, then the relation

(15.11) [ trK^dP= [ {K,n)d(jd^

where GQH is a surface measure on <9il, is called the Gauss-Ostrogradskii formula. The conditions under which the Gauss-Ostrogradskii formula holds are stated

in the following two theorems.

Page 141: Local Properties of Distributions of Stochastic Functionals

§15. THE GAUSS-OSTROGRADSKII FORMULA 125

THEOREM 15.3. Suppose that K is a finite-dimensional vector field, that is, QoK = K for some finite-dimensional projection Q$. Then the inclusions tr K^ G L ^ i ^ P ) and (K,ri) G L1(9il,crau) are sufficient for (15.11) to hold.

THEOREM 15.4. Suppose that

KW(X) G £ ( i )(H(X)), / | | # ( 1 ) I M P < oo

for any configuration X G it. Then the inclusions \K\ G (J<5>oL l + 6( i l>p) and \K\ e Ll(d^adn) are effi­

cient for (15.11) to hold.

P R O O F OF THEOREM 15.3. First, we assume that the union of il with finitely many r-open sets i l 1 , . . . , i tn from the definition of a set with smooth boundary contains the set {X G il~ | K(X) ^ 0}. Let fo and Qi (i = l , . . . , n ) be the corresponding functions and orthogonal projections from this definition. We choose some finite-dimensional orthogonal projection Q = Q(V) > Qo V Q\ V • • • V Qn for which V = B x A is open. We can write

[ trK^dP = J T / t r t f W d P , il* = {X G il | \X n V\ = 1} Ju jr^ JUL

and factor the measure P in each term

/ tvK^dP = L-n^ [ lili *• JX(G\V)

[ trK^dP = ^e-n{v) f Pv-x{dZ) [ ir(Mi)...ir(Ml) Jill '• JX(G\V) JBl

x / lidi{X)trKi<1\X)drl...dri

(we again use the notation X = Z U {(#i, n ) , . . . , (0j, r/)} and Z = v(X)). Let us perform some transformations in the inner integral. We set

il? = { ( r i , . . . r f e ) G A ( | X e i l ( }

(here and in what follows, we omit the dependence on the outer variables Z,

0 i , . . . , 0i) and define a vector field Kl: U° -> (R m ) ' by

(Kl(ru...irl))j=K(X)((9j)rj% j = l , . . . , l ,

where ilj is the Euclidean closure of il^. We also set

ilj = {(n,...,n)eA'|XGit;, \xnv\ = l}) ; = i,...,n, and consider a (smooth) partition of unity y>o» y>i> • • • > V>n subordinate to the open cover {ilj}f=0 of the set {Kl ^ 0}. Then the desired integral is equal to

/ t rK™(X)dn . . . d r t = V / ti{kl(pi)wdr1...drl.

We apply the Gauss-Ostrogradskii formula to each term on the right-hand side. Then the term with i = 0 disappears. Indeed, the boundary of the set il^ contains two parts, the boundary dtti and the boundary dAl of the cube. On the first part, (po and hence Kl(po vanish. Since the vector field K is finite-dimensional (Q0K = K)) it follows from the inequality Q > Qo that the normal component of

Page 142: Local Properties of Distributions of Stochastic Functionals

126 4. POISSON FUNCTIONALS

the field Kl is zero on the second part of the boundary of the set iif. Thus, the integral over the boundary of il^ is zero.

For i > 1, similar arguments show that

/ tr(lifVi)(1) dn ...drm= f (K<pu ^ ) da = f (Kipun)\Qn\-1 da. Ai°rui> J \ \Qn\J Jdu*

Here a is the Euclidean surface measure. In the passage to the last integral, we have applied the relations QK = K and supp</?i C itj.

By combining all these formulas and using the definition of the surface measure adix> w e a r r r v e a t t n e Gauss-Ostrogradskii formula (15.11).

Before we consider the general case, note that the previous arguments remain valid if we assume that (Kl)j is continuously differentiate only with respect to Tj (j = 1 , . . . , I). In this case, we can again refer to the classical Gauss-Ostrogradskii formula.

Let us consider the general case. We choose some finite-dimensional orthogonal projection Q = Q(V) > Qo for which V = B x A is open and an increasing sequence {F^} of measurable compact sets in (v(Xo),v(r)) such that Pv~1{F^l} -» 1. We also note that S contains an increasing sequence {Bn} of compact sets such that Bn C B and ir(Bn) —> TT(B) (this is a consequence of the fact that all finite measures on 05(5) are regular). We set

K = {Ye X(Bn x A) | | F | < n}, Fn = iv{F£ x F'n)

(recall that the mapping iy is defined by the relation iy(Y, Z) = YUZ). Obviously, the sets Fn are compact in the topology iy (r(V) x v(r)) and, therefore, in the topology r . Moreover, P n { F n } —• 1.

Consider the vector fields Kn = K1F . By the above-treated special case and the subsequent remark for the fields Kny we have (15.11). By passing to the limit with respect to n, we prove (15.11) for the vector field K. The proof of the theorem is complete. •

P R O O F OF THEOREM 15.4. Let gn be the functions introduced in the proof of Lemma 15.1. We set Kn(X)(x) = gn(x)K(X)(x). Obviously, the vector fields Kn satisfy the assumptions of Theorem 12.3, and hence, (15.11) holds for Kn. By passing to the limit, we prove (15.11) for the field K. •

PROBLEM 15.1. Suppose that g: D[0,T] —» Rfc is an integral functional, that is,

9{x{-))= / u(x{s))ds= ( / ui(x(s))ds,..., / uk(x(s))dsY

where the functions Uj G C^^(R) satisfy the following condition:

for Lebesgue almost all £! , . . . ,£ / . . Prove that g satisfies assumption (15.8) in Theorem 15.2.

Page 143: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 127

§16. Smooth functionals

Suppose that / : il —> R 1 is a smooth function (in the sense of §14) denned on a subset of the configuration space X(S x / ) and P is a Poisson measure of intensity IT x A. To obtain some information about the smoothness of the density of the measure P / _ 1 (Theorem 15.2 gives existence conditions for the density), it is convenient to use the method of differential operators (see §6).

Here the differential operators used in Theorem 6.1 will be the operators of differentiation along vector fields. Specifically, to a measurable vector field K: X \-^ K{X) e H{X) we assign the differential operator D defined by

(16.1) (Dg)(X) = {gW(X),K(X))H{xy

To obtain conditions under which the measure P belongs to the classes W^D) introduced in §6, we need integration by parts formulas.

PROPOSITION 16.1. Suppose that a function f and a vector field K satisfy the following conditions:

(l) feC^(X),KeC^(xy

(2)f.KeCW(X-);

(3) the field K is finite-dimensional, that is, QK = K for some finite-dimen­sional projection Q\

( 4 ) / ( | ( / ( 1 ) , K ) | + | / t r ^ 1 ) | ) r f P < o o . Then

(16.2) f(f',K)dP = - f ftvK^dP.

PROOF. Equation (16.2) can be rewritten in the form

tv(fK)w dP = 0,

which is a special case of the Gauss-Ostrogradskii formula. • / '

Now we assume that the vector field K defining the differential operator D belongs to the class C^l\X).

We define functions ao, a i , . . . , ai: X —> R 1 as follows:

(16.3) a0 = 1, Oi+i = (a^\ K) + aitrK{1) = Dai + trK{1).

One can readily show that a* (1 < i < I) is a linear combination of the expres­sions

(16.4) (trK^1 >)* (DtrK™)02... (D^1 t r i f ( 1 ) ) ^ ,

where /?; > 0 and X)}=i JPj = i-Set

Ai = {ge C « ( £ ) | V j = 1 , . . . , i (a^D'-ig)*: G C ^ a T ) ,

(16.5) J [ l ^ - ^ K h - ! tvK^\ + K a ^ . l O I ) ] ^ < oo,

/|D i^'+1^||aj_i|dP<oo}.

Page 144: Local Properties of Distributions of Stochastic Functionals

128 4. POISSON FUNCTIONALS

PROPOSITION 16.2. Suppose that a finite-dimensional vector field K belongs to C^l\X) and D is the differential operator (16.1). Then the measure P belongs to w[l\D I Ll(P),Au .. .,A{), where A\y... ,Ai are defined in (16.5). Moreover, d\ii = aidP are the representing measures of the junctionals DlP on Ai, i = 1 , . . . , / .

To prove this theorem, it suffices to note that for g £ Ai relations (16.2) and (16.3) yield

(16.6) f DlgdP = • • • = (-iy fajD^gdP = • • • = (-1)* f aigdP.

To state the following proposition, we use the multi-indices introduced in §8. Suppose that I > 1 and d > I is an even number. Let pi = (d/(d — i ) , . . . , d/(d — i)) be a multi-index of length i + 1 (i = 1 , . . . , l)) and p = (d , . . . , d) a multi-index of length I + 1.

PROPOSITION 16.3. Suppose that V c G is an open set, K e H^° , and D is the corresponding differential operator (16.1). Then for Pv~l-almost all Y (the mapping v is defined by (14.1)), the conditional measure P y belongs to the class

W (D | L 1 (Py ) ,H^ 1 , . . . ,H^!). Fori = 1,...,Z, the representing measure fa of the functional DlPy is absolutely continuous with respect to P y and satisfies

d\ii = ai dPY, ^ e Ll/i(PY),

where the functions ai are defined in (16.3).

Before we prove this proposition, let us make two remarks. First, the definition of ai contains the traces of some operators. The proof implies that these traces are defined almost surely (recall that by our definitions these operators are only Hilbert-Schmidt operators). Second, a similar statement for the measure P readily follows from Proposition 16.3.

COROLLARY. Under the assumptions of Proposition 16.3, the measure P be­longs to the class

W[(D | LX(P), i l i , . . . , 4 ) , where Ai = { ^ G H J ; | f(ai-jDjg)dP < oo, j = 0 , 1 , . . . ,i), i = 1 , . . . , / .

To prove this corollary, it suffices to integrate the relation

f D'gdP = • • • = ( - 1 ) ' fajD'-igdPy = • • • = (-1)* jai9dPY

- l with respect to the quotient measure Pv

P R O O F OF PROPOSITION 16.3. First, we prove two auxiliary lemmas. The first lemma can be derived from the Stokes theorem and hence can be applied to a wider class of integration domains. To state this lemma, we need some auxiliary notation. By the letter r we denote a permutation of the numbers 1 , . . . ,d, and Y^T denotes the sum over all d\ permutations. By Y(r) we denote the set of cycles comprising the permutation r . For a cycle 7 = ( i i , . . . , is) G T(r), we write Yliei Ai instead of A^ • . . . • Ais. (Since we consider matrices, the order of the factors is important). By il we denote a bounded open parallelepiped in R r . A set of mappings v\,..., Vd: it —> R r is said to be admissible for it if all these mappings

Page 145: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 129

belong to the class C^\ can be continuously extended to dtt together with their derivatives, and possess the following properties: if x G dii is not a corner point (this means that the normal n to dii at this point is determined uniquely), then

1) (vun)(x) = 0, i = l , . . . , d ; 2) (^1}(^),n)(a;) = 0, ij = 1 , . . . ,d, i ^ j .

LEMMA 16.1. i4ny se£ v i , . . . , v^ o/ CK2) mappings admissible for it satisfies the relation

(16.7) B-1)1™ / II trdl^^O.

From now on, integration over il is with respect to Lebesgue measure.

P R O O F OF LEMMA 16.1. For d = 1, (16.7) becomes

(16.8) / tru ( 1 ) = 0 . Ju

Indeed, we have the more general relation

(16.9) / / W 1 ) = - / ( / ( 1 ) , v ) , Ju Ju

which can be obtained by applying the Stokes theorem to the vector field fv. The conditions on v imply that the surface integral is equal to zero.

For an arbitrary d, the lemma is proved by induction. We restrict our consid­eration to the passage from d = 2 to d = 3 (the passage from d = 1 to d = 2 is obvious and hence not instructive).

It follows from (16.9) that

[tTvPtTvPtiv™ = - [ ( ( t r ^ t r t ; ^ ) ^ , ^ ) Ju Ju

= - / [ t r t ^ t t o j t r t / ^ +tTv£)(v3)tTv[1)] Ju

= - / [ t r ( ^ 1 ) ( ^ 3 ) ) ( 1 ) t r 4 1 ) + t r ( ^ 1 ) ( ^ 3 ) ) ( 1 ) t r ^ 1 ) ] Ju

+ / , [ t r ( ^ 4 1 V r 4 1 ) + t r ( ^ M 1 V r ^ 1 ) ] . Ju

By applying the induction assumption (for d = 2) to the first two terms and by using (16.9), we obtain

Ju

+ / [ t r ^ ^ V ^ + t r ( ^ 1 ) 4 1 ) ) t r ^ 1 ) ] Ju

= - f {tr[«{2)(«3)t41) +v[%i1V21)] +tr[t42)(V3)v[l) +vi%i%{1)]}

Ju

+ / [tv(v[l)vil)) tr vP + tviv^v^) tr v[l)] Ju

Page 146: Local Properties of Distributions of Stochastic Functionals

130 4. POISSON FUNCTIONALS

= - / ( ( t r M 1 ^ ) ) ^ ) , v 3 ) - / [ t r ^ M 1 ^ ) + tv(vi%i%^)) Ju Ja

• / [tr(vi1)^1)) t r ^ 1 } + t r (4 1 } 4 1 } ) trvj1*]

'[tr(t;i1)t41)t41)) + tr(«i1)t41 ,t41))] /u

+ [MvPv^tTvP +tT(v[1)vi1))tVVi21) +tl(vi1)vi1))tTv[1)]. D

/J

« ( 1 ) l l i

LEMMA 16.2. Suppose that, under the assumptions of Lemma 16.1, d is even and Vi = v,i = lt...,d. Then

(16.10) [(trvWfKcafWvWWi Ju Ja

with some constant Cd independent of it and r (here || • H2 w tfie Hilbert-Schmidt norm).

P R O O F OF LEMMA 16.2. We consider only the case d = 4; the general case can be considered in a similar way. We set

a= f(tvv^)\ 0= [ \\v<

By Lemma 16.1, we have

a = 6 / t r ( ( ^ ) 4 ) - 8 / t r ( ( ^ ) 3 ) W ^ + 6 [ ti{v^)2(tvv^)2-3 / ( t r ( ^ ) 2 ) 2 . Ju Ju Jn Ja

Since the absolute value of the trace of any operator does not exceed the trace norm of this operator, we have the estimate | tr(v^^)n | < Uglily for any n > 1. Now we apply the Holder inequality and obtain

a < 8a1/4/?374 + 6a1/2/?1/2 + (6 + 3)/3 < 9(a1/4/33/4 + a1/2/?1/2 + /?).

Thus, a < 729/3. •

An obvious approximation shows that to ensure (16.10), one need not require the second derivatives to exist.

Now let us return to the proof of Proposition 16.3. First, we consider the case 1 = 1.

Suppose that / G H y *' ^ ~ \ We choose approximating sequences {Vn}y

{/n}, and {Kn} for V, / , and Ky respectively; let Qn be the orthogonal projection corresponding to Vn. By Proposition 16.1, we have

(16.11) £yU?kl\Kn) = -EY(fntvK^)

for P t r^a lmos t all Y. Since fnl) -» / in L ^ " 1 ) and Kn-> K in Ld , it follows

that the left-hand side of (16.11) converges to Ey(/ , K). To pass to the limit in the expression on the right-hand side in (16.11), we need to show that the sequence {tvKn '} is fundamental in Ld. Let us choose n and n± (n < n{) and estimate Ey(trifni — tr Kn ) d . Let us additionally assume that all sets Vn have the form Vn = B x J , with B e S and J C I. We shall see that there is no loss of generality due to this assumption. We have

Page 147: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 131

EY(trtf£> - tr * # > ) " = E y ( t r [ < ) o Q n i - tr *#> o Qn])d

= /(py<)(^)f;&)i:e-n(v'„1) • r=0

x / ( t r [Kg o Q n i ( I u { i 1 , . . . , x r } ) - K < 1 ) o Q n ( X U { a ; 1 i r } ) ] ) '

n(cfai). . .II(dx r)

tn(vni)]* V(d0r)

r=0 ~ ] { Y n J [ ]h H JBT W*)]' x y ^ ( t r [#£> o Qni (X U { ( « ! , n ) , . . . , (er,rr)})

- X W o Q n ( X u { ( ^ 1 > r 1 ) , . . . , ( ^ ) r r ) } ) ] ) d d r 1 . . . d r r .

By applying the estimate (16.10) to the inner integral fJr and transforming the resulting expression backward, we obtain

E y ( t r t f « - tr K^)d < cdEY\\K$ o Qni - K™ ° Qn\\*.

By the definition of an approximating sequence, the right-hand side of the last estimate tends to zero as n, rt\ —> oo. Thus, we have proved that in (16.10) we can pass to the limit as n —> oo. This proves our proposition in the case 1 = 1.

For an arbitrary Z, we need to pass to the limit in the chain of equations

EY(Dlnfn) = '•' = (-lYEyia^D^fn) = • . . = ( - l ) ' E y ( a z , n / n ) .

Here Dn is the differential operator defined by the field Kn and a i , n , . . . , a^n is the sequence of the form (16.3) that corresponds to the field Kn. This passage to the limit can be justified just as in the case 1 = 1. Proposition 16.3 is proved. •

By using Theorem 6.1 and Propositions 16.2 and 16.3, we can obtain some results about the smoothness of the image of a Poisson measure P under the action of smooth (in the sense of §14) functionals.

There are differential operators of two types. The operators of the first type are defined by "finite-dimensional" vector fields (see Proposition 16.2) and the operators of the second type are defined by "infinite-dimensional" fields that satisfy some restrictions on the moments (see Proposition 16.3).

Sometimes it is necessary to use differential operators that are sums of first-and second-type operators. We present the corresponding statement.

THEOREM 16.1. Suppose that I > 1 and f: X —> R 1 is a measurable mapping. The following conditions are sufficient for the distribution P / _ 1 to belong to the class W j .

1. / e C ' + 1 ( * ) .

2. There exist open sets V, V G G (V is bounded) and a vector field K = Kx + K2 such that KUK2 G C^(X~), Q^ = Ku QK2 = K2 (here Q and Q

Page 148: Local Properties of Distributions of Stochastic Functionals

132 4. POISSON FUNCTIONALS

are the orthogonal projections corresponding to V and V, respectively), and the following properties hold4:

(a) P{Df ? 0} = 1; (b) /Gn p <oo H v^ = (Po ,P i , . . . ,P /+ i ) ; Kef]p<oou

pv,p = (p0)pu...,piy)

(c) we have (gai(Df)~l)K G C(X~) {the sets St,i were defined in §0) for all t = 0,...J-l,i = 0,...9t,andgeStti(D, fj.

3. All mathematical expectations of the form

E\(Df)-l-W(D2f)k* . . . (Dwf)kl(tiK^)^(DtiK^)^ . . . (D*"1 tiK™)*], l i

i = 0,...,i, ^2jkj=l-it ^2jPj=h 3=1 3=1

are finite.

PROOF. This theorem readily follows from Theorem 6.1 and Propositions 16.2 and 16.3. •

Now we use this result for studying the distribution of quadratic functionals of a stable vector.

Suppose that B is a separable Banach space, \x a stable measure on B with the exponent a (0 < a < 2), / : B —> R 1 a quadratic functional of the form

f(u) = {Au,u) + (b\u),

where b* G B*, and A: B —> B* a continuous self-adjoint linear operator. Let #o be the affine support of the measure \i. There exists a unique operator

A0: B0 -> B% such that (Abub2) = (A0b1)b2)y bub2 G B0.

THEOREM 16.2. If dim A0(B0) > 21, then \xf~l ew[.

PROOF. By S we denote the unit sphere in B; that is, 5 = {b G B | ||fr|| = 1}. Furthermore, consider the configuration space X on G = S x (0, oo) and a Poisson random measure v of intensity II = 7r x A, where A is the Lebesgue measure on (0, oo) and ir is the spectral measure of a stable vector (see §13).

Just as in §13, we assume that the random measure v is determined on the probability space (3£, E, P) by the identity mapping X —> X. The Poisson random measure v allows us to construct a stable random vector u with distribution \x by using stochastic integration:

(i6.i2) ._*,+.-{ / F$jk*w+ / F5§k*<4 {x|r(x)>l} {x|0<r(z)<l}

Here 0(x) G S and r(x) G R 1 are the coordinates of a point x G G. By the change of variables r i—• r~ a , we can obtain the stochastic integral (16.12) from the usual representation (13.6) with u\ = 0 and the Levy measure (see Problem 13.4). We slightly simplify the notation in (16.12) and write \x\ instead of r{x)\ thus, x = (0(x), \x\). Now we define a mapping / : X —> R 1 by the formula f(X) = f(u(X)). Then we have the obvious equation /if-1 = P / _ 1 , which allows us to use Theorem 16.1 for studying the smoothness properties of the measure \if~x (as

4In this theorem, D is the differential operator (16.1) corresponding to the field K.

Page 149: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 133

was already shown in §14, for any n = 1,2,... we have / G Cn(3£o), where Xo is some measurable saturated set of full measure.) First, let us obtain an explicit expression for the first derivative (gradient) of / . By (14.5), we have

(16.13) (fW(X))(x) = --^±_-{2Au + b*,9(x)), u = u(X).

In the statement of Theorem 16.1, we set V = S x (0,2) and V = S x (l,oo). Obviously, / G f)p<oo ^-v- Now, to verify assumption 2 of the theorem, we need to choose an appropriate vector field. To this end, we choose and fix some infinitely differentiable function p such that p(z) = z~l~lfa for z > 2, p(z) = z1+1/<* for

z < 1, and p is bounded away from zero on [1,2]. We construct a vector field K by setting

(16.14) (K{X)){x) = -a2(2Au + b\6(x))p{\x\).

It is easy to see that the field K can be represented as the sum K\ + K^ of vector fields that satisfy assumption 2 of Theorem 16.1. It suffices to take some partition of unity {</?i, ^2} corresponding to the cover ((—00,2), (1,00)) of the real axis and set

(ffxPOXa:) = (K(X)){x)Vl(\x\), (K2(X))(x) = (K(X))(x)M\x\)-

The motivation of this choice of K is quite obvious. We take an "almost" gradient of / (it is well known that the gradient of a function is the direction of its steepest descent) rather than the gradient itself, because the gradient has singularities at the points where \x\ = 0. The function p is used just to get rid of these singularities.

Thus, to prove the theorem, we need to verify assumptions 2 (a), 2 (c), and 3 of Theorem 16.1. Assumption 2 (c) is the simplest since p rapidly decays at zero (the details are left to the reader).

Let us verify assumption 3. In particular, we shall prove that the Zth-order moment of {Df)~l exists, which will imply the validity of assumption 2 (a).

First, we introduce some notation. By c we denote various positive constants whose precise values are inessential to us. By M^ we denote the set of random variables on (3£, E, P) that have moments of all orders.

By (16.13) and (16.14), we have

Df(X) = {fM{X)tK(X))H(x) = £ ^ ^

The properties of the function p imply the following lower bound for Df:

Df > cR2, where c> 0,

(16.15) Hf- StM. + r,*))'* £ ^/"ff1"'. \x\<2 \x\>2 ' '

Now let us obtain an upper bound for the higher derivatives (in the direction of the field K) of the functions / and tr K^l\

Page 150: Local Properties of Distributions of Stochastic Functionals

134 4. POISSON FUNCTIONALS

LEMMA 16.3. For any j = 1,2,... and £ € M^, the following conditions hold: 1) |£>J7I <^R2(l + R)j-\ 2) \DHxKW\<Z(l + R¥+K

PROOF. The statement is obvious for j = 1. For j = 2, we have

D2f = D(Df)

„ v-^ (2Au + fr*,fl(s)) u ( 2 A i + 6»,fl(y)) ,. .. = 2 L —uu+i /a y/p(l»l)<2^(y).g(«))—yiTiT^—P(M)

- a 2 £ (2Au + b*,6(x))3>c'(\x\)p(\x\) = AX+ A2, x€X

Let us estimate the first term:

£11+17= ^ D ,

V |x|<2 |x|>2 ' ' 7

< c( cardpsT n 5 x (0,2]) ^ (2Au + 6*,0(x)}2

\x\<2

|x|>2 ' ' |x|>2 ' ' '

Let us estimate the second term. Since x'([0,1]) = 0, we have

\A2\<c( £ |(2AU + 6 * ) ^ ) ) | 3 + ^ | ( 2 ^ u + 6 * ) ^ ) > | 3] - ^

M<|x|<2 |z|>2 ' '

<c( max \(2Au + b*,e(x))\ £ <2i4u + &*,0(aO>2

\ {* l _|x|_ i l<|x|<2

|(2A* + &*,fl(s))| ^ ( 2 ^ + b*,g(a ;))2\ 2 +{.a R^ ^ N ^ ; -* '

Thus, we have obtained the desired estimate for j = 2. For j > 2, the proof is similar. We differentiate the function Df along K appropriately many times and estimate each term by one of the two methods. Namely, if the sum over x G X contains the first power of the expression (2Au + &*, 0(x))) then we use the Holder inequality. If this expression is raised to a power j > 2, then we replace the factor \{2Au + b*,6{x))\i-2 (for 1 < |x| < 2) or \(2Au + b\6{x))\^2 |x|-(i+i/<*)0-2) (for

\x\ > 2) by its maximum over {x | 1 < x < 2} or {x \ \x\ > 2}, respectively.

Page 151: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 135

Now let us prove the second statement of Lemma 16.3. By (16.14), we have

- a2 £ ( 2 A u + b\0{x))p\\x\)h{x)g{y) xex

for h,g G H(X). Hence,

tr^(1) = 2 £ W y ^ - o ' {JMu + ft'.^MW) i € X ' ' xex

<Z(l + R), eeMoo.

The estimates for the higher derivatives of t r i ^ 1 ) can be obtained just as in the proof of the first statement. •

Now let us verify assumption 3 of Theorem 16.1. We need an upper bound for products of the form

(16.16)

where

( D / ) _ / - ^ = i k* (D2f)kl . . . (Dwf)kl

i i

^2jkj=l-i, ^TfjPj=h i = 0,l,...,Z. j=l j=l

By (16.15) and Lemma 16.3, we obtain the upper bound

^R-W+ZWtfZkj (l + R)ZJkj (l + R)Z3Pi =^1 JR-2 I(1 + JR)I> & EMoo,

for (16.16) for all i = 0 , 1 , . . . , / . It follows from this estimate that the existence of the 2(Z+<5)th-order moment of

R~l for some 6 > 0 is sufficient for the expectations in assumption 3 of Theorem 16.1 to be finite.

LEMMA 16.4. For some 6 > 0, we have Ei?~2( /+6) < oo.

PROOF. Consider the subset So C S of 0 such that ir(Uo) > 0 for any neigh­borhood U$ of 0. Since Bo is the afEne support of \x, it follows that Bo coincides with the closure of the linear span So up to a translation by some vector.

Furthermore, the condition dim A ) B Q > 21 implies that there exist m = 21 + 1 linearly independent vectors 0J , . . . , 0 ^ G So and m linearly independent vectors ^i>--->0m e ^o such that the matrix (A0f,0f)™j=1 is nondegenerate. The sets {Of} and {Of} need not consist of different vectors. Suppose that the number of coincidences is equal to m — mo, 0 < mo < m (that is, m — mo = card{z | Of = 0® for somej}).

We renumber the vectors in both sets so that Of = Of for all i = mo + 1 , . . . , m. Next we choose disjoint neighborhoods U\,..., Um, U\,..., Umo, t/mo+i =

Umo + l) • • • > Um = Um Of 0\, . . . , 0 m , 0\, . . . , 0mo, 0 m o + l = 0 m o + l > • • • ) 0m =^rn SO

that the matrix (AOi, 0j)™j=1 is uniformly nondegenerate with respect to all Oi G Ui

Page 152: Local Properties of Distributions of Stochastic Functionals

136 4. POISSON FUNCTIONALS

and 6i e Ui. We also note that ir(Ui)7r(Ui) > 0 for all i. Furthermore, for ni > 2, . . . , n m > 2,m > 2, . . . , n m o > 2, by F(nu...,rim) (respectively, by F ( n i , . . . , nm o)) we denote the set of configurations X for which the point x of the configuration X H (Ui x [2,oo)), i = 1 , . . . ,ra, (respectively, of X fl ([/* x [2, oo)), i = 1, . . . ,rao) with the least real coordinate lies in [/* x (ui^rii + 1] (respectively, in [/; x (rii,ni + 1]). Note that the events F{jt\,... , n m ) and F ( n i , . . . ,n m o ) are independent for any sets of numbers ( n i , . . . , n m ) and ( n i , . . . , n m o ) .

We have

^ / ^ (2Au + b*,0(x))*y^6\ ~ Z^f ny Z^f \X\2+2/6 J J-F(n1 , . . . ,nm)± i7 ,(n1 , . . . ,nm o)-

wi,...,nm |x|>2 ni, . . . ,nm o

Let us separately estimate each term in the last sum. To this end, we represent lF(ni,...,nm) as the product of two independent indicators l i • I2, where l i is the indicator of the event {X fl (Uj=i Uj x (2> nj]) = 0}> and I2 is the indicator of the event {X fl C/j x (njyrij + 1] 0 for all j = 1 , . . . ,m}.

We have

F(ni,. . . ,nm)1^(^i»---»^m0) ^ni,...,nmo \ ^ |d2+2A* /

v (2Au + b;o{x))*yli+s> 1^ | z | 2 + 2 / a ^ 1 l 1 2 J - F ( n i l . . . l n m o )

xexnp

where THQ

P=(U^x(ni>ni + 1 ] ) n ( U UjXinj.nj + l]). .7=1 j = m 0 + l

In turn, the last summand does not exceed

(16.17) j=1 j=1

x E ( £ ( 2 ^ + 6* ) 0(x) ) 2 ) " ( ' + 4 ) l 1 l 2 l i , ( n i n m o ) .

We represent the last mean value as the integral of the conditional mathematical expectation under the condition that the configurations are fixed outside the set Uj=i Uj x (rtj^rij +1], in each set the number of points of a configuration is equal to

Uj x (rij^rij +1], and finally, in each of these sets all points of a configuration except one are fixed and the first coordinate of this exceptional point is also fixed, that is, 6 G S. Since the values of a Poisson measure on disjoint sets are independent and II(d0, dr) = 7r(d6) dr, it follows that the free coordinates that correspond to the sets Uj x {njyrij + 1] are uniformly distributed on (n^, rij + 1] and are independent of each other.

Page 153: Local Properties of Distributions of Stochastic Functionals

§16. SMOOTH FUNCTIONALS 137

Now we rewrite the conditional expectation as the integral over the distribution and obtain an upper bound for the integrand by taking only one (arbitrary) term in each sum over x G X fl (Uj x (rtj^rij + 1]) as well as in the sum over x G X fl (Uj x (njyrij + 1]). We obtain

/ ? £ -^r \ (2+2/a)(Z+6)

(i6.i8) A£;:::;£» < (IlK- +1) l i f t +*>) Euinni_nrno)i2 3=1 3 = 1

dr\... drm

((2Au + b*,O1)* + -.- + (2Au + b*i0rn)2y+6'

n^=i(«j»«i+ ii

Note that by construction we have 0; G E/i, z = l , . . . , r a . Let us estimate the inner integral by a constant independent of the conditions. By (16.12), u can be represented in the form

(16.19) u = l(J^ + ...+ 0%.]+Uu 1/a

where 6i G Ui corresponds to the unique point x = (6i,ri) G £/; x (n^,?^ + 1] whose second coordinate is free. In this representation, the vector u\ depends only on the assumptions.

Now we substitute (16.19) into the inner integral in (16.18) and perform the change of variables r3- = sJQ. We obtain (16.20)

/ {(2Au + 6*,0i)2 + • • • + (2Au + 6*,0m)2)~ (Z+6) dn ...drm

<amY[(nj + l)1+1/a J ((-A^s^) +2Au1 + b*,61)

3=1 nr=i("71/Q-("i+ l)-1/a] J = 1

d s i . . . dsm. -1=1 '

m 2N -0+6)

E ~ 3 = 1

Let us perform another change of variables,

2 m

ti = - 2 ^ , ^ ) ^ + (2Am + b\0i). 3=1

Since the matrix (A6j,6i)™j=i is uniformly nondegenerate, it follows that the Jaco-bian of this change of variables is bounded above by a constant that depends only on a. Hence, the right-hand side of (16.20) has the upper bound

f dt\... dtm CQ /r i CPm t2)l+6'

Since m = 21 + 1, it follows that the last integral is finite for 6 < 1/2.

Page 154: Local Properties of Distributions of Stochastic Functionals

138 4. POISSON FUNCTIONALS

Summarizing, we obtain the upper bound

/ S - -^r N(2+2/a)(H-«)

AH:;:::;~i°<c(riK + 1)n^+1)) (16.21)

x n ( ^ + i ) i + i / a / i i d p - p { ^ i . - - - . ^ m o ) }

for A-1 '•'~mo. On the other hand, we have the estimates n i , . . . , n m >

/m

l i d P < c e x p < — /]njir(Uj) [>,

P { F ( m , . . . ,n m o )} < cexp | - ^ ^ ( C / , ) j i = i

for / l i dP and P { F ( n i , . . . , n m o ) } . These estimates together with (16.21) readily imply that the series A~*''*"'~m° converges. This completes the proof of Lemma 16.4 and hence of Theorem 16.2 •

§17. Dis t r ibut ion of t he no rm of a s table vector

Suppose that (B, || • ||) is a separable Banach space and u e B is a random element of B whose distribution is stable with exponent a, 0 < a < 2. We set F(x) = P{\\u\\p < x} for p > 0. The aim of the present section is to study the smoothness of F. The methods introduced in §6 allow us to obtain sufficient conditions for the existence and boundedness of the derivatives of F in a wide class of spaces.

First, let us describe the conditions to be imposed on B. We consider spaces equipped with a uniformly convex differentiable norm whose derivatives satisfy the conditions of boundedness (12.13) and power-law uniform convexity (12.8). We denote the value of the /cth-order derivative at a point b G B by V^(6)[&i, . . . , &&], and the norm of the derivative treated as a polylinear form by | |V^(6) | | .

Let us state the main result of this section

THEOREM 17.1. Suppose that \x is the distribution of a stable vector u in a Banach space B in which the norm satisfies the condition of mth-order power-law uniform convexity and the derivatives of order 1,2,. . . , / + 1 are bounded. If the support of ji is infinite-dimensional, then for any p > 0 the distribution of the random variable \\u\\p belongs to the class W x .

PROOF. Let us use the representation (16.12) of a stable random vector in terms of a stochastic integral over a random Poisson measure. We define a function / : X -> R 1 by setting f(X) = \\U(X)\\P. By (14.5), we have

(17.1) (/(1)(*))(x) = - -^TI7^( V ( 1 ) W M D I M r 1

Of ICC I '

(here, as before, |x| = r(x), that is, x = (0(#), \x\)). The remaining part of the proof is quite similar to the proof of Theorem 16.2.

First, we need to choose a vector field along which / varies sufficiently rapidly.

Page 155: Local Properties of Distributions of Stochastic Functionals

§17. DISTRIBUTION OF THE NORM OF A STABLE VECTOR 139

To this end, we choose and fix some monotone infinitely differentiable function x\ R 1 —> R + such that x((—oo,0]) = 0 and x([l,oo)) = 1. Using x, we introduce smoothing functions (fi(x) = H{2 — x), <po = 1 — y?i, <ps = x(x — 3), and ty(x) = x{x — 99). In the sequel, we need two random vectors related to the vector u as follows:

(17.2) ui = a - 1 X ) E ^ ^ i ( M ) > u0 = u- tii.

Then it is easy to see that the number of summands in the definition of u\ is finite, the random variable ||uo|| has moments of any order, and the vector u$ can be represented in the form

u0 = b0 + / j^^M\x\)v(dx),

where &o is some nonrandom vector. Now we consider the partition of unity (A, J2) in the configuration space X,

where h = # ( K | | / ( | | u 0 | | + 1)) and J2 = 1 - h.

LEMMA 17.1. 1. The inequality

(17.3) < V ^ ) , u i ) > ^ N I

is satisfied on the set {I\ > 0}.

2. On the set {I2 > 0}, the random variables \\u\\ and \\ui\\ have moments of any order.

PROOF. Statement 2 readily follows from the fact that ||iio|| has moments of all orders. Let us prove (17.3). On the set Ii > 0, we have

||tii| |>99(||tio|| + l)>99| | t io | | ,

and hence,

IHI = | | « i + « o | | > | | « i | | - | | « o | | > 9 8 | | t i o | | . Thus,

(V(1H«),tti> = (V<x>(«),«> - (V^(u),u0)

> INI - Kvf1)^),^)! > ||U|| - ||«o|| > |||«||. •

Let us introduce two vector fields. The "radial" vector field is defined by the formula

(17.4) Kx{x) = - | a # i ( | * | ) (x - (0(x), |x|) G X).

This field vanishes for \x\ > 2 and hence has only finitely many nonzero coordinates for all X e X.

The "gradient" vector field is a truncated, normalized and smoothed gradient of the function X i-> ||ti(X)||. This field is constructed as follows. Let 0 j , . . . , 0° G S be linearly independent unit vectors that belong to the support of the spectral measure. The number q must be sufficiently large (we estimate this number in the sequel). Let J7i , . . . , Uq be small neighborhoods of the vectors 0f such that their

Page 156: Local Properties of Distributions of Stochastic Functionals

140 4. POISSON FUNCTIONALS

spectral measures are positive and the following linear independence property is satisfied:

q

(17.5) inf y^Cifli > 0 { c i e R i . E ^ i . **€£/«} II £ ^ ||

(the existence of such vectors follows from the fact that the linear support of i± is finite-dimensional).

We set

(17.6) (K2(X))(x) = - V ( 1 ) ( u ( X ) ) ^ ] | x r 1 - 1 / V 3 ( | x | ) l { , e U ? = i Ui}

for x = (0(x)) \x\) G X. Finally, we define the desired vector field by the formula

K(X) = K1(X)I1+K2(X)I2.

The field K is constructed according to the following idea. Although the deriv­ative of / along K will be sufficiently far from the zero, for large values of ||n|| we fail to obtain good estimates for the higher derivatives of / along K2. Therefore, for large values of ||ti||, we must use the field K instead of K2\ in a sense, K is almost linear and does not present problems in estimating the higher derivatives.

We need to prove that the distribution of the random variable ||u||p belongs to

the class W j . By Theorem 16.1, it suffices to prove that the expectations

E | D / r ( ' + E U * i ) | D V l * l . - . | i 3 l + 1 / | f c |

x l t r K ^ l ^ l ^ t r ^ 1 ) ^ . . . ^ - 1 ^ ^ 1 ) ^

are finite for all possible sets of nonnegative numbers {kj} and {f3j} such that YJJ jPj = i and J2j Jkj = I — i, i = 0 , 1 , . . . , /. Just as above, by c we denote various positive constants whose values are of no importance for our estimates, and by Moo we denote the set of random variables on (£, E, P) that have moments of all orders.

Now let us estimate the higher derivatives of / .

LEMMA 17.2. 1. For any j = 0 , 1 , . . . J + 1, we have

(17.8) i ^ / i < « i + i i t i i n , €€Moo.

2. For any j = 0 , 1 , . . . , I, we have

(17.9) iD^tvK^leMoo.

3.

(17.10) H ^ E M o o , K H " 1 G Moo.

PROOF. Statements 1 and 2 can be proved in the same way as Lemma 16.1. Statement 3 follows from the fact that the support of the spectral measure is finite-dimensional. •

Page 157: Local Properties of Distributions of Stochastic Functionals

§17. DISTRIBUTION OF THE NORM OF A STABLE VECTOR 141

Now to prove that the expectations (17.7) are finite, it suffices to obtain a lower bound for \Df\. We have

nt p\u,\\'-i( V (V(1)(tt)[fl(*)])*>i(M)r

{xex||s|<2}

(V< \x\2+V"

v - (.V{1Hu)[e(x)])2

+ Z , — M 2 T 2 M — M m ) h {x€X\\x\>2te€UiUi}

(17.11) = ^ | H r i r a ( V W ( t i ) , « i ) / i

^ V^)[^)p {x6X||x|>2,eeUi^} ' '

> ' l l „ r r + ' ll„l|P-i V (V (1 )(")[g(»)])2f

{xeA-||s|>3,eeUiyi} ' '

(at the last step, we have used statement 1 of Lemma 17.1). Let us rewrite (17.7) as the sum of integrals over the domains {Ii > 1/2} and

{h < 1/2}. By Lemma 17.2 and (17.11), the integral over the first domain has the upper

bound

/ p(dx)c[i+iiuxin^^/iitiii^+E^^p, J{h>l/2} I

where £ G M^ and £ > 0. The last integral is finite, since ||wi|| < 2||w|| and N l > IMI - IKH > IK II - ||^i||/99 > 99/2 in the domain in question.

Now let us consider the integrals over the domain {I\ < 1/2}. To esti­mate the expression \D2f\kl...\Dl+1f\kl\trK^\^...\Di-1trK^f\ we again use Lemma 17.2 and take into account the fact that the factor [1 + | | tii | | ']£*' can be included in £ in the domain considered. Moreover, to obtain a lower bound for JD/ , we need to retain only the second summand in (17.11). We have (17.12)

/ |U/|-('+S>*>|D2/|fci . . . \Dwf\kl

J{h<l/2}

x| t r i ^ 1 ^ 1 . . . ID1-1 tiK^f* P(dX)

< [ P(dX)t/(\\u\\>-1Z(X)?+Xk>, J{h<l/2) '

where Z(X) = ^ | V^> (ti) [0(x)] |2 |x| -2(i+i/«).

{x\\x\>3}e€\JiUi}

Since in the domain of integration any power of ||IA|| belongs to Moo, we see that the existence of the corresponding moment of Z~l is sufficient for the convergence of the integral (17.14). Taking the maximum of ^kj, we see that it is necessary to estimate the moment of order 2(1 + 6)1, 6 > 0.

LEMMA 17.3. For any 6 > 0, we have EZ~2/(1+<5) < oo.

Page 158: Local Properties of Distributions of Stochastic Functionals

142 4. POISSON FUNCTIONALS

PROOF. For positive integers n i , . . . , nq) by £>(ni,. . . , nq) we denote the set of configurations X e X such that the point x G X fl (Ui x [3, oo)) with the minimal real coordinate lies in Ui x (rii^rii + 1] for all i = 1 , . . . , <?. We divide each set £>(ni,. . . , ng) into g-dimensional layers. Then we fix the part of the configuration that lies outside the set Ui K x (ni, n>i + i\ and all but one points of the configuration in each of the sets Ui x (rti^rii + 1]. Finally, we fix the first coordinate 02 G Ui of each of the q points (0*, |x;|) € Ui x (ni,rii + 1] that remain free. Now only \xi\ remains free in each element of the partition.

Since disjoint points are independent with respect to P , it follows that the conditional measure that arises on an element of the partition is just the Lebesgue measure on A n i x • • • x AUq, where An i = (n^, n2- +1], and the conditional moment is bounded above by the integral

, / Q v - 2 ( 1 + 6 ) 1

(17.13) / dn • •. dr J £ \V^(u)[ei]\2r-2-2/a

JAnix-xAnq \ i = 1 /

In addition to (17.13), we have the following expression for u:

u(X) = u(ru...>rq)=b+1-(^ + ... + L.),

where the vector b G B is constant within any element of the partition. We need an estimate of (17.13) which does not depend on b too strongly. In (17.13), we perform the change of variables Ti = a~asf and arrive at the integral

cf_ _ ds1...dJ±\v^(b + ±sA)[0iP^2/a)a)fls-^a\ JAnix-xAnq \ i = 1 i=1 / 2=1

We replace the inessential factor of the form (mini sz)~c by its upper bound, namely, by the function maxi(nz + l ) c , which has all moments, eliminate it from the integral, extend the domain of integration to the cube [0 ,a - 1 ] 9 , and obtain

(17.14) / rfsi...cfaJvM1)(6 + V ^ ) [ f t ] | - 2 ( 1 + 6 ) 1

where c = a~l. In this integral, we pass to polar coordinates centered at the point of the cube

where the convex function ||6 + YLl=\ si^i\\ attains its minimum. By replacing the vector b and by extending the domain of integration, we can assume that this minimum is attained at s t = 0.

Now let us estimate the integrand in (17.14). We set s® = s ; / ( S j = i s])

By the linearity of the form V^1) and the Holder inequality, we obtain

(17.15)

V « (& + £ sA) [ £ s%] | = | £ a ? V « (& + £ sA) [0i] 2 = 1 2 = 1

E|V(1)(6+E^)M2-

2 = 1 2 = 1 2 = 1 2 = 1

2 = 1 ' 2 = 1

Suppose that s° = {s?} G Rd , 70 = £ s ? 0 ; , and 7 = 7°/| |7°| | is the unit vector in B. The choice of the sets Ui implies that the vectors 02 are uniformly linearly

Page 159: Local Properties of Distributions of Stochastic Functionals

§17. DISTRIBUTION OF THE NORM OF A STABLE VECTOR 143

independent, that is, ||7°|| is uniformly bounded away from zero and infinity. By passing to polar coordinates in (17.14) and substituting the estimate (17.15), we obtain

(17.16) J ds0[ dy\^1\b + y1M\-*ll+^-%vWWb]>0).

To estimate the one-dimensional integral, we can use the following lemma, which follows from the homogeneity of the norm and the definition of power-law uniform convexity.

LEMMA 17.4. Suppose that the space (B, || • ||) satisfies the condition of power-law uniform convexity (12.8). Let k, AX > 0, b G B, 7 G S C B, and V(1)(6)[7] > 0. Then

(17.17) j * |vW(6 + i ry ) [7] | "V m + C dv < c ( m a x { ^ , | |&| |}) f c m + C + \

PROOF. We perform the change of variables z = 2/||6||-1 on the left-hand side in (17.17) and set 60 = &II&II-1- Since the gradient V ^ is constant along the radial rays, we have

/ |V^(6 + 2 / 7 ) [ 7 ] r V m + C ^ = / |vW(6o + ^)[7]r fcW|6||) fcm+c||6||cfa. Jo Jo

We estimate the integral over [0, a] by using the uniform convexity condition:

|V(1)(6o + «7)[7ir fcW fcm+ccfa • | | 6 | | * m + c + 1 < c||6||fcm+c+1. / Jo

In the domain [cr, -A||6|| *], since the derivative of a convex function is monotone, we obtain the estimate

rA\\b\\ -1

/ |V (1)(60 + ^7)[7]|~ *km+(:dz • | |6 | | f c m + c + 1

Let us return to the proof of Lemma 17.3. We need to apply (17.17) to the integral (17.16); therefore, we need to ensure that £ = q — 1 — 4(1 + 6)lm > 0. This can be achieved by choosing a sufficiently large q. Since the estimate (17.17) is uniform in 7, we obtain a majorant of the form c(l + ||6||)c for (17.16). Furthermore, since the vector b has been chosen so as to minimize the norm, we can pass from conditional moments to unconditional moments, thus obtaining

[p(dX)(Z(X))-2(1+W < c f P(dX)(l + MX)| | )C£ < 00. J J{xex\ii<i/2}

This implies that the integrals (17.12) are finite. The proofs of Lemma 17.3 and Theorem 17.1 are complete. •

Now let us discuss some consequences of Theorem 17.1.

Page 160: Local Properties of Distributions of Stochastic Functionals

144 4. POISSON FUNCTIONALS

1. As follows from the properties of distributions of the class W ^ under the assumptions of the theorem, the function F has I bounded derivatives and the Zth derivative is a function of bounded variation.

2. In fact, the assumptions of the theorem are also satisfied for the distribution of the random variable ||M + 6||P, 6 e B , since u + b is also a stable vector.

3. For a Hilbert space and the spaces Lp (where p is an even integer) the

theorem allows us to claim that the distribution of the norm belongs to W x for any / > 0, that is, the distribution function is infinitely differentiable. For other spaces Lp, p > 2, the theorem implies that \p — 1] derivatives of the distribution function are bounded. (Here [s] denotes the integral part of s for noninteger s and the integral part of s — 1 for integer s).

4. Here we consider only stable vectors with exponents a £ (0,2). For Gaussian vectors (a = 2), a result similar to Theorem 17.1 was given in §12.

5. Since p is arbitrary in the statement of the theorem, we see that in a neighborhood of zero the distribution of the norm not only is bounded, but decreases faster than any power-law function.

6. The statement of the theorem remains valid if instead of the assumption that the support is infinite-dimensional, we only require that its dimension exceeds some critical value n = n(Z,ra,p).

7. For an arbitrary Banach space B, the function F may have a more compli­cated structure. For example, in a space with a nonsmooth norm, there exists a stable vector such that F' is unbounded for p = 1. This means that the distribution of the norm does not belong to any class W \ . Still, some facts can be proved for the space with unsmooth norm. If 0 < a < 1 and the distribution of u is strongly stable, then F' is bounded [65].

Page 161: Local Properties of Distributions of Stochastic Functionals

CHAPTER 5

Local Limit Theorems

§18. Strong convergence theorems

We now proceed to studying the strong convergence of distributions of stochas­tic functionals. The problem can be stated as follows. Suppose that a sequence (P n ) of probability measures on a complete separable metric space (X, Q5x) weakly converges to some measure Poo- Let / : X —> R 1 be a functional. The question is whether P n / - 1 converges in variation to Poo / - 1 - If the distributions P n / - 1 and P o o / - 1 are absolutely continuous, then (see Problem 2.4) the convergence in vari­ation is equivalent to the convergence of the densities dPnf~

1/d\ to dPoof~l/dX in the metric of L ^ R 1 ) . Thus, we essentially deal with local limit theorems for distributions of stochastic functionals.

If the measures P n themselves converge in variation to Poo? then the answer to our question is obvious, since in this case the convergence P n / - 1 ^ P o o / - 1 holds for any functional / (see Theorem 2.5). However, this situation rarely occurs in practice. In a majority of the interesting cases, the measures P n are singular with respect to PQO for each n (for example, this is the case if P n is the distribution of random open polygons constructed from sums of independent identically distributed random variables in the ordinary way and Poo is the Wiener measure). It is just the mutual singularity of P n and PQO that causes difficulties in solving the problem.

Pre l iminary results. For a measure P n and a measurable partition V of the space X we denote the quotient measure and the system of conditional measures of P n with respect to T by Pn >r and {P n > 7 ,7 G X / r } .

THEOREM 18.1. Let P n =>» PQO. Suppose that the following assumptions are satisfied for the functional f and some partition T:

W / x / r H p ^ 7 / _ 1 " Poo, 7 / - 1 | |Pn,r (c*y) -> 0, n -> oo;

(ii) the mapping 7 1—> P o o ^ / - 1 o /X/T into Z(RX) is P00^-al'most everywhere continuous.

Then P n / " 1 V ^ P o o / - 1 .

REMARK 1. Since | | P n ) 7 / _ 1 - Poo , 7 / - 1 | | < 2, it follows that (i) is equivalent to the following assumption: P n , r {7 || P n ^ / " 1 — Poo , 7 / - 1 | | >e}—>0asn—>oo for all e > 0. We also note that | | P n , 7 / - 1 - P o o , 7 / - 1 | | < | |P n , 7 -Poo, 7 | | . Therefore, (i) is necessarily satisfied if the conditional distributions for the measures P n tend in variation to the corresponding conditional distributions for Poo-

REMARK 2. The structure of the range of / is irrelevant. In this theorem, we can assume that / is a measurable mapping of X into an arbitrary measurable space (E, (£) (in this case, (ii) will contain the space Z(E) instead of Z(R*)). In particular, a result similar to Theorem 18.1 is valid for functionals with values i n R d .

145

Page 162: Local Properties of Distributions of Stochastic Functionals

146 5. LOCAL LIMIT THEOREMS

PROOF. We define a measure Q n by setting

Q n = / P o o l 7 / " 1 P n i r ( d 7 ) .

Jx/r

By (3.4) and (2.3), we have

\\Pnf~1 ~ P o o / " 1 1 | < / H P r v r / - 1 - P o o l 7 / - 1 | | P n i r {<h) + | | Q n - P o o / " 1 II-Jx/r

By (ii), the first term on the right-hand side tends to zero as n —> oo. Let us consider the second term:

IIQn - Poo/"1!! = II / Poo l 7 / - 1 Pn,r (<*y) - / Poo.-y/^Poo.r (d-y)||-II Jx/r Jx/r II

Condition (ii) permits us to use Theorem 2.3 and thus obtain the desired conver­gence | |Qn - Poo/"11| -+ 0. •

The following result shows how Theorem 18.1 can be applied.

THEOREM 18.2. Let (fj,n) and (yn) be two sequences of probability measures on Hd. Suppose that jin => \i and vn

w—> v. If v is absolutely continuous, then var

PROOF. We set X = Kd x Rd , P n = \in x i/n, and Poo = M x > a n d define / : X -> R d by / (x , j/) = a; + y. Obviously, /xn * i/n = P n / _ 1 and /i * v = P o o / - 1 , so that it is natural to use Theorem 18.1 (with regard to Remark 2). Let T be the partition of X into the d-dimensional manifolds 7 = {x} x Rd , x G Rd . Since the measure P n > 7 / _ 1 (respectively, Poonf~

1) is just the measure vn (respectively, v) translated by x, we have | | P n > 7 / _ 1 — Poo, 7 / _ 1 | | = \Wn — HI- Hence, condition (i) of Theorem 18.1 is satisfied. By assumption, we obtain v <C Xd. We denote the density dv/d\d by p. Let 71 = {xi} x Rd and 72 = {x2} x Rd . Then

| | P c x > , 7 l / - 1 - P o o i 7 f t / - 1 | | = / \p(z-X1)-P(z-X2)\Xd(dz).

Jnd

Since the translation operator is continuous in Lx(Rd) , it follows that this expres­sion tends to zero as #2 —> X\. Thus, the mapping 7 —• Poo j 7 / _ 1 is everywhere continuous, and hence, condition (ii) of Theorem 18.1 is satisfied. •

Unfortunately, conditional distributions of weakly convergent measures seldom approach each other in variation. This fact hinders the straightforward application of Theorem 18.1 and forces one to seek other ways of solving the problem.

However, before stating the main result, Theorem 18.4, we present one more generalization of Theorem 18.1 to be used in the sequel.

THEOREM 18.3. Let P n => PQO, and let fn, n G N , be a sequence of function­al. Suppose that some partition V satisfies the following conditions:

(0 / H P n l 7 / n 1 - P o o l 7 / ^ 1 | | P n l r ( d 7 ) ^ 0 l 7i -> oo; Jx/r

(ii) the mappznq 7 ' * - oo ,7/00* ° / X / r into Z(R*) is 'P00^-cdmost everywhere continuous.

Page 163: Local Properties of Distributions of Stochastic Functionals

§18. STRONG CONVERGENCE THEOREMS 147

Then Pnfc1™ P^f-1.

PROOF. The proof virtually coincides with that of Theorem 18.1, and we leave it to the reader as an exercise. •

The superstructure method. This method was introduced in §5, where it was used to study the absolute continuity of distributions of stochastic function­a l . In what follows, we use the superstructure method for studying the strong convergence. Although Theorem 18.4 looks rather cumbersome at first glance, it is particularly convenient in applications.

THEOREM 18.4. Consider a sequence of probability measures ( P n ) ^ £ N), defined on the o-algebra Q5x of Borel subsets of a complete separable metric space (X, p). Suppose that P n => Poo and for Poc-almost all x there exists an open ball V centered at x, a number e > 0, and a family (GnjC, n G N, c G (0, e]) of measurable transformations o /X such that

(i) for each c G (0,£), one has Gn,c —> Goc,c in the measure P n as n —> oo; (ii) for each c G (0, e), the mapping Goo.c is Pco-almost everywhere continuous,

and moreover, d(S, c) = sup p(z, Goo.c^) —* 0 as c - > 0

zes for each open ball 5;

(iii) limc^o I5n"n HPnG-i - P n | | = 0;

(iv) for each 6 G (0,£), one has

HV^n,1* ~ V f l ^ . J P n (dz) -• 0 as n -> oo,

where ipn,z(c) = f{Gn,cz), n G N , and c G (0,e];

(v) for each 6 G (0,e), the mapping z i—> X^s^^z °fV into ^ (R 1 ) is Pco-almost everywhere continuous.

Then P n / - 1 ^ ' Poo / - 1 . Before proving this theorem, let us discuss conditions (i)-(v). Condition (i) needs no clarification. Condition (ii) says that the mappings GnjC

are close to the identity mapping for small c on bounded sets. Condition (iii) shows that the perturbations of P n by the mappings Gn>c are

uniformly small for small c. Although this condition does not cover the case n = oo, we have

(18.1) l imllPooG"1^- Poo | |=0 ,

which follows from (iii) by virtue of (i) and (ii) (see the reasoning after Lemma 18.1). Here the assumption that the mapping Goo,c is continuous is important. However, once (18.1) is established, we can drop this assumption.

According to Proposition 2.1, condition (iv) is satisfied if for Poo-almost all z the convergence zn —• z implies

llA[0,6]Vn,ln ~ *[0,6]<P^tzn\\ -> 0

as n —• oo. To verify this relation as well as assumption (v), we can use the results of §4 concerning the transfer of one-dimensional measures.

/

Page 164: Local Properties of Distributions of Stochastic Functionals

148 5. LOCAL LIMIT THEOREMS

PROOF. Let XQ be the set of x for which there exist an open ball V, a number e > 0, and a family (Gn>c) with the desired properties. By assumption, P^XQ) = 1. Decreasing the radius of the ball V if necessary, we can assume that Poo{<9V} = 0. The family (14, x G X0) is a cover of XQ. Using the separability, we can choose a countable subcover (14, k G N), where 14 is a ball centered at x^. Let £& and (G4,c) be, respectively, the number and the family of mappings corresponding to Xk> We can assume that Sk —> 0. We fix a /? > 0 and choose M so large that

< /?, n <E N. ••{(y,v')c} Setting W = [}f=l V, and Uk = Vk\ U j l j Vj, we note that

| | P n / _ * - Poo/" X | | < 2/J + l l P n , ^ / - 1 " P o o . ^ r ' I I M

<2/3+^npn, f / j.ri-p00,c / ir

iii.

Therefore, to prove the theorem, it suffices to prove that each term in this sum tends to zero as n —> oo. We fix k and omit the index k in the sequel: U = £/&, Gn)C = G i j , etc. Let us show that

(18.2) | | P n , t / / - 1 - P o o l c / r 1 | H 0 > n 00 .

To this end, we use the superstructure method. Let 6 G ( 0 , £ M ) - We set / = (0,5) and ii = 5_1A[0j5], and consider the measures Q n = \i x Pn,t/ and the functions Fn(c,x) = f(GniCx)y n G N, on the direct product I xU. Obviously, we have

HPn.t//-1 ~ Pocc//-1!! < Ai + A2 + A3,

where

A j = H P n . u / - 1 - Q n O l . A 2 = \\QnF~1 - QooF c oo1!

AaHIQooi^-Poo.t/r1!!.

To estimate Ai and A3, we need the following two lemmas.

LEMMA 18.1. Under conditions (i) and (ii) of Theorem 18.4, for all c G (0,£J we have the weak convergence PnG~*c => PooG^ c as n —> 00.

PROOF. Let ft be a uniformly continuous function bounded on X. We have

3n = \ [ hdPnG-%- [ hdP^G^ l i x Jx

< / l + / 2 ,

where

Ji = / |h(Gn,cz) - /i(Goo,cz)|Pn (<&)» ./x

h = \ h(GooiCx)Pn (dx) - / h(Goo,cx)Poo (dx) 17x 7x

Page 165: Local Properties of Distributions of Stochastic Functionals

§18. STRONG CONVERGENCE THEOREMS 149

Since the mapping GQO,c Is Poo-

almost everywhere continuous, it follows that the function x —> h(Goo,cx) is Poo-almost everywhere continuous and bounded. There­fore, by Proposition 2.1 we have I2 —> 0 as n —• 00. The integral I\ can be estimated as follows. Suppose that a > 0 and L = sup^ e X |ft(^)|; then

h < 2LPn{x I p(GnyCx,Goo,cx) > a}+u>h(<x).

Hence, limn/3n < cjfc(a). Since a is arbitrarily small and h is continuous, we see that limn (3n = 0, which implies the desired weak convergence. •

Note that the weak convergence together with Theorem 2.7 yields the inequality

HPooG^c - Pooll < H E H P n G - i - Pnll, n

which shows that under the assumptions of Theorem 18.4 one has

(18.3) l imJPooG- 1 , . - Pooll - 0 .

LEMMA 18.21. Suppose thafPn => Poo and conditions (i)-(iii) of Theorem 18.4 are satisfied for some family (Gn,c> n G N, c G (0,6)) of transformations of the space X. Then

(18.4) l iminSllPn^G-1 , - Pn | C , | | = 0. c—*0 n

for any set U such that Poo{dU} = 0.

PROOF. We have

\u{A H G~l{B)) - u{A HB)< \v(A n B) - v(G~l{A n B))\

+ \v{A n G~l{B)) - v(G-\A) n G~\B))\

for any measure v on Q5x, any transformation G of the space X, and any A, B G 2$x-By setting v = P n , G = Gn,c> and 4 = [/ in this inequality and by taking the

supremum over B x <8x> we obtain

(18.5) \\Pn,uG-}c - P n , c | | < IIPnG"1. - P„ | | + Pn{UAG^c(U)}.

The estimate of the first term is obvious. Let us consider the second term. For any e > 0, we have

UAG-*C(U) c {x I p(xyGniCx) >e}u {U£ n (C/C)e}.

Let us choose a sufficiently large ball S such that Poo{5 c} < e and Poo{9S} = 0, and use the fact that

{x \p(x,GntCx) >e}

cScL\{x\ p{GooiCx, Gn}Cx) > e/2) U {x G S \ p(x, Goo,cx) > e/2}.

By (ii), the last set is empty for sufficiently small c. Therefore,

P„{tf A G - ^ t / ) } < P n { S c } + Pn{x I ^(Goo.ea;, Gn,cx) > e/2} (18.6) v + P „ { [ / e n ( t f c ) £ } .

xCf. Lemma 5.1.

Page 166: Local Properties of Distributions of Stochastic Functionals

150 5. LOCAL LIMIT THEOREMS

Let us pass to the limit as n —> oo. The weak convergence and the choice of S imply that l i m n P n { 5 c } = P { S C } < e. By (i), the second term in (18.6) converges to zero. Therefore,

Hi5Pn{tf AG-^CO} < e + Iim"Pn{t/e n {Uc)£} (18.7) n ' n

< e + p { ( [ / e n ( c / c ) e ) - } .

By letting e and c tend to zero, we obtain

(18.8) limI5[Pn{J7AG^1c(J7)} < e + P{dU} -+ 0.

c—>0 n '

Inequalities (18.8) and (18.5) imply (18.4). •

Now we can estimate Ai . We have

A i = / P n . f / / " 1 ^ - f Pn.uG-^f-1 dfJL \\< [\\PntU-'PntuG-]c\\dlM. \\Jl JI II JI

By Fatou's lemma and Lemma 18.2, the last inequality readily implies that

lim lim Ai = 0. 6—>0 n

Just as in estimating Ai, we obtain the inequality

A3<y , | |Poo,t /-Poo,c/G-1) C | |d )u.

By (18.3), the sequence of measures P n = Poo and the family of mappings Gn,c = Goo.o n € N, satisfy the assumptions of Lemma 18.2. By this lemma, we have l im^o A3 = 0.

It remains to estimate A2. We consider the partition T of the space I x U into the fibers J x {z}, z e U, and apply Theorem 18.3 to the measures Q n and the functionals Fn. Conditions (iv) and (v) of Theorem 18.4 imply the validity of conditions (i) and (ii) of Theorem 18.3. Therefore, A2 —> 0 as n —> 00.

Thus, lim llPn.c//"1 - Pocc// - 1!! < HS Ai + A3.

n n The left-hand side is independent of 8, whereas the right-hand side tends to zero as 6 —> 0. Hence, we arrive at (18.2). •

PROBLEM 18.1. It would be interesting to obtain an analog of Theorem 18.2 for the case of an infinite-dimensional linear topological space. The above proof cannot be generalized to this case. Why?

PROBLEM 18.2. State and prove an analog of Theorem 18.4 for the strong convergence of the distributions Pn/rT1 to Poo/^1-

§19. Strong convergence of distributions of Gaussian functionals

In this section, we consider sequences (P n ) of Gaussian measures in a separa­ble Banach space X and show that the weak convergence P n =>• Poo implies the strong convergence of P n / _ 1 to P o o / - 1 for a wide class of functionals without any additional assumptions.

Page 167: Local Properties of Distributions of Stochastic Functionals

§19. STRONG CONVERGENCE OF DISTRIBUTIONS OF GAUSSIAN FUNCTIONALS 151

The classes Mp and Mp . First, we define two classes of functionals and prove some properties of these classes. In this section, P is an arbitrary probability measure, so that the definition is meaningful not only for Gaussian measures.

Sets of the form A = {x + cl,c G [0,a]}, x,l E X, will be called segments parallel to a vector I. We say that segments A n = {xn + cln,c G [0, an]} converge to A if xn —> x, ln —* Z, and an —> a. For a functional / and a segment A, we define a function / A by setting

/A(C) = f(x + d)9 ce [0,an].

Recall that Hp denotes the set of admissible directions for the measure P . We say that a functional / belongs to the class Mp if for P-almost each x

there exist a vector I G Hp and an open ball V centered at x with the following properties: for P-almost all y G V and any segment A C V, A = {y + cl, c G [0, a]}, the convergence A n —> A implies the convergence

(19.1) A / ^ ^ A / ^ 1 .

By Mp we denote the class of / G Mp such that, in addition to (19.1), we have

(19.2) A/^1 < A.

These classes are quite broad: Problem 19.1 shows that the class Mp is everywhere dense with respect to the uniform norm in the class of all P-almost everywhere continuous functionals. Problem 19.2 shows that the distributions of functionals from the class .Mp are absolutely continuous.

Let us consider smooth functionals.

THEOREM 19.1. Suppose that the following conditions are satisfied for P -almost all x:

(i) the derivative Df(x) is weakly continuous at x\ (ii) £>/(aO(HP) ^ {0}.

ThenfeM^.

PROOF. Let Xo be the set of all points x for which conditions (i) and (ii) are satisfied. Let x & Xo- Condition (ii) means that there exists an IQ € H p such that

(lo,Df{x)) = Dtof(x) = 6?0.

Without loss of generality, we can assume that 6 > 0. Since the derivative is continuous at x, we can find an open ball V centered at x such that

Dif(y) > 6/2

for all y G V and all vectors / close to io- Let us take some segment A = {y+cfo, c G [0, a]} in V such that y G X0. We intend to show that (19.1) is satisfied as A n —> A.

Let A n = {yn + cln)c G [0,an]}. Since an —> a, without loss of generality we can assume that all functions f&n are defined on the same interval [0,a]. It is easy to verify that the / A U uniformly converge to / A on [0,a]. Since f&n(c) = Dinf(yn+dn) and the derivative of / is continuous at y, it follows that the functions f'A also converge uniformly to fA on [0,a]. By the choice of V, the function fA is bounded away from zero on the entire interval [0, a]. Therefore, (19.1) follows from

Page 168: Local Properties of Distributions of Stochastic Functionals

152 5. LOCAL LIMIT THEOREMS

Theorem 4.5. Theorem 4.2 implies that A/^1 is absolutely continuous. The proof is complete. •

Let us apply this result to integral functionals. Let X = C(T), where T is a metric compact set and (£t, t G T) is a continuous Gaussian process with correlation function K(s,t) and distribution P .

We consider integral functionals of the form

(19.3) /(*) = J q(x(t)Mdt),

where {i is a finite measure on 2$T and q is a measurable function on R1 . Consider the following nondegeneracy condition:

(19.4) / K{syt)(p(t)ip(s)n(dt)fx(ds) > 0 , <p ^ 0, (p G L 1 ^ ) . JTxT

This condition means that any mixture

&<^)/z(cft), X where </> ^ 0 and (p G L1(d^), has a nondegenerate distribution. A condition of this type was used in §9 when we studied conditions under which integral functionals are absolutely continuous.

THEOREM 19.2. Suppose that (19.4) is satisfied, q is locally Lipschitzian, and

the derivative q' is almost everywhere continuous and nonzero. Then f £ Mp .

PROOF. We have already considered the functional / in Proposition 9.2, and proved that

Dif(x) = J l(t)q'(x(t))fi(dt).

Moreover, we have verified that Dif converges weakly at almost all x G X and that for each of these x there exists an I G Hp such that Dif(x) ^ 0. Thus, all assumptions of Theorem 19.1 are satisfied, and so / G Mp. •

The condition imposed on q can be weakened for some specific processes; see Problem 19.3.

Let us consider sufficient conditions under which convex functionals belong to the class Mp .

WTe assume that the measure P has a barycenter a p and / is a continuous convex functional. Set

ao = inf /(#)• xGsupp(P)

THEOREM 19.3. Suppose that the following conditions are satisfied: (i) ( a p + H p ) - = s u p p ( P ) ; (ii) P{x | f(x) = a 0 } = 0. ThenfeM{p\

Page 169: Local Properties of Distributions of Stochastic Functionals

§19. STRONG CONVERGENCE OF DISTRIBUTIONS OF GAUSSIAN FUNCTIONALS 153

PROOF. Let x G supp(P) be a point such that f(x) > OLQ. Since / is convex, it follows that there exist an I G Hp and an open ball V centered at x such that for any segment A c 7 parallel to i, the function / A is strictly monotone and convex. Therefore, A/^1 <C A. Furthermore, if A n —> A, then the sequence f&n converges to / A pointwise. Thus, (19.1) follows from Theorem 4.4. •

COROLLARY. Let P be a Gaussian measure that is not concentrated at one point and let f(x) = ||x||. Then the following assertions hold:

(i) ifaP G H P , then f G M$\ (ii) if (X, || • ||) is strictly convex, then f G Mp .

PROOF. Condition (i) of Theorem 19.3 is satisfied since P is Gaussian (see Proposition 7.4). Condition (ii) of Theorem 19.3 holds by Proposition 12.1. •

REMARK. For a Gaussian measure P , a convex functional / belongs to Mp if and only if P / " 1 < A.

The main theorem and its applications. Now we are in a position to state the main result of this section.

THEOREM 19.4. Suppose that ( P n , n G N) is a sequence of Gaussian measures, P n ^ P o o , andfeMPoo. Then Pnf-1™ P^f-1.

This theorem combined with the result of Problem 19.2 implies the following functional local limit theorem.

THEOREM 19.5. Suppose that P n => Poo and f G Mp*. Then the density of the absolutely continuous component of the distribution P n / - 1 converges to the density of the distribution Poof-1 in the It1-metric.

Note some applications of Theorems 19.4 and 19.5. Let (£fc) be a stationary Gaussian sequence with E£fc = 0. We set Sn = Z)?£fc

and consider a continuous random open polygon £n(0> * € [0,1], with vertices (*/n, SfcA/D^), fc = 0 , . . . ,n . By P n we denote the distribution in C[0,1] corre­sponding to the Gaussian process (£n0O>* ^ [0> !])• We assume that the variance of the sums Sn regularly varies as n increases, that is,

DSn = tfL{n))

where L(-) is a slowly varying function and 7 G (0,2). In this case, as was shown in [29], P n => PQO, where Poo is the distribution in C[0,1] of the semistable Gauss­ian process w1 considered in Example 7.7. In this case, we can apply Theorems 19.4 and 19.5, thus obtaining local limit theorems for all f e Mp simultaneously.

Suppose that P is a Gaussian measure, f(x) = ||x||, and X n is an increasing system of subspaces of X such that C(\Jn X n ) ~ = X. Next, let nn: X —> X n be a system of projections such that 7rnx —• x for P-almost all x. Then we have the weak convergence P n = PIT'1 => P. If P / - 1 < A, then, as we have seen, / G Mp , and by Theorem 19.5, the distribution density of the functional ||7rn(-)|| converges to the distribution density of P / - 1 in L1.

The limit behavior of conditional distributions. When one applies The­orem 18.4 to Gaussian measures, the perturbing transformations are naturally cho­sen to be translations along admissible directions. Indeed, suppose that P is a

Page 170: Local Properties of Distributions of Stochastic Functionals

154 5. LOCAL LIMIT THEOREMS

Gaussian measure, I G H p , and Gc is the translation mapping in the direction J, Gcx = x + cl. To estimate ||P — PG"1!!, we consider the partition T of the space X into lines parallel to I. By (2.3), we have

| | P _ P G - i | | < / | | P 7 - P 7 G f - 1 | | P r ( d 7 ) . Jx/r

We know that the conditional measures P 7 are one-dimensional Gaussian measures with variance 0p(Z) independent of 7. If we denote the density of the measure P 7

by p(t)y then the density of the measure P7G~X is p(t — c). Therefore,

(19.5) ||P7 - P ^ - 1 1 | = J \p(t) -p(t -c)\dc< - ^ (0'

(This inequality becomes obvious if we draw two Gaussian densities with variance <7p(Z) translated by c with respect to each other.)

Thus,

•,|i£fe (19.6) l | P - P < ? « , ^ „ _ 7 p ( | ) -

It is natural to try to use this inequality to derive condition (iii) of Theorem 18.4. However, one can show that using the same direction for all measures P n is not sufficient.2 It is necessary to carry out a more detailed analysis.

The following theorem provides a clue to the use of Theorem 18.4 in the Gauss­ian case.

THEOREM 19.6. Let (Pn>w G N) fee a sequence of Gaussian measures in a separable Banach space X. Suppose thafPn => Poo. Then for each l^ G H p ^ there exists a sequence (ln), ln G H p n , such that \\ln — loo\\ —> 0 andapn(ln) —> <7p00(i0o) as n —> 00 .

PROOF. By Ball^ we denote the closed unit ball in X*. Let \pn be the charac­teristic functional of the measure P n .

Since the spaces Hp n are invariant under translations of the measures P n and since an —> aoo by the result of Problem 11.6, we can assume that the measures P n

have zero barycenters an. Let Kn: X* —> Hp n be the correlation operator. We set

pn= sup \\Kn(x*) - KcoWl

x*GBalIJ

LEMMA 19.1. 0n -> 0 as n -> 00.

P R O O F OF THE LEMMA. Since the means are zero, we have *n(x*) = exp{-l\Kn(x*)\l},

where | • |n is the norm in H p n . By Theorem 2.4, tyn(x*) —• ^00(#*) as n —> 00 uniformly with respect to x* G Ball^. Therefore,

(19.7) SUp | | X n ( ^ ) | n - | ^ o o ( ^ * ) | o o H 0 , U -> 00. x*€Ball*

2See Problem 19.4.

Page 171: Local Properties of Distributions of Stochastic Functionals

§19. STRONG CONVERGENCE OF DISTRIBUTIONS OF GAUSSIAN FUNCTIONALS 155

Noting that (x,y)n = ^{\x\n + \y\n - \x - y\n), from (19.7) we obtain

8n = sup \(Kn(x*),Kn(y*))n - (Koo(z*)>#oo(y*))oo| -> 0, n -> oo.

By the definition of Kn) we have (Z, Kn(y*))n = (Z, y*) for any Z G Hp7i and y* G X*. Therefore, 6n = j3ni and so (3n —> 0. D

Obviously, in Lemma 19.1 the unit ball can be replaced by a ball of any other radius. Thus, if Zoo G H p ^ has the form l^ = Koo(x*), then, by setting ln = iiTn(o;*), we obtain the desired sequence. Indeed, the convergence of ln to Zoo in the norm of X follows from Lemma 19.1, and the convergence of conditional variances follows from (19.7), since o-pn(ln) = l^n(#*)ln2- F° r a n arbitrary Zoo G H p ^ , l^ ^ 0, we must use the fact that ifoo(X*) is everywhere dense in H p ^ both with respect to the norm of H p ^ and with respect to the norm of X. Hence, there exists a sequence ( i^ ), loo' = K00(x^n), such that \\l&' — loo\\ —> 0 and I'TO |oo —» I'ooloo- We have already proved that for each m there exists a sequence Zim) such that \\ln

m) - I<S°|| -> 0 and |z£m)|n ~* l oo loo- Now the desired sequence approximating Zoo can be constructed by the diagonal process. •

P R O O F OF THEOREM 19.4. Let X0 be the set of Poo-full measure that arises in the assumption that / G Mp . Suppose that x e Xo] then Zoo and W are, respectively, the Poo-admissible direction and the open ball corresponding to the point x. Let V be the open ball of radius e centered at x, where e is half the radius of W. Using Theorem 19.6, we can find elements ln such that they determine Pn-admissible directions convergent to Zoo and such that the variances an of the conditional measures for P n on the lines parallel to Zn converge to the variances cr^ of the conditional measures for Poo on the lines parallel to Zoo-

Set Gn,cx = x + cln, c > 0, n G N. Let us show that the neighborhood V, the number £, and the family of mappings Gn>c satisfy the assumptions of Theorem 18.4.

Assumptions (i) and (ii) are obviously satisfied. To prove that assumption (iii) is satisfied, we consider the partition Fn of X

into lines parallel to Zn. By (19.6), we have

\\PnG-}c-Pn\\<Jl^. V 7T(7n

Since an —> cr^, we see that this inequality proves assumption (iii). Now let us verify assumption (iv). Note that for 6 G (0,£) and z G V, the

measure A[0,6]^,l coincides with A/^1 , where A n = {z + cln,ce [0,5]}. Moreover, the segment A n is parallel to Zn and lies entirely in the ball W. Since / G Mp , it follows from the choice of Zoo and W that the function

has the following property: it follows from the convergence zn —• z that 6n(zn) —> 0. This, together with the remark following Theorem 18.4, means that assumption (iv) is satisfied.

Assumption (v) can be verified similarly to assumption (iv). Thus, all assumptions of Theorem 18.4 are satisfied, and Theorem 19.4 is

thereby proved. •

Page 172: Local Properties of Distributions of Stochastic Functionals

156 5. LOCAL LIMIT THEOREMS

PROBLEM 19.1. Suppose that H p ^ {0}. Then for any P-almost everywhere continuous functional / and any e > 0, there exists a functional / e G M p such that suPa,GX \f(x) — f£(x)\ < e. (Hint: in a neighborhood of a point of continuity, approximate / by a linear functional that grows in the direction of I G Hp.)

PROBLEM 19.2. Suppose that / G M^. Then P / " 1 < A. (Hint: let x be a point of the set of P-full measure occurring in the definition of the class Mp ; let V and / be the corresponding ball and admissible direction. Consider the partition T of the space X into lines parallel to Z, and use (3.4) and (19.2) to prove that Pvf-1 « A.)

PROBLEM 19.3. Let W be the Wiener measure on the space X = C[0,1], and let f(x) = f0 q(x(i)) d(t). Prove that / G M$ provided that for each e > 0 there exists an open interval J c (—e,e) on which the derivative q' is continuous and does not vanish. (Hint: use the fact that for any interval [a, b] C [0,1] there exists a function I G H w that is equal to zero outside (a, b) and positive on (a, &).)

PROBLEM 19.4. Let W be a Wiener measure on the space X = C[0,1]. By A we denote the set of all nondecreasing automorphisms of the interval [0,1] whose derivatives are bounded away from zero and infinity. To each u G A, we assign a bounded invertible linear operator Tu by setting

Tux(s) = x(u(s)), s G [0,1], xeX.

Let us choose automorphisms un G A so that 1) un converges to Uoo(t) = t uniformly on [0,1]; 2) for all E G 55[0ji] and \(E) > 0, one has

JE <(*) ds —• oo

as n —> oo. Prove that the sequence of Gaussian measures P n = WT~X has the following properties:

1) P n =» W; 2) the correlation operators Kpn converge strongly to i^w! 3) for each n, H p n coincides (as a set) with H w ; 4) for each / G H w , the variance crn(/) of the conditional distributions for P n

on lines parallel to I tends to zero as n —• oo. (Hint: verify that a^(l) = (f0 |/ /(^n(5))|2 |^(s)|2cf5) , where vn is the inverse of Un.)

§20. The local invariance principle

The classical Donsker-Prokhorov invariance principle [4, Ch. 2, §10] states that the distributions P n of continuous random open polygons in C[0,1] constructed in the standard way from sums Sn of independent identically distributed random vari­ables with finite variance converge weakly to a Wiener measure W. In other words, for any W-almost everywhere continuous functional / , the distributions P n / _ 1

converge weakly to the distribution W / _ 1 . At the same time, for some specific functional (for example, f(x) = supx(t) or f(x) = sup|#(£)|), it is known that,

Page 173: Local Properties of Distributions of Stochastic Functionals

§20. THE LOCAL INVARIANCE PRINCIPLE 157

under natural additional restrictions on the distribution 5i , strong convergence takes place.

In this section, we prove that the convergence P n / _ 1 ™ W / - 1 holds for all / G Mw. A similar result in the case of attraction to a stable law will be obtained in §21.

Suppose that {£n)7i € N} is a sequence of identically distributed random vari­ables with zero mean and finite variance, Sn = £i H h £n are the corresponding partial sums, a% = D5n are their variances, and P n are the distributions in the space X = C[0,1] of the random elements

(20.1) Cn(t) = ^n 'OV] + (nt - [**l)f[nt] + l), t G [°» ^

To guarantee the strong convergence of distributions for a wide class of func­tional, we need to impose additional restrictions on the joint distribution V of the variables £n. We assume that V has an absolutely continuous density p such that p'lp G L2(dP). In other words, we assume that the Fisher information is finite,

(20.2) Ip= [ (?-)2dV= [ ^-d\<oo.

This means that the density p possesses a certain smoothness. Indeed, the weaker condition p' jp G L1(d/P) is equivalent to the assumption that p is a function of bounded variation.

By analogy with the case of weak convergence, the following theorem is called the local invariance principle, or the functional local limit theorem.

THEOREM 20.1. Suppose that fn, n = 1,2,. . . , are independent identically dis­tributed random variables with zero mean and finite nonzero variance a2. Suppose that the distribution density p of the random variables £n is absolutely continuous and satisfies Ip < oo.

Then

(20.3) P n / " 1 ^ W / " 1

for any functional f G Mw.

PROOF. Since / G Mw, we can choose a set XQ, a point x G Xo, balls U and V, a number e, and a vector Zoo just as in the proof of Theorem 19.4. We set ln = 7rn(Zoo)j where 7rn is the self-mapping of X that assigns the continuous piecewise linear open polygon with vertices (k/ny x(k/n))) k = 0 , 1 , . . . , n, to each function x. Let Gn,cx = x + c/n, c > 0, and n G N + . Let us verify that the ball V, the number e, and the family (Gn>c) satisfy the assumptions of Theorem 18.4.

Assumptions (i) and (ii) can be readily verified. Assumptions (iv) and (v) can be verified by the method used in the proof of Theorem 19.4, since they are related only to the fact that / G Mw. It remains to prove (iii).

Let us consider the mapping Jn: X —* R n given by

.Let / n = irnJn and ln = Jn^n) = v ^ n » • • • »^n )>

For c > 0 and all n G N, we define mappings Gn)C: R n —> R n by setting

^*7i,cX- — X "T Cifi.

Page 174: Local Properties of Distributions of Stochastic Functionals

158 5. LOCAL LIMIT THEOREMS

Since the subspace En = 7rn(X) is the support of the measure P n and Jn is an isomorphism of En onto R n , we have

An(C) = HPnG-i - P n | | = \\VnG~l -Vn\\.

Since the measure Vn is absolutely continuous and its density is equal to ]X\P(xi)i we obtain

/» lb IV

A„(c)= / lUpM-Ylpixi-clW dx.

LEMMA 20.1. Suppose that p is absolutely continuous. Then

(20.4) A n ( c ) < c / y 2 | | 4 | | L f o i ] .

PROOF. First, note that An(c) = E|l - Zn(c)|, where

" • p ( 6 - c t f ) )

If loc(t) = £, then /n = V v ^ a n d ^n(c) is just the likelihood ratio in the estimation problem for the translation parameter. This relation allows us to prove (20.4) by using a specific method of the asymptotic estimation theory.

Let

Q(s,x) = Y[p(xi-sW).

Obviously,

Therefore,

(20.5)

i=\

A„(C) = / 1 r dQ(s,x) ds

ds dx

< A UR-IV P(^-^})

> v 1/2

Q(s ,x)dx) ds.

For a given s, the functions

7iW = W\ p'(Xj - Sl%})

p(Xi - Sl$)

treated as random variables on the probability space (Rn , Q5n, Q), where Q is the measure with density Q(s, x), are independent and have zero means. Their variance is

J /p'(^-s£)\ 2Q ( S ) X ) d x

JR« Vp(a:;-s4V 2

p(xi - sl$)dxi = Ip. = r (p'jxi-sW)}

Page 175: Local Properties of Distributions of Stochastic Functionals

§20. THE LOCAL INVARIANCE PRINCIPLE 159

Therefore,

IJL ... ^ , _ . / ^ l 2

Q(s, x) dx

(20.6)

1 jRn I 1

U p{Xi - sVk>

= D Q ( E 4 S ) = E(^)2Dg7i = IPjy$?. ^ 1 ' 1 1

Since

r(i+l)/n

Bw..?(.(4i)-.(i)) ,-iz:(.jr''«* n /.(i+l)/n

< £ / (i,(«))2d» = ||CII [o,i|. j Ji/n

we see that relations (20.5) and (20.6) prove the lemma. •

This lemma readily implies assumption (iii). An application of Theorem 18.4 completes the proof of Theorem 20.1. •

Along with continuous random open polygons £n, one often deals with processes of the form

(20.7) CnW = a - 1 5 M , t € [ 0 , l ] ,

whose trajectories are right continuous step-functions. The space C[0,1] cannot be used to study weak convergence of such processes and hence is usually replaced by the space D[0,1] of functions without second-order discontinuities, equipped with the Skorokhod topology. However, in this situation it is easier to use a different space, which is more convenient for our goals.

Let S be the cr-algebra of Borel subsets of D[0,1], and let il be the cr-algebra of subsets of D[0,1] generated by the uniform topology. Let P n be the probability measures on 2) corresponding to £n, and let W be a Wiener measure on 2). The measures P n and W can be continued from 2) to it (see [4, Ch. 3, §18]. We use the same notation P n and W for these continuations. If the variables (£n) that determine Cn are independent and identically distributed and have zero mean and finite variance, then, by analogy with the invariance principle in C[0,1], we have

the weak convergence P n => W. By E[0,1] we denote the uniform closure of the set of functions / G D[0,1]

that have finitely many jumps at rational points. The set E[0,1] equipped with the uniform norm is a separable Banach space; the cr-algebra of Borel subsets of E[0,1] coincides with the cr-algebra E[0,1] fl il. By W E we denote the restriction of the Wiener measure W to this cr-algebra.

Since E[0,1] is the support of the measures P n and W, we can literally repeat the proof of Theorem 20.1 and obtain the following statement.

THEOREM 20.2. Let P n , n— 1,2,.. . , be distributions in E[0,1] corresponding to the processes (n constructed according to (20.7) from independent identically distributed random variables £&. Suppose that E ^ = 0, D ^ = a2 < oo, and the

Page 176: Local Properties of Distributions of Stochastic Functionals

160 5. LOCAL LIMIT THEOREMS

density p of the distribution of £& is absolutely continuous. If Ip < oo and f is a functional on the space E[0,1] belonging to the class MWE, then P n / _ 1 ™ W / " 1 .

One can readily see that for any functional / G Mw on the space D[0,1], the restriction of / to E[0,1] belongs to the class A4W . Therefore, Theorem 20.2 implies the following statement.

THEOREM 20.3. The convergence P n / _ 1 ™ W / _ 1 holds under the assump­tions of Theorem 20.2 for any functional f G Mw on the space D[0,1].

We conclude this section by giving some problems that deal with some other analogs and generalizations of the local invariance principle for independent random variables with finite dispersion (see also the "Bibliographical notes").

PROBLEM 20.1. Consider independent random variables (£n) uniformly dis­tributed on [—1,1]. Let P n be the distributions in C[0,1] of random open polygons £n of the form (1), and let Gn>c be the group of translations along ln(t) = loo(t) = t. Then for any c > 0 one has

n^iip„G-1c-pn|i = 2.

n

This shows that the condition in Lemma 20.1 stating that the Fisher informa­tion is finite is essential.

PROBLEM 20.2 (a generalization of Lemma 20.1). Consider independent ran­dom variables £ i , . . . , £ n with distribution densities p i , . . . , p n . Let J i , . . . , In be the corresponding Fisher information. Then for any real numbers a\,..., an we have the inequality

r n n / n \ 1/2

j*>n i i • v i /

PROBLEM 20.3 (P. Breier). Suppose that (£n,i>^ = 1 , . . . , n ;n G N) is a se­quence of series of random variables that are independent in each series. Sup­pose that Fn,i is the distribution of fn|i; E£n,2- = 0; D£nti = ani\ Sn = YliZnti\

an = DSn]tn,k = an-2Zi<rli.

Let P n be the distributions in C[0,1] of continuous piecewise linear random open polygons £n with vertices at the points (tn^^~lSk)' By In^ we denote the Fisher information of the distribution Fn^. If

n

WmyZlnjali <oo n ' *

2 = 1

and / G Mw, then P n / - 1 ™ W / _ 1 . (Hint: to verify assumption (iii) of Theo­rem 18.4, use the result of Problem 20.2.)

PROBLEM 20.4. From the previous statement, derive the following version of the local limit theorem for the series.

If the variables (£n,i) satisfy the assumptions of Problem 20.3, then the distri­bution densities of the normalized sums cr~lSn converge in the metric of L1(R1) to the distribution density of the standard Gaussian law.

Page 177: Local Properties of Distributions of Stochastic Functionals

§21. THE CASE OF ATTRACTION TO A STABLE LAW 161

PROBLEM 20.5 (a bound for the convergence rate). Suppose that f i , . . . , f n

are independent identically distributed random variables, E£& = 0, D ^ = 1, and E|ffc|3 < oo. Let P n be the distribution in D[0,1] of the random element £n, constructed from £ i , . . . , £n. Let W be a Wiener measure on D[0,1]. We set

/ ex1 /2

lfc(t) = ( t i n - J , fo(t) = t^2\\nt\-\ t G [0,1];

Hi(L) = {xe D[0,1] | ux(t) < lA/>i(t) Vt G [0,1]}, i = 1,2.

Consider the functional

f£(x) = swp{x{t) + h(t)}> ge(x) = sup \x{t) + h(t)l [e,l] M ]

where x G D[0,1], h is a given function from D[0,1], and e G [0,1/2]. Then the following statements hold. 1) (Davydov [34]) Suppose that the absolutely continuous component of the

distribution of the sum Y^l° £& is not zero for some no- Then there exists a constant Ci, depending only on L and E |^ | 3 , such that

sup l lPn/" 1 - W/" 1 ! ! < C i e - ^ O n n ) 3 / ^ - 1 / 8

for all n > e _ 1 , e > 0. 2) (Lifshits) Suppose that for some no the distribution of the sum ^ ^ ° £& has

a bounded density. Then there exist constants C^ and C3, depending only on L and E|£/j;|3, such that

sup HPn/o"1 - W/o"1!! < C2n-^8(\nn)6exp(C3Vh^i). heH2{L)

The same estimates hold for the functionals g£.

§21. The local invariance principle in the case of attraction to a stable law

Now we assume that independent identically distributed variables (£n) lie in the domain of attraction of a stable law Fa with exponent a G (0,2). This means that the normalized sums B~lSn, where Sn = Y^l ffc> Bn = n 1 / aL(n) , and L is a slowly varying function that weakly converges to Fa. In this case, Skorokhod proved that the random processes

(2l.l) Ut) = B-1S[nt], * e [ 0 , l ] ,

weakly converge in the space D[0,1] to a stable process va for which VVot{\) = Fa. Thus, he obtained an analog of the Donsker-Prokhorov invariance principle. Note that in this case the continuous open polygons of the type (20.1) (with an replaced by Bn) do not weakly converge in C[0,1], since the trajectories of the process va

are discontinuous with probability 1. However, the main difficulty that arises when we try to obtain an analog of the local invariance principle in this case is related not to the passage from the space of continuous functions to the Skorokhod space D[0,1], but to the fact that the set of admissible translations of the measure Q a , the distribution of the process vaj is trivial. Therefore, the translation mappings

Page 178: Local Properties of Distributions of Stochastic Functionals

162 5. LOCAL LIMIT THEOREMS

used in the proof of Theorem 20.1 prove useless, and we have to seek other families of mappings.

First of all, we list some well-known properties of the processes vQ. The characteristic function of the variable va{t) has the form

(21.2) Eexp{isva(t)} = exp <ias + t I exp{ius} — 1 - ^ J7ra(^) du >,

where cuc2 > 0, cuc2 ^ 0, and na(u) = | r i | -1 _ a(ci l ( 0 | O o )(u) + c2l(_OO)0)(^)) is the density of the Levy measure of the distribution FQ.

We shall consider the case of strong stability; this means that the constant a in (21.2) is zero.

With probability 1, for any e > 0 the trajectories of the process vQ have only finitely many jumps of magnitude > e. At the same time, with probability 1 the process va has at least one jump in any subinterval of [0,1]. Moreover, if the constant C\ in (21.2) is not zero, then there is a jump of positive sign; a similar statement holds in the case c2 7 0.

If C2 = 0 and c\ ^ 0, then

suppQQ = X+ = {xG D[0,1] I x(0) = 0, x does not decrease}.

If c\ = 0 and c2 ^ 0, then

suppQa = X_ = {x e D[0,1] I x(0) = 0, x does not increase}.

If Ci 7 0 and c2 ^ 0, then

suppQQ = X 0 = {x e D[0,1] I x{0) = 0}.

These and some other properties of stable processes are described in [94]. In this section, we consider the following families {(7C, c € R+} of transforma­

tions of the space X = D[0,1]:

(21.3) Gcx = x + dx,

where (lXi x G X) is a set of vectors from X, called a local direction field. Obviously, if lx = I for all x, then we arrive at the familiar translation mappings.

It turns out that the direction field (lx) can be chosen so that the measure Q a

is quasi-invariant with respect to the mapping Gc, that is,

QoG;1 - QQ

for all c > 0. In this case, the local direction field (lx) is said to admissible for Q a . The set of such fields is rather wide, and this allows us to apply the general

results of §18 for the derivation of the local invariance principle. To describe the local direction fields, we introduce some auxiliary notation. We denote the value of the jump of a function # at a point s by 6x(s); that is,

Sx(s) = x(s) -x(s-). For e > 0, we set

J+(e) = {se[0)l]\6x(s)>e})

J-(e) = {se[0,l]\5x(s)<-e},

Jx(e) = J+(e)UJ~{e).

Page 179: Local Properties of Distributions of Stochastic Functionals

§21. THE CASE OF ATTRACTION TO A STABLE LAW 163

As was already noted, card Jx(e) < oo. Suppose that m G N, A i , . . . , A m are disjoint open subintervals of the interval

[0,1], and Ti , . . . , r m are real numbers. These objects and the number e are called the parameters of the field.

For s,t G [0,1], we set 771

¥>.(*) = ET*1A1(S)1(.,oo)(*)> 1 = 1

and for e > 0 we define the local direction field (lx,x G X) by setting

(21.4) *«= E <tf- E *>•"; seJ+(e) seJx(e)

here </?+ and </?J" are the positive and the negative part of y>5, respectively. Relation (21.3) defines the mappings Gc corresponding to this field. Let us describe the action of Gc on the function x informally. Suppose that

x has a jump at some point s € 4 , so that 6x(s) > e and r» > 0. Then at this point the function Gcx also has the jump 6x(s) + CTI. The functions x and Gcx have equal increments between neighboring jumps.

This relation implies that the family (Gc>c G R+) is a semigroup, that is, GCl+C2£ = GC2{GClx) for all ci,C2 > 0. Note that the trajectories of the semigroup (Gc) are half-lines, that is, rays (which are however not parallel to each other). That is why the local direction fields in question are said to be radial The semi-group (Gc) is a very special case of the set of semigroups of transformations, described by Lifshits in connection with functionals of random processes with independent increments. He obtained a result (see Theorem 9.2 in [40]), which readily implies the following statement.

THEOREM 21.1. The radial field {lx} is admissible for the measure Q a . The conditional measures for Qa on the orbits of the semigroup (Gc) exist and have densities.

REMARK. Precise expressions for these densities are given in [40, §9]; we do not need them in the sequel.

The following statement shows that the set of radial fields is sufficiently large.

LEMMA 21.1. Suppose that ci • C2 ^ 0 in (21.2). Then for Qa-almost each x and any open ball U in Xo, there exist an open ball Vx centered at x and parameters e, N, { A i , . . . , Ayv}, and {TI, . . . , ryv} such that the radial field {lx} defined by these parameters has the following property: for each y eVx the vector ly belongs to U.

If c\ ^ 0 and ci = 0 (or c\ = 0 and c<z ^ 0), then a similar statement remains valid with Xo replaced by X + (respectively, by X_) . The hint to Problem 21.1 outlines the proof of this lemma.

We say that a functional / : X —> R 1 belongs to the class /CQQ if for Qa-almost each x there exist an open ball Vx centered at x and a radial field {ly>y G X} such that for Qa-almost each y eVx and for any segment A = {y + cly, c G [0, a]} lying in Vx the convergence of segments A n to A implies that A A W / ^ ^ A A / A 1 -

This definition is close to the definition of the classes Mp', therefore, it is not surprising that the class /CQQ has similar properties (see Problems 21.2 and 21.3).

Now we can state the main result of this section.

Page 180: Local Properties of Distributions of Stochastic Functionals

164 5. LOCAL LIMIT THEOREMS

THEOREM 21.2. Suppose that (£n) are independent identically distributed ran­dom variables that lie in the domain of attraction of a strongly stable law with exponent a, 0 < a < 2 (the constant a in representation (21.2) is equal to zero). Let P n be the distribution in the space D[0,1] of the random processes (21.1) con­structed from the variables £n. Suppose that the distribution F of the variables £n

has density q which is an absolutely continuous function. Let Aq(u) be the total variation of q on the set R 1 \ [—u, u],

(21.5) Aq(u)= f \q'(s)\ds, J\s\>u

and let

(21.6) supnBnAq(Bn) < oo,

where the Bn are the normalizing constants in (21.1). Then for any f G /CQQ we have the convergence in variation

(21.7) P n / ^ - Q o / - 1 .

Condition (21.6) provides some regularity of the density q. This condition plays the same role as the condition stating that the Fisher information is finite in Theorem 20.4.

Before proving the theorem, we formulate the following corollary.

COROLLARY 1. Suppose that under the assumptions of Theorem 21.2 the dis­tribution of the variables £n is stable, that is, F = Fa. Then (21.7) holds for any /e /CQ a .

The proof follows from the fact that in this case \q'(s)\ ~ b\s~2~a as s —> oo and \q'{s)\ ~ b2S~2~a as s —> —00 for some choice of the constants 61 and 62 [51]. Therefore, Aq(u) ~ Cu~1~a

) and inequality (21.6) holds since Bn ~ Cnl/a.

P R O O F OF THE THEOREM. Just as in the proof of the local invariance principle for random variables with finite dispersion, we use Theorem 18.4.

By Xo we denote the set of Qa-full measure of points x for which there exist an open ball Vx and an admissible radial field {ly,y G X} that possess the properties listed in the definition of the class /CQ0 .

For each n G N+, we set

Gn,cy = Gcy = y + cly, yeX.

Let us show that all assumptions of Theorem 18.4 are satisfied. Assumptions (i) and (iv) are obvious. Assumption (v) readily follows from the

fact that / G /CQQ . Since on the orbits of the semigroup (Gc) there exist densities of the conditional

measures for the measure Q a , we have

(21.8) | | Q a G c - i _ Q a | | _ 0 , c - + 0

(see Problem 21.4). Therefore, we need not verify that the mappings Gc are continu­ous Qa-almost everywhere (see the discussion of the assumptions of Theorem 18.4). The verification of the second part of assumption (ii) does not pose any problems.

Page 181: Local Properties of Distributions of Stochastic Functionals

§21. THE CASE OF ATTRACTION TO A STABLE LAW 165

Indeed, let e > 0. Let M£(x) be the number of jumps of x with magnitude > e. Then

sup \lx(t)\ <TM£{x), *E[0,1]

where T = max |T~»|, and therefore,

Pn{x | p(x,Gcx) >e} = Pn{x | c sup \lx(t)\ > e} < Pn{x \ Me(x) > 4^}-

Since for any k G N the set {x \ M£(x) > k} is closed (see Problem 21.6), we have

hmPn{a; | p(x,Gcx) > e} < Pooja; | M£(x) > [ ^ ; ] } .

Since for Q^-almost all x the number of jumps whose absolute values are bounded away from zero by some positive number is finite, it follows that the right-hand side of the above inequalities tends to zero as c —> 0, and the desired statement is thereby proved.

Thus, it remains to verify that (iii) is satisfied under our assumptions. This can be performed by arguments close to those used in the proof of Lemma 20.1.

By E n we denote the subspace of functions that are constant on the intervals [(k-l)/n) k/n). Obviously, P n ( E n ) = 1. Let Jn: E n —> R n be the mapping defined by the formula Jn(x) = x = (x i , . . . ,xn)> x^ = x(k/ri). We set P n = Pn^T 1 a n d Gc = JnoGcoJ-1. Then

^ ( c ) = | | P n - P n G - 1 | | = | | P n - P n G - 1 | | = / |Q(c, x) - Q(0, x ) | dx,

where Q(s,x) is the density of the distribution PnGj1. It follows from the construction of the transformation Gc that Q(s, x) has the

form n

Q(5,x) = ]\q(xi - sBnWi(xi)), l

where J Tj if \t\ > eBny i/n G Aj , and Tjt > 0, \ 0 otherwise.

Hence,

and therefore,

OS Mc) = / I f

JRn I JO

< [ [ ( ± sBn\Wi{Xi)\ WjXi " SnnW1 *f ) <?('• *> d*ds

Jo Jit" V i q(xi - sBnWi(xi)) )

= Bn s[^2 \wi(xi)\ W(xi ~ sBnWi{xi))\dxAds.

Page 182: Local Properties of Distributions of Stochastic Functionals

166 5. LOCAL LIMIT THEOREMS

Since \wi(xi)\ < Tlr. . i > e B i, we have

6n(c) < TBn J s( V / \q'(t - sBnWi(xi))\ dt) ds JO \iJ\t\>eBn /

< TnBn f s [ \q'(u)\ duds = ^-TnBnAq(^Bn) JO J\t\>%Bn * \Z /

for c < e/2T. For any positive integer k we have Bn ~ kl/aB[nm\ therefore, the inequalities

LK n[n/k] - ^n - 2 [n/k]

hold for sufEciently large n. Let us choose k so that k > (4/e) a . Then

nBnAq(^Bn) < 2nkl^B[n/k]Aq(^kl^B[n/k^

Hence,

lim nBnAq(-Bnj = C < oo,

whence we obtain lim Sn(c) < CTc2.

n This inequality means that assumption (iii) of Theorem 18.4 is satisfied. •

PROBLEM 21.1. Prove Lemma 21.1. (Hint: suppose that c\ ^ 0 and c<i ^ 0. The other cases can be considered in a similar way. Without loss of generality, we can assume that U is a ball centered at Z, where I has the form

M

1

here M, c^, tj, j = 1 , . . . ,M, are given numbers, and tj G [0,1]. Suppose that 6 > 0 is so small that the intervals Ij = (tj — £, ij + 6) are disjoint and for any Uj G Ij the function l(t) = Y^i aj^-\u. ii ^ e s m U- Suppose that on each interval Ij the function x has at least one jump of the same sign as aj (say, at a point Wj G Ij). The set of such x is of full QQ-measure. Note that for an e sufEciently close from the left to minj \6X(WJ)\, there exist subintervals Ij centered at Wj and an open ball Vx centered at x such that each function y G Vx has exactly one discontinuity point in Ij at which the value of the jump is greater than e. Then the field {ly} with parameters e, N = M, A^ = J^, and Tj = OLJ, j = 1 , . . . , TV, is the desired field.)

PROBLEM 21.2. A functional / : X —> R 1 is said to belong to the class /CQQ C /CQQ if / G /CQQ and A A / ^ 1 <C A. Prove the following analogs of Problems 19.1 and 19.2:

a) the class /CQQ is everywhere dense in the class of all Qa-almost everywhere continuous Junctionals;

b) if / G /C£a, then Q * / - 1 < A.

Page 183: Local Properties of Distributions of Stochastic Functionals

§22. THE INFINITE-DIMENSIONAL LOCAL LIMIT THEOREM 167

PROBLEM 21.3. Suppose that for Qa-almost each x eX. the following condi­tions hold:

a) there exists an / such that Dif(x) = 0; b) the mapping (h,y) —» Dhf(y) is continuous at the point (/,#) (in the direct

product topology). Then / G /CQQ- (Hint: see the proof of Theorem 19.1.)

PROBLEM 21.4. Suppose that f(x) = J0 h(x(t))dt, h is locally Lipschitzian, and h! is continuous and nonzero almost everywhere on R1 . Then / G /CQQ . (Hint: use the result of the preceding problem.)

PROBLEM 21.5. Prove relation (21.8). (Hint: use the total probability formula (3.4) with respect to the partition T into orbits of the semigroup (Gc).)

PROBLEM 21.6. Suppose that Mx(e) = ca,rdJx(e), k G N. Prove that the set {x | Mxe > k} is closed.

§22. The infinite-dimensional local limit theorem

Suppose that £ is a random element of a separable Banach space (X, || • ||) and P is the distribution of £ on the Borel cr-algebra 2$x of the space X. Suppose that fii&j • • • are independent copies of £, Sn = n _ 1 / 2 ( f i + • • • + £n), and P n is the distribution of Sn. Suppose that the central limit theorem (CLT) holds for P n , that is, the P n converge weakly to a nondegenerate Gaussian measure Poo.

If X is finite-dimensional, then we understand the local limit theorem for the sequence (£n) as the statement that the densities dPn/d\ converge to dP^/dX, where A is the Lebesgue measure. In the infinite-dimensional case there is no analog of the Lebesgue measure. Therefore, the problem of how to interpret the local limit theorem is nontrivial.

If we admit that for any given measure \x the densities dPn/d/i converge to dPoo/djj, in mean with respect to \±, then, as is well known, this means that the measures P n converge in variation to Poo. This means that there is no dependence on the auxiliary measure, and we must study convergence in variation. In this connection, it is interesting to know whether the following analog of Prokhorov's theorem [51] is valid in the infinite-dimensional case.

THEOREM A. The convergence P n ™ Poo holds if and only if for some no the measure P n o has a nonzero component that is absolutely continuous with respect to Poo-

Results of this type are interesting because in this case we have the convergence

(22.1) P^-1 ™ Poo / " 1

for any functional / on X. In this section we prove an auxiliary result; namely, we obtain the conditions

under which (22.1) is satisfied for a certain class of functionals / . Note that for all n the measures P n can be singular with respect to Poo, and hence, our result is not covered by Theorem A.

To give the precise statement, we need some notions and notation. Just as in the preceding sections, by

/ , - / <52!<(A

Page 184: Local Properties of Distributions of Stochastic Functionals

168 5. LOCAL LIMIT THEOREMS

we denote the Fisher information for a probability distribution F on R 1 with den­sity q.

For each Z G X, by Ti we denote the partition of the space X into lines j y = {y + c ! , cGR 1 } parallel to I. By {ay} and x, we denote, respectively, the system of conditional measures and the quotient measure for the measure P with respect to the partition T/ (naturally, they are independent of I). We set I^y = Iay.

Suppose that /C is a subset of X and Q is a measure on Q3x- We say that a functional / belongs to the class MQ(IC) if for Q-almost each x G X there exist an open ball U and a vector I G /C such that for any segment A that lies in U and is parallel to I the condition A n —> A implies that

Note that the class MQ introduced earlier in this book is just A ^ Q ( H Q ) , where H Q is the space of admissible translations of the measure Q. In the sequel, by /C we denote the set

/C = {I G X | Mi = ess sup// y < oo}.

THEOREM 22.1. Suppose that the central limit theorem holds, that is, P n => Poo. Then (22.1) is satisfied for any f G Mp (/C).

Note that specific conditions under which the central limit theorem can be used are not essential to us (we refer the reader to [84]).

Let us consider another example in which Theorem 22.1 can be used. Suppose that X is a Banach sequence space with strictly convex norm and

f(x) = \\x\\. Suppose that the first component (j of a random element £ = (Ci» C2» • • •) is independent of the other components (£2, C3> • • • )• I*1 other words, the distribution P of this element can be viewed as a product measure. We also assume that the distribution of £i has finite Fisher information I\.

COROLLARY. If the central limit theorem is satisfies for £, then the distribution of the variable \\Sn\\ converges in variation to the distribution of the norm of the Gaussian random element 5QO w^ distribution Poo.

PROOF. In this case, the class /C necessarily contains the vector I = (1 ,0 ,0 , . . . ) , since I^y = I\ for P r i -

a l m o s t all y. Then we have / G Mp (/C), thanks to the strong convexity and Theorem 4.4. •

REMARKS. 1) The condition of strong convexity is, in fact, too strong. For the corollary to hold, it suffices to require that for x ^ 0 the function c —> \\x + d||, c G R1 , attain its minimum only at one point.

2) The measures P n can be singular with respect to PQO for all n. Indeed, it suffices to assume that £2 is independent of (£3, £4,. . .) and the distribution of the variable £2 is discrete.

P R O O F OF THEOREM 22.1. We need the following auxiliary fact. Let us consider the partition Ti of the space X into lines parallel to /. We

identify the quotient space X/r^ with some subspace Y that does not contain /. By Y n we denote the direct product Yl™ Y.

Page 185: Local Properties of Distributions of Stochastic Functionals

§22. THE INFINITE-DIMENSIONAL LOCAL LIMIT THEOREM 169

LEMMA 22.1. Let Q i , . . . , Q n be a system of probability measures on 2$x- Let {ay , y e Y} and x& be the system of conditional measures and the quotient mea­sure for Qk with respect to the partition i y Suppose that {ay,y e Y} and x are the system of conditional measures and the quotient measure for the convolution Q = Qi * • • • * Q n with respect to i y Then the following assertions hold:

(i) x = x\ * • • • * xn\

( i i ) a y = / o£U---*o$xy(dy), JEy _

where Ey = {y € Y n \y\-\ h yn = y} and xy are the conditional measures for x\ x • • • x xn on the fibers Ey.

PROOF. Suppose that h is a nonnegative bounded measurable function on X and the mapping a: X n —> X is defined by a{xi)...,xn) = x\ -\ h xn. Then Q = (Qx x • • • x Qn)cr_1; therefore

hdQ= h(xi H h x n )Qi x • • • x Q n (dx) Jx. Jxn

(22.2) = I \ I M(»l + ClO + • • • + (Vn + Cnl))

xo$x---xo$(dc) xi x ••• x x n (dy) ;

here we use the partition Ti on each copy of X. The partition of the space Y n into the fibers Ey can be considered as the partition into preimages of points under the mapping a: Yn —> Y. Thus, it is natural to identify the quotient space with Y. Obviously, for x\ x • • • x xn the quotient measure with respect to this partition is equal to x = x\ * • • • * xn. Therefore, we can transform the integrals in (22.2) as follows:

f hdQ= [ [ [ fc(y + (ci + - " + Cn)Z) 7X ./Y JEy Ai/i><-X7yn

(22.3) x o$ x . . . x a£>(dc)xy (dy)x (dy)

= h(y + cl)ay(dc)x(dy), JYJly

where ay = fEy ay\] * • • • * ay

n^xy (dy).

Relation (22.3) readily proves our lemma. •

Now we return to the proof of the theorem. Let us verify that under our assumptions all five conditions of Theorem 18.4 are satisfied.

Let x £ Xo be the set of Poo-full measure mentioned in the definition of the inclusion / G Mp (/C), let U be the corresponding open ball, and let I be the corresponding vector from /C.

We define a mapping Gn>c = Goo,c = Gc by setting

Gcz = z + cl, ce [0,1].

We can assume that U is a ball centered at the point x. Suppose that r is its radius. Then we can take V to be the open ball of radius r /2 centered at x and set e = r| | / | | /2.

Page 186: Local Properties of Distributions of Stochastic Functionals

170 5. LOCAL LIMIT THEOREMS

Let us show that the above mappings, the ball V, and the number e satisfy all the necessary assumptions.

Assumptions (i) and (ii) of Theorem 18.4 are automatically satisfied. Assump­tions (iv) and (v) can br readily derived from the fact that / G Mp (/C) and from Theorem 4.5. It is more difficult to verify assumption (iii).

By P n we denote the distribution of the sum £i H h £n. Obviously, we have

I I P n - P n G f ^ l ^ H P n - P n G ^ H .

Suppose that {vy,y G Y} and v are the system of conditional measures and the quotient measure for P n with respect to the partition T/; next, suppose that {vy,y G Y} and v are similar objects for PnG~\^. Obviously, v = v. Therefore,

| | P n - P » G j 1 | | = jh |*/y- i7y | | l> (dy).

By applying Lemma 22.1 to the measures Q^ = P , k = 1 , . . . , n, and then to the measures Qfc = ^nG~}r^, k = 1 , . . . , n, we obtain

\Wv ~ *Vll = / « » ! * • • • * a X n x y (dx) - / aXl*--* aXnJcy (dx) (22.4) "j!Ey JEy

< / \\aXl * • • • * aXn - aXl * • • • * aXn \\xy (dx); JEy

here {otyyy G Y} and {5y ,y G Y} are, respectively, the systems of conditional measures for P and PG",1,^- with respect to T/; x is the corresponding quotient measure.

Since / G /C, it follows that the measures ax have finite Fisher information 7X, and Mi = ess sup Ix < oo. It follows from (22.4) and the estimate in Problem 20.2 (with di = Cy/n) that

P / n \ 1 / 2

\\"V ~ ?y|| < JE ^ E ^ c 2 n _ 1 J *V (dX) ^ °Ml '

The obtained estimate is uniform in y and n. Therefore,

IIP — P n~1\\<rM1^2

and assumption (iii) is thereby proved. Thus, the measures P n and the functional / satisfy all assumptions of Theorem 18.4. Hence, P n / _ 1 ™ Poo / - 1 - D

Page 187: Local Properties of Distributions of Stochastic Functionals

Bibliographical Notes

§1. The notions considered in this section are conventional. The problems discussed here are considered in more detail in the books [4, 15, 16, 18, 24, 27, 44, 53, 88, 97].

§2. A. D. Aleksandrov was the first to study weak convergence of measures. The theory of weak convergence was developed by Kolmogoroff, Prokhorov, Sko­rokhod, and Donsker. For references to the classical works in this field, see [4]. Theorem 2.2 is a special case of results obtained by Billingsley and Tops0e [125]. Apparently, the result of Theorem 2.3 has not been mentioned previously.

§3. Rokhlin constructed the theory of measurable partitions. Concerning the existence of conditional measures, see [78, §§45, 46].

§4. The main idea of the stratification method first appeared in Davydov [30]. Later on, this method was used for solving various problems in [31-36, 56-58, 60—62, 89—101]. A review of these papers can be found in [40]. The general scheme in terms of semigroups was proposed in [60]. The one-dimensional analog of Lemma 4.1, which is very important, was proved by Vershik and Skorokhod [95]. Our proof of this lemma is based on an idea proposed by Polishchuk.

Theorem 4.2 for absolutely continuous functions was proved by Geman and Horowitz [137]; the corresponding result for monotone functions was obtained by Zaretskii (see [74]). Wide generalizations of Theorems 4.2 and 4.3 [41, 111] are well known; their proofs go back to Banach's theorem on the indicatrix [121, 74]. Theorems 4.4-4.6 are taken from [34, 40]. The classical Theorems 4.7 and 4.8 are due to Gnedenko [26; 5, Ch. 4], but the proof of Theorem 4.8 in this book is new. It was given by Lifshits.

§5. This section is a slight modification of Davydov's paper [170]. §6. The method presented in this section was stated by Smorodina in [104].

The main ideas of this approach have common features with the methods used by Malliavin [146] and his numerous successors [42, 20, 122, 123, 124, 160, 163] to prove the smoothness of one-dimensional distributions of diffusion processes.

§7. The first examples and definitions of Gaussian measures were published by Wiener [165] and Kolmogorov [141]. The most important properties of these measures can be found in the monographs [27, 44, 88, 97, 120]; see also [129, 138, 161]. Cameron and Martin [131] were the first to consider a special case of the very important formula for the density of the translated Gaussian measure in terms of the original measure. Systems of conditional Gaussian measures that appear due to the partition into parallel lines, as well as the description of the space of admissible translations in terms of the factorization of the correlation vector (Theorem 7.3), were considered in [40, 57]; for the factorization, see also [15, 16]. The following classes of Gaussian measures and processes, which are most interesting to us, have been considered: measures in the Hilbert space, by Skorokhod [97]; stationary

171

Page 188: Local Properties of Distributions of Stochastic Functionals

172 BIBLIOGRAPHICAL NOTES

processes by Rozanov [52, 88]; semistable processes by Molchan and Golesov [70]; the white noise by Bochner [126]; and the multiparameter field by Yeh [166, 167]. Aronszajn studied reproducing kernels [119, 116].

§8. The material considered in this section goes back to [104]. The system of definitions follows in many aspects the cycle of Shigekawa's papers [158, 159, 139] in which he considered a special case of the Wiener measure. The book by Daletskii and Fomin [44] contains numerous integration by parts formulas for Gaussian mea­sures, as well as an alternative approach to the definition of differentiability of a measure [44, Ch. 4]. Among the papers dealing with the properties of differentiable measures, we point out [6, 7, 43, 72, 108, 109], which are closely related to our monograph.

§9. We discuss the results about absolute continuity following [40], but these results were published earlier in [30, 34, 37, 58, 99, 158]. We consider the boundedness condition for the distribution density of functionals, which is very useful for estimating the rate of convergence in the invariance principle [8, 9, 11; 84, Ch. 5]. Theorem 9.1 was proved in [40, 64]. In [157, 92, 10, 57, 93], successively weakened versions of the boundedness condition for integral functionals of the Wiener process were found. Some results about the smoothness of functional distributions were obtained in [102, 104]; some closely related classes of functionals were considered in [21].

An important example of smooth functionals is the value of the diffusion process if we consider this value as a functional on the space of trajectories of the Wiener process that generates the diffusion. The analysis of studying this situation in relation to smooth solvability of differential equations led to the so-called "Malliavin calculus"; see the remarks to §6.

§10. Sudakov and Tsirel'son [107] and Borell [128] independently proved the isoperimetric property of half-spaces with respect to a Gaussian measure. Various kinds of Brunn-Minkowski inequality were obtained in [127, 128, 135], and some special cases of this inequality, in [49, 118]. In our monograph, we follow Ehrhard's approach [135], which allows us to obtain both the isoperimetric property and the Brunn-Minkowski inequality. For "non-Gaussian" generalizations, see [5, 127].

§11. Tsirel'son [113] presented a characterization of the distribution structure for convex functionals. The important Example 11.2 is taken from [117], Theo­rem 11.2 from [168], Theorem 11.3 from [116], and its proof from [58]. For the geometric characterization of the separation point, see [40]; in [59] it was shown how this point is related to degenerate functionals. The simplest results about the absolute continuity [27] and the boundedness of the distribution density [150] of a convex functional were pointed out long ago; some similar facts about non-Gaussian measures are proved in [35, 36].

More or less sharp exponential estimates for the measure of large deviations of a functional, which are close to Theorem 11.5, were obtained in [71, 96, 127, 129, 136, 144]. Freidlin [17, 112] used the statement with the functional ac­tion (Proposition 11.5); the statement with the norm of the correlation operator (Proposition 11.6) probably originates somewhere in mathematical folklore. For a review and a list of papers dealing with sharp exponential estimates for specific classes of Gaussian measures and functionals, see [19, 85, 86, 98]. The new Theo­rem 11.6 (Lifshits) appeared because of further development of the theory of large deviations [2].

Page 189: Local Properties of Distributions of Stochastic Functionals

BIBLIOGRAPHICAL NOTES 173

§12. The absolute continuity of the norm distribution was considered in [40]. The idea of constructing the Gaussian measure with unbounded density of the norm distribution was proposed by Tsirel'son; then this idea was independently implemented by Paulauskas [81, 84, 149] and Lifshits [40]. Bentkus proposed Lemma 12.1, which is very useful for constructing a counterexample. Further con­sideration of examples with bad properties of the distribution of the norm can be found in [75, 77, 151, 152]. Theorem 12.1 belongs to Rhee and Talagrand [153]. This theorem generalizes earlier results due to Davydov [40] and Paulauskas [80] to the spaces Lp and lp, as well as the remarks from [15, 143] that pertain to Hilbert spaces. The dependence of the uniform estimate of the density on the correlation operator and the barycenter of the measure was studied in [79, 80, 84, 110, 117].

The entropy approach to estimating the density was proposed in [62-64]. This approach is based on the ideas of Dudley and Sudakov and on estimates obtained by Dmitrovskii [46]. Smorodina [102, 103] and Uglanov [109] proved the smoothness of the norm distribution for Lp.

§§13, 14. The method used in these section appeared first in [105, 106]. §15. The main results of this section were obtained by Smorodina. These

results are new; this is their first publication. §16. The results of this section first appeared in [105, 106]. In [123], similar

methods were used to study the smoothness of densities of one-dimensional distri­butions for locally infinitely divisible processes. The results about the boundedness of the distribution densities of functionals of processes with independent increments can be found in [40, 61].

§17. This result appeared first in [106]. Some other interesting results about the properties of the norm distribution for random vectors are proved in [65, 130, 145, 154, 169].

§18. Theorem 18.1, which is the simplest application of the stratification method for solving the strong convergence problem, was obtained in [34]. That paper also contains more difficult results on strong convergence.

Theorem 18.2, which illustrates the possibilities of Theorem 18.1, is a special case of a result obtained by Parthasarathy and Steerneman [148] by quite a different method. Note that this result can be substantially improved if we use Theorem 18.1.

Theorem 18.4 is an improved version of a theorem from [38]. The result of Problem 2 was also obtained in [38]. §19. Theorem 19.4 was proved by Davydov [40]. The superstructure method

(Theorem 18.4), which appeared later, allowed us to simplify the proof considerably. §20. The local invariance principle for independent identically distributed ran­

dom variables (Theorems 20.1 and 20.2) was obtained by Davydov [40]. By using the superstructure method, just as in the Gaussian case, the proof can be substan­tially simplified and clarified.

The idea of using the spaces £"[0,1] for the consideration of discontinuous ran­dom open polygons is due to Borodin.

The condition that the Fisher information is finite seems too strong in Theo­rems 20.1 and 20.2. Recently, Davydov proved that if one considers a somewhat narrower functional class than Mw, then instead of the condition lp < oo it suf­fices to require that the initial values of F have a distribution density. The proof of this result is rather complicated and cannot be presented in this monograph.

Various generalizations of Theorems 20.1 and 20.2 can be found in [100, 101, 67, 68, 12, 13].

Page 190: Local Properties of Distributions of Stochastic Functionals

174 BIBLIOGRAPHICAL NOTES

Some other results of local invariance principle type for special classes of func­tional can be found in the review [40].

§21. Theorem 21.2 was obtained by Breier. §22. Theorem 22.1 was obtained in [39]. Bentkus [171] and later Zitikis [172]

obtained local limit theorems in Banach spaces for some cases in which / is a homogeneous symmetric polynomial.

Page 191: Local Properties of Distributions of Stochastic Functionals

Bibliography

1. Ya. I. Belopol'skaya and Yu. L. Daletskii, Stochastic equations and differential geometry, "Vysshaya Shkola", Kiev, 1989; English transl., Kluwer, Dordrecht, 1990.

2. V. Yu. Bentkus, Large deviations in Banach spaces, Teor. Veroyatnost. i Primenen. 31 (1986), 710-716; English transl., Theory Probab. Appl. 31 (1986), 627-633.

3. V. Yu. Bentkus and D. Pap, On the norm distribution for a stable random vector in a Hilbert space, Litovsk. Mat. Sb. 26 (1986), 211-220; English transl. in Lithuanian Math. J. 26 (1986).

4. P. Billingsley, Convergence of probability measures, Wiley, New York, 1968. 5. D. G. Bobkov, Upper functions and oscillations of Gaussian processes and related random

processes, Thesis, Leningrad State University, Leningrad, 1988. (Russian) 6. V. I. Bogachev, Some results on differentiable measures, Mat. Sb. 127 (1985), 336-351;

English transl. in Math. USSR Sb. 55 (1986). 7. , Subspaces of differentiability of smooth measures on infinite-dimensional spaces,

Dokl. Akad. Nauk SSSR 299 (1988), 18-22; English transl. in Soviet Math. Dokl. 37 (1988). 8. I. S. Borisov, On the rate of convergence of distributions of junctionals of integrable type,

Teor. Veroyatnost. i Primenen. 21 (1976), 293-308; English transl., Theory Probab. Appl. 21 (1976), 283-298.

9. , On the rate of convergence in the "conditional" invariance principle, Teor. Vero­yatnost. i Primenen. 23 (1978), 67-79; English transl., Theory Probab. Appl. 34 (1978), 63-75.

10. , On conditions for the existence of a bounded density for the distribution of additive functionals of some random processes, Teor. Veroyatnost. i Primenen. 25 (1980), 464-475; English transl., Theory Probab. Appl. 25 (1980), 454-465.

11. A. A. Borovkov, On the rate of convergence in the invariance principle, Teor. Veroyatnost. i Primenen. 18 (1973), 217-234; English transl., Theory Probab. Appl. 18 (1973), 207-224.

12. P. Breier, Local limit theorems for functionals in the conditional invariance principle, Zap. Nauchn. Sem. LOMI 158 (1987), 32-38; English transl. in J. Soviet Math. 43 (1988), no. 6.

13. , Local invariance principle for independent non-identically distributed random vari­ables, Rings and Modules. Limit Theorems in Probability Theory, vol. 2, Leningrad State Univ., Leningrad, 1988, pp. 143-149. (Russian)

14. , Local invariance principle in the case of convergence to a stable law, Manuscript. (Russian)

15. N. N. Vakhaniya, Probability distributions in linear spaces, "Metsniereba", Tbilisi, 1971; English transl., North-Holland, Amsterdam, 1981.

16. N. N. Vakhaniya, V. I. Tarieladze, and S. A. Chobanyan, Probability distributions in Banach spaces, "Nauka", Moscow, 1985; English transl., Reidel, Dordrecht, 1987.

17. A. D. Wentzel, Theorems about the action functional for Gaussian random functions, Teor. Veroyatnost. i Primenen. 17 (1972), 542-544; English transl., Theory Probab. Appl. 17 (1072), 515-518.

18. , A course in the theory of random processes, "Nauka", Moscow, 1975; German transl., Theorie zufallige Prozesse, Akademie-Verlag, Berlin, and Birkhauser, Basel, 1979.

19. M. I. Freidlin and A. D. Wentzel, Random perturbations of dynamical systems, "Nauka", Moscow, 1979; English transl., Springer-Verlag, New York, 1984.

20. A. Yu. Veretennikov, A probability approach to hyperellipticity, Uspekhi Mat. Nauk 38 (1983), no. 3, 113-125; English transl. in Russian Math. Surveys 38 (1983).

175

Page 192: Local Properties of Distributions of Stochastic Functionals

176 BIBLIOGRAPHY

21. T. R. Vinogradova, On the accuracy of normal approximation on sets defined by a smooth function. I, II, Teor. Veroyatnost. i Primenen. 30 (1985), 219-229, 554-557; English transl., Theory Probab. Appl. 30 (1985), 235-245, 590-593.

22. J. Galambos, The asymptotic theory of extremal order statistics, Wiley, New York, 1978. 23. W. Hoeffding, On a theorem of V. M. Zolotarev, Teor. Veroyatnost. i Primenen. 9 (1964),

96-99; English transl., Theory Probab. Appl. 9 (1964), 89-92. 24. I. I. Gikhman and A. V. Skorokhod, Theory of random processes. Vol. 1, "Nauka", Moscow,

1971; English transl., Springer-Verlag, New York, 1974. 25. I. I. Gikhman and A. V. Skorokhod, Stochastic differential equations and their applications,

"Naukova Dumka", Kiev, 1982; English transl. of 1st ed., Springer-Verlag, New York, 1972. 26. B. V. Gnedenko, A local limit theorem for densities, Dokl. Akad. Nauk SSSR 95 (1954), 5-7.

(Russian) 27. H.-H. Kuo, Gaussian measures in Banach spaces, Lecture Notes in Math., vol. 463, Springer-

Verlag, New York, 1976. 28. V. M. Gol'dstein and Yu. G. Reshetnyak, Introduction to the theory of functions with gen­

eralized derivatives and quasiconformal mappings, "Nauka", Moscow, 1983; English transl., Quasiconformal mappings and Sobolev spaces, Kluwer, Dordrecht, 1990.

29. Yu. A. Davydov, The invariance principle for stationary processes, Teor. Veroyatnost. i Primenen. 15 (1970), 498-509; English transl., Theory Probab. Appl. 15 (1970), 487-498.

30. , On absolute continuity of distributions of Junctionals of random processes, Teor. Veroyatnost. i Primenen. 23 (1978), 228-229; English transl. in Theory Probab. Appl. 23 (1978).

31. , On strong convergence of distributions of Junctionals of random processes, Teor. Veroyatnost. i Primenen. 24 (1979), 429; English transl. in Theory Probab. Appl. 24 (1979).

32. , Local limit theorems for functionals of supremum type in a lattice case, Zap. Nauchn. Sem. LOMI 97 (1980), 51-61; English transl. in J. Soviet Math. 24 (1984), no. 5.

33. , Local invariance principle. I, II, Zap. Nauchn. Sem. LOMI 97 (1980), 45-50; 130 (1983), 69-77; English transl. in J. Soviet Math. 24 (1984), no. 5; 27 (1984), no. 6.

34. , On strong convergence of distributions of functionals of random processes. I, II, Teor. Veroyatnost. i Primenen. 25 (1980), 782-799; 26 (1981), 266-286; English transl., Theory Probab. Appl. 25 (1980), 772-788; 26 (1981), 258-278.

35. , The structure of distributions of convex functionals, Fourth Internat. Vilnius Conf. Probab. Theory and Math. Statist., Abstracts of Communications. Vol. I, Inst. Mat. i Kiber-net. Akad. Nauk Litovsk. SSR, Vilnius, 1985, pp. 201-203. (Russian)

36. , Distributions of convex functionals of random processes, Rings and Moduli. Limit Theorems in Probability Theory, vol. 1, Leningrad State Univ., Leningrad, 1986, pp. 157-163. (Russian)

37. , A remark on the absolute continuity of distributions of Gaussian functionals, Teor. Veroyatnost. i Primenen. 33 (1988), 170-172; English transl., Theory Probab. Appl. 33 (1988), 158-160.

38. , Local limit theorems for functionals of random processes, Teor. Veroyatnost. i Prime­nen. 33 (1988), 788-794; English transl., Theory Probab. Appl. 33 (1988), 732-738.

39. , A version of infinite-dimensional local limit theorem, Zap. Nauchn. Sem. LOMI 177 (1989), 46-50; English transl. in J. Soviet Math. 61 (1992), no. 1.

40. Yu. A. Davydov and M. A. Lifshits, Stratification method in some probability problems, Itogi Nauki i Tekhniki: Teor. Veroyatnost. Mat. Statist. Teoret. Kibernet., vol. 22, VINITI, Moscow, 1984, pp. 61-157; English transl. in J. Soviet Math. 37 (1987), no. 1.

41. Yu. A. Davydov and A. L. Rozin, On sojourn times for functions and random processes, Teor. Veroyatnost. i Primenen. 23 (1978), 650-654; English transl., Theory Probab. Appl. 23 (1978), 633-637.

42. Yu. L. Daletskii, Stochastic differential geometry, Uspekhi Mat. Nauk 38 (1983), no. 3, 87-111; English transl. in Russian Math. Surveys 38 (1983).

43. Yu. L. Daletskii and B. D. Maryanin, Smooth measures on infinite-dimensional manifolds, Dokl. Akad. Nauk SSSR 285 (1985), 1297-1300; English transl. in Soviet Math. Dokl. 32 (1985).

44. Yu. L. Daletskii and S. V. Fomin, Measures and differential equations on infinite-dimensional spaces, "Nauka", Moscow, 1983; English transl., Kluwer, Dordrecht, 1991.

45. J. Dixmier, Les C*-algebres et leurs representations, 2nd ed., Gauthier-Villars, Paris, 1969.

Page 193: Local Properties of Distributions of Stochastic Functionals

BIBLIOGRAPHY 177

46. V. A. Dmitrovskii, Estimates for the distribution of the maximum of a random Gaussian field, Random Processes and Fields, Moscow State Univ., Moscow, 1979, pp. 22-31. (Russian)

47. , Boundedness condition and estimates for the distribution of the maximum of random fields on arbitrary sets, Dokl. Akad. Nauk SSSR 253 (1980), 271-274; English transl. in Soviet Math. Dokl. 22 (1980).

48. , On the distribution of the maximum and local properties of Gaussian random fields, Fifth Internat. Vilnius Conf. Probab. Theory and Math. Statist., Abstracts of Communica­tions, vol. 1, Inst. Mat. i Kibernet. Akad. Nauk Litovsk. SSR, Vilnius, 1989, p. 196.

49. V. A. Zalgaller, Compound volumes and the probabilities of convex domains under a multi­dimensional normal distribution, Mat. Zametki 2 (1967), 97-104; English transl. in Math. Notes 2 (1967).

50. V. M. Zolotarev, Concerning a certain probability problem, Teor. Veroyatnost. i Primenen. 6 (1961), 219-222; English transl., Theory Probab. Appl. 6 (1961), 201-204.

51. I. A. Ibragimov and Yu. V. Linnik, Independent and stationary sequences of random vari­ables, "Nauka", Moscow, 1965; English transl., Wolters-Noordhoff, Groningen, 1971.

52. I. A. Ibragimov and Yu. A. Rozanov, Gaussian random processes, "Nauka", Moscow, 1970; English transl., Springer-Verlag, New York, 1978.

53. K. Matthes, J. Kerstan, and J. Mecke, Infinitely divisible point processes, Akademie Verlag, Berlin, and Wiley, New York, 1978.

54. Yu. Ch. Kokaev, The behavior of oscillations and conditional Gaussian distributions of linear functional, Zap. Nauchn. Sem. LOMI119 (1982), 128-143; English transl. in J. Soviet Math. 27 (1984), no. 5.

55. A. N. Kolmogoroff, Foundations of the theory of probability, 2nd ed., "Nauka", Moscow, 1974; English transl. of 1st ed., Chelsea, New York, 1956.

56. M. A. Lifshits, Some problems of the theory of random processes related to mappings of measures, Thesis, Leningrad State University, Leningrad, 1981. (Russian)

57. , The fiber method and its application to the study of junctionals of stochastic pro­cesses, Teor. Veroyatnost. i Primenen. 27 (1982), 67-80; English transl., Theory Probab. Appl. 27 (1982), 69-82.

58. , On the absolute continuity of distributions of junctionals of random processes, Teor. Veroyatnost. i Primenen. 27 (1982), 559-566; English transl., Theory Probab. Appl. 27 (1982), 600-607.

59. , Absolute continuity of supremum type junctionals of Gaussian processes, Zap. Nauchn. Sem. LOMI 119 (1982), 154-166; English transl. in J. Soviet Math. 27 (1984), no. 5.

60. , Stratification method for processes with independent increments, Zap. Nauchn. Sem. LOMI 130 (1983), 109-121; English transl. in J. Soviet Math. 27 (1984), no. 6.

61. , An application of the stratification method to the study of Junctionals of processes with independent increments, Teor. Veroyatnost. i Primenen. 29 (1984), 723-734; English transl., Theory Probab. Appl. 29 (1984), 753-764.

62. , Density of the maximum of a Gaussian process, Teor. Veroyatnost. i Primenen. 29 (1984), 814-815; English transl. in Theory Probab. Appl. 29 (1984).

63. , The probability density of the maximum of a Gaussian process, Fourth Internat. Vilnius Conf. Probab. Theory Math. Statist., Abstracts of Communications, vol. 2, Inst. Mat. i Kibernet. Akad. Nauk Litovsk. SSR, Vilnius, 1985, pp. 113-114. (Russian)

64. , On the distribution of the maximum of a Gaussian process, Teor. Veroyatnost. i Primenen. 31 (1986), 134-142; English transl., Theory Probab. Appl. 31 (1986), 125-133.

65. , On the probability density of the norm of a stable vector, Zap. Nauchn. Sem. LOMI 158 (1987), 105-114; English transl. in J. Soviet Math. 43 (1988), no. 6.

66. , Oscillations and a lower bound of the distribution of the maximum of a Gaussian field, Zap. Nauchn. Sem. LOMI 177 (1989), 78-82; English transl. in J. Soviet Math. 61 (1992), no. 1.

67. G. V. Martynova, Limit theorems for Junctionals of random processes, Thesis, Leningrad State University, Leningrad, 1987. (Russian)

68. , Strong convergence of distributions of functional of a sequence of processes linearly generated by independent random variables, Zap. Nauchn. Sem. LOMI 158 (1987), 122-126; English transl. in J. Soviet Math. 43 (1988), no. 6.

Page 194: Local Properties of Distributions of Stochastic Functionals

178 BIBLIOGRAPHY

69. , On the strong convergence of distributions of multidimensional functional, Rings and Modules. Limit Theorems in Probability Theory, vol. 2, Leningrad State Univ., Lenin­grad, 1988, pp. 163-171. (Russian)

70. G. M. Molchan and Yu. N. Golesov, Gaussian stationary processes with asymptotic power law spectrum, Dokl. Akad. Nauk SSSR 184 (1969), 546-549; English transl. in Soviet Math. Dokl. 10 (1969).

71. S. V. Nagaev, On probabilities of large deviations for a Gaussian distribution in a Banach space, Izv. Akad. Nauk UzSSR Ser. Fiz.-Mat. Nauk 1981, no. 5, 18-21. (Russian)

72. N. V. Norin, Stochastic integrals and differentiate measures, Teor. Veroyatnost. i Primenen. 32 (1987), 114-124; English transl., Theory Probab. Appl. 32 (1987), 107-117.

73. S. V. Nagaev and A. N. Startsev, Asymptotic properties of distributions of an infinite qua­dratic form of Gaussian random variables, Limit Theorems for Random Process and Statis­tical Conclusions, "Fan", Tashkent, 1981, pp. 144-160. (Russian)

74. I. P. Natanson, Theory of functions of a real variable, 3rd ed., "Nauka", Moscow, 1974; English transl., Vols. 1 (1st ed., Chaps. I-IX), 2 (2nd ed., Chaps. X-XVI, XVIII), Ungar, New York, 1955, 1961.

75. D. Pap, On the distribution of the norm of a Gaussian vector in a Banach space, Litovsk. Mat. Sb. 24 (1984), no. 1, 140-145; English transl. in Lithuanian Math. J. 24 (1984).

76. , Boundedness of the norm density of stable random vectors in a Hilbert space, Teor. Veroyatnost. i Mat. Statist. 36 (1987), 102-105; English transl. in Theor. Probability Math. Statist. 36 (1988).

77. D. Pap and V. I. Paulauskas, A remark on the distribution of the norm of a Gaussian vector in the Banach space co, Litovsk. Mat. Sb. 24 (1985), no. 4, 112-117; English transl. in Lithuanian Math. J. 24 (1985).

78. K. R. Parthasarathy, An introduction to probability and measure theory, Macmillan of India, Delhi, 1980; Springer-Verlag, New York, 1977.

79. V. I. Paulauskas, On the closeness of the distributions of two sums of independent random variables with values in a Hilbert space, Litovsk. Mat. Sb. 16 (1975), no. 3, 177-200; English transl. in Lithuanian Math. J. 16 (1975).

80. , On the rate of convergence in the central limit theorem in certain Banach spaces, Teor. Veroyatnost. i Primenen. 21 (1976), 775-791; English transl., Theory Probab. Appl. 21 (1976), 754-770.

81. , On the rate of convergence in the Banach space co, Dokl. Akad. Nauk SSSR 254 (1980), 286-288; English transl. in Soviet Math. Dokl. 22 (1980).

82. , On the probability density of the norm of a Gaussian vector in Banach spaces, Dokl. Akad. Nauk SSSR 266 (1982), 1301-1304; English transl. in Soviet Math. Dokl. 26 (1982).

83. , On the density of the norm of a Gaussian vector in a Banach space, Abstracts Fourth Soviet-Japan Sympos. Probab. Theory Math. Statist., Tbilisi, 1982, pp. 145-146. (Russian)

84. V. I. Paulauskas and A. Yu. Rachkauskas, Approximation theory in the central limit theorem. Exact results in Banach spaces, "Mokslas", Vilnius, 1987; English transl., Kluwer, Dordrecht, 1989.

85. V. I. Piterbarg, Gaussian random processes, Itogi Nauki i Tekhniki: Teor. Veroyatnost. Mat. Statist. Teoret. Kibernet., vol. 19, VINITI, Moscow, 1982, pp. 155-198; English transl. in J. Soviet Math. 25 (1983), no. 3. (Russian)

86. , Asymptotic methods in the theory of Gaussian processes and fields, Moscow State Univ., Moscow, 1988; English transl., Amer. Math. Soc , Providence, RI, 1996.

87. A. P. Robertson and W. J. Robertson, Topological vector spaces, Cambridge Univ. Press, Cambridge, 1964.

88. Yu. A. Rozanov, Infinite-dimensional Gaussian distributions, Trudy Mat. Inst. Steklov. 108 (1968); English transl., Proc. Steklov Inst. Math. 108 (1971).

89. V. I. Rotar' and T. L. Shervashidze, Some estimates for distributions of quadratic forms, Teor. Veroyatnost. i Primenen. 30 (1985), 549-554; English transl., Theory Probab. Appl. 30 (1985), 585-590.

90. R. Rudzkis, On the probability of a large excursion of a nonstationary Gaussian process. I, Litovsk. Mat. Sb. 25 (1985), no. 1, 143-154; English transl. in Lithuanian Math. J. 25 (1985).

Page 195: Local Properties of Distributions of Stochastic Functionals

BIBLIOGRAPHY 179

91. , On the probability density of a large excursion of a Gaussian process. I, II, Litovsk. Mat. Sb. 26 (1986), 512-519; 27 (1987), 731-746; English transl. in Lithuanian Math. J. 26 (1986); 27 (1987).

92. A. I. Sakhanenko, A condition for the existence of a bounded density of the distributions of integral type Junctionals, Second Internat. Vilnius Conf. Probab. Theory Math. Statist., Ab­stracts of Communications, vol. 2, Inst. Mat. i Kibernet. Akad. Nauk Litovsk. SSR, Vilnius, 1977, pp. 149-150. (Russian)

93. , A bound for the probability density of integral type junctionals, Trudy Mat. Inst. Sibirsk. Otdel. Akad. Nauk SSSR 1 (1982), 180-186. (Russian)

94. A. V. Skorokhod, Stochastic processes with independent increments, "Nauka", Moscow, 1964; English transls., (a) Clearing House Federal Sci. Tech. Information, Springfield, VA, 1966; (b), Theory of random processes, British lending Library Sci. Tech., Boston Spa, Yorkshire, England, 1971; 2nd rev. ed., "Nauka", Moscow, 1986; English transl., Kluwer, Dordrecht, 1991.

95. , On admissible translations of measures in a Hilbert space, Teor. Veroyatnost. i Primenen. 15 (1970), 577-598; English transl., Theory Probab. Appl. 15 (1970), 557-578.

96. , A note on Gaussian measures in a Banach space, Teor. Veroyatnost. i Primenen. 15 (1970), 519; English transl., Theory Probab. Appl. 15 (1970), 508-509.

97. , Integration in Hilbert space, "Nauka", Moscow, 1975; English transl., Springer-Verlag, New York, 1974.

98. Yu. K. Belyaev (editor), Random processes. Sample functions and intersections, Collection of translated papers, "Mir", Moscow, 1978. For the contents, see MR 81j:60049.

99. N. V. Smorodina, Absolute continuity of distributions of Junctionals of diffusion processes, Uspekhi Mat. Nauk 37 (1982), no. 6, 185-192; English transl. in Russian Math. Surveys 37 (1982).

100. , A local invariance principle for processes linearly generated by independent random variables, Teor. Veroyatnost. i Primenen. 29 (1984), 680-691; English transl., Theory Probab. Appl. 29 (1984), 707-718.

101. , Local limit theorems for functional of random fields, Zap. Nauchn. Sem. LOMI 136 (1984), 168-182; English transl. in J. Soviet Math. 33 (1986), no. 1.

102. , Conditions for smoothness of probability densities of functionals of random pro­cesses, Fourth Internat. Vilnius Conf. Probab. Theory Math. Statist., Abstracts of Commu­nications, vol. 3, Inst. Mat. i Kibernet. Akad. Nauk Litovsk. SSR, Vilnius, 1985, pp. 143-145. (Russian)

103. , Conditions for smoothness of the probability density of the Lip-norm of a Gaussian vector, Zap. Nauchn. Sem. LOMI 158 (1987), 161-166; English transl. in J. Soviet Math. 43 (1988), no. 6.

104. , Differential calculus on measurable spaces and conditions for smoothness of proba­bility densities of random variables, Dokl. Akad. Nauk SSSR 292 (1987), 1053-1057; English transl. in Soviet Math. Dokl. 35 (1987).

105. , Differential calculus in configuration space and stable measures. I, Teor. Veroyatnost. i Primenen. 33 (1988), 522-534; English transl., Theory Probab. Appl. 33 (1988), 488-500.

106. N. V. Smorodina and M. A. Lifshits, On the distribution of the norm of a stable vector, Teor. Veroyatnost. i Primenen. 34 (1989), 304-313; English transl. in Theory Probab. Appl. 34 (1989).

107. V. N. Sudakov and B. S. Tsirel'son, Extremal properties of half-spaces for spherically in­variant measures, Zap. Nauchn. Sem. LOMI 41 (1974), 14-24; English transl. in J. Soviet Math. 9 (1978), no. 1.

108. A. V. Uglanov, Surface integrals in a Banach space, Mat. Sb. 110 (1979), 189-217; English transl. in Math. USSR Sb. 38 (1981).

109. , On the smoothness of distributions of functionals of random processes, Teor. Vero­yatnost. i Primenen. 33 (1988), 535-544; English transl., Theory Probab. Appl. 33 (1988), 500-509.

110. V. V. Wyanov, Bounds of the density of the norm of a Gaussian random variable, Numerical Methods in Mathematical Physics, "Nauka", Moscow, 1986, pp. 107-109. (Russian)

111. H. Federer, Geometric measure theory, Springer-Verlag, New York, 1978. 112. M. I. Freidlin, The action functional for a class of stochastic processes, Teor. Veroyatnost. i

Primenen. 17 (1972), 536-541; English transl., Theory Probab. Appl. 17 (1972), 511-516.

Page 196: Local Properties of Distributions of Stochastic Functionals

180 BIBLIOGRAPHY

113. B. S. Tsirel'son, The density of the distribution of the maximum of a Gaussian process, Teor. Veroyatnost. i Primenen. 20 (1975), 864-873; English transl., Theory Probab. Appl. 20 (1975), 847-856.

114. , A natural modification of a random process and its applications to random functional series and Gaussian measures, Zap. Nauchn. Sem. LOMI 55 (1976), 35-63; English transl. in J. Soviet Math. 16 (1981), no. 2.

115. , Addendum to the paper on a natural modification, Zap. Nauchn. Sem. LOMI 72 (1977), 202-211; English transl. in J. Soviet Math. 23 (1983), no. 4.

116. , Geometrical approach to maximal likelihood estimation for infinitely-dimensional Gaussian location. I, Teor. Veroyatnost. i Primenen. 27 (1982), 388-395; English transl., Theory Probab. Appl. 27 (1982), 411-418.

117. V. I. Chebotarev, An estimate for the rate of convergence in the central limit theorem in lp, Sibirsk. Mat. Zh. 20 (1979), 1099-1116; English transl. in Siberian Math. J. 20 (1979).

118. T. V. Anderson, The integral of a symmetric unimodal function over a symmetric convex set and some probability inequalities, Proc. Amer. Math. Soc. 6 (1955), 170-176.

119. N. Aronszajn, Theory of reproducing kernels, Trans. Amer. Math. Soc. 68 (1950), 337-404. 120. A. Badrikian and S. Shevet, Mesures cylindriques, espaces de Wiener et fonctions aleatoires

gaussiennes, Lecture Notes in Math., vol. 379, Springer-Verlag, Berlin, 1974. 121. S. Banach, Sur les lignes rectifiables et les surfaces dont I'aire est finie, Fund. Math. 7

(1925), 225-237. 122. D. Bell, The Malliavin calculus, Longman Sci. Tech., Harlow, 1987. 123. K. Bichteler, J. B. Graveraux, and J. Jacod, Malliavin calculus for processes with jumps,

Gordon and Breach, London, 1987. 124. J.-M. Bismut, Martingales, the Malliavin calculus and hypoellipticity under general Hor-

mander's conditions, Z. Wahrsch. Verw. Gebiete 56 (1981), 469-505. 125. P. Billingsley and F. Tops0e, Uniformity in weak convergence, Z. Wahrsch. Verw. Gebiete 7

(1967), 1-16. 126. S. Bochner, Harmonic analysis and the theory of probability, Univ. of California Press, Berke­

ley, 1955. 127. C. Borell, Convex measures on locally convex spaces, Ark. Mat. 12 (1974), 239-252. 128. , The Brunn-Minkowski inequality in Gauss space, Invent. Math. 30 (1975), 207-216. 129. , Gaussian Radon measures on locally convex spaces, Math. Scand. 38 (1976), 265-

284. 130. T. Byczkowski and K. Samotij, Absolute continuity of stable seminorms, Ann. Probab. 14

(1986), 299-312. 131. R. H. Cameron and W. T. Martin, Transformations of Wiener integrals under translation,

Ann. of Math. 45 (1944), 386-396. 132. R. M. Dudley, Sample functions of the Gaussian process, Ann. Probab. 1 (1973), 66-103. 133. R. M. Dudley, J. Feldman, and L. Le Cam, On seminorms and probabilities, and abstract

Wiener spaces, Ann. of Math. 93 (1971), 390-408. 134. A. Ehrhard, Sur la densite du maximum d'une fonction aleatoire gaussienne, Lecture Notes

in Math., vol. 920, Springer-Verlag, Berlin, 1982, pp. 581-601. 135. , Symetrisation dans Vespace de Gauss, Math. Scand. 53 (1983), 281-301. 136. X. Fernique, Integrabilite des vecteurs gaussiens, C. R. Acad. Sci. Paris Ser. A 270 (1970),

1698-1699. 137. D. Geman and J. Horowitz, Occupation times for smooth stationary processes, Ann. Probab.

1 (1973), 131-137. 138. L. Gross, Abstract Wiener spaces, Proc. 5th Berkeley Symp. Math. Statist. Probab., vol. 2,

Univ. of California Press, Berkeley, 1965, pp. 31-42. 139. N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, Kodan-

sha, Tokyo, and North-Holland, Amsterdam, 1981. 140. K. Ito and M. Nisio, On the oscillation function of Gaussian processes, Math. Scand. 22

(1968), 209-223. 141. A. N. Kolmogoroff, La transformation de Laplace dans les espaces lineaires, C. R. Acad. Sci.

Paris 200 (1935), 1717-1718. 142. J. Kuelbs, Abstract Wiener spaces and applications to analysis, Pacific J. Math. 31 (1969),

433-450.

Page 197: Local Properties of Distributions of Stochastic Functionals

BIBLIOGRAPHY 181

143. J. Kuelbs and T. Kurtz, Berry-Esseen estimates in Hilbert space and application to the law of the iterated logarithm, Ann. Probab. 2 (1974), 387-407.

144. H. L. Landau and L. A. Shepp, On the supremum of a Gaussian process, Sankhya Ser. A 32 (1970), 369-378.

145. W. Linde, Infinitely divisible and stable measures on Banach spaces, Teubner, Leipzig, 1983. 146. P. Malliavin, Ck-hypoellipticity with degeneracy. I, Stochastic Analysis, Academic Press,

New York, 1978, pp. 199-214. 147. M. B. Marcus and L. A. Shepp, Sample behavior of Gaussian processes, Proc. 6th Berkeley

Sympos. Math. Statist. Probab., vol. 2, Univ. of California Press, Berkeley, 1972, pp. 423-442.

148. K. R. Parthasarathy and T. Steerneman, A tool in establishing total variation convergence, Proc. Amer. Math. Soc. 95 (1985), 626-630.

149. V. Paulauskas, On the density of the norm of Gaussian vector in Banach spaces, Lecture Notes in Math., vol. 990, Springer-Verlag, Berlin, 1983, pp. 179-197.

150. L. Pitt, Local sample paths properties of Gaussian fields, Ann. Probab. 7 (1979), 477-493. 151. W. S. Rhee, On the distribution of the norm for a Gaussian measure, Ann. Inst. H. Poincare

Probab. Statist. 20 (1984), 277-286. 152. W. S. Rhee and M. Talagrand, Bad rates of convergence for the central limit theorem in

Hilbert space, Ann. Probab. 12 (1984), 843-850. 153. , Uniform convexity and the distribution of the norm for a Gaussian measure, Probab.

Theory Related Fields 71 (1986), 59-68. 154. M. Ryznar, Density of stable seminorms, Bull. Polish Acad. Sci. Math. 33 (1985), 431-440. 155. H. Sato, Souslin support and Fourier expansion of a Gaussian Radon measure, Lecture Notes

in Math., vol. 860, Springer-Verlag, Berlin, 1981, pp. 299-313. 156. H. Sato and Y. Okazaki, Separabilities of a Gaussian Radon measure, Ann. Inst. H. Poincare

Probab. Statist. 11 (1975), 287-298. 157. S. Sawyer, Rates of convergence for some functionals in probability, Ann. Math. Statist. 43

(1972), 273-284. 158. I. Shigekawa, Absolute continuity of probability laws of Wiener functionals, Proc. Japan

Acad. Ser. A Math. Sci. 54 (1978), 230-234. 159. , Derivatives of Wiener functionals and absolute continuity of induced measures, J.

Math. Kyoto Univ. 20 (1980), 263-289. 160. D. Stroock, The Malliavin calculus and its applications, Lecture Notes in Math., vol. 851,

Springer-Verlag, Berlin, 1981, pp. 394-432. 161. M. Talagrand, Mesures gaussiennes sur un espace localement convexe, Z. Wahrsch. Verw.

Gebiete 64 (1983), 181-210. 162. , Sur I'integrabilite des vecteurs gaussiens, Z. Wahrsch. Verw. Gebiete 68 (1984), 1-8. 163. S. Watanabe, Lectures on stochastic differential equations, Springer-Verlag, Berlin, 1984. 164. M. Weber, Sur la densite de la distribution du maximum d'un processus gaussien, J. Math.

Kyoto Univ. 25 (1985), 515-523. 165. N. Wiener, Differential space, J. Math, and Phys. Mass. Inst. Tech. 2 (1923), 131-174. 166. J. Yeh, Wiener measure in a space of functions of two variables, TVans. Amer. Math. Soc.

95 (1960), 433-450. 167. , Cameron-Martin translation theorem in Wiener space of functions of two variables,

Trans. Amer. Math. Soc. 107 (1963), 409-420. 168. N. D. Ylvisaker, The expected number of zeros of a stationary Gaussian process, Ann. Math.

Statist. 36 (1965), 1043-1046. 169. T. Zak and M. Lewandowsky, On the density of the distribution of p-stable seminorms,

0 < p < 1, Proc. Amer. Math. Soc. 100 (1987), 345-351. 170. Yu. A. Davydov, On the exact continuity of images of measures, Zap. Nauchn. Sem. LOMI

142 (1985), 48-54; English transl. in J. Soviet Math. 36 (1987), no. 4. 171. V. Yu. Bentkus, Asymptotic expansions in a local limit theorem in a Hilbert space, Litovsk.

Mat. Sb. 25 (1985), no. 1, 9-22; English transl. in Lithuanian Math. J. 25 (1985). 171. R. E. Zitikis, Asymptotic expansions in a local limit theorem for the Cramer-Mises-Smimov

statistics, Thesis, Vilnius, 1988. (Russian)

Page 198: Local Properties of Distributions of Stochastic Functionals
Page 199: Local Properties of Distributions of Stochastic Functionals

Index

Action functional, 40 Additive mapping, 113 Admissibility of a translation, 40 Admissible direction, 38 Admissible mapping, 14, 128

sufficient class of, 21 Admissible semigroup, 14 Admissible translation, 38, 44 Anderson's inequality, 72

Barycenter, 3

Canonical bijection, 105 Charge, 2 Configuration, 105

close, 113 Convergence

strong, 6 total, 6 in variation, 6

Core of a measure, 38, 47 Cylinder algebra, 2

Differential operators 29, 30 Dudley integral, 87, 92

Ellipsoid of concentration, 67, 69 e-dimension, 94 Erhard inequality, 66, 67, 71

Family of conditional measures, 9 Fiber, 13 Fractional derivative, 45 Functional, 1

action, 40 ( £ , #)-smooth, 21 centered linear, 37 convex, 72 differentiable of order q, 47 lower semicontinuous, 72 measurable affine, 36 measurable linear, 36

Gauss-Ostrogradskii formula, 116, 124 Gaussian distribution

degenerate, 3 nondegenerate, 3 standard, 4

Gaussian white noise, 46, 47

Hellinger distance, 8

Intensity of a random measure, 106 Isoperimetric inequality, 55, 57, 59 Isoperimetric property of a half-space, 59, 67 Ito-Nisio oscillation, 85

fc-symmetrization, 57

Levy-Khinchin representation, 109 Local invariance principle, 157

Measurable partition, 10 Measure

core of, 38, 47 Gaussian, 4 infinite, 2 inner, 2 Leray, 109, 110 mixture of, 7 outer, 2 Poisson random, 106

intensity of, 106 quotient, 9 Radon, 1 representing, 31 signed, 2 spectral, 109 support of 2, 3 surface, 118, 121 translate of, 16 Wiener, 45

Metric entropy, 87, 92 Mixture of measures, 7 Modulus of uniform convexity, 95

Orbits of a semigroup, 14 Orthogonal projection, 111

Pettis mean value, 3

Quotient space, 9

Radial field, 163 Random process, 1 Reduction of a surface, 59

183

Page 200: Local Properties of Distributions of Stochastic Functionals

184 INDEX

Saturated subset of configuration space, 113 Semistable process of order a, 27 Sequence

locally finite, 105 representing, 105 weakly convergent, 4

Set with smooth boundary, 124 Stable vector, 109

with exponent a, 109 Stochastic integral, 107 Strata, 13 Stratification, 13 Sufficient class of mappings, 21 Superstructure, 24 Support of a measure

linear, 3 topological, 2

Surface of codimension k> 116, 117 elementary, 116

Surface sets, 117

Topology coarse, 112 Lindelof, 113 natural, 111 O-. 112 Poisson, 112, 113 weak, 2

TVanslate of a measure, 16 Transport operator, 114

Uniformly convergent space, 95

Variation of a charge, 6 Vector field, 48, 111

Page 201: Local Properties of Distributions of Stochastic Functionals

Copying and reprinting. Individual readers of this publication, and non­profit libraries acting for them, are permitted to make fair use of the material, such as to copy a chapter for use in teaching or research. Permission is granted to quote brief passages from this publication in reviews, provided the customary acknowledgment of the source is given.

Republication, systematic copying, or multiple reproduction of any material in this publication (including abstracts) is permitted only under license from the American Mathematical Society. Requests for such permission should be addressed to the Assistant to the Publisher, American Mathematical Society, P. O. Box 6248, Providence, Rhode Island 02940-6248. Requests can also be made by e-mail to [email protected].

Page 202: Local Properties of Distributions of Stochastic Functionals

Selected Titles in This Series (Continued from the front of this publication)

134 V. V. Prasolov, Problems and theorems in linear algebra, 1994

133 P. I. Naumkin and I. A. Shishmarev, Nonlinear nonlocal equations in the theory of waves, 1994

132 Haj ime Urakawa, Calculus of variations and harmonic maps, 1993

131 V. V. Sharko, Functions on manifolds: Algebraic and topological aspects, 1993

130 V. V. Vershinin, Cobordisms and spectral sequences, 1993

129 Mi tsuo Morimoto , An introduction to Sato's hyperfunctions, 1993

128 V. P. Orevkov, Complexity of proofs and their transformations in axiomatic theories,

1993

127 F . L. Zak, Tangents and secants of algebraic varieties, 1993

126 M. L. Agranovskit, Invariant function spaces on homogeneous manifolds of Lie groups

and applications, 1993

125 Masayoshi Nagata , Theory of commutative fields, 1993

124 Masahisa Adachi, Embeddings and immersions, 1993

123 M. A. Akivis and B. A. Rosenfeld, Elie Cartan (1869-1951), 1993

122 Zhang Guan-Hou, Theory of entire and meromorphic functions: deficient and

asymptotic values and singular directions, 1993

121 I. B . Fesenko and S. V. Vostokov, Local fields and their extensions: A constructive

approach, 1993

120 Takeyuki Hida and Masuyuki Hitsuda, Gaussian processes, 1993

119 M. V. Karasev and V. P. Maslov, Nonlinear Poisson brackets. Geometry and

quantization, 1993

118 Kenkichi Iwasawa, Algebraic functions, 1993

117 Boris Zilber, Uncountably categorical theories, 1993

116 G. M. Fel'dman, Arithmetic of probability distributions, and characterization problems

on abelian groups, 1993

115 Nikolai V. Ivanov, Subgroups of Teichmuller modular groups, 1992

114 Seizo Ito, Diffusion equations, 1992

i l 3 Michail Zhitomirskff, Typical singularities of differential 1-forms and Pfaffian equations,

1992

112 S. A. Lomov, Introduction to the general theory of singular perturbations, 1992

111 S imon Gindikin, Tube domains and the Cauchy problem, 1992

110 B . V . Shabat, Introduction to complex analysis Part II. Functions of several variables,

1992

109 Isao Miyadera, Nonlinear semigroups, 1992

108 Takeo Yokonuma, Tensor spaces and exterior algebra, 1992

107 B . M. Makarov, M. G. Goluzina, A. A. Lodkin, and A. N . Podkorytov, Selected

problems in real analysis, 1992

106 G . -C . W e n , Conformal mappings and boundary value problems, 1992

105 D . R. Yafaev, Mathematical scattering theory: General theory, 1992

104 R. L. Dobrushin, R. Kotecky, and S. Shlosman, Wulff construction: A global shape

from local interaction, 1992

103 A. K. Tsikh, Multidimensional residues and their applications, 1992

102 A. M. Il'in, Matching of asymptotic expansions of solutions of boundary value problems,

1992

101 Zhang Zhi-fen, Ding Tong-ren, Huang Wen-zao, and Dong Zhen-xi, Qualitative

theory of differential equations, 1992

100 V. L. Popov, Groups, generators, syzygies, and orbits in invariant theory. 1992

99 Norio Shimakura, Partial differential operators of elliptic type, 1992

98 V. A. Vassiliev, Complements of discriminants of smooth maps: Topology and applications, 1992 (revised edition, 1994)

(See the AMS catalog for earlier titles)

Page 203: Local Properties of Distributions of Stochastic Functionals

m H^B SHE

ISBN 0-8218-0584-3

^ ^ | H^^H

1 AMS <w Web | www.ams.org |