sparse tomography - samuli siltanen

22
Copyright © by SIAM. Unauthorized reproduction of this article is prohibited. SIAM J. SCI. COMPUT. c 2013 Society for Industrial and Applied Mathematics Vol. 35, No. 3, pp. B644–B665 SPARSE TOMOGRAPHY KEIJO H ¨ AM ¨ AL ¨ AINEN , AKI KALLONEN , VILLE KOLEHMAINEN , MATTI LASSAS § , KATI NIINIM ¨ AKI , AND SAMULI SILTANEN § Abstract. A wavelet-based sparsity-promoting reconstruction method is studied in the context of tomography with severely limited projection data. Such imaging problems are ill-posed inverse problems, or very sensitive to measurement and modeling errors. The reconstruction method is based on minimizing a sum of a data discrepancy term based on an 2 -norm and another term containing an 1 -norm of a wavelet coefficient vector. Depending on the viewpoint, the method can be considered (i) as finding the Bayesian maximum a posteriori (MAP) estimate using a Besov-space B 1 11 (T 2 ) prior, or (ii) as deterministic regularization with a Besov-norm penalty. The minimization is performed using a tailored primal-dual path following interior-point method, which is applicable to problems larger in scale than commercially available general-purpose optimization package algorithms. The choice of “regularization parameter” is done by a novel technique called the S-curve method, which can be used to incorporate a priori information on the sparsity of the unknown target to the recon- struction process. Numerical results are presented, focusing on uniformly sampled sparse-angle data. Both simulated and measured data are considered, and noise-robust and edge-preserving multireso- lution reconstructions are achieved. In sparse-angle cases with simulated data the proposed method offers a significant improvement in reconstruction quality (measured in relative square norm error) over filtered back-projection (FBP) and Tikhonov regularization. Key words. sparse-angle tomography, X-ray tomography, wavelets, Besov space, primal-dual interior-point methods AMS subject classifications. 65R32, 62P10, 92C55, 65T60, 42C40, 90C51, 65F22 DOI. 10.1137/120876277 1. Introduction. The idea of tomographic imaging is to reveal the inner struc- ture of an unknown body from a collection of X-ray projection images taken of the body from different directions. In this paper we concentrate on tomographic projec- tion data collected from two-dimensional targets from a full angle of view but only from a few directions. Such problems are very sensitive to measurement and modeling errors. The motivation for our study comes from medical X-ray imaging, where it is nec- essary to keep the radiation dose given to the patient as low as possible. Recently, dose reduction of traditional computerized tomography (CT) through optimizing hardware settings has been discussed [69, 45, 50, 79]. However, in our view it is more effective to reduce the dose by taking fewer images in the first place; such an approach offers an over tenfold dose reduction in dental implant planning [11, 29, 53]. In medical imaging, the most interesting features are often the boundaries be- tween different tissues, and that kind of information can be sparsely represented Submitted to the journal’s Computational Methods in Science and Engineering section May 7, 2012; accepted for publication (in revised form) February 26, 2013; published electronically June 6, 2013. This work was supported by the Academy of Finland (projects 119270, 140731, 141094) and by the Finnish Centre of Excellence in Inverse Problems Research 2006–2011 (Academy of Finland CoE-projects 213476, 250215, 125925). http://www.siam.org/journals/sisc/35-3/87627.html Department of Physics, University of Helsinki, FI-00014 Helsinki, Finland (keijo.hamalainen@ helsinki.fi, aki.kallonen@helsinki.fi). Department of Applied Physics, University of Eastern Finland, FI-70211 Kuopio, Finland (ville. kolehmainen@uef.fi, kati.niinimaki@uef.fi). § Department of Mathematics and Statistics, University of Helsinki, FI-00014 Helsinki, Finland (matti.lassas@helsinki.fi, samuli.siltanen@helsinki.fi). B644 Downloaded 06/07/13 to 128.214.5.10. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

Upload: khangminh22

Post on 25-Jan-2023

1 views

Category:

Documents


0 download

TRANSCRIPT

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SIAM J. SCI. COMPUT. c© 2013 Society for Industrial and Applied MathematicsVol. 35, No. 3, pp. B644–B665

SPARSE TOMOGRAPHY∗

KEIJO HAMALAINEN†, AKI KALLONEN†, VILLE KOLEHMAINEN‡, MATTI LASSAS§ ,KATI NIINIMAKI‡ , AND SAMULI SILTANEN§

Abstract. A wavelet-based sparsity-promoting reconstruction method is studied in the contextof tomography with severely limited projection data. Such imaging problems are ill-posed inverseproblems, or very sensitive to measurement and modeling errors. The reconstruction method is basedon minimizing a sum of a data discrepancy term based on an �2-norm and another term containing an�1-norm of a wavelet coefficient vector. Depending on the viewpoint, the method can be considered(i) as finding the Bayesian maximum a posteriori (MAP) estimate using a Besov-space B1

11(T2) prior,

or (ii) as deterministic regularization with a Besov-norm penalty. The minimization is performedusing a tailored primal-dual path following interior-point method, which is applicable to problemslarger in scale than commercially available general-purpose optimization package algorithms. Thechoice of “regularization parameter” is done by a novel technique called the S-curve method, whichcan be used to incorporate a priori information on the sparsity of the unknown target to the recon-struction process. Numerical results are presented, focusing on uniformly sampled sparse-angle data.Both simulated and measured data are considered, and noise-robust and edge-preserving multireso-lution reconstructions are achieved. In sparse-angle cases with simulated data the proposed methodoffers a significant improvement in reconstruction quality (measured in relative square norm error)over filtered back-projection (FBP) and Tikhonov regularization.

Key words. sparse-angle tomography, X-ray tomography, wavelets, Besov space, primal-dualinterior-point methods

AMS subject classifications. 65R32, 62P10, 92C55, 65T60, 42C40, 90C51, 65F22

DOI. 10.1137/120876277

1. Introduction. The idea of tomographic imaging is to reveal the inner struc-ture of an unknown body from a collection of X-ray projection images taken of thebody from different directions. In this paper we concentrate on tomographic projec-tion data collected from two-dimensional targets from a full angle of view but onlyfrom a few directions. Such problems are very sensitive to measurement and modelingerrors.

The motivation for our study comes from medical X-ray imaging, where it is nec-essary to keep the radiation dose given to the patient as low as possible. Recently, dosereduction of traditional computerized tomography (CT) through optimizing hardwaresettings has been discussed [69, 45, 50, 79]. However, in our view it is more effectiveto reduce the dose by taking fewer images in the first place; such an approach offersan over tenfold dose reduction in dental implant planning [11, 29, 53].

In medical imaging, the most interesting features are often the boundaries be-tween different tissues, and that kind of information can be sparsely represented

∗Submitted to the journal’s Computational Methods in Science and Engineering section May 7,2012; accepted for publication (in revised form) February 26, 2013; published electronically June 6,2013. This work was supported by the Academy of Finland (projects 119270, 140731, 141094) andby the Finnish Centre of Excellence in Inverse Problems Research 2006–2011 (Academy of FinlandCoE-projects 213476, 250215, 125925).

http://www.siam.org/journals/sisc/35-3/87627.html†Department of Physics, University of Helsinki, FI-00014 Helsinki, Finland (keijo.hamalainen@

helsinki.fi, [email protected]).‡Department of Applied Physics, University of Eastern Finland, FI-70211 Kuopio, Finland (ville.

[email protected], [email protected]).§Department of Mathematics and Statistics, University of Helsinki, FI-00014 Helsinki, Finland

([email protected], [email protected]).

B644

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B645

using wavelets. Furthermore, in medical applications a priori sparsity level can be es-timated from CT image atlases. One could, for example, take a collection of archivedCT images of different individuals and compute the mean of the amounts of nonzerowavelet coefficients over the sample images.

We present a new sparsity-promoting reconstruction method for tomographicimaging from projection data with sparse angular sampling. Our method achievesnoise-robustness by complementing the insufficient measurement data by a prioriknowledge about the sparsity of the attenuation coefficient in a wavelet basis. Fur-thermore, the a priori sparsity level, together with a novel S-curve method, allows usto determine the “regularization parameter” of our algorithm. We demonstrate themethod on simulated data and on X-ray data measured from a biological specimen.

In tomography we want to recover the X-ray attenuation function f from indirectprojection measurements m modeled by

(1.1) m = Kf + ε,

where m ∈ Rk is the projection data vector, f ∈ R

n is a vector of pixel values ofthe X-ray attenuation function, K ∈ R

k×n is a matrix that implements the transformfrom the pixel values to the projection data, and ε is the measurement noise takingvalues in R

k. We do not assume any relationship between the numbers k and n: thenumber k of measurements is determined by the measurement device and number ofprojections, while n depends only on how finely we choose to discretize the unknown.We build our inversion method in a discretization-invariant way: if k is kept fixed andn → ∞, there should be a well-defined infinite-dimensional model in the limit. Thatallows us to express the same a priori information at different discretizations, whichis in turn crucial in computational strategies involving a fine and a coarse grid. See[47, 53] for more details.

In Bayesian inversion, m and f are modeled as random variables, and a completesolution to the inverse problem, the posterior distribution, is given by the Bayes’formula

(1.2) πpost(f |m) =π(m|f)π(f)

π(m).

Here π(m|f) is the likelihood function, which is a probability density model for themeasurement process, and π(f) is the prior density, which is a probabilistic model forthe a priori information of the unknowns. Further, π(m) is the marginal density ofthe measurement m and can be considered as a normalizing constant.

When the noise ε is modeled by white noise with density ε ∼ N (0, σ2) and inde-pendent of f , the likelihood function is given by

(1.3) π(m|f) = πε(m−Kf) = C exp

(− 1

2σ2‖Kf −m‖2�2

),

where C is a constant. Often, a single representative of the posterior distribution ispresented as the solution to the inverse problem. The most common estimates of theposterior distribution are the conditional mean (CM) and the maximum a posteriori(MAP) estimate. In this work we study the MAP estimate defined by

(1.4) πpost(fMAP |m) = max

f∈Rnπpost(f |m), f ≥ 0,

where the constraint comes from the nonnegativity prior on the X-ray attenuationfunction f . For general references on Bayesian inversion, see [36, 66].

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B646 HAMALAINEN ET AL.

We study the Besov space prior π(f) = C exp(−α‖f‖B111), where α > 0 is a

prior parameter controlling the marginal variances of the density, using discretizationsbased on truncated wavelet bases. Besov space priors are known to be discretization-invariant [47] and can produce edge-preserving MAP estimates [41]. While the totalvariation prior can produce edge-preserving MAP estimates as well, it cannot beused for discretization-invariant Bayesian inversion [48]. The estimates are known tobe sparse since they can be viewed as regularized reconstructions using Besov-normpenalties, which are sparse [16, 10, 25, 26]. In this paper we present the first to-mographic reconstructions based on the use of Besov space priors and Haar wavelets,and the results are edge-preserving. Moreover, we apply the sparsity-based parameterchoice rule for the selection of the prior parameter α introduced in [41]. Discretization-invariance allows us to choose the parameter α at a coarser resolution and then com-pute the actual estimate at a finer resolution using the same α.

The literature on reconstruction algorithms for sparse-angle data tomography isalready quite extensive. We review here only sparsity-promoting and wavelet-basedapproaches. Total variation regularization has been used in [17, 57, 43, 61, 62, 44, 49,63, 28, 65, 19, 5, 4, 33, 30, 67], level set methods in [76, 21, 58, 42, 75, 40], varioussparsity-promoting methods in [9, 8, 77, 78, 12, 32, 46], and multiresolution-sparsitymethods in [59, 54, 64, 73, 39, 23].

The computation of the MAP estimate with Besov B111 space prior and nonneg-

ativity constraint is equivalent to the following optimization problem:

(1.5) fMAP = arg minf∈Rn

⎧⎨⎩ 1

2σ2‖Kf −m‖2�2 + α

∑j

|〈f, ψj〉|⎫⎬⎭ , f ≥ 0,

where ψj is a wavelet basis and wj = 〈f, ψj〉 are the wavelet coefficients. The ob-jective functional in the minimization problem (1.5) with mixed �2- and �1-norms isnondifferentiable. In addition the minimization is constrained due to the nonnega-tivity requirement. However, we can reformulate (1.5) as a quadratic programming(QP) problem with linear constraints:

minx

{1

2xTQx+ cTx+ d

}, AIx ≥ bI , AEx = bE ,(1.6)

where Q is a symmetric matrix and indices I and E denote the inequality and equalityconstraints, respectively. The trick in moving from (1.5) to (1.6) is to write the waveletcoefficients of f ∈ R

n in the form w = w+ − w−, where w+ ≥ 0 and w− ≥ 0 arethe nonnegative and nonpositive parts of w, respectively. The new unknown is thenx = [fT (w+)T (w−)T ]T ∈ R

3n. See section 3 below for the full explanation.Convex QP problems of the form (1.6) can be solved using standard convex opti-

mization methods such as interior-point methods [74, 56]. For standard interior-pointmethods there exists general-purpose solvers such as mosek or cvx. The standardmethods can handle only small and medium-scale problems, whereas problems aris-ing from tomography imaging are often large-scale problems. However, specializedinterior-point method can handle large-scale problems [34]. Furthermore, [1] showsthat primal-dual interior-point methods are more efficient for minimizing a sum ofnorms or sum of absolute values than interior-point methods or other classical meth-ods. Hence in this work we use a tailored primal-dual path following interior-pointmethod to solve the constrained minimization problem, similarly to [41, 14].

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B647

Primal-dual interior-point methods can be applied to a wide range of inverseproblems arising from real applications. Johnson and Sofer used them to solve aconstrained MAP minimization problem in image reconstruction from emission to-mography data [35], Fu et al. for deblurring problems [24], Nikolova et al. for imagerestoration [55], and Kim et al. for sparse signal recovery problems in magnetic res-onance imaging [38]. Borsic et al. derived a primal-dual interior-point framework forEIT (electrical impedance tomography) imaging with total variation regularizationin [6]. Primal-dual path following interior-point methods were used to solve one-dimensional deconvolution problems in [41]; a similar method was used to recover thediscontinuous reaction-diffusion coefficients of a coupled parabolic system in [14].

Of course, there are other types of algorithms for compressed sensing and sparsitypromotion [31, 60, 27, 18, 3, 20] in addition to QP. Also, alternative large-scale opti-mization methods include, for example, [2]. However, we use the QP approach and aprimal-dual interior-point method because we have an implementation available andbecause it allows a simple way to enforce the nonnegativity constraint.

This paper is organized as follows. In section 2 we define the two-dimensionalperiodic wavelets functions and construct the B1

11(T2) space prior. In section 3 we

give details of the reformulation of (1.5) into a constrained quadratic form and presentthe primal-dual path following interior-point method tailored to the problem. Sec-tion 4 is devoted to an explanation of the S-curve method. Section 5 presents theX-ray projection data sets used in the computations. The computational results arepresented in section 6. Section 7 discusses and concludes the results.

2. Wavelets and Besov space priors.

2.1. Periodic wavelets in dimension two. Following standard references [15,52, 37, 13, 68] we construct two-dimensional periodic tensor-product wavelet functionson a two-dimensional torus T2.

Let φC and ψC be compactly supported scaling and wavelet functions of an or-thonormal multiresolution analysis in L2(R). In particular, we will use the Haarscaling and wavelet function. Following Daubechies [15, section 9.3] we construct1-periodic wavelets on the one-dimensional torus (circle) T. Set

φperj,k (x) =∑�∈Z

2jφC(2j(x+ �)− k),(2.1)

ψperj,k (x) =

∑�∈Z

2jψC(2j(x+ �)− k)(2.2)

and define the spaces

Vj = span{φperj,k | k = 0, 1, . . . , 2j − 1},Wj = span{ψper

j,k | k = 0, 1, . . . , 2j − 1}.

Now for j ≤ 0 the spaces Vj contain only constant functions, and for j ≥ 0 they arenested: V0 ⊂ V1 ⊂ V2 . . . . Furthermore,Wj = Vj∩V ⊥

j−1 and we have ∪j≥0Vj = L2(T).The two-dimensional periodic scaling functions and wavelets are obtained using

the tensor-product construction. The functions are parameterized by three indices:the type index � = 1, 2, 3, the scale index j ≥ 0 as above, and a two-dimensional

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B648 HAMALAINEN ET AL.

location index k = (k1, k2), where 0 ≤ k1 ≤ 2j − 1 and 0 ≤ k2 ≤ 2j − 1. Define

φj,�k(x, y) = 2j φperj,k1(x)φperj,k2

(y),

ψ1j,�k

(x, y) = 2j φperj,k1(x)ψper

j,k2(y),

ψ2j,�k

(x, y) = 2j ψperj,k1

(x)φperj,k2(y),

ψ3j,�k

(x, y) = 2j ψperj,k1

(x)ψperj,k2

(y),

where we use the definitions (2.1) and (2.2). Now we can represent discrete imagesby means of wavelet expansion

(2.3) f(x, y) =2J0−1∑k1=0

2J0−1∑k2=0

cJ0�kφJ0,�k

(x, y) +J−1∑j=J0

3∑�=1

2j−1∑k1=0

2j−1∑k2=0

wj�k�ψ�j,�k

(x, y),

where the coefficients cJ0�k and wj�k� are defined by

cJ0�k = 〈f, φJ0

�k〉 =∫T2

f(x, y)φj�k(x, y)dxdy,

wj�k� = 〈f, ψ�j�k〉 =

∫T2

f(x, y)ψ�j�k(x, y)dxdy.

The use of wavelets in computational inversion is attractive because fast algorithmsare available for computing the wavelet coefficients cJ0

�k and wj�k� for discrete images.

2.2. Besov space priors. Let us now present the Besov spaces Bspq(T

2) with asmoothness parameter s ∈ R and integrability exponents 1 ≤ p, q <∞. Following [68]we can characterize periodic Besov space functions using wavelets: if the functionsφC and ψC are smooth enough, a function f belongs to Bs

pq(T2) if and only if the

following expression is finite:

‖f‖Bspq(T

2) =

⎛⎝2J0−1∑k1=0

2J0−1∑k2=0

|cJ0�k|p

⎞⎠1p

+

⎛⎜⎝ ∞∑j=J0

2jq(s+1− 2p )

⎛⎝ 3∑�=1

2j−1∑k1=0

2j−1∑k2=0

|wj�k�|p⎞⎠

qp

⎞⎟⎠1q

.

In this paper we focus on the above Besov space norm with p = 1 and q = 1 ands = 1:

(2.4) ‖f‖B111(T

2) =

2J0−1∑k1=0

2J0−1∑k2=0

|cJ0�k|+

∞∑j=J0

3∑�=1

2j−1∑k1=0

2j−1∑k2=0

|wj�k�|.

The infinite-dimensional Besov B111(T

2) prior can be written formally as

(2.5) π(f)formally

= C exp(−α‖f‖B1

11(T2)

).

Rigorously speaking, the infinite-dimensional Besov B111(T

2) prior is the distribution,in the space Bs

11(T2) with s < −1, of the random variable

(2.6) F (x, y) =

2J0−1∑k1=0

2J0−1∑k2=0

CJ0�kφJ0,�k

(x, y) +

∞∑j=J0

3∑�=1

2j−1∑k1=0

2j−1∑k2=0

Wj�k�ψ�j,�k

(x, y),

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B649

where CJ0�k and Wj�k� are independent, identically distributed real valued random

variables having the density function π(x) = ce−|x|, x ∈ R.Consider now working with a periodic function f ∈ B1

11(T2) discretized as a

matrix also denoted by f ∈ RN×N . For simplicity, we renumber the matrix elements

using only one index and represent the image as vector f = [f1, . . . , fn]T ∈ R

n withn = N2. We denote the direct and inverse wavelet transforms, respectively, by

(2.7) w = B−1f ∈ Rn, f = Bw ∈ R

n,

that is, w = B−1f are the first n wavelet coefficients of the function f when thewavelets are ordered in a sequence and renumbered by an index ν ∈ Z+. Then theprobability density function of the finite-dimensional version of (2.5) is

(2.8) π(f) = C exp

(−α

n∑ν=1

|(B−1f)ν |),

where α > 0 is a parameter controlling the marginal variances of the prior distribution.

3. MAP estimation as quadratic programming. Using the Besov B111(T

2)prior leads to the following posterior distribution:

(3.1) πpost(f |m) = C exp

{− 1

2σ2‖Kf −m‖2�2 − α

n∑ν=1

|(B−1f)ν |}, f ≥ 0,

where σ > 0 is the standard deviation of the Gaussian measurement noise ε. Thecomputation of the MAP estimate for the posterior (3.1) amounts to the constrainedminimization of a nondifferentiable functional:

fMAP = argminf∈Rn

{1

2σ2‖Kf −m‖2�2 + α

n∑ν=1

|(B−1f)ν |},(3.2)

fν ≥ 0 for 1 ≤ ν ≤ n.

The minimization of (3.2) can be reformulated into a form of a QP problem withlinear constraints. Denoting

B−1f = w = w+ − w−,

with w±ν ≥ 0 for all 1 ≤ ν ≤ n, we can write (3.2) in the form

minf,w+,w−

{1

2σ2fTKTKf − 1

σ2fTKTm+ α1Tw+ + α1Tw− +

1

2σ2mTm

}f ≥ 0, w+ ≥ 0, w− ≥ 0, B−1f = w+ − w−.

Furthermore, denoting

x =

⎡⎣ fw+

w−

⎤⎦ , Q =

⎡⎣ 1σ2K

TK 0 00 0 00 0 0

⎤⎦ , c =

⎡⎣ − 1σ2K

Tmα1α1

⎤⎦ , d =1

2σ2mTm,

the minimization problem takes the standard form

minx∈R3n

{1

2xTQx+ cTx+ d

}, Ax = b, xν ≥ 0,(3.3)

where A =[B−1 −I I

].

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B650 HAMALAINEN ET AL.

3.1. A primal-dual interior-point algorithm. We consider the primal prob-lem (3.3) where x ∈ R

nx and b ∈ Rny . Furthermore, Q is an nx × nx matrix and A is

an ny ×nx matrix. We can eliminate the nonnegativity constraints xν ≥ 0 by placingthem in a barrier term, resulting in the barrier problem

minimize qμ(x) =1

2xTQx+ cTx+ d− μ

nx∑i=1

log(xi)(3.4)

such that Ax = b,

where the objective function qμ is the classical Fiacco–McCormick-type logarithmicbarrier function [22]. The Lagrangian function for the problem (3.4),

(3.5) L(x, y;μ) = 1

2xTQx+ cTx+ d− μ

nx∑i=1

log(xi)− yT (Ax− b),

yields the first order optimality conditions, often referred to as the Karush–Kuhn–Tucker (KKT) conditions

∇xL(x, y;μ) = Qx+ c− μX−11−AT y = 0,

∇yL(x, y;μ) = b−Ax = 0,

where X denotes a diagonal matrix with elements xi and 1 is a vector of all ones.Denoting μX−11 =: z, the KKT conditions can be written as

AT y + z −Qx = c,

Ax = b,(3.6)

XZ1 = μ1,

where Z denotes a diagonal matrix with elements zi and μ > 0 is the central pathparameter; X and Z are diagonal matrices with elements xi, zi, respectively. Thevariable z is complementary to the nonnegative variable x, which implies that z ≥ 0.The central path, parameterized by μ > 0, is a trajectory leading to the optimal so-lution of the QP problem. Note that at the optimal point μ ≡ 0 the barrier objectivefunction becomes equivalent with the primal objective function. When Q is positivesemidefinite these KKT conditions are both necessary and sufficient optimality con-ditions [56] for the QP problem (3.3), and the problem can be solved by finding asolution to the system (3.6). Writing the KKT conditions in a form of a mappingF : R2nx+ny → R

2nx+ny ,

(3.7) F(x, y, z;μ) =

⎡⎣ Qx−AT y − z + cAx− b

XZ1− μ1

⎤⎦ = 0,

assuming that μ is fixed, and applying Newton’s method we obtain

(3.8)

⎡⎣ −Q AT IA 0 0I 0 Z−1X

⎤⎦⎡⎣ ΔxΔyΔz

⎤⎦ =

⎡⎣ σργz

⎤⎦ ,where σ = Qx + c − AT y − z and ρ = b − Ax and γz = μX−11 − z +X−1ΔXΔZ.We can eliminate Δz from (3.8), resulting in the reduced KKT system

(3.9)

[ − (Q+X−1Z

)AT

A 0

] [ΔxΔy

]=

[σ −X−1Zγz

ρ

],

where Δz = X−1Z (γz −Δx).

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B651

The primal-dual path following interior-point algorithm we use to solve the QPproblem is based on Mehrotra’s predictor-corrector method [51] and it proceeds iter-atively from an initial point (x0, y0, z0) through a sequence of points determined by

xk+1 = xk + βprimalΔx,

yk+1 = yk + βdualΔy,

zk+1 = zk + βdualΔz.

The algorithm proposed here uses different step lengths for the primal and dual vari-ables. The unequal step lengths βprimal and βdual are chosen as follows: first, wecompute the largest feasible step lengths

βmaxp = 0.95×min

(min

Δxi<0

(− xiΔxi

), 1

), i = 1, . . . , nx,

βmaxd = 0.95×min

(min

Δzi<0

(− ziΔzi

), 1

), i = 1, . . . , nx,

which keep the variables xk+1i and zk+1

i strictly positive. Then we set βdual = βmaxd

and compute the βprimal by using a backtracking line search on interval [0, βmaxp ]. The

central path parameter μ which is the measure of duality is computed using a similarmethod first proposed by Vanderbei and Shanno in 1999 [72]; thus μ is defined by

(3.10) μ = 0, 1×min

(0.05

1− ξ

ξ, 2

)3xT z

nx,

where ξ = mini xizizT x/nx

.

For general references to QP methods, in particular interior-point methods, see[56, 7, 74, 71]. In the above we consider only equality constraints and nonnegativityconstraints. Other types of constraints, including general inequalities, can be handledsimilarly [56, 7, 74].

4. Sparsity-based parameter selection: The S-curve method. Assumethat we have a priori an estimate S for the number of nonzero wavelet coefficients(B−1f)ν of the unknown function f .

Given an estimate S for the number of nonzero wavelet coefficients, we select theprior parameter α using a modification of the method we proposed in [41]. Denote

(4.1) S(α) = #{ν : 1 ≤ ν ≤ n, |(B−1fMAP(α))ν | > κ

},

where #E denotes the number of elements in the set E. Theoretically we considerκ = 0, but in practical computations κ is set to a small but positive value.

In [41] estimates with 500 different values of α were computed to obtain pairs{α(h), S(α(h))}, and then the value was selected as

α = argminh

{|S − S(α(h))|}.

With respect to the computational burden, the approach has computational complex-ity similar to that of the classical L-curve method, which also involves computationof hundreds of estimates with different values of α. Computing so many estimatesis time consuming and can become a computational bottleneck in large dimensional

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B652 HAMALAINEN ET AL.

problems such as X-ray tomography. Thus, to decrease the computational complexity,we modify the approach in [41] as follows.

First, instead of computing {α(h), S(α(h))} with a large number of α(h)’s, wedecrease the number of samples and compute S(α(h)) only for a small number ofvalues and interpolate the data {α(h), S(α(h))} to get the S-curve S(α). Thus, wetake a collection of the prior parameters α ranging on the interval [0,∞] such that

0 < α(1) < α(2) < · · · < α(M) <∞

and compute the corresponding estimates fMAP(α(1)), . . . , fMAP(α(M)). The firstvalue α(1) is taken to be so small that almost all of the wavelet coefficients of fMAP(α(1))are nonzero. The last value α(M) is taken so large that almost all of the wavelet coef-ficients of fMAP(α(M)) are zero; see [41, Appendix A] for a proof that this is possible.Then we compute the numbers of (essentially) nonzero wavelet coefficients of eachof the computed MAP estimates and fit a smooth interpolation curve to the data{α(h), S(α(h))}. The interpolated sparsity curve is then used to find the value of α

for which S(α) = S.Second, in the selection of α we employ the discretization-invariance of the MAP

estimate with the B111 prior. By [47, 41] the solution of (1.5) converges to a well-

defined asymptotical estimate as discretization is refined. This implies that we mayuse, in the computations for selecting the value of α, a coarser discretization than isused in the computation of the final MAP estimate.

5. Measurement data. In this work we consider two test cases: (i) simulatedprojection data from the Shepp–Logan phantom, and (ii) experimental sparse-angleprojection data from a walnut. As a reference method for the MAP estimates with theBesov space prior we compute the filtered back-projection (FBP) reconstructions. Inboth cases we study the performance of the MAP and FBP estimates as the numberof projection directions is decreased. In addition, reconstruction using the Tikhonovregularization, i.e.,

(5.1) fTik = minf∈Rn

(1

2σ2‖Kf −m‖22 + α‖f‖22

), f ≥ 0,

is computed from the simulated measurement data such that the regularization pa-rameter α is selected using the Morozov discrepancy principle.

5.1. Simulated data. The simulated measurement data was computed usingthe modified Shepp–Logan phantom. Gaussian measurement noise ε ∼ N (0, σ2I)with a standard deviation of 1% of the maximum of the computed noise-free datawas added to the projection data. A more dense grid was used in the generation ofthe noise-free data than was used in any of the reconstructions, thus avoiding inversecrime. With the simulated data we consider five data sets consisting of 148, 74, 37,19, and 13 projections with uniform angular sampling from angular spanning of 180◦.

5.2. Tomographic projection data. The X-ray tomography data of the wal-nut were acquired with the custom-built μCT device nanotom 180 supplied by Phoenixand X-ray Systems and Services GmbH (Wunstorf, Germany). The measurementsetup is depicted in Figure 5.1, and the chosen geometry resulted in a magnificationwith a resolution of 18.33 μm/pixel. The X-ray detector is a 12-bit CMOS (comple-mentary metal oxide semiconductor) flat panel detector with 2304× 2284 pixels of 50μm size (Hamamatsu Photonics, Japan). A set of 90 projection images was acquired

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B653

Fig. 5.1. Left: Experimental setup for collecting tomographic X-ray data of a walnut. Thedetector plane is on the left and the X-ray source on the right in the picture. The walnut is attachedto a computer-controlled rotator platform. Right: Two examples of the resulting projection images.

over a 180-degree rotation with a uniform angular step of 2 degrees between projec-tions. Each projection image was composed of an average of six 750 ms exposures.The X-ray tube acceleration voltage was 80 kV and tube current 200 μA, and the fullpolychromatic beam was used for image acquisition. For this work we chose only theprojections corresponding to the middle cross-section of the walnut, thus resultingin a two-dimensional reconstruction task. From the measured data we picked foursubsets consisting of 90, 45, 30, and 15 projections with uniform angular samplingfrom a total opening angle of 180 degrees.

5.3. Photographic data. We wish to use the S-curve method of section 4 todetermine the prior parameter α for two-dimensional tomographic reconstructionsfrom the projection data measured from a walnut (see section 5.2). For this we needan a priori estimate of the sparsity of the cross-section of the walnut in the appropriatewavelet basis. We simulate the use of an anatomical atlas for the selection of α bytaking photographs of dissected walnuts.

We carefully sawed three walnuts in half. The exposed cross-sections were il-luminated using three LED flashlights placed at 120-degree angles. The beams ofthe flashlights were roughly aligned with the plane of the cross-section, maximizingthe contrast between shadows and illuminated parts of the surface. The cross-sectionswere photographed with a Canon 5D Mark II digital single lens reflex camera featuringa 21.1-megapixel full-frame CMOS sensor. The lens was a Canon EF 100mm/f2.8USM macro optimized for close-up photography. See Figure 6.1 for the resultingimages.

The walnut used as the target in the X-ray experiment in section 5.2 was notincluded in the set of three walnuts used for estimating the sparsity.

6. Results. We solve the minimization problems (3.4) and (5.1) by using theprimal-dual path following interior-point method of section 3.1.

In the examples the prior parameter choice is carried out using 128 × 128 reso-lution, and final MAP estimates with the selected α are computed on a 256 × 256grid. The number M of interpolation points {α(h), S(α(h))} is set to M = 20. The

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B654 HAMALAINEN ET AL.

Fig. 6.1. Photographs of three walnuts split in half.

regularization parameters α in (5.1) are computed using the Morozov discrepancyprinciple.

Notice that in real-life applications we would measure the sparse-angle projectiondata only, and thus the value of α has to be chosen using the data at hand. Thus, wecompute a separate value of α for each data set.

6.1. Simulated data. We compute the a priori sparsity level S using the targetphantom sampled at n = 128×128 resolution and κ = 10−6; this resulted in S = 1622.This value is then used to compute the value of α (using 128 × 128 discretization).This situation corresponds to the ideal case that the sparsity level of the true unknownis exactly known.

Figure 6.2 shows the sparsity-based choice of α applied to each data set withdifferent numbers of projections. The left column shows the smooth interpolatedcurve obtained by interpolation using the 20 points of data (α(h), S(α(h))) and theright column shows the MAP estimate with the selected α.

The left column of Figure 6.3 shows the FBP reconstructions, the middle col-umn the MAP estimates with the selected parameters, and the right column theTikhonov regularized reconstructions with the Morozov discrepancy principle. Wealso computed the relative errors of the FBP, Tikhonov, and MAP reconstructionswith respect to the true target image ftrue:

δR =‖ftrue − fR‖

‖ftrue‖ .

The estimation errors are presented in Table 6.1.We also applied the L-curve method to the simulated data set consisting of 37

projections (third row in Figure 6.2). The MAP estimates were computed using128 × 128 discretization and 400 values of α ranging on interval [10−4, 107]. Theresults are shown in Figure 6.4. The left column shows the L-curve plot and theright column shows the resulting MAP estimate with the selected α. The reconstruc-tions when α is selected using the L-curve method, S-curve method, and Morozovdiscrepancy principle are presented in Figure 6.5 for the simulated data consisting of37 projections.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B655

S = 1622

12000

S = 1622

12000

S = 1622

10000

S = 1622

10000

S = 1622

10000

α = 32.64

148 angles

α = 23.77

74 angles

α = 15.24

37 angles

α = 8.61

19 angles

α = 5.18

13 angles

10−4 107

S-curve Reconstruction

Fig. 6.2. Sparsity-based choice of the prior parameter. Left: The numbers of nonzero waveletcoefficients in MAP estimates computed with 20 values of α ranging in the interval [10−4, 107](denoted by ◦) and plot of the interpolation curve used to determine the value of α. Right: Recon-structions (n = 128×128) using the selected α. The selected values for α and the numbers of equallydistributed projection angles are denoted on the right.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B656 HAMALAINEN ET AL.

148 angles

74 angles

37 angles

19 angles

13 angles

α = 32.64

α = 23.77

α = 15.24

α = 8.61

α = 5.18

αM = 0.059

αM = 0.026

αM = 0.009

αM = 0.014

αM = 0.018

Fig. 6.3. Sparse-angle tomography reconstructions of the Shepp–Logan phantom. Reconstruc-tions using FBP (left column), MAP estimates using B1

11 prior (middle column), and Tikhonovreconstructions (right column). The numbers of projections and α’s are indicated on the right. αdenotes the prior parameter selected using the S-curve method and αM denotes the values of regu-larization parameters selected using Morozov’s discrepancy principle. The reconstruction resolutionis 256× 256.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B657

Table 6.1

Relative errors and computational times of MAP, FBP, and Tikhonov reconstructions. Relativeerrors are computed with respect to the original Shepp–Logan phantom at resolution 256 × 256.

148 74 37 19 13δMAP 0.10 0.12 0.12 0.13 0.17tMAP 102 min 87 min 110 min 127 min 146 minδFBP 0.14 0.16 0.27 0.51 0.78tFBP 0.12 sec 0.04 sec 0.02 sec 0.01 sec 0.01 secδTik 0.10 0.12 0.13 0.23 0.29tTik 157 sec 75 sec 42 sec 26 sec 23 sec

101 10310−15

105MAP estimate with α = 33022

‖Kf −m‖

‖f‖B111

Fig. 6.4. L-curve method applied to data set with 37 projections. Left: The L-curve plotcomputed using 400 values of α ranging on interval [10−4, 107]. Right: MAP estimate (resolution128 × 128) with the selected α = 330022.

Fig. 6.5. L-curve method, S-curve method, and Morozov discrepancy principle applied to dataset with 37 projections. Reconstructions with selected α’s from left to right are MAP estimate withα selected using the L-curve method αL = 33022, MAP estimate with α selected using the S-curvemethod α = 15.24, and Tikhonov reconstruction with the α selected using the Morozov discrepancyprinciple αM = 6.68, respectively. The reconstruction resolution is 128 × 128.

6.2. Experimental data. We estimate the sparsity level S from photographsof three different walnuts that were split in half. The photographs of the split walnutsare shown in Figure 6.1. Each of the photographs was resized to 128× 128, and thenumber of nonzero wavelet coefficients was computed from each image using thresholdκ = 10−6. The sparsity level estimate S was then computed as a mean over the threesamples, leading to the estimate S = 5936. Figure 6.6 shows the sparsity-based

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B658 HAMALAINEN ET AL.

S = 5936

12000

S = 5936

12000

S = 5936

14000

S = 5936

16000

α = 0.024

90 angles

α = 0.011

45 angles

α = 0.019

30 angles

α = 0.015

15 angles

10−6 107

S-curve Reconstruction

Fig. 6.6. Sparsity-based choice of α. Left: The numbers of nonzero wavelet coefficients inMAP estimates computed with 20 values of α ranging in the interval [10−4, 107] (denoted by ◦) andplot of the interpolation curve used to determine the value of α. Right: MAP estimates (resolution128×128) using the selected values of α. The numbers of projection angles are denoted on the right.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B659

α = 0.024

α = 0.011

α = 0.019

α = 0.015

90 angles

45 angles

30 angles

15 angles

Fig. 6.7. Sparse angle tomography reconstructions of the walnut data. Reconstructions usingFBP (left column) and MAP estimates with B1

11 prior (right column). The reconstructions arecomputed at resolution 256× 256.

selection of α using the estimated S. The left column shows the smooth interpolatedcurve obtained by interpolation using the 20 points of data (α(h), S(α(h))), and theright column shows the MAP estimate with the selected α. Figure 6.7 shows the FBP

(top row) and final MAP estimates (bottom row) computed at 256× 256 resolution.

7. Discussion. We investigate the performance of Bayesian MAP estimateswith the Besov B1

11 space prior and Haar wavelet functions by computing two-dimensional reconstructions from sparse-angle X-ray projection data. As a referencemethod we compute FBP reconstructions. Both methods are applied to sparse-angledata sets in order to study how these methods perform when the number of projection

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B660 HAMALAINEN ET AL.

angles is progressively reduced.In [41] we proposed a novel method for selecting the prior parameter α based on

a priori knowledge about the number of nonzero wavelet coefficients. In this work wemodify the method in order to make it computationally less complex, and we call theresult the S-curve method. Instead of evaluating sparsity curve S(α) using a largenumber of values of α, we use smooth interpolation using a small number of datapoints to estimate the sparsity curve S(α). Further decrease in the computationalcost is obtained by utilizing the discretization-invariance of the MAP estimate withthe B1

11 prior by carrying out the selection of α using a coarser resolution than is usedin the final MAP estimates.

The results show that the MAP estimates with the B111 prior perform robustly

when the number of projections is decreased. The relative errors of the reconstructionswith respect to the true target in the simulated test case are smaller for the MAP

estimates than they are for the FBP reconstructions, and also the increment in theestimation error is much smaller for MAP than FBP when the number of projectionimages decreases. Whereas with relatively densely sampled data the relative errorswith the MAP estimates and Tikhonov regularization are the same, the errors in theMAP estimates are smaller than in the Tikhonov reconstructions when the numberof projections is decreased. However, the price of this benefit is paid in increasedcomputation time, as is seen in Table 6.1.

We compared the sparsity-based parameter selection method to the L-curvemethod. It turned out that the L-curve method fails with the X-ray projection data;the L-shape in the curve is barely recognizable and the selected α results in a recon-struction with inferior quality; see Figure 6.4. The results of applying the L-curvemethod to the tomographic data are shown only for one sparse-angle data set; how-ever, the results with the L-curve method for the other data sets were similar. Notethat whereas the L-curve finds the trade-off between the mismatch and the primalnorm of the regularizer, the S-curve identifies a target sparsity level in the MAP esti-mate. For an alternative approach to finding the trade-off between the least-squaresfit and the ‖·‖1 norm of the solution, see [70], where a basis pursuit denoising problemis solved by sampling the Pareto-curve.

The proposed S-curve method was also tested within the following case study.Let’s assume that we determine S with images from healthy subjects, but the pro-jection data is not tumor-free. What would happen? Would S turn out to be toolow to detect a tumor in the actual subject? We suspect the following. Waveletsare mostly needed near edges in the image. A tumor will contribute somewhat moreedges in the image, but usually we wouldn’t expect the amount of edges to doublecompared to a tumor-free situation. Rather, we have in any case a lot of boundariesbetween inner organs, and the presence of the tumor will probably result in a smallproportion of extra edges. If there are not enough wavelets for all parts of the edges,the available wavelets probably will be distributed at edges all around the image do-main representing all edges somewhat erroneously. Thus also the tumor will be asvisible as all the other structures. Figure 7.1 presents a numerical illustration of sucha situation; here the selection of α was carried out using the sparsity estimate S fromthe original “healthy” Shepp–Logan phantom that was used in Figure 6.2.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B661

αS = 32.64

αS = 23.77

αS = 15.24

αS = 8.61

αS = 5.18

αM = 0.0586

αM = 0.0255

αM = 0.0092

αM = 0.0140

αM = 0.0176

148 angles

74 angles

37 angles

19 angles

13 angles

Fig. 7.1. Sparse-angle tomography reconstructions of the Shepp–Logan phantom with tumor.Reconstructions using FBP (left column), MAP estimates using B1

11 prior (middle column), andreconstructions using Tikhonov’s regularization (right column). The number of projections and thevalues of the parameters α are indicated on the right. αS denotes the prior parameter selected usingthe S-curve method, and αM denotes the values of regularization parameters selected using Morozov’sdiscrepancy principle.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B662 HAMALAINEN ET AL.

The question of choosing the mother wavelet is curious. We used Haar waveletsabove because they provided us with edge-preserving reconstructions. Smootherwavelets would be needed for ensuring that the multiresolution analysis really givesa basis for the Besov B1

11 space (see Triebel’s book [68]). Further studies are neededfor a better understanding of the consequences of the mother wavelet choice.

Acknowledgments. The authors thank the anonymous referees for their con-structive criticism that helped improve the study. The authors thank Lauri Harhanenand Esa Niemi for their help in tomographic modeling.

REFERENCES

[1] K. D. Andersen, E. Christiansen, A. R. Conn, and M. L. Overton, An efficient primal-dualinterior-point method for minimizing a sum of Euclidean norms, SIAM J. Sci. Comput.,22 (2000), pp. 243–262.

[2] U. M. Ascher and E. Haber, Grid refinement and scaling for distributed parameter estimationproblems, Inverse Problems, 17 (2001), pp. 571–590.

[3] D. Baron, S. Sarvotham, and R. G. Baraniuk, Bayesian compressive sensing via beliefpropagation, IEEE Trans. Signal Process., 58 (2010), pp. 269–280.

[4] J. Bian, X. Han, E. Y. Sidky, G. Cao, J. Lu, O. Zhou, and X. Pan, Investigation of sparsedata mouse imaging using micro-CT with a carbon-nanotube-based X-ray source, TsinghuaSci Technol., 15 (2010), pp. 74–78.

[5] J. Bian, J. H. Siewerdsen, X. Han, E. Y. Sidky, J. L. Prince, C. A. Pelizzari, and

X. Pan, Evaluation of sparse-view reconstruction from flat-panel-detector cone-beam Ct,Phys. Med. Biol., 55 (2010), pp. 6575–6599.

[6] A. Borsic, B. Graham, A. Adler, and W. Lionheart, In vivo impedance imaging with totalvariation regularization, IEEE Trans. Medical Imaging, 29 (2010), pp. 44–54.

[7] S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, Cam-bridge, UK, 2009.

[8] D. Calvetti and E. Somersalo, Microlocal sequential regularization in imaging, InverseProbl. Imaging, 1 (2007), pp. 1–11.

[9] E. J. Candes, J. Romberg, and T. Tao, Robust uncertainty principles: Exact signal recon-struction from highly incomplete frequency information, IEEE Trans. Inform. Theory, 52(2006), pp. 489–509.

[10] E. J. Candes, J. K. Romberg, and T. Tao, Stable signal recovery from incomplete andinaccurate measurements, Comm. Pure Appl. Math., 59 (2006), pp. 1207–1223.

[11] A. Cederlund, M. Kalke, and U. Welander, Volumetric tomography—a new tomographictechnique for panoramic units, Dentomaxillofacial Radiology, 38 (2009), pp. 104–111.

[12] K. Choi, J. Wang, L. Zhu, T.-S. Suh, S. Boyd, and L. Xing, Compressed sensing basedcone-beam computed tomography reconstruction with a first-order method, Med. Phys., 37(2010), pp. 5113–5125.

[13] C. K. Chui, Wavelets: A Mathematical Tool for Signal Analysis, Math. Model. Comput. 1,SIAM, Philadelphia, 1997.

[14] M. Cristofol, P. Gaitan, K. Niinimaki, and O. Poisson, Inverse problem for a coupledparabolic system with discontinuous conductivities: One-dimensional case, Inverse Probl.Imaging, 7 (2013), pp. 159–182.

[15] I. Daubechies, Ten Lectures on Wavelets, CBMS-NSF Reg. Conf. Ser. Appl. Math 61, SIAM,Philadelphia, 1992.

[16] I. Daubechies, M. Defrise, and C. De Mol, An iterative thresholding algorithm for linearinverse problems with a sparsity constraint, Comm. Pure Appl. Math., 57 (2004), pp. 1413–1457.

[17] A. H. Delaney and Y. Bresler, Globally convergent edge-preserving regularized reconstruc-tion: An application to limited-angle tomography, IEEE Trans. Image Process., 7 (1998),pp. 204–221.

[18] D. L. Donoho, A. Maleki, and A. Montanari, Message-passing algorithms for compressedsensing, Proc. Nat. Acad. Sci. USA, 106 (2009), pp. 18914–18919.

[19] X. Duan, L. Zhang, Y. Xing, Z. Chen, and J. Cheng, Few-view projection reconstructionwith an iterative reconstruction-reprojection algorithm and TV constraint, IEEE Trans.Nuclear Sci., 56 (2009), pp. 1377–1382.

[20] M. F. Duarte and R. G. Baraniuk, Kronecker compressive sensing, IEEE Trans. ImageProcess., 21 (2012), pp. 494–504.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B663

[21] H. Feng, W. Karl, and D. Castanon, A curve evolution approach to object-based tomographicreconstruction, IEEE Trans. Image Process., 12 (2003), pp. 44–57.

[22] A. V. Fiacco and G. P. McCormick, Nonlinear Programming: Sequential UnconstrainedMinimization Techniques, John Wiley and Sons, New York, 1968.

[23] J. Frikel, Sparse Regularization in Limited Angle Tomography, preprint, arXiv:1109.0385[math.NA], 2011.

[24] H. Fu, M. K. Ng, M. Nikolova, and J. L. Barlow, Efficient minimization methods of mixedl2-l1 and l1-l1 norms for image restoration, SIAM J. Sci. Comput., 27 (2006), pp. 1881–1902.

[25] M. Grasmair, M. Haltmeier, and O. Scherzer, Sparse regularization with lq penalty term,Inverse Problems, 24 (2008), 055020.

[26] M. Grasmair, M. Haltmeier, and O. Scherzer, Necessary and sufficient conditionsfor linear convergence of l1-regularization, Comm. Pure Appl. Math., 64 (2011),pp. 161–182.

[27] L. He and L. Carin, Exploiting structure in wavelet-based Bayesian compressive sensing, IEEETrans. Signal Process., 57 (2009), pp. 3488–3497.

[28] G. T. Herman and R. Davidi, Image reconstruction from a small number of projections,Inverse Problems, 24 (2008), 045011.

[29] N. Hyvonen, M. Kalke, M. Lassas, H. Setala, and S. Siltanen, Three-dimensional dentalX-ray imaging by combination of panoramic and projection data, Inverse Probl. Imaging,4 (2010), pp. 257–271.

[30] T. Jensen, J. Jørgensen, P. Hansen, and S. Jensen, Implementation of an optimal first-order method for strongly convex total variation regularization, BIT, 52 (2012), pp. 329–356.

[31] S. Ji, Y. Xue, and L. Carin, Bayesian compressive sensing, IEEE Trans. Signal Process., 56(2008), pp. 2346–2356.

[32] X. Jia, B. Dong, Y. Lou, and S. B. Jiang, GPU-based iterative cone-beam CT reconstructionusing tight frame regularization, Phys. Med. Biol., 56 (2011), pp. 3787–3807.

[33] X. Jia, Y. Lou, R. Li, W. Y. Song, and S. B. Jiang, GPU-based fast cone beam CT recon-struction from undersampled and noisy projection data via total variation, Med. Phys., 37(2010), pp. 1757–1760.

[34] C. Johnson, J. Seidel, and A. Sofer, Interior-point methodology for 3-d pet reconstruction,IEEE Trans. Medical Imaging, 19 (2000), pp. 271–285.

[35] C. A. Johnson and A. Sofer, A primal-dual method for large-scale image reconstruction inemission tomography, SIAM J. Optim., 11 (2000), pp. 691–715.

[36] J. Kaipio and E. Somersalo, Statistical and Computational Inverse Problems, Appl. Math.Sci. 160, Springer-Verlag, Berlin, New York, 2004.

[37] F. Keinert, Wavelets and Multiwavelets, Chapman and Hall, London, 2004.[38] S.-J. Kim, K. Koh, M. Lustig, S. Boyd, and D. Gorinevsky, An interior-point method for

large-scale l1-regularized least squares, IEEE J. Selected Topics Signal Process., 1 (2007),pp. 606–617.

[39] E. Klann, R. Ramlau, and L. Reichel, Wavelet-based multilevel methods for linear ill-posedproblems, BIT, 51 (2011), pp. 669–694.

[40] E. Klann, R. Ramlau, and W. Ring, A Mumford-Shah level-set approach for the inversionand segmentation of SPECT/CT data, Inverse Probl. Imaging, 5 (2011), pp. 137–166.

[41] V. Kolehmainen, M. Lassas, K. Niinimaki, and S. Siltanen, Sparsity-promoting Bayesianinversion, Inverse Problems, 28 (2012), 025005.

[42] V. Kolehmainen, M. Lassas, and S. Siltanen, Limited data X-ray tomography using nonlin-ear evolution equations, SIAM J. Sci. Comput., 30 (2008), pp. 1413–1429.

[43] V. Kolehmainen, S. Siltanen, S. Jarvenpaa, J. Kaipio, P. Koistinen, M. Lassas, J. Pirt-

tila, and E. Somersalo, Statistical inversion for medical X-ray tomography with fewradiographs: II. Application to dental radiology, Phys. Med. Biol., 48 (2003), pp. 1465–1490.

[44] V. Kolehmainen, A. Vanne, S. Siltanen, S. Jarvenpaa, J. Kaipio, M. Lassas, and

M. Kalke, Parallelized Bayesian inversion for three-dimensional dental X-ray imaging,IEEE Trans. Medical Imaging, 25 (2006), pp. 218–228.

[45] T. Kubo, P.-J. P. Lin, W. Stiller, M. Takahashi, H.-U. Kauczor, Y. Ohno, and

H. Hatabu, Radiation dose reduction in chest CT: A review, Amer. J. Roentgenology,190 (2008), pp. 335–343.

[46] H. Langet, C. Riddell, Y. Trousset, A. Tenenhaus, E. Lahalle, G. Fleury, and N. Para-

gios, Compressed sensing based 3D tomographic reconstruction for rotational angiogra-phy, in Medical Image Computing and Computer-Assisted Intervention (MICCAI 2011),G. Fichtinger, A. Martel, and T. Peters, eds., Lecture Notes in Comput. Sci. 6891, Springer,Berlin, Heidelberg, 2011, pp. 97–104.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

B664 HAMALAINEN ET AL.

[47] M. Lassas, E. Saksman, and S. Siltanen, Discretization-invariant Bayesian inversion andBesov space priors, Inverse Probl. Imaging, 3 (2009), pp. 87–122.

[48] M. Lassas and S. Siltanen, Can one use total variation prior for edge-preserving Bayesianinversion?, Inverse Problems, 20 (2004), pp. 1537–1563.

[49] H. Liao and G. Sapiro, Sparse representations for limited data tomography, in Proceedings ofthe 5th IEEE International Symposium, Biomedical Imaging: From Nano to Macro (ISBI2008), IEEE Press, Piscataway, NJ, pp. 1375–1378.

[50] C. H. McCollough, A. N. Primak, N. Braun, J. Kofler, L. Yu, and J. Christner, Strate-gies for reducing radiation dose in ct, Radiol. Clin. North Amer., 47 (2009), pp. 27–40.

[51] S. Mehrotra, On the implementation of a primal-dual interior point method, SIAM J. Optim.,2 (1992), pp. 575–601.

[52] Y. Meyer, Wavelets and Operators, Vol. 1, Cambridge University Press, Cambridge, UK, 1995.[53] J. Mueller and S. Siltanen, Linear and Nonlinear Inverse Problems with Practical Applica-

tions, Comput. Sci. Engrg. 10, SIAM, Philadelphia, 2012.[54] K. Niinimaki, S. Siltanen, and V. Kolehmainen, Bayesian multiresolution method for local

tomography in dental X-ray imaging, Phys. Med. Biol., 52 (2007), pp. 6663–6678.[55] M. Nikolova, M. K. Ng, S. Zhang, and W.-K. Ching, Efficient reconstruction of piecewise

constant images using nonsmooth nonconvex minimization, SIAM J. Imaging Sci., 1 (2008),pp. 2–25.

[56] J. Nocedal and S. Wright, Numerical Optimization, 2nd ed., Springer Ser. Oper. Res.,Springer-Verlag, New York, 2006.

[57] M. Persson, D. Bone, and H. Elmqvist, Total variation norm for three-dimensional it-erative reconstruction in limited view angle tomography, Phys. Med. Biol., 46 (2001),pp. 853–866.

[58] R. Ramlau and W. Ring, A Mumford-Shah level-set approach for the inversion and segmen-tation of X-ray tomography data, J. Comput. Phys., 221 (2007), pp. 539–557.

[59] M. Rantala, S. Vanska, S. Jarvenpaa, M. Kalke, M. Lassas, J. Moberg, and S. Siltanen,Wavelet-based reconstruction for limited-angle X-ray tomography, IEEE Trans. MedicalImaging, 25 (2006), pp. 210–217.

[60] P. Schniter, L. Potter, and J. Ziniel, Fast bayesian matching pursuit, in Proceedings ofthe Information Theory and Applications Workshop, IEEE Press, Piscataway, NJ, 2008,pp. 326–333.

[61] E. Sidky, C. Kao, and X. Pan, Effect of the data constraint on few-view, fan-beam CT imagereconstruction by TV minimization, in Nuclear Science Symposium Conference Record,Vol. 4, IEEE Press, Piscataway, NJ, 2006, pp. 2296–2298.

[62] E. Y. Sidky, C.-M. Kao, and P. Xiaochuan, Accurate image reconstruction from few-views and limited-angle data in divergent-beam ct, J. X-Ray Sci. Technol., 14 (2006),pp. 119–139.

[63] E. Y. Sidky and X. Pan, Image reconstruction in circular cone-beam computed tomographyby constrained, total-variation minimization, Phys. Med. Biol., 53 (2008), pp. 4777–4807.

[64] C. Soussen and J. Idier, Reconstruction of three-dimensional localized objects from limitedangle X-ray projections: An approach based on sparsity and multigrid image representa-tion, J. Electron. Imaging, 17 (2008), 033011.

[65] J. Tang, B. E. Nett, and G.-H. Chen, Performance comparison between total variation(TV)-based compressed sensing and statistical iterative reconstruction algorithms, Phys.Med. Biol., 54 (2009), p. 5781–5804.

[66] A. Tarantola, Inverse Problem Theory and Methods for Model Parameter Estimation, SIAM,Philadelphia, 2005.

[67] Z. Tian, X. Jia, K. Yuan, T. Pan, and S. B. Jiang, Low-dose CT reconstruction via edge-preserving total variation regularization, Phys. Med. Biol., 56 (2011), p. 5949–5967.

[68] H. Triebel, Function Spaces and Wavelets on Domains, EMS Tracts Math. 7, EuropeanMathematical Society, Zurich, 2008.

[69] V. Tsapaki, J. E. Aldrich, R. Sharma, M. A. Staniszewska, A. Krisanachinda, M. Re-

hani, A. Hufton, C. Triantopoulou, P. N. Maniatis, J. Papailiou, and M. Prokop,Dose reduction in CT while maintaining diagnostic confidence: Diagnostic reference levelsat routine head, chest, and abdominal CT—IAEA-coordinated research project, Radiology,240 (2006), pp. 828–834.

[70] E. van den Berg and M. P. Friedlander, Probing the pareto frontiers for basis pursuitsolutions, SIAM J. Sci. Comput., 31 (2008), pp. 890–912.

[71] R. J. Vanderbei, Linear programming: Foundations and Extensions, 3rd ed., Internat. Ser.Oper. Res. Management Sci. 114, Springer-Verlag, New York, 2008.

[72] R. J. Vanderbei and D. F. Shanno, An interior-point algorithm for nonconvex nonlinearprogramming, Comput. Optim. Appl., 13 (1999), pp. 231–252.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php

Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

SPARSE TOMOGRAPHY B665

[73] S. Vanska, M. Lassas, and S. Siltanen, Statistical X-ray tomography using empirical Besovpriors, Internat. J. Tomography Statist., 11 (2009), pp. 3–32.

[74] S. J. Wright, Primal-Dual Interior-Point Methods, SIAM, Philadelphia, 1997.[75] S. Yoon, A. R. Pineda, and R. Fahrig, Simultaneous segmentation and reconstruction: A

level set method approach for limited view computed tomography, Med. Phys., 37 (2010),pp. 2329–2340.

[76] D. F. Yu and J. A. Fessler, Edge-preserving tomographic reconstruction with nonlocal regu-larization, IEEE Trans. Medical Imaging, 21 (2002), pp. 159–173.

[77] H. Yu and G. Wang, Compressed sensing based interior tomography, Phys. Med. Biol., 54(2009), p. 2791–2805.

[78] H. Yu and G. Wang, A soft-threshold filtering approach for reconstruction from a limitednumber of projections, Phys. Med. Biol., 55 (2010), p. 3905–3916.

[79] L. Yu, X. Liu, S. Leng, J. M. Kofler, J. C. Ramirez-Giraldo, M. Qu, J. Christner, J. G.

Fletcher, and C. H. McCollough, Radiation dose reduction in computed tomography:Techniques and future perspective, Imaging Med., 1 (2009), pp. 65–84.

Dow

nloa

ded

06/0

7/13

to 1

28.2

14.5

.10.

Red

istr

ibut

ion

subj

ect t

o SI

AM

lice

nse

or c

opyr

ight

; see

http

://w

ww

.sia

m.o

rg/jo

urna

ls/o

jsa.

php