Imaging through distributed-volume aberrations using single-shot digital holography

Casey J. Pellizzari; Mark F. Spencer; Charles A. Bouman

doi:10.1364/JOSAA.36.000A20

1. INTRODUCTION

Digital holography (DH) uses coherent illumination and spatial-heterodyne detection to sense the amplitude and phase of light scattered off an object’s surface [1]. The resulting data are sensitive to phase errors caused by index-of-refraction perturbations in, for example, the atmosphere or optical systems. With this in mind, we can estimate these phase errors directly from the DH data for wavefront sensing purposes or to digitally correct aberrated images [2–6].

Image-sharpening (IS) algorithms can estimate the phase errors from DH data by maximizing an image-sharpness metric. In practice, the sharpened images result from estimates of the complex-valued reflection coefficient, $g$ , which is the complex-valued ratio of the reflected field to the incident field. For surfaces that are rough relative to the illumination wavelength, this estimation process leads to images with high-spatial-frequency variations known as speckle. Therefore, IS algorithms require incoherent averaging of multiple speckle realizations to accurately estimate the phase errors—a process referred to here as multishot DH [2].

Recently, we developed a model-based iterative reconstruction (MBIR) algorithm for jointly computing the maximum a posteriori (MAP) estimates of the phase errors, $ϕ$ , and the real-valued reflectance, $r$ , from a single data realization—a process referred to here as single-shot DH [5]. We define the reflectance as $r = E [{| g |}^{2}]$ , where $E [\cdot]$ indicates the expected value [7]. In general, $r$ is smoother and has higher spatial correlation when compared to the reflection coefficient, $g$ . Reconstructing $r$ , rather than $g$ , allows us to leverage its higher spatial correlation to better constrain the estimation process and produce more accurate estimates of the phase errors with fewer data and less signal, compared to IS algorithms [5,6].

Both the IS algorithm in Ref. [2] and the MBIR algorithm in Ref. [5] were designed for the estimation and correction of phase errors that are isoplanatic. In general, isoplanatic phase errors (IPEs) result in a shift-invariant point spread function (PSF) in the image domain. We typically model IPEs using a single phase screen located near the entrance pupil of the imaging system [8]. This model accurately represents scenarios where the phase errors are concentrated near the observer, such as the imaging of space-based objects through atmospheric turbulence along mostly vertical propagation paths.

For other scenarios, the phase errors are often distributed along the propagation path, such as the imaging of Earth-based objects through atmospheric turbulence along mostly horizontal propagation paths. In these cases, the phase errors are anisoplanatic, meaning they result in a shift-variant PSF in the image domain. We typically model anisoplanatic phase errors (APEs) using a discrete representation consisting of multiple phase screens distributed along the propagation path [8]. To help quantify APE effects, we use the isoplanatic angle, $θ_{0}$ [9]. This atmospheric turbulence parameter, in practice, gives us a gauge for the image-domain angular separation over which the PSF is essentially unchanged. Therefore, points in the image separated by an angle larger than $θ_{0}$ have significantly different PSFs due to the APEs.

Importantly, the IS and MBIR algorithms in Refs. [2] and [5] can only correct for a single PSF across the entire image field of view (FOV). Thus, in the presence of APEs, these algorithms cannot obtain diffraction-limited performance. Estimating and correcting APEs using DH data is difficult due to increased model complexity and an increased number of unknowns. Extensions of the basic IS algorithm from [2] have been demonstrated for the correction of APEs [10–12]. In particular, Tippie and Fienup demonstrated an IS algorithm for estimating APEs modeled as multiple phase screens distributed along the propagation path [11]. However, this IS algorithm still requires multishot DH data to accurately estimate the phase errors and form a focused speckle-free image.

In this paper, we propose a novel algorithm for estimating APEs and reconstructing focused speckle-free images from single-shot DH data. Our algorithm, referred to as multiplane iterative reconstruction (MIR), uses a Bayesian framework to jointly estimate both the APEs modeled as multiple phase screens distributed along the propagation path, along with the object’s reflectance, $r$ . Using MIR, we compute the joint MAP estimates of each phase screen along with the focused image. We compare the proposed algorithm to the multiplane IS (MIS) algorithm found in Ref. [11] over a range of anisoplanatic conditions using both simulated and experimental data. The results show that the MIR algorithm can more accurately estimate APEs and form focused, speckle-reduced images from single-shot DH data as compared to the leading MIS algorithm.

In what follows, we first formulate the overall concept for the MIR algorithm and review the prior models used in our Bayesian framework. Next, we formulate the execution of the MIR algorithm and our methods for quantifying performance. Results then follow with discussion and a conclusion.

2. MIR ALGORITHM

Figure 1 shows an example DH system using an off-axis image-plane recording geometry (IPRG) [4]. The system images an object with a reflectance function, $r$ , and a corresponding reflection coefficient, $g$ , through distributed-volume phase errors, $ϕ$ . In this example, atmospheric turbulence causes anisoplanatic effects in the DH data; however, the analysis is equally applicable to more general types of APEs, as we show in the experimental results section. For mathematical simplification, it is common to treat distributed-volume phase errors, $ϕ$ , as a finite number of discrete layers, $[ϕ_{1}, ϕ_{2}, \dots, ϕ_{K}]$ , where each layer is a thin unit-amplitude phase screen that represents the phase perturbations along an incremental volume [8]. After encountering $ϕ$ , the signal is imaged onto a detector array, where it is mixed with a reference field to form the digital hologram.

Fig. 1. Example DH system using an off-axis IPRG. A laser source is used to flood-illuminate an object that has a reflectance function, $r$ , and corresponding reflection coefficient, $g$ . In this example, the return signal is corrupted by atmospheric turbulence, which causes APEs, $ϕ$ . We model $ϕ$ as a layered atmosphere consisting of multiple phase screens, $ϕ = [ϕ_{1}, ϕ_{2}, \dots, ϕ_{K}]$ , where $K = 3$ in this example. In our discrete model, each propagation plane, $k$ , has an associated grid sample spacing, $δ_{k}$ , and the distance between grids is given by $Δ z_{i}$ . The blue and green dotted cones represent ray traces from two different points on the object. The ray traces show that light from different points on the object will encounter different phase errors, resulting in anisoplanatic conditions. The returning signal passes through the pupil-aperture function, $a$ , is collimated, and is imaged onto a focal-plane detector array, where it is mixed with a reference field to form the digital hologram.

Download Full Size | PDF

Figure 2 provides insight into the conventional image-processing steps used for a DH system with an off-axis IPRG. We first extract the signal, $y$ , from the spectrum of the digital hologram. Due to the discrete Fourier transform (DFT) relationship between the pupil plane and the image/detector plane, $y$ is a complex image of the signal at the pupil plane. Thus, the conventional approach to form an image is to take an inverse fast Fourier transform (FFT) of the data, $y$ . Before moving on in the analysis, the reader should note that we provide additional details about this DH IPRG model in Appendix B.

Fig. 2. Conventional image-processing steps for a DH system using an off-axis IPRG. Here, we show the magnitude squared of any complex-valued quantities. To extract the signal, we start with (a) a real-valued digital hologram, then take a 2D FFT to obtain the (b) complex-valued holographic spectrum. Note that the digital hologram, shown in (a), naturally has low contrast. Next, a section of the spectrum is isolated (c), as indicated by the dashed white line. This subset of the spectrum, $y$ , represents a complex image of the signal field in the pupil plane. Finally, for basic image formation, we take an inverse FFT (IFFT) of $y$ to form the image shown in (d).

Download Full Size | PDF

In this work, our goal is to jointly compute the MAP estimates of the reflectance, $r \in R^{M}$ , and the phase errors, $ϕ = [ϕ_{1}, ϕ_{2}, \dots, ϕ_{K}]$ , where $ϕ_{k} \in R^{M} \forall k$ , from a single noisy data realization, $y \in C^{M}$ . Here, we use vectorized notation for all 2D variables. The joint MAP estimates are given by

(\hat{r}, \hat{ϕ}) = \underset{r, ϕ}{argmin} {- \log p (r, ϕ | y)} = \underset{r, ϕ}{argmin} {- \log p (y | r, ϕ) - \log p (r) - \log p (ϕ)},

where

r

and

ϕ

are assumed to be independent. Here,

p (\cdot)

and

p (\cdot | \cdot)

represent conventional and conditional probability distributions, respectively. For coherent imaging applications, the log-likelihood function,

\log p (y | r, ϕ)

, is not easily evaluated due to its computational complexity [13]. Therefore, we require a simplified approach to solve Eq. (1). In the remaining subsections, we derive the form of

\log p (y | r, ϕ)

, develop a simplified framework using the expectation maximization (EM) algorithm, and define our distributions for

r

and

ϕ

.

A. Log-Likelihood Model for Coherent Imaging

For a DH system using an off-axis IPRG, we model the complex pupil image,

y = A_{ϕ} g + w,

as a linear transform of the reflection coefficient,

g \in C^{N}

, plus complex-valued measurement noise,

w \in C^{M}

[5]. In Ref. [5], Eq. (2) was derived for a DH system using an off-axis pupil-plane recording geometry. However, as shown in Appendix B, the model also holds for the pupil image,

y

, obtained from a DH system using an off-axis IPRG. The transform,

A_{ϕ} \in C^{M \times M}

, accounts for the propagation and measurement geometry and is dependent on

ϕ

.

For anisoplanatic conditions, the estimation framework of Eq. (1) and data model of Eq. (2) are more complex than they are for the isoplanatic case treated in Ref. [5]. First, we must account for intermediate propagations between multiple phase screens, which adds complexity to the structure of $A_{ϕ}$ . With this in mind, we use the split-step beam propagation method (BPM) to model $A_{ϕ}$ as a series of Fresnel propagations between the planes of interest, as described in Appendix A [8]. Second, we must model and estimate higher dimensional phase errors, where for $K$ phase screens, we get $ϕ \in R^{K M}$ . This makes the joint estimation of $r$ and $ϕ$ significantly more challenging, since the number of unknowns increases relative to the measurements.

To determine the likelihood function, $p (y | r, ϕ)$ , we first define distributions for $g$ and $w$ ; then we relate these distributions to $y$ using Eq. (2). For a surface with reflectance, $r$ , which is rough relative to the illumination wavelength, we model the vector $g$ as

p (g | r) \sim C N (0, D (r), 0),

where

D (\cdot)

denotes an operator that produces a diagonal matrix from its vector argument and

C N (μ, C, Γ)

indicates a multivariate complex normal distribution with mean,

μ

, covariance matrix,

C

, and pseudo-covariance matrix,

Γ

[5]. Here,

Γ = 0

, since the phase of

g

is uniformly distributed. Next, to model the stochastic nature of coherent detection, we treat the measurement process as being shot-noise limited and use an additive, zero-mean, complex Gaussian model [14]. The distribution for

w

is therefore given by

p (w) \sim C N (0, σ_{w}^{2} I, 0),

where

σ_{w}^{2}

is the noise variance. Finally, using the data model from Eq. (2) and the distributions from Eqs. (3) and (4), the likelihood function for

y

, given

r

and

ϕ

, becomes

p (y | r, ϕ) \sim C N (0, A_{ϕ} D (r) A_{ϕ}^{H} + σ_{w}^{2} I, 0),

where the superscript

H

indicates the Hermitian transpose.

B. Prior Models

Both the reflectance, $r$ , and the phase errors, $ϕ_{k}$ , are modeled using Markov random fields (MRFs) with the form

p (Δ) = \frac{1}{z} \exp {- \sum_{{i, j} \in P} b_{i, j} ρ (Δ)},

where

Δ

is the difference between neighboring pixel pair values,

z

is the partition function,

b_{i, j}

is the weight between pixel pairs,

P

is the set of all pair-wise cliques falling within the same neighborhood, and

ρ (\cdot)

is the potential function [15]. We obtain different models for the reflectance and phase errors by choosing different potential functions.

1. Reflectance

For the reflectance function, $r$ , we used a Q-generalized Gaussian Markov random field (QGGMRF) potential function with the form

ρ_{r} (\frac{Δ_{r}}{σ_{r}}) = \frac{{| Δ_{r} |}^{p}}{p σ_{r}^{p}} (\frac{{| \frac{Δ_{r}}{T σ_{r}} |}^{q - p}}{1 + {| \frac{Δ_{r}}{T σ_{r}} |}^{q - p}}),

where

T

is a unitless threshold value that controls the transition of the potential function from having the exponent

q

to having the exponent

p

, and

σ_{r}

controls the variation in

\hat{r}

[16]. The parameters of the QGGMRF potential function affect its shape, and therefore the influence neighboring pixels have an effect on one another. In general, the QGGMRF potential function is designed to preserve edges.

2. Phase Errors

We assume that the layers of the distributed-volume phase error, $ϕ$ , are statistically independent [8]. Therefore, we can write the distribution for $ϕ$ as the product of the distributions for the individual screens, such that

p (ϕ) = \prod_{k = 1}^{K} p (ϕ_{k}) .

Estimating phase-error functions for multiple planes can significantly increase the number of unknowns that must be estimated relative to the number of measurements,

M

. For

K

screens, we must estimate

K M

unknowns for

ϕ

, plus

M

unknowns for

r

, giving a total of

M (K + 1)

unknowns. To reduce the number of unknowns, we model the phase errors,

ϕ_{k}

, using a low-resolution Gaussian Markov random field (GMRF), denoted as

{\bar{ϕ}}_{k}

[5]. If

ϕ_{k}

is subsampled by a factor of

n_{b}

in both dimensions to obtain

{\bar{ϕ}}_{k}

, each screen will have

M / n_{b}^{2}

unknowns. The value of

n_{b}

can be a function of the propagation plane,

k

. When the same value is used for all planes, this technique reduces the total number of unknowns to

M (K / n_{b}^{2} + 1)

.

The Gaussian potential function for ${\bar{ϕ}}_{k}$ is given by

ρ_{{\bar{ϕ}}_{k}} (\frac{Δ_{{\bar{ϕ}}_{k}}}{σ_{{\bar{ϕ}}_{k}}}) = \frac{{| Δ_{{\bar{ϕ}}_{k}} |}^{2}}{2 σ_{{\bar{ϕ}}_{k}}^{2}},

where

Δ_{{\bar{ϕ}}_{k}}

is the difference between neighboring phase samples and

σ_{{\bar{ϕ}}_{k}}

controls the assumed variation in

{\bar{ϕ}}_{k}

. This potential function enforces a strong influence between neighboring samples, which generates a more smoothly varying output when compared to the QGGMRF model.

While the phase errors are estimated as a low-resolution function, $\bar{ϕ}$ , we must up-sample the estimate to the resolution of $ϕ$ when applying the forward-model operator, $A_{ϕ}$ . Our interpolation scheme is given by

ϕ_{k} = P {\bar{ϕ}}_{k},

where

P

is an

M \times M / n_{b}^{2}

nearest-neighbor-interpolation matrix with elements in the set {0, 1}. More complex interpolations can be used; however, nearest neighbor allows for fast computation.

C. MIR Framework

Using Eqs. (5)–(10), we can write the MAP cost function from Eq. (1) as

c (r, \bar{ϕ}) = - \log p (y | r, \bar{ϕ}) - \log p (r) - \log p (\bar{ϕ}) = \log | A_{\bar{ϕ}} D (r) A_{\bar{ϕ}}^{H} + σ_{w}^{2} I | + y^{H} {(A_{\bar{ϕ}} D (r) A_{\bar{ϕ}}^{H} + σ_{w}^{2} I)}^{- 1} y + \sum_{{i, j} \in P} b_{i, j} ρ_{r} (\frac{Δ_{r}}{σ_{r}}) + \sum_{k = 1}^{K} \sum_{{i, j} \in \bar{P}} ρ_{{\bar{ϕ}}_{k}} (\frac{Δ_{{\bar{ϕ}}_{k}}}{σ_{{\bar{ϕ}}_{k}}}) .

As noted in Ref. [5], the determinant and inverse in Eq. (11) make it difficult to evaluate; however, we may use the EM algorithm to replace Eq. (11) with a more tractable surrogate function.

The EM algorithm represents a special case of majorization minimization using a surrogate function. For any function $c (x)$ , we define a surrogate function, $Q (x; x^{'})$ , to be an upper-bounding function, such that $c (x) \leq Q (x; x^{'}) + κ$ , where $x^{'}$ is the current value of $x$ that determines the functional form of $Q$ , and $κ$ is a constant that ensures the two functions are equal at $x^{'}$ . Surrogate functions have the property that minimization of $Q (x; x^{'})$ implies minimization of $c (x)$ [15].

The EM algorithm provides a formal framework for developing a surrogate function [17]. If we choose $g$ to be the unobserved data, our surrogate function becomes

Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'}) = - E_{g} [\log p (y, g | r, \bar{ϕ}) | y, r^{'}, {\bar{ϕ}}^{'}] - \log p (r) - \log p (\bar{ϕ}),

where the conditional expectation is taken over

g

, given the data,

y

, the current estimate of the reflectance,

r^{'}

, and the current estimate of the phase errors,

{\bar{ϕ}}^{'}

. By introducing

g

as the unobserved data, we create a cost function that is more tractable than Eq. (11) because of the linear relationship between the data,

y

, and the reflection coefficient,

g

. Taking the expectation helps overcome the fact that we do not know

g

. Evaluation of the expectation in Eq. (12), with respect to the random vector,

g

, constitutes the E-step of the EM algorithm. This step establishes the specific functional form of

Q

. Next, we conduct the M-step to jointly minimize

Q

according to

(\hat{r}, \hat{\bar{ϕ}}) = \underset{r, \bar{ϕ}}{argmin} {Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'})} .

The EM algorithm consists of iteratively applying the E- and M-steps shown in Eqs. (12) and (13) and updating

r^{'}

and

ϕ^{'}

. For the optimization during the M-step, we use iterative coordinate descent (ICD) to sequentially minimize the entire surrogate cost function with respect to each element,

r_{s}

and

{\bar{ϕ}}_{k, s}

, where the joint subscripts

k

,

s

indicate the

s

th sample of the

k

th phase screen [15]. Figure 3 shows the steps of the MIR algorithm. In Appendix D, we provide the update equations for each step. The update for each reflectance element,

r_{s}

, has a closed form solution and the update for each phase-error element,

{\bar{ϕ}}_{k, s}

, requires a 1D line search. Notice that the separable form of

Q

derived in Appendix C ensures that sequential optimization of

r

and

ϕ

is equivalent to joint optimization of these two quantities.

Fig. 3. MIR algorithm for the MAP estimates of $r$ and $ϕ$ . Here, $S$ is the set of all samples in $r$ , $K$ is the number of partial-propagation planes, $\bar{S}$ is the set of all samples in the low-resolution phase screens, and $i$ is the iteration index.

Download Full Size | PDF

Since $Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'})$ is a surrogate function for $c (r, \bar{ϕ})$ , the alternating steps of the EM algorithm converge in a stable manner to the local minima of the original MAP cost function given by Eq. (11) [13]. Furthermore, since both the MAP cost function and its surrogate are nonconvex, the algorithm is likely to converge to a local minimum that is not global. Since we cannot directly evaluate the MAP cost function to compare different local minima, we cannot easily select among different local minima based on the values of their MAP cost function. Thus, to ensure we find a good local minimum, in the sense that it produces a focused and useful image, we pay close attention to our initial conditions, and we employ an iterative initialization process first developed in Ref. [13]. For a set number of outer-loop iterations, $N_{L}$ , we allow the basic MIR algorithm to run for $N_{K}$ iterations. Each time the MIR algorithm is called, $r$ is reinitialized, but the previous value of $ϕ$ is used. After the outer loop runs $N_{L}$ times, we again run the basic MIR algorithm. For this final reconstruction run, we stop the algorithm when a metric, $ε$ , falls below a threshold value of $ε_{T}$ , where

ε = \frac{∥ r^{i} - r^{i - 1} ∥}{∥ r^{i - 1} ∥} .

3. RESULTS

In this section, we demonstrate the utility of our proposed MIR algorithm using both simulated and experimental data. In both cases, we compare the performance of MIR to the MIS algorithm from [11] over a range of anisoplanatic conditions. Additionally, we include reconstructions from the MBIR algorithm [5], which assumes IPEs. For the reader’s convenience, we provide an overview of the MIS algorithm in Appendix E.

A. Simulation

To generate synthetic data, we simulated the scenario shown in Fig. 1 in which atmospheric turbulence causes APEs. We used the split-step BPM MATLAB code from [8] to simulate the propagation of light from the object to the pupil plane. For atmospheric turbulence, we used three Kolmogorov phase screens of equal strength, distributed over the propagation path. We quantified the strength of the distributed-volume phase errors using two parameters, $D / r_{0, s w}$ and ${\tilde{θ}}_{0} = θ_{0} / (λ / D)$ . The first parameter, $D / r_{0, s w}$ , is the ratio of the pupil diameter, $D$ , to the spherical-wave coherence length, $r_{0, s w}$ [9]. High values of $D / r_{0, s w}$ correspond to strong phase errors, and low values correspond to weak phase errors. The second parameter, ${\tilde{θ}}_{0}$ , is the isoplanatic angle, $θ_{0}$ , normalized by the diffraction-limited viewing angle, $λ / D$ . As previously noted, $θ_{0}$ provides a gauge for the image-domain angular separation over which the PSF is essentially unchanged [9]. Thus, high values of ${\tilde{θ}}_{0}$ indicate neighboring points in the image have similar PSFs. As ${\tilde{θ}}_{0} \to 1$ , points that are just barely resolved will have different PSFs, making reconstruction difficult. After propagating the fields to the pupil plane, we simulated the detection process for a DH system using an off-axis IPRG. The signal-to-noise ratio (SNR) was fixed for all data sets, and in Appendix F, we provide additional details about the simulations.

To numerically quantify algorithm performance in simulation, we used two metrics, normalized root mean square error (NRMSE) and peak Strehl ratio. NRMSE measures the distortion between the reconstructed images, $\hat{r}$ , and the simulation input, $r$ . It is given by

NRMSE = \sqrt{\frac{{∥ α^{*} \hat{r} - r ∥}^{2}}{{∥ r ∥}^{2}}},

where

α^{*} = \underset{α}{argmin} {{∥ α \hat{r} - r ∥}^{2}}

is the least-squares fit for any multiplicative scalar offset between

r

and

\hat{r}

.

The peak Strehl ratio, $S_{p}$ , measures the quality of our phase-error estimate, $\hat{ϕ}$ . It is important to note that we should not simply compare $\hat{ϕ}$ to the simulated phase errors, $ϕ$ , directly. The imaging system is underdetermined, and there are infinitely many solutions that will produce the same effects. It is possible to obtain an estimate, where $\hat{ϕ} \neq ϕ$ , that can still conjugate the effects of the distributed-volume phase errors and form a focused image. Therefore, to assess the quality of the reconstructed phase errors, we measure how well $\hat{ϕ}$ conjugates the effects of $ϕ$ during backpropagation. Specifically, we backpropagate the pupil function, $a$ , with uniform phase, from $k = 3$ to the object plane at $k = 0$ , through a set of phase screens, $ϕ_{c} = ϕ - \hat{ϕ}$ . Our estimate, $\hat{ϕ}$ , should ideally cancel out the effects of $ϕ$ during backpropagation. We define the peak Strehl ratio as

S_{p} = \frac{{[{| A_{ϕ_{c}}^{H} D (a) |}^{\circ 2}]}_{\max}}{{[{| A_{0}^{H} D (a) |}^{\circ 2}]}_{\max}},

where

A_{0}^{H}

indicates the backpropagation through a vacuum and

{[\cdot]}_{\max}

indicates that we take the maximum value of the argument. In Eq. (17), the numerator is the maximum intensity in the object plane that occurs when we backpropagate the function

a

, using

\hat{ϕ}

to conjugate the effects of

ϕ

at each partial-propagation plane. The denominator represents the maximum intensity that occurs in the object plane when there are no turbulence effects (i.e., backpropagation through a vacuum). Thus,

S_{p}

quantifies how close we are to the diffraction limit, ignoring the effects of tilt. Note that this Strehl-ratio metric only measures the phase-error conjugation for a single propagation angle; however, we observed that other angles have similar degrees of correction. Furthermore, we found this Strehl-ratio metric to be more useful at assessing the quality of

\hat{ϕ}

compared to image-quality metrics, such as those used in Ref. [11].

Using the methods described above, we generated 10 independent and identically distributed (i.i.d.) data realizations for 10 different values of ${\tilde{θ}}_{0} \in [100, 1]$ , and three different values of $D / r_{0, s w} \in {5, 10, 15}$ . Each i.i.d. data realization had unique turbulence, speckle, and noise realizations. Figures 4–6 show example reconstructions. The images displayed were those with the median value of $S_{p}$ from the 10 i.i.d. realizations at each value of ${\tilde{θ}}_{0}$ and $D / r_{0, s w}$ . We also show the original uncompensated image in the left column. Figures 7 and 8 show $S_{p}$ and NRMSE, averaged over the 10 i.i.d. realizations, as a function of ${\tilde{θ}}_{0}$ , for each $D / r_{0, s w}$ . We include the Strehl ratio for the uncompensated phase errors, computed using Eq. (17) with $\hat{ϕ} = 0$ , and the NRMSE for the original uncompensated image obtained according to $\hat{r} = {| A_{ϕ = 0}^{H} y |}^{\circ 2}$ .

Fig. 4. Example single-shot reconstructions (simulated) for $D / r_{0, s w} = 5$ . Images are displayed on a log-based decibel (dB) scale obtained according to $r_{db} = 10 \log {r / \max (r)}$ , where $\max (\cdot)$ indicates the maximum value in the 2D array.

Download Full Size | PDF

Fig. 5. Example of single-shot reconstructions (simulated) for $D / r_{0, s w} = 10$ .

Download Full Size | PDF

Fig. 6. Example of single-shot reconstructions (simulated) for $D / r_{0, s w} = 15$ .

Download Full Size | PDF

Fig. 7. Average peak Strehl ratio, $S_{p}$ , as a function of ${\tilde{θ}}_{0}$ for $D / r_{0, s w} = 5$ (top), $D / r_{0, s w} = 10$ (middle), and $D / r_{0, s w} = 15$ (bottom). The error bars show the standard deviation at each point.

Download Full Size | PDF

Fig. 8. Average NRMSE as a function of ${\tilde{θ}}_{0}$ for $D / r_{0, s w} = 5$ (top), $D / r_{0, s w} = 10$ (middle), and $D / r_{0, s w} = 15$ (bottom). The error bars show the standard deviation at each point.

Download Full Size | PDF

The results show that the polynomial-based MIS algorithm is able to correct aberrations in the less challenging conditions when $D / r_{0, s w}$ is low and ${\tilde{θ}}_{0}$ is high. For a fixed value of $D / r_{0, s w}$ , the MIS algorithm’s performance tapers off as ${\tilde{θ}}_{0}$ decreases. For fixed values of ${\tilde{θ}}_{0}$ , MIS performance falls off quickly as $D / r_{0, s w}$ increases. Similar to MIS, for a fixed value of $D / r_{0, s w}$ , the MIR performance tapers off as ${\tilde{θ}}_{0}$ decreases. However, the MIR algorithm is able to achieve much higher peak Strehl ratios and lower NRMSE than the MIS algorithm in all scenarios. The significantly lower NRMSE produced by the MIR algorithm can be attributed to both a better estimate of $\hat{ϕ}$ and a reduction in speckle variation, as seen in Fig. 4. Unlike the MIS algorithm, MIR performance does not fall off as rapidly as $D / r_{0, s w}$ increases for fixed values of ${\tilde{θ}}_{0}$ . Lastly, we see that in most cases, the MBIR algorithm, which again assumes IPEs, performs as well or better than the MIS algorithm in terms of average Strehl ratio and outperforms the MIS algorithm in terms of average NRMSE. When visually compared to the MIR algorithm, the MBIR reconstructions look similar in the weaker turbulence cases and differ more drastically in the stronger turbulence cases. Upon close inspection of the weaker turbulence cases, we see more irregularities in the MBIR reconstructions than in the MIR reconstructions. This is due to the large difference in average Strehl ratios of the two algorithms, as shown in Fig. 7. The MBIR algorithm is simply not able to produce as accurate estimates of the APEs as the MIR algorithm. This large difference is not as apparent in the images, since some of the energy that gets spread out by the turbulence is removed during the MBIR regularization process.

To assess algorithm run time, we measured $S_{p}$ as a function of time for two different cases. In Case 1, the turbulence was relatively weak, with $D / r_{0, s w} = 5$ and ${\tilde{θ}}_{0} = 100$ . For Case 2, the turbulence was strong, with $D / r_{0, s w} = 15$ and ${\tilde{θ}}_{0} = 2.8$ . In both cases, we used MATLAB on a PC with a 2.2 GHz Intel Xeon processor. Figure 9 plots the results for both the MIS and MIR algorithms, and Fig. 10 shows the intermediate estimates, $\hat{r}$ , as a function of time for Case 1. The plots show that in general, the MIR algorithm achieves higher values of $S_{p}$ much faster than the MIS algorithm. Additionally, we see that, for Case 1, the MIR estimate changes very little in the last 90% of the total run time; thus, we can shorten the run time and still get similar results. However, as noted in Appendix F, for simplicity we fixed the number MIR and MBIR outer-loop iterations at $N_{L} = 50$ and the inner-loop iterations at $N_{K} = 20$ for all cases. During the last outer loop, we let the algorithms run until the stopping criteria were met. On average, MIS ran 60 iterations in 67 min, MBIR ran 2565 iterations in 11 min, and MIR ran 2674 iterations in 28 min.

Fig. 9. Peak Strehl ratio as a function of run time for both weak (blue) and strong (red) APEs.

Download Full Size | PDF

Fig. 10. Example of the intermediate samples of $\hat{r}$ taken at various times during the algorithm run time corresponding to Case 1 in Fig. 9. The top images are the MIS estimates at $t = 0, 341, 706, 2086, 3134$ , and 4501 s. The bottom images are the MIR estimates at $t = 0, 9.9, 21.4, 33.2, 163$ , and 1324 s (ordered from left to right first, then top to bottom).

Download Full Size | PDF

B. Experiment

To demonstrate the utility of the MIR algorithm with real data, we also conducted a laboratory experiment, as outlined in Appendix F. A laser with wavelength $λ = 1064 nm$ was used to illuminate an Air Force resolution chart 2.5 m away. To simulate APEs over the propagation path, we introduced multiple phase screens made from plastic CD jewel cases. For the experiment, we started with three screens located at $z = [2.2, 2.3, 2.4] m$ , then incrementally added additional screens at distances 2.1 m, 2.0 m, and 1.9 m. However, to simplify reconstruction, we only estimated four screens located at $z = [1.9, 2.1, 2.3, 2.5] m$ for all data. Figure 11 shows the images reconstructed from the experimental data for three, four, five, and six plastic phase screens. The left column shows the original images reconstructed with no phase-error mitigation. Due to the nonuniformity of the plastic phase screens used, we see anisotropic (not to be confused with anisoplanatic) phase-error effects in the images. Here, the multiplane phase errors encountered in the experiment had stronger effects in the x axis than in the y axis. In Fig. 11, we also see that the phase errors were stronger in some areas of the image than others, making for a more challenging reconstruction problem.

Fig. 11. Experimental results for three, four, five, and six plastic phase screens. The original reconstruction with no phase-error compensation is shown in the left column, the MIS reconstructions are in the second column, and the MIR reconstructions are in the third column. Additionally, we show reconstructions from the isoplanatic MBIR algorithm.

Download Full Size | PDF

Since the underlying statistics of the plastic phase screens used in the experiment were unknown, we cannot easily quantify performance in terms of atmospheric turbulence parameters such as $D / r_{0, s w}$ and ${\tilde{θ}}_{0}$ . However, we can compare against the simulated results to get a general idea of the experimental conditions. Doing so, we see that the overall turbulence strength most closely resembles our simulated data when $D / r_{0, s w} = 5$ . Furthermore, including the reconstructions from the MBIR algorithm provides a visual representation of the degree of anisoplanatic effects, since it assumes IPEs.

Figure 11 shows that the MBIR algorithm is not able to fully correct the multiplane phase errors encountered in the experiment. Alternatively, the MIS algorithm is able to improve upon the original image in all cases, although the sharpness is reduced as the number of screens increases. Similar to the simulations, the MIR algorithm is able to produce sharper and higher-contrast images with less speckle. Note that while the experimental phase errors were clearly anisoplanatic and anisotropic, the conditions were not as challenging as those produced in the simulation.

4. CONCLUSION

In summary, we have developed a novel algorithm for jointly estimating APEs and an object’s reflectance from single-shot DH data using a Bayesian framework. Our MIR algorithm models the APEs using multiple phase screens distributed along the propagation path and computes the MAP estimates of each phase screen along with the focused reflectance image.

To test our algorithm, we designed a simulation and experiment to produce data representative of a DH system using an off-axis IPRG. We generated data for a range of anisoplanatic conditions and compared the MIR algorithm to the polynomial-based MIS algorithm from [11]. To assess the results, we presented both qualitative and quantitative comparisons.

In all cases, we observed that the polynomial-based MIS algorithm works moderately well for the less severe APEs (e.g., when the value of ${\tilde{θ}}_{0}$ is high and $D / r_{0, s w}$ is low). However, the performance of the MIS algorithm falls off quickly as the APEs worsen (i.e., with decreases in ${\tilde{θ}}_{0}$ and increases of $D / r_{0, s w}$ ). Additionally, we observed that the MIR algorithm produces high-quality images and phase-error estimates over a wide range of anisoplanatic conditions. We therefore believe the MIR algorithm provides a robust alternative to the MIS algorithm for estimating APEs and for producing focused, speckle-reduced images from single-shot DH data.

APPENDIX A: PROPAGATION MODEL

In Eq. (2), the linear transform, $A_{ϕ}$ , represents the propagation of light from the object to the observer through a number of intermediate planes containing phase screens. Using the split-step BPM found in Ref. [8], we model $A_{ϕ}$ as a series of partial propagations between consecutive planes. This technique uses the angular spectrum method (ASM) to model each partial propagation followed by the application of a phase screen. In this section we first review the ASM, then we use the split-step BPM to provide an explicit definition for $A_{ϕ}$ in Eq. (A9).

1. ASM

To model the vacuum propagation of a monochromatic field, ${\tilde{u}}_{0} (x_{0}, y_{0})$ , at some arbitrary distance, $Δ z$ , we use the ASM [18]. Here, we use tildes to indicate continuous functions. The ASM computes the convolutional form of Fresnel diffraction integral by multiplying the input field in the spatial-frequency domain with the free-space transfer function. Note that the free-space transfer function is given by

\tilde{H} (f_{x_{0}}, f_{y_{0}}) = e^{j \frac{2 π}{λ} Δ z} e^{- j π λ Δ z (f_{x_{0}}^{2} + f_{y_{0}}^{2})},

where

λ

is the wavelength and

f_{x_{0}}

and

f_{y_{0}}

are the spatial frequency coordinates [18]. Thus, at a distance

Δ z

away, the diffraction pattern from

{\tilde{u}}_{0} (x_{0}, y_{0})

is given by

{\tilde{u}}_{1} (x_{1}, y_{1}) = {CSFT}^{- 1} [\tilde{H} (f_{x_{0}}, f_{y_{0}}) CSFT [{\tilde{u}}_{0} (x_{0}, y_{0})]],

where

CSFT [\cdot]

and

{CSFT}^{- 1} [\cdot]

indicate the continuous-space Fourier transform (CSFT) operation and its inverse, respectively.

In practice, we leverage the discrete version of Eq. (A2) found in Ref. [8]. Here, we write that discrete model, using vector notation, as

u_{1} = F^{- 1} H F u_{0},

where

u_{0} \in C^{M}

is the vectorized input field,

u_{1} \in C^{M}

is the vectorized output field, and

F \in C^{M \times M}

is a 2D DFT matrix. The diagonal matrix,

H

, is the matrix form of the free-space transfer function and is given by

H = D (vec [e^{- j π \frac{λ Δ z}{N^{2} δ_{0}^{2}} (p^{2} + q^{2})}]) .

Here,

vec [\cdot]

is an operator that forms a 1D vector from a 2D argument,

δ_{0}

is the sample spacing,

N

is the number of samples in each dimension of the 2D transfer function, and

p, q \in [- N / 2, N / 2 - 1]

are indices in the spatial-frequency domain. For simplicity, we have dropped any complex scalars and assumed a square sampling grid where

N \times N = M

.

We can only use Eq. (A3) when the input- and output-field sample spacing and grid lengths are the same (i.e., $δ_{1} = δ_{0}$ ). Since this is restrictive, we use a modified version of the ASM found in Ref. [8] to control the ratio $β_{1} = δ_{1} / δ_{0}$ , and thus adjust the output grid size, $δ_{1} N$ . The modified propagation model is given by

u_{1} = Λ_{1} F^{- 1} H_{0, 1} F Λ_{0} u_{0},

where

Λ_{0} = D (vec [e^{j \frac{π}{λ} \frac{1 - β_{1}}{Δ z} δ_{0}^{2} (m^{2} + n^{2})}]), Λ_{1} = D (vec [e^{j \frac{π}{λ} \frac{β_{1} - 1}{β_{1} Δ z} δ_{1}^{2} (m^{2} + n^{2})}]),

and

H_{0, 1} = D (vec [e^{- j \frac{π λ Δ z}{N^{2} δ_{1} δ_{0}} (p^{2} + q^{2})}]) .

Here,

Λ_{0}

and

Λ_{1}

are diagonal matrices that apply quadratic phase factors,

β_{1} = δ_{1} / δ_{0}

is the ratio of the sample spacing in each grid, and

m, n \in [- N / 2, N / 2 - 1]

are indices in the spatial domain. We add the subscript 0,1 to indicate that

H_{0, 1}

is the free-space-transfer function between planes

k = 0

and

k = 1

.

2. Split-Step BPM Model

The split-step BPM model combines multiple partial propagations between phase screens using the modified ASM approach described by Eq. (A5). It accounts for differences between the object-plane sample spacing, $δ_{0}$ , and the pupil-plane sample spacing, $δ_{K}$ . If we convert the spit-step BPM operators found in Ref. [8] to vector notation, we can write the field passing through the pupil in the $K$ th propagation plane as

u_{K} = A_{ϕ} u_{0},

where

A_{ϕ} = D (a) Λ_{K} [\prod_{k = 1}^{K} D (e^{j ϕ_{k}}) F^{- 1} H_{k - 1, k} F] Λ_{0}, Λ_{0} = D (vec [e^{j \frac{π}{λ} \frac{1 - β_{1}}{Δ z_{1}} δ_{0}^{2} (m^{2} + n^{2})}]), Λ_{K} = D (vec [e^{j \frac{π}{λ} \frac{β_{K} - 1}{β_{K} Δ z_{K}} δ_{K} (m^{2} + n^{2})}]),

and

H_{k - 1, k} = D (vec [e^{- j \frac{π λ Δ z_{k}}{N^{2} δ_{k - 1} δ_{k}} (p^{2} + q^{2})}]) .

Here,

δ_{k}

is the sample spacing in the

k

th plane,

N

is the number of samples in each dimension of the 2D functions,

m, n \in [- N / 2, N / 2 - 1]

are indices in the spatial domain, and

p, q \in [- N / 2, N / 2 - 1]

are indices in the spatial-frequency domain. Additionally, in Eq. (A9),

ϕ_{k}

is the phase screen in the

k

th plane,

Δ z_{k}

is the partial-propagation distance between plane

(k - 1)

and plane

k

, and

β_{k} = δ_{k} / δ_{k - 1}

. We add the subscript

k, k - 1

to indicate that

H_{k - 1, k}

is the free-space-transfer function from planes

k - 1

to

k

. As previously noted,

D (a)

is a diagonalization of the pupil-aperture vector,

a

, and

F

is a 2D DFT. Note that we have dropped any complex scalars and assumed a square sampling grid for simplicity (i.e.,

N \times N = M

).

In Eq. (A9), we see that in order to model the propagation using $A_{ϕ}$ , we must have knowledge of the geometry. Specifically, we must know the total distance, $Δ z$ , and the sample spacings, $δ_{0}$ and $δ_{K}$ . In practice, these values can be determined from knowledge of the object range, sensor FOV, and pupil diameter. The partial-propagation distances can be an arbitrary set, depending on how many screens are to be modeled and their desired locations. Following [8], the values of $δ_{1}$ through $δ_{K - 1}$ are set automatically from the values of $δ_{0}$ and $δ_{K}$ .

APPENDIX B: MEASUREMENT MODEL

In this section, we derive a discrete input–output relationship for a DH system using an off-axis IPRG. We start with the complex signal image, ${\tilde{u}}_{S} (x, y)$ , a continuous quantity located at the focal-plane array (FPA) of the DH sensor. For the off-axis IPRG, we perform spatial-heterodyne detection by mixing the signal image with a reference field from a local oscillator (LO) that is split off from the master-oscillator laser. For a point source LO, located at point $(x_{p} = r_{x}, y_{p} = r_{y})$ in the pupil plane, the reference field at the FPA is given by

{\tilde{u}}_{R} (x, y) = {\tilde{R}}_{0} e^{j 2 π (f_{r_{x}} x + f_{r_{y}} y)},

where

{\tilde{R}}_{0}

is the amplitude, assumed to be spatially uniform. For brevity, we ignore the quadratic phase factor,

e^{j \frac{π^{2}}{λ f} (x^{2} + y^{2})}

, applied to the reference,

u_{R} (x, y)

, by the imaging lens. Since this phase is applied to both the reference and the signal, it cancels out of Eq. (B2). In Eq. (B1),

f_{r_{x}} = r_{x} / (λ f)

and

f_{r_{y}} = r_{y} / (λ f)

are the spatial frequencies associated with the modulation from the off-axis reference, and

f

is the focal length of the imaging lens. The resulting hologram intensity is given by

\tilde{i} (x, y) = {| {\tilde{u}}_{R} (x, y) + {\tilde{u}}_{S} (x, y) |}^{2} = {| {\tilde{u}}_{R} (x, y) |}^{2} + {| {\tilde{u}}_{S} (x, y) |}^{2} + {\tilde{u}}_{R} (x, y) {\tilde{u}}_{S}^{*} (x, y) + {\tilde{u}}_{R}^{*} (x, y) {\tilde{u}}_{S} (x, y),

where

*

indicates the complex conjugate. Substituting Eq. (B1) into Eq. (B2) gives

\tilde{i} (x, y) = {\tilde{R}}_{0}^{2} + {| {\tilde{u}}_{S} (x, y) |}^{2} + {\tilde{R}}_{0} {\tilde{u}}_{S}^{*} (x, y) e^{j 2 π (f_{r_{x}} x + f_{r_{y}} y)} + {\tilde{R}}_{0} {\tilde{u}}_{S} (x, y) e^{- j 2 π (f_{r_{x}} x + f_{r_{y}} y)} .

We model the discrete and noisy digital hologram, measured from the intensity in Eq. (B3), as

{i (m, n) = \tilde{i} (x, y) |}_{\begin{array}{l} x = m δ_{x} y = n δ_{y} \end{array}} + w_{R} (m, n),

where

δ_{x}

and

δ_{y}

are the sample spacing on the FPA and

m

and

n

are the sample indices in

x

and

y

dimensions. Here, we have simplified the model by ignoring the blurring effects of the finite-sized pixels. Furthermore, we assume that the DH system is shot-noise limited with noise

w_{R} (m, n)

, driven by the power of the reference field [14]. Thus, we model

w_{R} (m, n)

as additive, zero-mean, white Gaussian noise. Substituting Eq. (B3) into Eq. (B4) gives

i (m, n) = R_{0}^{2} + {| u_{S} (m, n) |}^{2} + R_{0} u_{S}^{*} (m, n) e^{j (ω_{m} m + ω_{n} n)} + R_{0} u_{S} (m, n) e^{- j (ω_{m} m + ω_{n} n)} + w_{R} (m, n),

where

R_{0}

is the digitized reference amplitude, and

ω_{m} = 2 π f_{x, r} δ_{x}

and

ω_{n} = 2 π f_{y, r} δ_{y}

are the modulation frequencies associated with the digitized reference field.

Following the example shown in Fig. 2, we extract the signal by first taking a DFT of the digital hologram, $i (m, n)$ . The resulting hologram spectrum is given by

I (p, q) = DFT [i (m, n)] = R_{0}^{2} δ_{[p, q]} + U_{S} (p, q) ⋆ U_{S} (p, q) + R_{0} U_{S}^{*} (p + p_{r}, q + q_{r}) + R_{0} U_{S} (p - p_{r}, q - q_{r}) + W_{h} (p, q) .

Here,

DFT [\cdot]

is the 2D DFT operator,

δ_{[p, q]}

is a discrete 2D delta function,

⋆

denotes a discrete convolution,

p_{r}

and

q_{r}

are the spatial-frequency shifts corresponding to the modulation from the off-axis reference,

W_{h} (p, q) = DFT [w_{h} (m, n)]

, and

U_{S} (p, q)

is the DFT of the complex-signal image,

u_{S} (m, n)

. Due to the Fourier-transform relationship between the image and pupil planes,

U_{S} (p, q)

is a discrete representation of the signal field in the pupil plane [4].

From Eq. (B6), we see that the modulation from the off-axis reference shifts the spectrum of the two cross terms away from the spectrum of the first two terms, which are centered at D.C. in $I (p, q)$ . Thus, we can spatially extract a grid around the term of interest (fourth term) and divide by $R_{0}$ to get our signal data, such that

y (p, q) = U_{S} (p, q) + w (p, q),

where

w (p, q)

is the extracted and scaled noise. Finally, we can represent Eq. (B7) using vector notation as

y = u + w,

where

y

,

u

, and

w

are in the set

C^{M}

and are obtained by vectorizing

y (p, q)

,

U_{s} (p, q)

, and

w (p, q)

, respectively. To obtain our final measurement model, given by Eq. (2), we substitute the discrete version of the propagated field, obtained using Eq. (A8), into Eq. (B8) for

U

.

APPENDIX C: SURROGATE FUNCTION

In this section, we derive the EM surrogate for the MAP cost function. We start by writing Eq. (12) as

Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'}) = - E_{g} [\log p (y, g | r, \bar{ϕ}) | y, r^{'}, {\bar{ϕ}}^{'}] - \log p (r) - \log p (\bar{ϕ}), = - E_{g} [\log p (y | g, \bar{ϕ}) + \log p (g | r) | y, r^{'}, {\bar{ϕ}}^{'}] - \log p (r) - \log p (\bar{ϕ}),

where we have used Bayes’ theorem inside the expectation and the fact that

p (y | g, r, \bar{ϕ}) = p (y | g, \bar{ϕ})

. Next, we substitute in the likelihood and prior models from Eqs. (5)–(10). This gives

Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'}) = E_{g} [\frac{1}{σ_{w}^{2}} {∥ y - A_{\bar{ϕ}} g ∥}^{2} | y, r^{'}, {\bar{ϕ}}^{'}] + \log | D (r) | + \sum_{i = 1}^{M} \frac{1}{r_{i}} E_{g} [{| g_{i} |}^{2} | y, r^{'}, {\bar{ϕ}}^{'}] + \sum_{{i, j} \in P} b_{i, j} ρ_{r} (\frac{Δ_{r}}{σ_{r}}) + \prod_{k = 1}^{K} [\sum_{{i, j} \in P} b_{i, j} ρ_{{\bar{ϕ}}_{k}} (\frac{Δ_{{\bar{ϕ}}_{k}}}{σ_{{\bar{ϕ}}_{k}}})] + κ,

where

κ

is a constant with respect to

r

and

\bar{ϕ}

.

In Ref. [5], we showed that the conditional posterior distribution of $g$ is complex Gaussian with mean

μ = C \frac{1}{σ_{w}^{2}} A_{{\bar{ϕ}}^{'}}^{H} y

and covariance

C = {[\frac{1}{σ_{w}^{2}} A_{{\bar{ϕ}}^{'}}^{H} A_{{\bar{ϕ}}^{'}} + D {(r^{'})}^{- 1}]}^{- 1} .

Using Eqs. (C3) and (C4), we evaluate the expectations in Eq. (C2) to get the final form of our EM surrogate function. This gives us

Q (r, \bar{ϕ}; r^{'}, {\bar{ϕ}}^{'}) = - \frac{1}{σ_{w}^{2}} 2 Re {y^{H} A_{\bar{ϕ}} μ} + \log | D (r) | + \sum_{i = 1}^{N} \frac{1}{r_{i}} (C_{i, i} + {| μ_{i} |}^{2}) + \sum_{{i, j} \in P} b_{i, j} ρ_{r} (\frac{Δ_{r}}{σ_{r}}) + \prod_{k = 1}^{K} [\sum_{{i, j} \in P} b_{i, j} ρ_{{\bar{ϕ}}_{k}} (\frac{Δ_{{\bar{ϕ}}_{k}}}{σ_{{\bar{ϕ}}_{k}}})],

where

μ_{i}

is the

i

th element of the posterior mean and

C_{i, i}

is the

i

th diagonal element of the posterior covariance.

APPENDIX D: MIR ALGORITHM

In this section, we present details for executing each step of the MIR algorithm. The discussion describes how we evaluate the surrogate function in the E-step and how we optimize it in the M-step. We also present initialization and stopping procedures.

1. E-Step: Surrogate Function Formation

The E-step forms Eq. (C5) so that it can be minimized during the M-step. Specifically, this consists of computing the posterior mean and covariance according to Eqs. (C3) and (C4). We simplify this process by approximating the posterior covariance as

C \approx D (\frac{σ_{w}^{2}}{1 + \frac{σ_{w}^{2}}{r^{'}}}),

which requires that

A_{{\bar{ϕ}}^{'}}^{H} A_{{\bar{ϕ}}^{'}} \approx I

. The matrix

A_{\bar{ϕ}}

can be decomposed into several matrices that are all unitary, with the exception of the

D (a)

, which applies the pupil-aperture function. Therefore, when

D (a) = I

and/or

D {(a)}^{H} D (a) = I

, the approximation of Eq. (D1) is exact. Typically, this is not the case, since

a

is circular, but we approximate it as such to dramatically simplify computations. In practice, we have found this to work well. Alternatively, a square subset of the DH data can be used for which

D (a) = I

and Eq. (D1) is exact [6].

2. M-Step: Surrogate Function Optimization

During the M-step, we minimize the surrogate function according to Eq. (13). Specifically, we employ iterative coordinate descent (ICD) to sequentially minimize Eq. (C5) with respect to each element, $r_{s}$ and ${\bar{ϕ}}_{k, s}$ . Here, the joint subscripts $k$ , $s$ indicate the $s$ th sample of the $k^{t h}$ phase screen [15].

To update each reflectance element, $r_{s}$ , we ignore all terms in Eq. (C5) that do not contain $r_{s}$ . The resulting 1D cost function can be minimized with a line search over $R^{+}$ ; however, a closed-form solution can be obtained using the symmetric bound method of majorization described in Ref. [15]. Using this technique, we replace the QGGMRF potential function from Eq. (7) with a quadratic surrogate function. The resulting cost function is given by

q_{s} (r_{s}; r^{'}, {\bar{ϕ}}^{'}) = \log r_{s} + \frac{C_{s, s} + {| μ_{s} |}^{2}}{r_{s}} + \sum_{j \in \partial s} {\tilde{b}}_{s, j} {(r_{s} - r_{j})}^{2},

where

{\tilde{b}}_{s, j}

are the modified weights between neighboring pixel pairs given by [15]

{\tilde{b}}_{s, j} = b_{s, j} \frac{{| r_{s^{'}} - r_{j^{'}} |}^{p - 2}}{2 σ_{r}^{p}} \frac{{| \frac{r_{s^{'}} - r_{j^{'}}}{T σ_{r}} |}^{q - p} (\frac{q}{p} + {| \frac{r_{s^{'}} - r_{j^{'}}}{T σ_{r}} |}^{q - p})}{{(1 + {| \frac{r_{s^{'}} - r_{j^{'}}}{T σ_{r}} |}^{q - p})}^{2}} .

To obtain a closed-form solution for

{\hat{r}}_{s}

, we differentiate the right-hand side of Eq. (D2) with respect to

r_{s}

, set it equal to zero, and multiply both sides by

r_{s}^{2}

. The solution,

{\hat{r}}_{s}

, can then be found by rooting the polynomial

α_{1} r_{s}^{3} + α_{2} r_{s}^{2} + α_{3} r_{s} + α_{4}

, where

α_{1} = 2 \sum_{j \in \partial s} {\tilde{b}}_{s, j}, α_{2} = - 2 \sum_{j \in \partial s} {\tilde{b}}_{s, j} r_{j}, α_{3} = 1, α_{4} = - (C_{s, s} + {| μ_{s} |}^{2}) .

Thus, we update each sample using the closed-form solution for the roots of a third-order polynomial. If multiple real roots exist, we chose the one with the lowest cost according to Eq. (D2).

While the reflectance can be updated using a closed-form solution, the phase-error update requires a line search. To compactly express the cost function for the update of the phase-error element, ${\bar{ϕ}}_{k, s}$ , it is helpful to write the forward-model operator as

A_{\bar{ϕ}} = Ψ_{k} D (e^{- j P {\bar{ϕ}}_{k}}) Σ_{k},

where

Ψ_{k}

represents all the matrices on the left-hand side of

D (e^{- j P {\bar{ϕ}}_{k}})

in Eq. (A9), and

Σ_{k}

represents all the matrices on the right-hand side of

D (e^{- j P {\bar{ϕ}}_{k}})

. Using this notation, we can write the cost function for the

k

th phase screen as

q ({\bar{ϕ}}_{k}; r^{'}, {\bar{ϕ}}^{'}) = - \frac{2}{σ_{w}^{2}} Re {(Ψ_{k}^{H} y) D (e^{- j P {\bar{ϕ}}_{k}}) (Σ_{k} μ)} + \frac{1}{2 σ_{{\bar{ϕ}}_{k}}^{2}} \sum_{{i, j} \in P} b_{i, j} {| Δ_{{\bar{ϕ}}_{k}} |}^{2} .

If we consider just the terms in Eq. (D6), which depend on the element

{\bar{ϕ}}_{k, s}

, we obtain the 1D cost function for that element given by

q ({\bar{ϕ}}_{k, s}; r^{'}, {\bar{ϕ}}^{'}) = - | χ | \cos (∠ χ - {\bar{ϕ}}_{k, s}) + \frac{1}{2 σ_{{\bar{ϕ}}_{k}}^{2}} \sum_{j \in \partial s} b_{i, j} {| {\bar{ϕ}}_{k, s} - {\bar{ϕ}}_{k, j} |}^{2},

where

χ = \frac{2}{σ_{w}^{2}} \sum_{i = 1}^{n_{b}^{2}} {[Ψ_{k}^{H} y]}_{B (i)}^{*} {[Σ_{k} μ]}_{B (i)} .

In Eq. (D7),

j \in \partial s

is an index over neighboring low-resolution phase samples,

B

is the

n_{b}^{2}

set of vector indices of the high-resolution phase,

ϕ_{k}

, corresponding to the low-resolution element

{\bar{ϕ}}_{k, s}

, and

∠ χ

indicates the phase of the complex-valued scalar,

χ

. Intuitively,

χ

is the difference between the fields on either side of the screen

D (e^{- j P {\bar{ϕ}}_{k}})

, summed over all high-resolution points that correspond to the single low-resolution point being updated.

The cost function of Eq. (D7) is the superposition of a cosine term, which has an infinite number of minima with equal cost, and a quadratic term. The global minima for the surrogate will be near the minimum of the quadratic term, bracketed on either side by the period of the cosine. Thus, we minimize Eq. (D7) using a 1D line search over ${\bar{ϕ}}_{k, s} \in [{\bar{ϕ}}_{k, s}^{**} - π, {\bar{ϕ}}_{k, s}^{**} + π]$ , where ${\bar{ϕ}}_{k, s}^{**}$ minimizes the quadratic term in Eq. (D7). After repeating this for all $s \in \bar{S}$ , where $\bar{S}$ is the set of samples in the lower-resolution phase-error grid, we obtain the full-resolution estimate of the unwrapped phase screen, ${\hat{ϕ}}_{k}$ , according to Eq. (10).

3. Initialization and Stopping Criteria

Figure 12 summarizes the steps of the basic MIR algorithm. The parameters $σ_{r}$ and $σ_{w}$ are set automatically, although the regularization of $r$ can be adjusted using a unitless parameter, $γ$ . Note that while we initialize $r$ using a simple back-projection of the noisy data, we use an iterative process to initialize $ϕ$ , based on the heuristic developed in Refs. [5,13].

Fig. 12. MIR algorithm for the MAP estimates of $r$ and $ϕ$ . Here, ${| \cdot |}^{° 2}$ indicates the element-wise magnitude square of a vector, and $s^{2} (\cdot)$ computes the sample variance of a vector’s elements [19]. Additionally, $z = [Δ z_{1}, Δ z_{2}, \dots, Δ z_{K}]$ is a vector of the partial-propagation distances, $δ = [δ_{0}, δ_{K}]$ is a vector of the object and pupil-plane sample spacing, $γ$ is a unitless parameter introduced to tune the amount of regularization in $r$ , $S$ is the set of all samples in $r$ , $K$ is the number of partial-propagation planes, $\bar{S}$ is the set of all samples in the low-resolution phase screens, and $i$ is the iteration index.

Download Full Size | PDF

Figure 13 details the steps of this iterative initialization process, referred to as the MIR initialization. The initial estimate of the phase-error vector is simply $ϕ^{0} = 0$ . Then, for a set number of outer-loop iterations, $N_{L}$ , we allow the basic MIR algorithm to run for $N_{K}$ iterations. Each time the MIR algorithm is called, $r$ is reinitialized, but the previous value of $ϕ$ is used. After the outer loop runs $N_{L}$ times, we again run the basic MIR algorithm. For this final reconstruction run, we stop the algorithm when $ε < ε_{T}$ , in accordance with Eq. (14). It is important to note that while the MIR algorithm has many parameters, we set most of them automatically. In the methods section, we provide settings that we have found to robustly provide good results over a wide range of anisoplanatic conditions.

Fig. 13. Iterative process used to initialize $ϕ$ .

Download Full Size | PDF

APPENDIX E: MULTIPLANE IS ALGORITHM

To investigate the performance gain of the MIR algorithm over conventional techniques, we considered two different MIS algorithms that differ in their representation of the phase errors, $ϕ$ . A 15th-order polynomial is used for each phase screen in Ref. [11], while a point-by-point representation is used in Ref. [12]. Unfortunately, neither of these algorithms was designed for single-shot data, and only the polynomial-based algorithm produced usable results. Even with the bootstrapping initialization approach, we found that the point-by-point MIS algorithm produces poor results by over-sharpening single-shot images to a few points. Conversely, the polynomial-based MIS algorithm has fewer unknowns and is naturally constrained by the 15th-order representation. We found it to produce more useful results for single-shot data. Therefore, we chose the polynomial-based algorithm in Ref. [11] as our reference for comparisons.

The polynomial-based MIS algorithm computes the phase estimate according to

\hat{ϕ} = [Z \oplus Z \oplus \dots \oplus Z] \hat{c},

where

Z \in R^{M \times N_{z}}

is a matrix with columns corresponding to the first

N_{z}

Zernike polynomials and

\oplus

indicates a matrix direct sum. The number of

Z

matrices direct-summed is equal to the number of phase-error planes modeled. In Eq. (E1), the value

\hat{c} \in R^{3 N_{z}}

is an estimate of the polynomial coefficients for all three phase screens found according to

\hat{c} = \underset{c}{argmax} {{∥ ({| A_{c}^{H} y |}^{\circ 2})^{\circ β} ∥}_{1} - α {∥ D (W) {| F A_{c}^{H} y |}^{\circ 2} ∥}_{1}} .

Here,

{∥ \cdot ∥}_{1}

indicates the L1 norm,

β = 1.01

is an image-sharpness parameter,

α

is a scalar that balances the two optimization terms,

A_{c}

indicates a dependence of

A

on

c

, and

W \in R^{M}

is a vector of weights that applies a high-pass filter to the magnitude squared of the back-projected image spectrum,

{| F A_{c}^{H} y |}^{\circ 2}

. The second term in Eq. (E2) is a regularization term that prevents object demagnification [11,12,20].

Following the algorithm in Ref. [11], we use conjugate gradient (CG) to optimize Eq. (E2) in an iterative manner by first estimating only the coefficients corresponding to a third-order polynomial for all the screens, thenfourth, and so on, continuing up to the 15th order. We generated $W$ by first creating an inverse rectangular function with width $0.8 N$ (i.e., $(1 - rect [p / 0.8 N]) (1 - rect [q / 0.8 N])$ , where $p$ and $q$ are the spectral indices in the set $[- N / 2, N / 2 - 1)$ ; then we applied a raised-cosine transition from the inner section of zeros to the outer section of ones. Through trial and error, we found $α = 1 \times 10^{- 6}$ to be optimal.

APPENDIX F: METHODS

In this section, we provide details about the methods used to produce data. We begin with a full description of the wave-optics simulation used to produce synthetic data and then describe our laboratory setup used to produce experimental data.

1. Simulation Details

Figure 1 of the paper shows the $256 \times 256$ reflectance function, $r$ , used as the simulation input. We generated a reflection coefficient according to $g \sim C N (0, D (r))$ . Next, we propagated $g$ a distance, $Δ z = 80 m$ , through three phase screens, located at $z = [26.4, 52.8, 80] m$ [8]. We generated the three phase screens using a Kolmogorov model for the refractive-index power spectral density (PSD) [8].

The strength of each individual screen was parameterized using the plane-wave coherence length, $r_{0, p w}$ , also known as the Fried parameter [9]. Each of the three screens had equal values of $r_{0, p w}$ . We quantified the overall strength of the distributed-volume phase errors with the parameter $D / r_{0, s w}$ . Note that the spherical-wave coherence length is more appropriate than the plane-wave coherence length for distributed-volume phase errors. The value of $D / r_{0, s w}$ specifies the degrees of freedom in the phase-error function at the pupil plane and allows us to quantify the cumulative effects of the distributive-volume phase errors. High values of $D / r_{0, s w}$ correspond to strong phase errors and low values correspond to weak phase errors. For our experiment, we controlled $D / r_{0, s w}$ by varying the value of $r_{0, p w} \in [3 \times 10^{- 4}, 10] m$ for the three screens.

We quantified the anisoplanatic conditions with the parameter ${\tilde{θ}}_{0} = θ_{0} / (λ / D)$ . For our simulations, we controlled ${\tilde{θ}}_{0}$ by adjusting $(λ / D)$ . Specifically, we adjusted the grid spacing in the object plane, $δ_{0} \in [1 \times 10^{- 4}, 1.2 \times 10^{- 2}] m$ , and set the corresponding pupil-plane sample grid spacing according to $δ_{3} = λ Δ z δ_{0} / 256 m$ . The intermediate grid spacings, $δ_{1}$ and $δ_{2}$ , were set according to the split-step BPM MATLAB code from [8].

To simulate detection by a DH system using an off-axis IPRG, we padded the propagated field to obtain an array size of $512 \times 512$ . Next, we applied a $512 \times 512$ binary circular pupil-aperture function, $a$ , which had a circle of ones, 256 pixels wide, centered in an array of zeros. After, we first applied a thin-lens phase function that collimated the propagated light, then applied an FFT to form an image in the focal plane. Next, we mixed the image with a reference field given by Eq. (B1) and detected the resultant power. Figure 2(a) shows an example of the resulting digital hologram. The reference-beam power was set at approximately 80% of the well depth per detector element [i.e., 80% of $5 \times 10^{4} photoelectrons (pe)$ ]. We also modeled Gaussian read noise with a standard deviation of 40 pe and digitized the output to 12 bits.

After detection, we took an FFT of the digital hologram data and isolated the signal of interest, which was a $256 \times 256$ complex image of the signal in the pupil plane. Figure 2(b) shows the magnitude squared of the digital hologram spectrum. The $256 \times 256$ signal of interest, $y$ , is highlighted by a white dotted line and is also shown in Fig. 2(c) after it is removed from the spectrum. The SNR was set to 100 for all data sets, where

SNR = \frac{s^{2} (A_{ϕ} g)}{s^{2} (w)},

and

s^{2} (\cdot)

is the sample-variance operator.

For simulated reconstructions using the MIR algorithm, we allowed the outer initialization loop to run $N_{L} = 50$ times, with $N_{K} = 20$ EM iterations each time. Once the iterative initialization process was complete, we set a stopping criterion of $ϵ_{T} = 1 \times 10^{- 4}$ and let the EM algorithm run to completion. During initialization, we used $q = 2$ , $p = 2$ , $T = 1$ , $γ = 2$ , and $b = G (0.8)$ , where $G (σ)$ indicates a $3 \times 3$ Gaussian kernel with standard deviation, $σ$ . For the final MIR reconstruction, we set $p = 1.1$ , $T = 0.1$ , and $b = G (0.1)$ . Additionally, for the phase-error prior model parameters, we used $b = G (0.1)$ , $σ_{\bar{ϕ}} = [0.05, 0.10, 0.15]$ , and set $n_{b} = 4$ .

2. Laboratory Experiment Details

Figures 14 and 15 describe our experimental setup. For our master oscillator (MO) laser, we used a LightWave nonplanar ring oscillator with 700 mW of continuous-wave power and approximately 1 km of coherence length, with a wavelength of 1064 nm. Past a Faraday isolator and two alignment mirrors, we used a half-wave plate and polarized beam-splitting cube to create two optical legs. In the first optical leg, we coupled the beam from the MO laser into a single-mode, polarization-maintaining fiber to create an off-axis LO. This off-axis LO created a quasi-uniform reference beam with the appropriate tilt for digital-holographic detection in the off-axis IPRG. The mean strength of this reference beam was set to approximately 75% of the camera’s pixel-well depth. In the second optical leg, we focused the beam from the MO laser through a pinhole and then used an adjustable collimating lens to flood-illuminate the object after reflection from a steering mirror.

Fig. 14. Diagram of our experimental setup. The output from a 1064 nm laser was split into two paths. One path was sent through a beam expander to illuminate the object. The other path created an ideal reference that interfered with the scattered signal using digital-holographic detection in the off-axis IPRG. Phase screens were placed in front of the imaging lens to simulate the effects of anisoplanatic aberrations.

Download Full Size | PDF

Fig. 15. Images of the object (left) and the transmitter and receiver optics (right) used in our experiment.

Download Full Size | PDF

After setting up the imaging system with an object distance of approximately 2.5 m and an image distance of approximately 10 cm, we simulated the effects of anisoplanatic aberrations by placing six different phase screens in front of the imaging lens with a spacing of 10 cm in between. Each experimental phase screen contained two crisscrossed plastic plates (from CD jewel cases) placed back-to-back. For digital-holographic detection, we used a single lens to image the scattered light from the object onto a Guppy PRO f-125 FireWire camera. We then saved the recorded digital hologram in a $512 \times 512$ , 16-bit TIFF file. Note that our recorded digital hologram had an SNR of approximately 10 according to Eq. (F1). Also note that we positioned the object, a chrome-on-glass Air Force resolution chart backed by Labsphere Spectralon, at an angle to minimize specular reflections and ensured that it was uniformly illuminated.

For all experimental reconstructions, MIS, MIR, and MBIR, we chose to digitally estimate four phase screens located at $z = [1.9, 2.1, 2.3, 2.5] m$ . Note that we observed better results when we included a phase screen at the aperture plane located at $z = 2.5 m$ . We allowed the outer initialization loop to run $N_{L} = 100$ times, with $N_{K} = 50$ EM iterations each time. Once the iterative initialization process was complete, we set a stopping criterion of $ϵ_{T} = 1 \times 10^{- 4}$ and let the EM algorithm run to completion. During initialization, we used $q = 2$ , $p = 2$ , $T = 1$ , $γ = 2$ , and $b = G (0.8)$ . For the final MIR reconstructions, we set $p = 1.1$ , $T = 0.05$ , $γ = 2.5$ , and $b = G (0.1)$ . Additionally, for the phase-error prior model parameters, we used $σ_{\bar{ϕ}} = [0.15, 0.30, 0.45, 0.60]$ , and set $n_{b} = [10, 10, 5, 5]$ . For the MBIR reconstructions, we set the parameters equal to the MIR parameters, except that only the $z = 2.5 m$ phase screen was reconstructed.

Acknowledgment

The authors would like to thank B. T. Plimmer and D. E. Thornton for their assistance in conducting the laboratory experiments and in reviewing this paper.

REFERENCES

1. T.-C. Poon and J.-P. Liu, Introduction to Modern Digital Holography: With MATLAB (Cambridge University, 2014).

2. S. T. Thurman and J. R. Fienup, “Phase-error correction in digital holography,” J. Opt. Soc. Am. A 25, 983–994 (2008). [CrossRef]

3. J. C. Marron, R. L. Kendrick, N. Seldomridge, T. D. Grow, and T. A. Höft, “Atmospheric turbulence correction using digital holographic detection: experimental results,” Opt. Express 17, 11638–11651 (2009). [CrossRef]

4. M. F. Spencer, R. A. Raynor, M. T. Banet, and D. K. Marker, “Deep-turbulence wavefront sensing using digital-holographic detection in the off-axis image plane recording geometry,” Opt. Eng. 56, 031213 (2017). [CrossRef]

5. C. Pellizzari, M. F. Spencer, and C. A. Bouman, “Phase-error estimation and image reconstruction from digital-holography data using a Bayesian framework,” J. Opt. Soc. Am. A 34, 1659–1669 (2017). [CrossRef]

6. C. Pellizzari, M. T. Banet, M. F. Spencer, and C. A. Bouman, “Demonstration of single-shot digital holography using a Bayesian framework,” J. Opt. Soc. Am. A 35, 103–107 (2018). [CrossRef]

7. J. W. Goodman, Speckle Phenomena in Optics: Theory and Applications (Roberts and Company, 2006).

8. J. D. Schmidt, Numerical Simulation of Optical Wave Propagation with Examples in MATLAB (SPIE, 2010).

9. L. Andrews and R. Phillips, Laser Beam Propagation through Random Media, 2nd ed. (SPIE, 2005).

10. S. T. Thurman and J. R. Fienup, “Correction of anisoplanatic phase errors in digital holography,” J. Opt. Soc. Am. A 25, 995–999 (2008). [CrossRef]

11. A. E. Tippie and J. R. Fienup, “Phase-error correction for multiple planes using a sharpness metric,” Opt. Lett. 34, 701–703 (2009). [CrossRef]

12. A. E. Tippie and J. R. Fienup, “Multiple-plane anisoplanatic phase correction in a laboratory digital holography experiment,” Opt. Lett. 35, 3291–3293 (2010). [CrossRef]

13. C. J. Pellizzari, R. Trahan, H. Zhou, S. Williams, S. E. Williams, B. Nemati, M. Shao, and C. A. Bouman, “Synthetic aperature ladar: a model-based approach,” IEEE Trans. Comput. Imaging 3, 901–916 (2017). [CrossRef]

14. V. V. Protopopov, Laser Heterodyning, Vol. 149 of Springer Series in Optical Sciences (Springer, 2009).

15. C. A. Bouman, Model Based Image Processing, 2013, https://engineering.purdue.edu/~bouman/publications/pdf/MBIP-book.pdf.

16. J. B. Thibault, K. Sauer, C. Bouman, and J. Hsieh, “A three-dimensional statistical approach to improved image quality for multi-slice helical CT,” Med. Phys. 34, 4526–4544 (2007). [CrossRef]

17. A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,” J. R. Stat. Soc. Ser. B 39, 1–38 (1977).

18. J. W. Goodman, Introduction to Fourier Optics, 3rd ed. (Roberts and Company, 2005).

19. C. Forbes, M. Evans, N. Hastings, and B. Peacock, Statistical Distributions (Wiley, 2011).

20. A. E. Tippie, “Aberration correction in digital holography,” Ph.D. dissertation (University of Rochester, 2012).

Imaging through distributed-volume aberrations using single-shot digital holography

Abstract

1. INTRODUCTION

2. MIR ALGORITHM

A. Log-Likelihood Model for Coherent Imaging

B. Prior Models

1. Reflectance

2. Phase Errors

C. MIR Framework

3. RESULTS

A. Simulation

B. Experiment

4. CONCLUSION

APPENDIX A: PROPAGATION MODEL

1. ASM

2. Split-Step BPM Model

APPENDIX B: MEASUREMENT MODEL

APPENDIX C: SURROGATE FUNCTION

APPENDIX D: MIR ALGORITHM

1. E-Step: Surrogate Function Formation

2. M-Step: Surrogate Function Optimization

3. Initialization and Stopping Criteria

APPENDIX E: MULTIPLANE IS ALGORITHM

APPENDIX F: METHODS

1. Simulation Details

2. Laboratory Experiment Details

Acknowledgment

REFERENCES

Cited By

Figures (15)

Equations (51)

Journal of the Optical Society of America A