Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Removal of correlated background in a high-order harmonic transient absorption spectra with principal component regression

Open Access Open Access

Abstract

We demonstrate a 40x mean noise power reduction (NPR) in core-to-valence extreme ultraviolet (XUV) femtosecond transient absorption spectroscopy with a high harmonic generation (HHG) light source. An adaptive iteratively reweighted principal component regression (airPCR) is used to analyze and suppress spectrally correlated HHG intensity fluctuations. The technique requires significantly less user input and leads to a higher mean NPR than a previously introduced edge-pixel PCR method that relies on the manual identification of signal-free spectral regions. Both techniques are applied in a time-resolved XUV absorption study of the 2snp 1Po (n ≥ 2) autoionizing Rydberg states of helium, demonstrating sub-10−3 optical density sensitivity.

© 2021 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

1. Introduction

Transient absorption spectroscopy (TAS) is a well-established technique for investigating time-dependent dynamics extending from the terahertz energy range [13] to the extreme ultraviolet (XUV) and X-ray energies [46]. During the last decade, TAS with high-order harmonic generated (HHG) probe pulses has emerged as a powerful technique for probing ultrafast electron dynamics in the XUV and soft X-ray regimes in atoms, molecules and solids on femtosecond and attosecond timescales [723]. The table-top nature of HHG light sources has also provided a flexible platform for utilizing tailored driving laser radiation in useful ways [24,25].

Absorbance is, by convention, defined as the logarithmic ratio of two measured intensities, $I^1$ and $I^0$. It is the attenuation (optical density) that arises with/without a sample in linear absorption spectroscopy, or in the case of transient absorption, from a second coupling laser pulse that triggers time-dependent dynamics. Multiple terms contribute to the attenuation in transmitted intensity. Contributions from reflection and scattering processes are typically minor and will be omitted here. The remaining terms are the signal of interest ${\Delta }OD_{\mathrm {signal}}$, a multiplicative noise term $F$ and an additive noise term arising from changes in the probe light source intensity $I_{\mathrm {probe}}$ in the two measurements [26,27]:

$${\Delta}OD ={-}{\log_{10}\frac{I^1}{I^0}} \approx {\Delta}OD_{\mathrm{signal}} {\cdot} F -\log_{10}{\frac{I_{\mathrm{probe}}^{\mathrm{1}}}{I_{\mathrm{probe}}^{\mathrm{0}}}}.$$

In visible/NIR transient absorption spectroscopy, it is common to use a beam splitter to record the probe spectrum before and after interaction with the sample. Knowledge of the probe light intensity allows the additive noise to be minimized and enables the detection of changes in the mean optical density ${\langle {\Delta }OD}\rangle$ as small as $\sim 10^{-6}$ [28]. In the XUV/X-ray energy region, the lack of broadband beam samplers, the limited HHG flux and the high cost of detection systems have largely prevented implementation of this approach. Instead, the probe spectrum is measured with and without the coupling laser pulse in rapid succession, in an effort to reduce any differences between $I_{\mathrm {probe}}^{\mathrm {0}}$ and $I_{\mathrm {probe}}^{\mathrm {1}}$. Averaging over many acquisitions minimizes the additive noise. The relatively low photon flux and stability of HHG sources sets a lower limit to the detectable absorbance change, typically greater than 1 mOD ($10^{-3}$ optical density) [29,30].

The main source of noise in HHG transient absorption experiments comes from the fluctuation of the light source [31], which has been shown to be correlated with both the ion yield [32] and the laser intensity [33]. Recently, Volkov et al. reported that spectral shifts of the high-order harmonics can be ascribed to intensity-induced process, which depends on both the intensity of the laser and the phase-matching in the target [31]. The dependence of the HHG fluctuation on only a few parameters results in highly structured and spectrally correlated noise. Here, we demonstrate that we can exploit the high spectral correlation of the HHG light source to model the noise based on measurements without the coupling laser. The retrieved additive noise can then be subtracted from the transient absorption spectra. Implementations of this correction have been dubbed "edge-pixel referencing" [34], meaning that the noise calibration region of the spectrum must be free from any coupling laser induced signals, so that the difference between the noise calibration and the noise underlying the measured optical density can be minimized. Application to optical TAS [26,34] and XUV attosecond TAS [27] spectroscopy have recently been reported.

In this work, we apply principal component analysis (PCA) based approaches to characterize the spectral correlations within the noise in terms of linearly independent principal components. Two techniques are implemented for subtracting the noise. In the first case, principal component regression (PCR) of the measured signal in the edge-pixel referencing region is applied to retrieve the noise floor. This procedure is similar to the one implemented by R. Géneaux et al. [27] for TAS measurements using an isolated attosecond pulse (IAP). Our results demonstrate that sufficient correlation also exists in the HHG spectrum of an attosecond pulse train (APT) to enable edge-pixel referencing for background reconstruction. In the second case, an adaptive iteratively reweighted PCR (airPCR) algorithm is introduced. In this case, we demonstrate that the background noise floor can be retrieved without making any prior assumption about the edge-pixel referencing region. We demonstrate in both cases sub-mOD sensitivity in a time-resolved study of autoionizing Rydberg states of helium. Moreover, we demonstrate that the airPCR method produces a higher mean noise power reduction compared to the standard edge-pixel referencing approach, reaching a $\sim$40x mean noise power reduction across the entire spectral range, compared to a $\sim$20x reduction outside the edge-pixel referencing region when using the edge-pixel PCR technique.

2. Experiment

The femtosecond XUV transient absorption apparatus has been described in detail elsewhere [10,35]. In brief, the output of a Ti:Sapphire laser producing $13\,$mJ of $804\,$nm radiation at a repetition rate of $1\,$kHz and $35\,$fs in duration is split into two arms by a 70% / 30% beam splitter. In one arm, $4.0\,$mJ of the NIR is focused into a semi-infinite gas cell containing 70 Torr of argon to generate femtosecond XUV pulses. A differential pumping stage placed downstream from the generation point removes the gas load from the XUV propagation path. The NIR driving field is rejected by a $200\,$nm thick aluminum foil. A toroidal mirror focuses the XUV beam into a $4\,$mm long sample cell, continuously filled with helium gas, that is mounted inside an interaction chamber evacuated by a turbomolecular pump. The transmitted light is detected by an XUV spectrometer consisting of a variable line spaced grating ($1200$ lines$/$mm) and an X-ray CCD camera.

To record time-resolved data, $268\,$nm UV pulses (photon energy $E_{\mathrm {UV}} = 4.63\,$eV) with up to $300\,\mu$J of energy are produced by second harmonic generation followed by sum-frequency mixing with the NIR fundamental [35]. The UV energy is controlled with a combination of a half waveplate and a thin film polarizer to attenuate the fundamental NIR. Four high-reflective UV mirrors are used to remove the residual second harmonic and NIR fields. The UV pulses are then focused inside the sample cell with a $500\,$mm focal length lens. In the interaction region, the typical duration of the UV pulses is $100\,$fs. The XUV and UV beams are spatially overlapped inside the sample cell in a quasi-colinear geometry ($\sim 2^{\circ }$ angle) using a drilled-through mirror to transmit the XUV and reflect the UV pulses. For the data presented here, the UV beam is attenuated to energies of approximately $30\,\mu$J per pulse. The delay $t$ between the UV and XUV pulses is controlled with a motorized delay stage, with positive $t$ defined as the UV pulse preceding the XUV pulse. An optical 2-slots 101.6$\,$mm diameter chopper wheel with a $50\%$ duty cycle is placed in the path of the UV beam, whose diameter is $~$6$\,$mm. The rising and falling edges of the chopper transmission trigger CCD exposures every $125\,$ms to alternately record UV-on and UV-off spectra. The trigger delay is calibrated in order to maximize the UV transmission through the chopper during UV-on acquisitions. For each delay $t$ between the two pulses, we acquired $N_s=450$ UV-on and $N_s$ UV-off spectra. $\Delta OD_{\mathrm {raw}}$ is measured for $N_t=30$ values of $t$, corresponding to a total number of UV-off (and UV-on) spectra acquired of $p=N_sN_t=13500$, and a total acquisition time of approximately one hour.

3. Analysis of the Noise

A calibration dataset ${\Delta }OD_{\mathrm {calib}}$ is formed by pairing consecutively acquired UV-off spectra and computing the logarithm of their intensity ratios. All the $p=13500$ UV-off spectra acquired (regardless of the delay $t$), are used to form the calibration dataset. The dimensionality of ${\Delta }OD_{\mathrm {calib}}$ is therefore $n$ rows $\times p'$ columns, whereby $n$ is the number of pixels in the spectrum and $p'=p/2=6750$ is the number of paired observations. Deviations of ${\Delta }OD_{\mathrm {calib}}$ from zero encode changes in the XUV probe light source intensity between two consecutively acquired UV-off spectra throughout the experiment. We extract this information to correct the transient absorption signal ${{\Delta }OD}_{\mathrm {raw}}$ constructed from the consecutively acquired UV-on and UV-off spectra.

An exemplary UV-off XUV spectrum is shown in Fig. 1(a). We characterize the light source fluctuations by calculating the probability density function (PDF) of the absorbance noise ${\Delta }OD_{\mathrm {calib}}$ at each pixel/photon energy. The false color plot in Fig. 1(b) shows the calculated PDF, emphasizing that, while the mean absorbance is centered at zero, there is a considerable variation in the width of the distribution arising from noise. The $2\sigma _s$ standard deviation of $\Delta OD_{\mathrm {calib}}$ as a function of energy is quantified by an overlaid line, ranging from 40 to 150 mOD.

 figure: Fig. 1.

Fig. 1. Analysis of the noise of ${\Delta }OD_{\mathrm {calib}}$, formed by pairing consecutively acquired UV-off spectra. (a) Sample UV-off spectrum, corresponding to a $\sim 125\,$ms long acquisition of the 1$\,$kHz HHG source radiation. (b) False-color plot of the PDF of ${\Delta }OD_{\mathrm {calib}}$ across the XUV spectrum, demonstrating the noise associated with different harmonics. The standard deviation of the noise 2$\sigma _s$ is represented by the black line. (c) Correlation matrix of the noise in the photon energy domain, showing the PCC between any two photon energies. (d) Correlation matrix of the noise by CCD exposure, showing the PCC between any two of the first 1000 observations.

Download Full Size | PDF

We characterize the correlation of the noise by using the Pearson’s correlation coefficient (PCC) metric. The PCC $\rho _{x,y}$ between two variables $x$ and $y$ is defined as the covariance $\sigma _{xy}$ of the two variables divided by the product of their standard deviations $\sigma _{x}\sigma _{y}$ [36]:

$$\rho_{x,y} = \frac{\sigma_{xy}}{\sigma_{x}\sigma_{y}} \quad\quad \sigma_{xy} = \langle(x-\langle x \rangle)(y-\langle y \rangle)\rangle, \quad\quad \sigma_x = \langle x^2\rangle^{1/2}, \quad\quad \sigma_y = \langle y^2\rangle^{1/2}$$
where the brackets $\langle \cdot \rangle$ represent the mean operation. When $x$ and $y$ are the entries in two rows of ${\Delta }OD_{\mathrm {calib}}$, associated with two pixels in the spectrum, the mean $\langle \cdot \rangle$ averages over the observation domain of length $p'$, and the resulting PCC describes the correlation between the noise at any two photon energies. The PCC correlation matrix is shown in Fig. 1(c). When $x$ and $y$ are the entries in two columns of ${\Delta }OD_{\mathrm {calib}}$, corresponding to two observations, the mean $\langle \cdot \rangle$ averages over the pixel domain of length $n$, and the resulting PCC describes the correlation between any two noise observations. The PCC correlation matrix obtained by considering only the first 1000 observations is shown in Fig. 1(d). In Figs. 1(c) and 1(d), the regions of positive and negative correlation are shown in red and blue, respectively, while the uncorrelated regions are in white. Crucially, a highly structured and regular correlation is only seen in the energy domain (Fig. 1(c)), while the observation/measurement correlation matrix shows a less regular structure (Fig. 1(d)), where in any case many off-diagonal elements show non-zero correlation. This analysis demonstrates that a highly correlated energy-dependent noise fluctuation observed at one particular time between two consecutive acquisitions has a chance to reoccur several times during the experiment, at least during the first 1000 observations taken into account in this analysis. Moreover, the correlation matrix in the energy domain shown in Fig. 1(c) demonstrates that different regions of the spectrum are strongly correlated (or anti-correlated) with each other. This suggests that it is possible to predict the noise fluctuations in the transient absorption spectra across the entire spectral range by first calibrating the correlations of the noise in the UV-off measurements and then characterizing the noise within a limited energy region in the transient absorption measurements.

A principal component analysis (PCA) is used to project the energy correlated noise onto a set of linearly independent principal components. PCA is a dimensionality-reduction technique that orthogonally transforms a large set of highly correlated data into a limited set of linearly uncorrelated principal components that accounts for as much variation in the data as possible [37]. The PCA is implemented in Python using the Scikit-learn library [38]. The first $q$ principal components returned by PCA, corresponding to the largest variance of the data, are collected in an $n\times q$ matrix ${\Delta }OD_{\mathrm {PC}}$. The absorbance noise ${\Delta }OD_{\mathrm {calib}}$ can be approximated as:

$${\Delta}OD_{\mathrm{calib}} \sim {\Delta}OD_{\mathrm{PC}} \cdot C'$$
where $C'$ is the $q\times p'$ coefficient matrix representing the projections of each noise terms on the principal component basis set ${\Delta }OD_{\mathrm {PC}}$. The utility of each principal component is that it describes how likely changes in one region of the spectrum affect another region of the spectrum. Moreover, the approximation in Eq. (3) allows to regularize the noise floor, removing uncorrelated sources of noise from the calibration dataset.

4. Methods

We present two methods for calibrating the noise floor based on linear regression of the observed spectrum on the most relevant PCs. In the edge-pixel PCR method, we restrict the fit to the edge-pixel referencing region free of UV-induced signal. In the air-PCR method, every pixel from the data is used, but weighted differently according to the signal observed.

4.1 First method: edge-pixel PCR

Figure 2 illustrates the background subtraction method using PCR on the edge-pixel referencing region. Background ${\Delta }OD$ spectra are estimated by calculating the logarithm of the intensity ratios of paired UV-off XUV spectra. These background ${\Delta }OD$ spectra form the columns of ${\Delta }OD_{\mathrm {calib}}$. Sample columns are shown in Fig. 2(a). A PCA on ${\Delta }\mathrm {OD}_{\mathrm {calib}}$ finds that $96.7\%$ of the variance of the measured $p' = 6750$ background spectra can be described with just 35 principal components, the first 8 of which are illustrated in Fig. 2(b).

 figure: Fig. 2.

Fig. 2. (a) First three columns of ${\Delta }OD_{\mathrm {calib}}$, obtained from two adjacent pairs of UV-off spectra ($p_1$ and $p_2$). (b) First 8 principal components ordered by decreasing relevance from top to bottom (first 8 columns of ${\Delta }OD_{\mathrm {PC}}$). (c) Recovery of additive noise across the entire spectrum (red line), compared to the mean noise (green line) for different fit regions (shaded gray). (d) Average measured spectral change $\langle \Delta OD_{\mathrm {raw}}(t)\rangle$ as a function of the UV-XUV pulse delay $t$. (e) Average spectral change $\langle \Delta OD_{\mathrm {raw}}\rangle$ over all UV-XUV pulse delays (green line) and final fit region selection (shaded gray).

Download Full Size | PDF

A linear least-squares regression fit using the first $q=35$ PCs is performed on the conventional optical density ${\Delta }OD_{\mathrm {raw}}$ (dimensions: ${n}$ pixels $\times$ ${p}$ observations), formed by pairs of UV-on and UV-off spectra. The fit is performed choosing a fitting region (edge-reference pixel region) free of UV-induced signal. Each measured transient absorption spectrum is corrected using

$${\Delta}OD_{\mathrm{ref}} = {\Delta}OD_{\mathrm{raw}} - {\Delta}OD_{\mathrm{fit}} \quad\quad\quad {\Delta}OD_{\mathrm{fit}} = {\Delta}OD_{\mathrm{PC}}\cdot C$$
where ${\Delta }OD_{\mathrm {fit}}$ is the fitted noise, and $C$ is a $q\times p$ matrix that minimizes $|{\Delta }OD_{\mathrm {ref}}|^2$ in the edge-pixel referencing region. The coefficient matrix C is determined by:
$$C = \left(\Delta OD_{PC}^T \cdot W \cdot \Delta OD_{PC}\right)^{{-}1} \cdot \Delta OD_{PC}^T \cdot W \cdot {\Delta}OD_{\mathrm{raw}},$$
where the exponent ${-1}$ indicates the matrix inversion operation, and the weight matrix $W$ is an $n\times n$ diagonal matrix, whose $i$-th element $w_i$ of the main diagonal is equal to $1$ in the edge-referencing pixels region and $0$ otherwise. The $n$-dimensional column vector $\mathbf {w}$ of elements $w_i$ thus defines the regions of the spectrum where the fit is performed.

At a delay $\Delta t = -500\,$fs, negligible signal $\Delta {OD}_{\mathrm {signal}} \approx 0$ provides a benchmark $\Delta {OD}_{\mathrm {raw,test}}$ for testing the procedure. We perform fits within different limited regions of the spectrum. Each panel in Fig. 2(c) corresponds to a different fit region, indicated by a gray background. The fit result $\langle \Delta {OD}_{\mathrm {raw,fit}}\rangle$ (red line) is shown together with the measured noise $\langle \Delta {OD}_{\mathrm {raw,test}}\rangle$ (green line). The virtually constant red fit result for all panels in Fig. 2(c) indicates that a limited reference region is sufficient to predict the fluctuations across the entire spectral range. The performance of the method in reducing the noise can be estimated by computing the mean Noise Power Reduction ($\overline {\mathrm {NPR}}$) outside the edge-pixel referencing region. This corresponds to the weighted average:

$$\overline{\mathrm{NPR}}= \frac{\sum_{i=0}^n (1-w_i) \mathrm{NPR}_i}{\sum_{i=0}^n (1-w_i)} \quad\quad\quad \mathrm{NPR} = \frac{\sigma_{\mathrm{raw}}^2}{\sigma_{\mathrm{ref}}^2} = \frac{\langle\Delta OD_{\mathrm{raw}}^2\rangle}{\langle\Delta OD_{\mathrm{ref}}^2\rangle},$$
where $\mathrm {NPR_i}$ is the Noise Power Reduction obtained for the $i^{\mathrm {th}}$ pixel. The $\overline {\mathrm {NPR}}$ obtained in the test sample for each choice of the referencing region is noted in the figure. This result demonstrates that absorbance fluctuations across the entire harmonic spectrum due to HHG noise can be predicted based on a PCR on a limited spectral range.

Figure 2(d) shows a false color map of the measured mean spectral change $\langle {\Delta }OD_{\mathrm {raw}}(t)\rangle$ as a function of the UV-XUV pulse delay t. The mean spectral change $\langle {\Delta }OD_{\mathrm {raw}}\rangle$ over all UV-XUV pulse delays $t$ is shown in 2(e) (green line). No UV-induced signal is apparent in the energy regions $E < 55$eV and $E > 65.4$eV (shaded gray areas in Fig. 2(e)). This is consistent with the fact that the strongest transient features are expected at energies between 60 and 65.4 eV, corresponding to the $2s{n}p$ $^1$P$^o$ ($n\ge 2$) absorption lines of helium [39]. We used this referencing region for the final fit, for which we obtain an $\overline {\mathrm {NPR}} = 12.5\,$dB (18-fold NPR) using Eq. (6) on the test sample.

We emphasize an advantage of PCR over the correlation matrix method described in Ref. [26], that was already pointed out by R. Géneaux et al. [27]. Restricting the fit to only the first 35 principal components allows, in our implementation, to reproduce only the low frequency modulations of the noise, as shown in Fig. 2(c). Increasing the number of PCs generally leads to a higher NPR, but it introduces PCs with high frequency noise, that are not necessarily correlated between edge and signal regions. Filtering out these high frequency noise components results in a procedure that is less sensitive to weak signals (potentially present) in the edge-referencing pixel region.

4.2 Second method: airPCR

The edge-pixel PCR method requires the identification of a spectral region with no UV-induced signal, which is used as a reference for performing the fit of the noise floor. This procedure requires user intervention and is prone to some degree of variability. As an alternative, we present a different approach for calculating the PCR coefficient matrix $C$ that automatically calculates the weight matrix $W$ in Eq. (5) according to the signal observed without any user input. This technique follows an adaptive iteratively reweighted (air) algorithm similar to the one implemented in the air Penalized Least Squares (airPLS) algorithm by Z-M Zhang et al. [40]. In contrast to this procedure, we present an implementation where an air principal component regression (airPCR) is performed, allowing to take advantage of both the previous knowledge of the correlated noise and the adaptive reweighted approach.

Figure 3 compares the flowcharts of the airPCR algorithm and the edge-pixel PCR method. Both methods share the common first step illustrated in Fig. 3(a). In this step, a PCA is performed on the calibration dataset $\Delta OD_{\mathrm {calib}}$, and the $q$ most relevant PC $\Delta OD_{\mathrm {PC}}$ are selected, as described in Section 3. Then, in the edge-pixel PCR approach, a weighted PCR is performed through Eqs. (4) and 5, by setting the main diagonal entries $w_{i}$ of matrix W to 1 in the edge-referencing pixels region and 0 otherwise, as illustrated in Fig. 3(b). With the airPCR approach instead, the weight vector $\mathbf {w}$ of elements $w_i$ is computed according to the actual signal present in the data, through the iterative procedure illustrated in Fig. 3(c). We choose to use the same number of PC ($q=35$) for both approaches, in order to allow a direct comparison of the two methods.

 figure: Fig. 3.

Fig. 3. Flowcharts of edge-pixel PCR and airPCR methods: (a) PCA of the calibration dataset $\Delta OD_{\mathrm {calib}}$ (same for both methods), (b) edge-pixel PCR flowchart, (c) airPCR flowchart.

Download Full Size | PDF

In the following, we describe the iterative algorithm in more detail (an example Python code is shown in Code 1, Ref. [41]). In the first step $k=0$ of the iteration, the initial weight vector $\mathbf {w}_{k=0}$ is set everywhere to $1$, and the weight matrix $W_{k=0}$ is the identity matrix. Then, at each step of the iteration, the referenced spectra $\Delta OD_{\mathrm {ref}, k}$ are computed through the weighted PCR approach described in Eqs. (4) and 5. $\Delta OD_{\mathrm {ref}, k}$ can be used to estimate the regions of the spectra that contain UV-induced signal variations. The mean $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ is expected to have small magnitude in the region of pixels free from UV-induced signal variations. Its absolute value $\mathbf {d}_k = |\langle \Delta OD_{\mathrm {ref}, k}\rangle |$ is a vector of dimension $n$ that can be used as a metric for retrieving the weight vector $\mathbf {w}_{k+1}$ of the next iteration. This is computed from:

$$\mathbf{w}_{k+1} = \exp\left[-\alpha(k) \frac{\mathbf{d}_{k}}{||\mathbf{d}_{k}||}\right], \quad\quad \alpha(k)= (1+c)^k , \quad\quad c > 0$$
where the square norm ${||\mathbf {d}_{k}||}$ is used as a normalization factor, and $\alpha (k)$ is an increasing monotonic function of $k$. Equation (7) is a variation of Eq. (9) in Ref. [40] adjusted in order to take into account that the absorbance change $\Delta OD$ can be either positive or negative. Moreover, in Ref. [40], $\alpha (k)=k$, while in our implementation $\alpha (k)= (1+c)^k$, where $c$ is a positive constant, set and fixed to 0.2 hereafter. This allows to optimize the computational speed, while maintaining consistent results. The square norm $||\mathbf {d}_{k}||$ is chosen as a normalization factor in order to be consistent with the definition provided in Ref. [40]. Figure 4(a) shows the mean referenced spectra $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ within a limited energy region, obtained for several numbers of iterations $k$. The associated weights are shown in Fig. 4(b). As soon as the iteration proceeds, signals present in the spectrum are progressively assigned with lower weights. For example, for the main transient absorption signal at $60\,$eV, the weight approaches 0 at step $k = 21$ (yellow line). The weight of the weak signal at $57.43\,$eV shown in the inset of Fig. 4(a) approaches 0 at step $k = 41$ (green line). Figure 4(c) shows the square norm ${||\mathbf {d}_{k}||}$ and Fig. 4(d) shows the sum of $\mathbf {w}_k$, ${||\mathbf {w}_k||}_1$, which can be interpreted as the effective number of pixels that are used to fit the noise at step $k$. Its value starts at ${||\mathbf {w}_0||}_1 =2040$ and progressively decreases at each step of the iteration.

 figure: Fig. 4.

Fig. 4. (a-b) Mean referenced signal $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ (a) and airPCR weights $w_k$ (b) as a function of energy and for different values of the iteration index $k$ (see legend in panel a). The inset in (a) shows a y-axis magnification of $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ in a selected energy region. (c) Square norm $||\mathbf {d}_k||$ of the optical density as a function of $k$. (d) Sum of all weights ${||\mathbf {w}_k||}_1$ as a function of $k$. (e) $\overline {\mathrm {NPR}}$ for the test sample (dashed line) and $\overline {\mathrm {NPR}}$ averaged over all UV-XUV delays (solid line) as a function of $k$. (c-e) The iteration index $k=46$, corresponding to the optimum result, is indicated by a vertical dashed line.

Download Full Size | PDF

The performance of the algorithm in reducing the noise as the iteration proceeds is tracked by calculating $\overline {\mathrm {NPR}}$ for each step in the region where signal is detected. This is done through the weighted average in Eq. (6), using the weights $\mathbf {w}_k$ and referenced spectra $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ for each step. The dashed line in Fig. 4(e) shows, as a function of $k$, the $\overline {\mathrm {NPR}}$ obtained for the test dataset $\overline {\mathrm {NPR}}_{\mathrm {test}}$ (recorded at $\Delta t=-500\,$fs). For $k\le 46$, $\overline {\mathrm {NPR}}_{\mathrm {test}}$ is approximately constant, since a high effective number of pixels is used for calibrating the test noise floor (${||\mathbf {w}_0||}_1 > 420$). For $k > 46$, $\overline {\mathrm {NPR}}_{\mathrm {test}}$ starts to drop quickly, which indicates that the performance of the PCR fit on the test sample is reduced. At the same time, for $k > 46$, the square norm $||\mathbf {d}_{k}||$ is approximately constant, which indicates that no significant change of $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ is observed beyond this value. The solid line in Fig. 4(e) represents $\overline {\mathrm {NPR}}_{\mathrm {avg}}$, which results from averaging over all UV-XUV delays. Its value at $k=1$ is lower than the one obtained for the test sample, due to the fact that a high weight is assigned to pixels with signals, which have a higher noise. $\overline {\mathrm {NPR}}_{\mathrm {avg}}$ reaches a maximum at $k = 46$, where its value is comparable to the one obtained for the test sample. Then it quickly drops upon further iteration. In Fig. 5, the raw $\langle \Delta \mathrm {OD}_{\mathrm {raw}}(t)\rangle$ false-color map in Fig. 5(a) is compared with the referenced $\langle \Delta \mathrm {OD}_{\mathrm {ref,k}}(t)\rangle$ false-color maps in Figs. 5(b-f), which are obtained at different values of the iteration index $k$. The corresponding standard deviations of the optical densities, averaged over all the UV-XUV delays, are displayed in the bottom panels. At $k=1$, the standard deviation is drastically reduced compared to the raw dataset. However, artifacts appear in the spectrum (e.g. around 57 eV), due to the fact that pixels with non-zero transient signal are still assigned a high weight. As soon as the iteration proceeds, the artifacts are progressively reduced and almost disappear for $k\ge 41$. On the other hand, the standard deviation stays approximately constant for $k\le 41$, while it starts to increase for larger values of $k$, which is consistent with the decrease of $\overline {\mathrm {NPR}}_{\mathrm {avg}}$ beyond $k=46$ seen in Fig. 4(e). The increase of noise for high values of $k$ is also apparent in Fig. 5(f). Artifacts can be identified in the same way as discussed by R. Géneaux et al. [27], by comparing the result of the airPCR algorithm with the referenced spectra obtained with the edge-pixel PCR approach, which are presented in the next section. Another way to identify the presence of artifacts in the referenced airPCR spectra is to track the behavior of $||\mathbf {d}_{k}||$ as the iteration proceeds. The steep change of $||\mathbf {d}_{k}||$ during the first $\approx 40$ steps is due to the progressive reduction of artifacts in the referenced spectra. When all the pixels associated with signal are assigned with low weight, $||\mathbf {d}_{k}||$ reaches a plateau, and artifacts are not visible anymore. Further iteration can only increase the noise but cannot introduce any new artifacts, due to the choice of the weight function, that progressively assigns lower weights to the region of spectra with signal.

 figure: Fig. 5.

Fig. 5. Top panels: raw $\langle \Delta OD_{\mathrm {raw}}(t)\rangle$ (a) and referenced $\langle \Delta OD_{\mathrm {ref}, k}(t)\rangle$ for different values of $k$ (b-f). Bottom panels: standard deviation of optical density averaged over all UV-XUV delays, for raw (a) and referenced spectra obtained for different numbers of airPCR iterations $k$ (b-f).

Download Full Size | PDF

Identifying the optimum result of the iterative procedure is achieved by balancing two requirements. On one hand, ${||\mathbf {w}_k||}_1$ is required to be small, so that even small signals like the one represented in the inset of Fig. 4(a) are assigned a small weight, and the fit of the noise floor is not affected by them. On the other hand, the NPR of the test sample is required to be high. This means that the effective number of pixels ${||\mathbf {w}_k||}_1$ must not decrease below a certain threshold, beyond which the PCR performance is reduced. These two requirements are satisfied best at $k = 46$, just before both $\overline {\mathrm {NPR}}_{\mathrm {avg}}$ and $\overline {\mathrm {NPR}}_{\mathrm {test}}$ start to decrease exponentially. At this $k$, $\overline {\mathrm {NPR}}_{\mathrm {test}} = 16.2$ dB and $\overline {\mathrm {NPR}}_{\mathrm {avg}} = 15.9$ dB. This value is indicated by a vertical dashed line in Figs. 4(c-e). We note that the number of iterations (and, thus, the total computation time) needed to reach the optimal value reduces as the constant $c$ in Eq. (4) is increased. At the same time, this constant has to be kept small enough to allow a sufficiently fine sampling of the parameters shown in Figs. 5(c-e), so that the optimum result can be correctly identified. The value $c = 0.2$ was chosen as a compromise between these two requirements, but we have found that, for our implementation, the effectiveness of the algorithm in reducing the noise is still preserved for any value $c \le 3$.

5. Results

Raw time-resolved transient absorption spectra of helium are shown as a false color map in Fig. 6(a). An UV-induced change in the absorption profile of the autoionizing Rydberg $2s{n}p$ $^1$P$^o$ ($n\ge 2$) states can be observed in the range of photon energies above $58\,$eV. These transient phenomena are induced by the coupling of two-electron XUV-excited states by the UV laser pulses, as previously observed by C. Ott et al. with a visible coupling pulse [14]. The $2s2p$, $2s3p$ and $2s4p$ resonances are labelled. A substantial improvement of signal-to-noise is obtained with both the edge-pixel PCR and airPCR approaches, as demonstrated by the corrected transient absorption false-color maps in Fig. 6(b) and Fig. 6(c), respectively. In Fig. 6(d), the raw mean $\langle \Delta OD_{\mathrm {raw,test}}\rangle$ in red of the test dataset (recorded at $\Delta t=-500\,$fs) is overlaid with the fit $\langle \Delta OD_{\mathrm {fit,test}}\rangle$ obtained with the edge-pixel PCR and airPCR methods, represented by green and blue lines, respectively. $\langle \Delta OD_{\mathrm {fit,test}}\rangle$ describes the baseline associated with the driving laser additive noise floor well in both cases, and a notable improvement is observed for the airPCR method. The regions of high frequency (ringing) noise around 59, 62, and 65$\,$eV correspond to minima in the HHG flux (a sample HHG spectrum is shown in Fig. 6(e) for reference); other sources of noise such as the dark-current noise of the detector are expected to dominate here.

 figure: Fig. 6.

Fig. 6. (a-c) False-color maps of the raw (a), edge-pixel PCR referenced (b) and airPCR referenced (c) transient absorption spectra between $57-67\,$eV. (d) Comparison of the mean noise $\langle \Delta OD_{\mathrm {raw,test}}\rangle$ (red line) of the test sample and fit $\langle \Delta OD_{\mathrm {raw,fit}}\rangle$ obtained with the edge-pixel PCR method (green line) and the airPCR method (blue line). (e) Sample XUV spectrum for reference. (f-i) Statistical analysis of the noise before (red) and after referencing with the edge-pixel PCR method (green) and the airPCR method (blue): (f-g) single pixel PDF before and after referencing for the test sample (h) standard deviations before and after referencing, averaged over all UV-XUV delays; (i) NPR averaged over all UV-XUV delays.

Download Full Size | PDF

The reduction in noise power through the spectrum can be estimated by performing a statistical analysis of the measured $\Delta OD_{\mathrm {raw}}$ and referenced $\Delta OD_{\mathrm {ref}}$. Figure 6(f-g) show a comparison of the PDFs of $\Delta OD_{\mathrm {raw,test}}$ and $\Delta OD_{\mathrm {ref,test}}$ for a single detector pixel corresponding to an energy $E = 58\,$eV. After referencing, the edge-pixel PCR returns a standard deviation reduced from $35\,$mOD to $6.6\,$mOD (Fig. 6(f)). With the airPCR method (Fig. 6(g)), the standard deviation reduces to $3.1\,$mOD. The standard deviations averaged over all the UV-XUV delays of the measured and referenced datasets are reported in Fig. 6(h) for all detector pixels. Figure 6(i) displays the NPR for the two methods, averaged over all the UV-XUV delays. For both the edge-pixel PCR and the airPCR approaches, the NPR inside the referencing region (shaded gray area) reaches a maximum of $\sim$23 dB (300-fold NPR). Outside of this region, it varies between a minimum of $\sim$2 dB (1.6-fold NPR) and a maximum of about $\sim$17.7 dB (60-fold NPR) for the edge-pixel PCR method. The airPCR approach leads to a stronger noise power reduction, reaching a maximum of $\sim$ 22 dB (160-fold NPR) at 57.8 eV. Thanks to the weighted approach, pixels with no signal in this region are taken into account, in addition to the pixels in the edge region, leading to a better or equally good performance throughout the entire region compared to the edge-pixel PCR approach. The results in Fig. 6(i) show that, regardless of the method used, the noise reduction at different pixels varies dramatically, as the peaks and troughs of the HHG spectrum have different stability and signal intensity. With the edge-pixel PCR approach, the mean noise reduction across all pixels on the detector is $\sim$14.8 dB (30-fold NPR), while outside the referencing region it is $\sim$12.2 dB (16.8-fold NPR). The airPCR method enables a higher mean noise reduction, equal to $\sim$16.2 dB (41.3-fold NPR) across all pixels on the detector and $\sim$16.5 dB (44.8-fold NPR) outside the edge-pixel referencing region.

Figures 7(a), 7(b) and 7(c) show false-color maps of the raw, edge-pixel PCR and airPCR referenced time-resolved transient absorption spectra, respectively, between $54-58.5\,$eV. They highlight weak transient features and the dramatic improvement in signal-to-noise after referencing, especially for the airPCR referenced spectra. Spectra at two UV-XUV delays before correction, after edge-pixels PCR referencing and after airPCR referencing are highlighted in Figs. 7(d), 7(e) and 7(f), respectively. Before referencing, almost no signal is distinguishable from the $\sim$2$\,$mOD noise. After the noise subtraction, however, features around $E_1 = 55.2\,$eV and $E_2 = 57.43\,$eV are revealed, with maximum absorbances of $2\,$ and $3\,$mOD, respectively. The feature at $E_1$ is attributed to coupling to the 2s4p resonance by the three photon process $E_1 + 2E_{\mathrm {UV}} = E_{2s4p} = 64.47$eV, while the feature at $E_2$ is attributed to the two photon absorption process $E_2 + E_{\mathrm {UV}} = E_{2p^2}$ that accesses the dark $2p^2$ state at an energy E$_{2p^2}$ = 62.06$\,$eV. At these two energies, the average NPR obtained with the edge-pixel PCR approach is $15.5\,$dB (35.5-fold NPR) and $15.3\,$dB (33.9-fold NPR). With the airPCR approach, the NPR for these two energies increases to $20.6\,$dB (115-fold NPR) and $17.4\,$dB (55-fold NPR).

 figure: Fig. 7.

Fig. 7. (a-c) Raw (a), edge-pixel PCR referenced (b) and airPCR referenced transient absorption spectra between $54-58.5\,$eV. (d-f) Spectra at $t = -200\,$fs and $t = 20\,$fs: raw (d), edge-pixel PCR referenced (e) and airPCR referenced (f). (g-j) Temporal variations of absorption features $E_1 = 55.17\,$eV (g,i) and $E_2 = 57.43\,$eV (h,j). The transients are represented before background subtraction (semi-transparent red lines), after edge-pixel PCR referencing (solid green lines), and after airPCR referencing (solid blue lines).

Download Full Size | PDF

Single-pixel lineouts of the time-dependent behavior of the absorption features at $E_1$ and $E_2$ after edge-pixels PCR referencing are shown in Figs. 7(g) and 7(h), respectively (green lines), while the airPCR referenced transients at these two energies are shown in Figs. 7(i) and 7(j) (blue lines). In both cases, transients before background subtraction are reported for comparison (semi-transparent red lines). Substantial oscillations in ${\Delta }OD$ are suppressed after the subtraction and clear transient features emerge. In particular at $E_1$, the airPCR approach provides a better suppression of these oscillations compared with the edge-pixel PCR approach. The shaded red, green and blue areas represent $68\%$ confidence intervals before noise subtraction, after edge-pixels PCR, and after airPCR referencing, respectively, calculated from the estimated standard deviation of the mean $\sigma _{\mathrm {mean}} = \sigma _{\mathrm {ref}} \cdot {N_s}^{1/2}$ at each delay. Before correction, the average $\sigma _{\mathrm {mean}}$ for different UV-XUV delays is $1.96\,$mOD at $E_2$ and $1.62\,$mOD at $E_1$. After the edge-pixel PCR noise subtraction, the average $\sigma _{\mathrm {mean}}$ for different UV-XUV delays is reduced to $\sim$280$\mu$OD at $E_2$ and $\sim$340$\mu$OD at $E_1$. With the airPCR approach, a further reduction of $\sigma _{\mathrm {mean}}$ is observed, reaching $\sim$220$\mu$OD at $E_2$ and just $\sim$185$\mu$OD at $E_1$.

6. Conclusions

We present two methods to minimize the noise associated with fluctuations of the HHG light source in ultrafast XUV transient absorption spectra. The methods rely on the high spectral correlation of the noise and enable the recovery of UV-induced changes in XUV-spectra with sub-mOD sensitivity.

The results obtained with the edge-pixel principal component regression (edge-pixel PCR) method complement previous attosecond transient absorption spectroscopy (ATAS) measurements using a few-cycle laser to generate isolated attosecond pulses (IAP) [27] and demonstrate that sufficient correlation also exists in the HHG spectrum of an attosecond pulse train (APT) to enable edge-pixel referencing. In this case, we demonstrate that, in contrast to an IAP, the noise reduction for an APT varies dramatically through the entire spectral range.

An adaptive iteratively reweighted principal component regression (airPCR) algorithm is presented, which allows to calibrate the noise floor without any prior assumption on the position and extent of the referencing region. With this method, a 40x mean power reduction of the noise is demonstrated across the entire spectral range. The extension of this iterative retrieval procedure to other experiments that exploit broadband highly spectrally correlated sources, such as an IAP, is straightforward.

Funding

Chemical Sciences, Geosciences, and Biosciences Division (DE-AC02-05CH11231); Horizon 2020 Framework Programme (734690); Ministero dell’Istruzione, dell’Università e della Ricerca (PRIN 2017RKWTMY).

Acknowledgments

This work was supported by the Basic Energy Sciences, Office of Science, U.S. Department of Energy, Chemical Sciences, Geosciences, and Biosciences Division (DE-AC02-05CH11231). This project has received funding from the European Union’s Horizon 2020 research and innovation programme (SONAR) under the Marie Skłodowska-Curie grant agreement No. 734690. This study was supported by the Italian Ministry of Research and Education with the project PRIN aSTAR (2017RKWTMY).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request. An example code of the airPCR algorithm is available in Code 1, Ref. [41].

References

1. L. R. Vanderhoef, A. K. Azad, C. C. Bomberger, D. R. Chowdhury, D. B. Chase, A. J. Taylor, J. M. Zide, and M. F. Doty, “Charge carrier relaxation processes in tbas nanoinclusions in GaAs measured by optical-pump THz-probe transient absorption spectroscopy,” Phys. Rev. B 89(4), 045418 (2014). [CrossRef]  

2. F. H. Su, F. Blanchard, G. Sharma, L. Razzari, A. Ayesheshim, T. L. Cocker, L. V. Titova, T. Ozaki, J.-C. Kieffer, R. Morandotti, M. Reid, and F. A. Hegmann, “Terahertz pulse induced intervalley scattering in photoexcited GaAs,” Opt. Express 17(12), 9620–9629 (2009). [CrossRef]  

3. M. Schall and P. U. Jepsen, “Photoexcited GaAs surfaces studied by transient terahertz time-domain spectroscopy,” Opt. Lett. 25(1), 13–15 (2000). [CrossRef]  

4. M. B. Gaarde, C. Buth, J. L. Tate, and K. J. Schafer, “Transient absorption and reshaping of ultrafast XUV light by laser-dressed helium,” Phys. Rev. A 83(1), 013419 (2011). [CrossRef]  

5. A. R. Beck, D. M. Neumark, and S. R. Leone, “Probing ultrafast dynamics with attosecond transient absorption,” Chem. Phys. Lett. 624, 119–130 (2015). [CrossRef]  

6. L. X. Chen, X. Zhang, E. C. Wasinger, K. Attenkofer, G. Jennings, A. Z. Muresan, and J. S. Lindsey, “Tracking electrons and atoms in a photoexcited metalloporphyrin by X-ray transient absorption spectroscopy,” J. Am. Chem. Soc. 129(31), 9616–9618 (2007). [CrossRef]  

7. E. Goulielmakis, Z.-H. Loh, A. Wirth, R. Santra, N. Rohringer, V. S. Yakovlev, S. Zherebtsov, T. Pfeifer, A. M. Azzeer, M. F. Kling, S. R. Leone, and F. Krausz, “Real-time observation of valence electron motion,” Nature 466(7307), 739–743 (2010). [CrossRef]  

8. H. Wang, M. Chini, S. Chen, C.-H. Zhang, F. He, Y. Cheng, Y. Wu, U. Thumm, and Z. Chang, “Attosecond time-resolved autoionization of argon,” Phys. Rev. Lett. 105(14), 143002 (2010). [CrossRef]  

9. M. Holler, F. Schapper, L. Gallmann, and U. Keller, “Attosecond electron wave-packet interference observed by transient absorption,” Phys. Rev. Lett. 106(12), 123601 (2011). [CrossRef]  

10. M.-F. Lin, A. N. Pfeiffer, D. M. Neumark, S. R. Leone, and O. Gessner, “Strong-field induced XUV transmission and multiplet splitting in 4d(-1)6p core-excited Xe studied by femtosecond XUV transient absorption spectroscopy,” J. Chem. Phys. 137(24), 244305 (2012). [CrossRef]  

11. S. Chen, M. J. Bell, A. R. Beck, H. Mashiko, M. Wu, A. N. Pfeiffer, M. B. Gaarde, D. M. Neumark, S. R. Leone, and K. J. Schafer, “Light-induced states in attosecond transient absorption spectra of laser-dressed helium,” Phys. Rev. A 86(6), 063408 (2012). [CrossRef]  

12. M. Chini, B. Zhao, H. Wang, Y. Cheng, S. Hu, and Z. Chang, “Subcycle ac stark shift of helium excited states probed with isolated attosecond pulses,” Phys. Rev. Lett. 109(7), 073601 (2012). [CrossRef]  

13. M. Schultze, E. M. Bothschafter, A. Sommer, S. Holzner, W. Schweinberger, M. Fiess, M. Hofstetter, R. Kienberger, V. Apalkov, V. S. Yakovlev, M. I. Stockman, and F. Krausz, “Controlling dielectrics with the electric field of light,” Nature 493(7430), 75–78 (2013). [CrossRef]  

14. C. Ott, A. Kaldun, L. Argenti, P. Raith, K. Meyer, M. Laux, Y. Zhang, A. Blättermann, S. Hagstotz, T. Ding, R. Heck, J. Madro nero, F. Martín, and T. Pfeifer, “Reconstruction and control of a time-dependent two-electron wave packet,” Nature 516(7531), 374–378 (2014). [CrossRef]  

15. C.-T. Liao, A. Sandhu, S. Camp, K. J. Schafer, and M. B. Gaarde, “Beyond the single-atom response in absorption line shapes: probing a dense, laser-dressed helium gas with attosecond pulse trains,” Phys. Rev. Lett. 114(14), 143002 (2015). [CrossRef]  

16. A. S. Chatterley, F. Lackner, D. M. Neumark, S. R. Leone, and O. Gessner, “Tracking dissociation dynamics of strong-field ionized 1, 2-dibromoethane with femtosecond XUV transient absorption spectroscopy,” Phys. Chem. Chem. Phys. 18(21), 14644–14653 (2016). [CrossRef]  

17. S. P. Neville, M. Chergui, A. Stolow, and M. S. Schuurman, “Ultrafast x-ray spectroscopy of conical intersections,” Phys. Rev. Lett. 120(24), 243001 (2018). [CrossRef]  

18. P. Peng, C. Marceau, M. Hervé, P. Corkum, A. Y. Naumov, and D. Villeneuve, “Symmetry of molecular rydberg states revealed by XUV transient absorption spectroscopy,” Nat. Commun. 10(1), 5269 (2019). [CrossRef]  

19. C.-T. Liao, X. Li, D. J. Haxton, T. N. Rescigno, R. R. Lucchese, C. W. McCurdy, and A. Sandhu, “Probing autoionizing states of molecular oxygen with XUV transient absorption: Electronic-symmetry-dependent line shapes and laser-induced modifications,” Phys. Rev. A 95(4), 043427 (2017). [CrossRef]  

20. S. Bengtsson, E. W. Larsen, D. Kroon, S. Camp, M. Miranda, C. Arnold, A. L’Huillier, K. Schafer, M. Gaarde, L. Rippe, and J. Mauritsson, “Space–time control of free induction decay in the extreme ultraviolet,” Nat. Photonics 11(4), 252–258 (2017). [CrossRef]  

21. L. Drescher, G. Reitsma, T. Witting, S. Patchkovskii, J. Mikosch, and M. Vrakking, “State-resolved probing of attosecond timescale molecular dipoles,” J. Phys. Chem. Lett. 10(2), 265–269 (2019). [CrossRef]  

22. Y. Kobayashi, K. F. Chang, T. Zeng, D. M. Neumark, and S. R. Leone, “Direct mapping of curve-crossing dynamics in IBr by attosecond transient absorption spectroscopy,” Science 365(6448), 79–83 (2019). [CrossRef]  

23. H. Timmers, X. Zhu, Z. Li, Y. Kobayashi, M. Sabbar, M. Hollstein, M. Reduzzi, T. J. Martínez, D. M. Neumark, and S. R. Leone, “Disentangling conical intersection and coherent molecular dynamics in methyl bromide with attosecond transient absorption spectroscopy,” Nat. Commun. 10(1), 3133 (2019). [CrossRef]  

24. A. Ferré, C. Handschin, M. Dumergue, F. Burgy, A. Comby, D. Descamps, B. Fabre, G. Garcia, R. Géneaux, L. Merceron, E. Mével, L. Nahon, S. Petit, B. Pons, D. Staedter, S. Weber, T. Ruchon, V. Blanchet, and Y. Mairesse, “A table-top ultrashort light source in the extreme ultraviolet for circular dichroism experiments,” Nat. Photonics 9(2), 93–98 (2015). [CrossRef]  

25. D. D. Hickstein, F. J. Dollar, P. Grychtol, J. L. Ellis, R. Knut, C. Hernández-García, D. Zusin, C. Gentry, J. M. Shaw, T. Fan, K. M. Dorney, A. Becker, A. Jaroń-Becker, H. C. Kapteyn, M. M. Murnane, and C. G. Durfee, “Non-collinear generation of angularly isolated circularly polarized high harmonics,” Nat. Photonics 9(11), 743–750 (2015). [CrossRef]  

26. Y. Feng, I. Vinogradov, and N.-H. Ge, “General noise suppression scheme with reference detection in heterodyne nonlinear spectroscopy,” Opt. Express 25(21), 26262–26279 (2017). [CrossRef]  

27. R. Géneaux, H.-T. Chang, A. M. Schwartzberg, and H. J. Marroux, “Source noise suppression in attosecond transient absorption spectroscopy by edge-pixel referencing,” Opt. Express 29(2), 951–960 (2021). [CrossRef]  

28. C. Schriever, S. Lochbrunner, E. Riedle, and D. Nesbitt, “Ultrasensitive ultraviolet-visible 20 fs absorption spectroscopy of low vapor pressure molecules in the gas phase,” Rev. Sci. Instrum. 79(1), 013107 (2008). [CrossRef]  

29. M. Zürch, H.-T. Chang, L. J. Borja, P. M. Kraus, S. K. Cushing, A. Gandman, C. J. Kaplan, M. H. Oh, J. S. Prell, D. Prendergast, C. D. Pemmaraju, D. M. Neumark, and S. R. Leone, “Direct and simultaneous observation of ultrafast electron and hole dynamics in germanium,” Nat. Commun. 8(1), 15734 (2017). [CrossRef]  

30. F. Schlaepfer, M. Lucchini, S. A. Sato, M. Volkov, L. Kasmi, N. Hartmann, A. Rubio, L. Gallmann, and U. Keller, “Attosecond optical-field-enhanced carrier injection into the GaAs conduction band,” Nat. Phys. 14(6), 560–564 (2018). [CrossRef]  

31. M. Volkov, J. Pupeikis, C. R. Phillips, F. Schlaepfer, L. Gallmann, and U. Keller, “Reduction of laser-intensity-correlated noise in high-harmonic generation,” Opt. Express 27(6), 7886–7895 (2019). [CrossRef]  

32. B. D. Bruner, M. Krüger, O. Pedatzur, G. Orenstein, D. Azoury, and N. Dudovich, “Robust enhancement of high harmonic generation via attosecond control of ionization,” Opt. Express 26(7), 9310–9322 (2018). [CrossRef]  

33. N. Tsatrafyllis, I. Kominis, I. Gonoskov, and P. Tzallas, “High-order harmonics measured by the photon statistics of the infrared driving-field exiting the atomic medium,” Nat. Commun. 8(1), 15170 (2017). [CrossRef]  

34. K. C. Robben and C. M. Cheatum, “Edge-pixel referencing suppresses correlated baseline noise in heterodyned spectroscopies,” J. Chem. Phys. 152(9), 094201 (2020). [CrossRef]  

35. B. W. Toulson, M. Borgwardt, H. Wang, F. Lackner, A. S. Chatterley, C. Pemmaraju, D. M. Neumark, S. R. Leone, D. Prendergast, and O. Gessner, “Probing ultrafast C–Br bond fission in the uv photochemistry of bromoform with core-to-valence transient absorption spectroscopy,” Struct. Dyn. 6(5), 054304 (2019). [CrossRef]  

36. O. Dunn and V. Clark, Applied statistics: Analysis of variance and regression (John Wiley & Sons Inc, 1974).

37. S. Wold, K. Esbensen, and P. Geladi, “Principal component analysis,” Chemom. Intell. Lab. Syst. 2(1-3), 37–52 (1987). [CrossRef]  

38. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, “Scikit-learn: Machine learning in Python,” Journal of Machine Learning Research 12, 2825–2830 (2011).

39. R. Madden and K. Codling, “Two-electron excitation states in helium,” Astrophys. J. 141, 364 (1965). [CrossRef]  

40. Z.-M. Zhang, S. Chen, and Y.-Z. Liang, “Baseline correction using adaptive iteratively reweighted penalized least squares,” Analyst 135(5), 1138–1146 (2010). [CrossRef]  

41. D. Faccialà, B. W. Toulson, and O. Gessner, “Adaptive iteratively reweighted Principal Component Regression (airPCR),” figshare (2021), https://doi.org/10.6084/m9.figshare.16538847

Supplementary Material (1)

NameDescription
Code 1       Code example of airPCR: an adaptive iteratively reweighted Principal Component Regression technique for background removal in high-harmonic transient absorption spectra.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request. An example code of the airPCR algorithm is available in Code 1, Ref. [41].

41. D. Faccialà, B. W. Toulson, and O. Gessner, “Adaptive iteratively reweighted Principal Component Regression (airPCR),” figshare (2021), https://doi.org/10.6084/m9.figshare.16538847

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (7)

Fig. 1.
Fig. 1. Analysis of the noise of ${\Delta }OD_{\mathrm {calib}}$ , formed by pairing consecutively acquired UV-off spectra. (a) Sample UV-off spectrum, corresponding to a $\sim 125\,$ ms long acquisition of the 1 $\,$ kHz HHG source radiation. (b) False-color plot of the PDF of ${\Delta }OD_{\mathrm {calib}}$ across the XUV spectrum, demonstrating the noise associated with different harmonics. The standard deviation of the noise 2 $\sigma _s$ is represented by the black line. (c) Correlation matrix of the noise in the photon energy domain, showing the PCC between any two photon energies. (d) Correlation matrix of the noise by CCD exposure, showing the PCC between any two of the first 1000 observations.
Fig. 2.
Fig. 2. (a) First three columns of ${\Delta }OD_{\mathrm {calib}}$ , obtained from two adjacent pairs of UV-off spectra ( $p_1$ and $p_2$ ). (b) First 8 principal components ordered by decreasing relevance from top to bottom (first 8 columns of ${\Delta }OD_{\mathrm {PC}}$ ). (c) Recovery of additive noise across the entire spectrum (red line), compared to the mean noise (green line) for different fit regions (shaded gray). (d) Average measured spectral change $\langle \Delta OD_{\mathrm {raw}}(t)\rangle$ as a function of the UV-XUV pulse delay $t$ . (e) Average spectral change $\langle \Delta OD_{\mathrm {raw}}\rangle$ over all UV-XUV pulse delays (green line) and final fit region selection (shaded gray).
Fig. 3.
Fig. 3. Flowcharts of edge-pixel PCR and airPCR methods: (a) PCA of the calibration dataset $\Delta OD_{\mathrm {calib}}$ (same for both methods), (b) edge-pixel PCR flowchart, (c) airPCR flowchart.
Fig. 4.
Fig. 4. (a-b) Mean referenced signal $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ (a) and airPCR weights $w_k$ (b) as a function of energy and for different values of the iteration index $k$ (see legend in panel a). The inset in (a) shows a y-axis magnification of $\langle \Delta OD_{\mathrm {ref}, k}\rangle$ in a selected energy region. (c) Square norm $||\mathbf {d}_k||$ of the optical density as a function of $k$ . (d) Sum of all weights ${||\mathbf {w}_k||}_1$ as a function of $k$ . (e) $\overline {\mathrm {NPR}}$ for the test sample (dashed line) and $\overline {\mathrm {NPR}}$ averaged over all UV-XUV delays (solid line) as a function of $k$ . (c-e) The iteration index $k=46$ , corresponding to the optimum result, is indicated by a vertical dashed line.
Fig. 5.
Fig. 5. Top panels: raw $\langle \Delta OD_{\mathrm {raw}}(t)\rangle$ (a) and referenced $\langle \Delta OD_{\mathrm {ref}, k}(t)\rangle$ for different values of $k$ (b-f). Bottom panels: standard deviation of optical density averaged over all UV-XUV delays, for raw (a) and referenced spectra obtained for different numbers of airPCR iterations $k$ (b-f).
Fig. 6.
Fig. 6. (a-c) False-color maps of the raw (a), edge-pixel PCR referenced (b) and airPCR referenced (c) transient absorption spectra between $57-67\,$ eV. (d) Comparison of the mean noise $\langle \Delta OD_{\mathrm {raw,test}}\rangle$ (red line) of the test sample and fit $\langle \Delta OD_{\mathrm {raw,fit}}\rangle$ obtained with the edge-pixel PCR method (green line) and the airPCR method (blue line). (e) Sample XUV spectrum for reference. (f-i) Statistical analysis of the noise before (red) and after referencing with the edge-pixel PCR method (green) and the airPCR method (blue): (f-g) single pixel PDF before and after referencing for the test sample (h) standard deviations before and after referencing, averaged over all UV-XUV delays; (i) NPR averaged over all UV-XUV delays.
Fig. 7.
Fig. 7. (a-c) Raw (a), edge-pixel PCR referenced (b) and airPCR referenced transient absorption spectra between $54-58.5\,$ eV. (d-f) Spectra at $t = -200\,$ fs and $t = 20\,$ fs: raw (d), edge-pixel PCR referenced (e) and airPCR referenced (f). (g-j) Temporal variations of absorption features $E_1 = 55.17\,$ eV (g,i) and $E_2 = 57.43\,$ eV (h,j). The transients are represented before background subtraction (semi-transparent red lines), after edge-pixel PCR referencing (solid green lines), and after airPCR referencing (solid blue lines).

Equations (7)

Equations on this page are rendered with MathJax. Learn more.

Δ O D = log 10 I 1 I 0 Δ O D s i g n a l F log 10 I p r o b e 1 I p r o b e 0 .
ρ x , y = σ x y σ x σ y σ x y = ( x x ) ( y y ) , σ x = x 2 1 / 2 , σ y = y 2 1 / 2
Δ O D c a l i b Δ O D P C C
Δ O D r e f = Δ O D r a w Δ O D f i t Δ O D f i t = Δ O D P C C
C = ( Δ O D P C T W Δ O D P C ) 1 Δ O D P C T W Δ O D r a w ,
N P R ¯ = i = 0 n ( 1 w i ) N P R i i = 0 n ( 1 w i ) N P R = σ r a w 2 σ r e f 2 = Δ O D r a w 2 Δ O D r e f 2 ,
w k + 1 = exp [ α ( k ) d k | | d k | | ] , α ( k ) = ( 1 + c ) k , c > 0
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.