Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Use of numerical orthogonal transformation for the Zernike analysis of lateral shearing interferograms

Open Access Open Access

Abstract

A numerical orthogonal transformation method for reconstructing a wavefront by use of Zernike polynomials in lateral shearing interferometry is proposed. The difference fronts data in two perpendicular directions are fitted to numerical orthonormal polynomials instead of Zernike polynomials, and then the orthonormal coefficients are used to evaluate the Zernike coefficients of the original wavefront by use of a numerical shear matrix. Due to the fact that the dimensions of the shear matrix are finite, the high-order terms of the original wavefront above a certain order have to be neglected. One of advantages of the proposed method is that the impact of the neglected high-order terms on the outcomes of the lower-order terms can be decreased, which leads to a more accurate reconstruction result. Another advantage is that the proposed method can be applied to reconstruct a wavefront on an aperture of arbitrary shape from its difference fronts. Theoretical analysis and numerical simulations shows that the proposed method is correct and its reconstruction error is obviously smaller than that of Rimmer-Wyant method.

©2012 Optical Society of America

1. Introduction

In lateral shearing interferometry (LSI), the wavefront is interfered with a laterally shifted part of itself, thus eliminating the need of an extra reference wavefront. In addition, the two interference beams travel almost equivalent paths, which gives LSI an advantage that it is insensitive to mechanical vibration and environmental turbulence [1]. Owing to these advantages described above, LSI has been used extensively in many applications [2,3]. However, the interferograms of LSI is directly related to the difference between the wavefront and its sheared copy rather than the wavefront under test itself. Thus, an inversion problem that reconstructs the original wavefront from two wavefront differences (difference fronts) in two perpendicular shear directions has to be resolved. A variety of approaches to solve this problem have been devised [419]. They are mostly categorized as either zonal reconstruction or modal reconstruction. In zonal reconstruction, the wavefront is directly evaluated at specific grid points [49]. While in modal reconstruction, the wavefront is expanded into a set of certain basis functions and its corresponding coefficients are evaluated [1018]. As shown in [19], if the wavefront under test can be decomposed into a set of basis functions, modal reconstruction is superior to zonal reconstruction because of its better noise propagation properties and its efficient calculations.

For the modal reconstruction, Zernike polynomials are commonly used as basis functions for wavefront expansion [1417]. It was introduced to analyze the lateral shearing interferograms first by Rimmer and Wyant (1975) [14]. The two difference fronts in perpendicular shear directions as well as the original wavefront are expanded in terms of Zernike polynomials, the Zernike coefficients of difference front and original wavefront are related to each other by a shear matrix whose elements are functions of shear ratio. In fact, the number of terms of Zernike polynomials of a practical wavefront should be infinite in general case. However, the dimensions of the shear matrix cannot be extended to infinity. This contradiction leads to a result that the terms of Zernike polynomials of the original wavefront above a certain order have to be neglected because they cannot be represented by one or more elements of the shear matrix. These neglected terms are referred to as remaining high-order terms, and they have a negative effect on the outcomes of the lower-order terms. As can be seen below, this effect cannot be omitted even if the contributions of the remaining high-order terms to the original wavefront are slight compared with that of the lower-order terms. This problem was analyzed theoretically with matrix formulations by Herrmann [12], but he did not evaluate the error source that result from this problem. This error source was defined as remaining error and was evaluated theoretically and numerically by Dai [18], and the results showed that the remaining error can be reduced by use of Karhunen–Loève (K-L) functions instead of Zernike polynomials as basis functions to expand the wavefront. The K-L functions are optimal set of basis functions for atmospheric turbulence modal phase compensation. However, it is not employed in the field of optical testing.

In this paper, we propose a numerical orthogonal transformation method to reconstruct the wavefront from difference fronts based on Zernike polynomials. By using this method, the sensitivity of the outcomes of lower-order terms to the remaining high-order terms can be decreased, and a consequent result is that the remaining error is reduced and then the reconstruction accuracy is improved. This method can be implemented easily from Rimmer-Wyant method, and can be applied to reconstruct a wavefront on an aperture of arbitrary shape from its difference fronts. By theoretical analysis and numerical calculations, it is confirmed that the accuracy of the proposed method is superior to that of Rimmer-Wyant method.

2. Numerical orthogonal transformation

It is supposed that a wavefront W(x,y) under test is represented by the first Jfringe Zernike polynomials [20] as

W(x,y)=j=2JajZj(x,y).
wherexandyare normalized in Cartesian coordinates, Zj(x,y) denotes thejthfringe Zernike polynomials, and ajis the corresponding weighting coefficient. In Eq. (1) the pistonZ1was omitted because it is of no concern. Assuming that the two lateral shearing interfergrams in two perpendicular shear directions have been obtained within the two overlap regionsΣxandΣyon a series of discrete points, as shown in Figs. 1(a) and 1(b). And the shear ratiosin the two shear directions is assumed identical in this paper, thus the number of measurement pointsNfor the two difference fronts is the same. The two difference fronts on these discrete points can be evaluated from the two lateral shearing interferograms.

 figure: Fig. 1

Fig. 1 Schematic diagrams of lateral shearing interferograms. (a) Lateral shearing interferogaram when the shearing is in the x direction. (b) Lateral shearing interferogram when the shearing is in the y direction.

Download Full Size | PDF

The difference fronts are also expanded by Zernike polynomials. For the moment, we analyze the difference front when the shearing is in thexdirection. Then we have

ΔWx(x,y;s)=W(x+s,y)W(xs,y)=j=1JAx,jZj(x,y).
whereAx,j denotes thejthZernike coefficient ofΔWx(x,y;s), and the summation starts from j=1since the piston of the difference front cannot be omitted because it relates to some terms of the original wavefront, e.g. tilt and coma. Note that some high-orders of Ax,j may be zero, but they still be reserved to facilitate the derivation subsequently. Substituting Eq. (1) and the expression of Zernike polynomials in Cartesian coordinates into Eq. (2), the relationship between the Zernike coefficients of the difference front and the original wavefront can be derived. It is written in matrix form for simplicity as
Ax=Nxa.
whereAxandaareJ×1and (J1)×1vectors, respectively, they represent the arrays of Zernike coefficients of the difference front and the original wavefront, respectively. Nxis a shear matrix that relates the Zernike coefficients of the original wavefront and the difference front.

However, the Zernike coefficients of the difference fronts are not independent of each other, because Zernike polynomials are orthogonal basis functions over unit circle but not orthogonal over the overlap regionΣxof two circular beams where the difference front lying in. Thus, it can be expected that the use of orthogonal polynomials instead of Zernike polynomials to expand the difference fronts and a new shear matrix to relate the Zernike coefficients of the original wavefront and the orthogonal coefficients of the difference fronts may reduce the reconstruction error. Indeed, as can be seen below, the reconstruction of the original wavefront with orthogonal transformation results in a smaller remaining error and a more accurate reconstruction result.

2.1. Calculation of the numerical orthonormal polynomials

Although some methods have been presented to construct orthogonal polynomials analytically for non-circle pupils in simple shape, e.g. annular, hexagon, ellipse and rectangle [2126], it is still a challenging work to construct a set of orthogonal polynomials over the overlap region of two inference beams analytically, especially when the wavefront under test is in a more complicated region, e.g. annular region with a spider that emerged in the Extreme Ultraviolet Lithography [27]. In fact, what we needed is a set of polynomials that are orthogonal over the discrete points of the difference front data set withinΣxrather than polynomials that are orthogonal over the full region ofΣx.The orthogonal polynomials over the discrete points can be obtained conveniently from numerical calculation by computer regardless of the shape of the region. Since Zernike polynomials form a complete set, the numerical polynomials {Fx,j(xn,yn)},(n=1,2,...,N)that are orthonormal over the discrete points withinΣxare available from the linear combination of Zernike polynomials. The Zernike polynomials and the set{Fx,j(xn,yn)}are related to each other according to

Fx,i(xn,yn)=j=1JMx,ijZj(xn,yn).
whereMx,ijis the element of ithrow and jthcolumn of the conversion matrixMx which is a lower triangle matrix. Equation (4) can be written in matrix form as
Fx=ZxMxT.
whereFxandZxareN×Jmatrix representing each of J orthonormal polynomials and J Zernike polynomials over theNdata points within the regionΣx, respectively, and MxTis the transpose of matrixMx.An elegant nonrecursive method to determine the conversion matrix Mxin integrable domain has been described by Dai [24] which can be extended to discrete domain conveniently. Multiply both sides of Eq. (5) by FxTandZxT, respectively, we obtain

FxTFx=FxTZxMxT.
ZxTFx=ZxTZxMxT.

BecauseFxare orthonormal over the data set, so we have(FxTFx)/N=I, whereIis J×J identity matrix. Use this condition to Eq. (6), we get

MxZxTFx=NI.

Substitute Eq. (7) to Eq. (8) and letMx=(QxT)1, we obtain

QxTQx=(ZxTZx)/N.

Equation (9) can be solved forQxuniquely with, e.g, Cholesky decomposition due to the fact that ZxTZxis symmetric positive definite matrix [24], and then conversion matrixMxcan be determined fromMx=(QxT)1.The expressions of Zernike polynomials in Cartesian coordinate are known, and then the matrix Zxcan be calculated by substitute the coordinate values of the measurement points into the expressions of Zernike polynomials. Therefore, the numerical orthonormal polynomials Fx can be determined from Eq. (5) with the matrixZxand the conversion matrixMx.

2.2 Derivation of the corresponding shear matrix

By expanding the difference front within Σxwith the fistJ numerical orthonormal polynomials{Fx,j(xn,yn)}, the corresponding orthogonal coefficients are obtained as J×1 vectorBx. Since the fact that each term in the set{Fx,j(xn,yn)}is a linear combination of the Zernike polynomials, the difference front expanded with the orthonormal polynomials {Fx,j(xn,yn)}is identical to that expanded with corresponding set of Zernike polynomials. Then, we have a matrix formula,

ΔWx=ZxAx=FxBx.
whereΔWxisN×1vector representing the difference front values inNpoints within the region Σx.Substituting Eq. (5) to Eq. (10), the relationship between the orthonormal coefficients Ax and the Zernike coefficientsBxcan be obtained as

Ax=MxTBx.

Substituting Eq. (11) into Eq. (3), we have

Bx=(MxT)1Nxa=Hxa.

Here, Hx=(MxT)1Nx denotes the shear matrix that relates the Zernike coefficients of the original wavefront and the orthonormal coefficients of difference front.

2.3 Wavefront reconstruction

For the analysis of the difference front within Σywhere the shearing is in theydirection, we find that, similar to Eq. (3) and Eq. (12),

Ay=Nya.
By=(MyT)1Nya=Hya.
whereAyandByareJ×1vectors, which represent Zernike coefficients and orthonormal coefficients of the difference front withinΣy, respectively. Myis the conversion matrix which transforms Zernike polynomials to the numerical polynomials{Fy,j(xn,yn)}. The numerical polynomials are orthonormal over the discrete points withinΣy. Ny and Hy=(MyT)1Ny are the shear matrixes corresponding toNxandHx, respectively. All the four shear matrixesNx, Ny,Hxand Hy are J×(J1)upper triangular matrixes whose matrix elements are functions of shear ratiosor zero. Note that, NxandNyare given in analytical form, while HxandHyare given in numerical form.

Combing Axand Ayinto a single vector and matrixNxandNyinto a single matrix, a matrix formula is obtained from Eq. (3) and Eq. (13) as

(AxAy)=(NxNy)a.
which can be abbreviated to

A=Na.

Appling the same treatment to Eq. (12) and Eq. (14), a matrix formula is obtained as

(BxBy)=(HxHy)a.
which can also be simplified as

B=Ha.

Since Eqs. (16) and (18) are obviously over determined equations, the least-square solutions of the two equations are

a^=NA^.
a^=HB^.
whereA^andB^are estimation ofAandBobtained by fitting the two difference fronts ΔWx(xn,yn) and ΔWy(xn,yn)to Zernike polynomials and numerical orthonormal polynomials, respectively, and N=(NTN)1NTandH=(HTH)1HTare generalized inverse of matrixes N andH, respectively. Equations (19) and (20) are the representation of Rimmer-Wyant method and the proposed method, respectively.

We note that a similar method by using elliptical orthogonal transformation has been presented in Ref. 16. In this method, the difference front inxdirection withinΣxwas reduced to an elliptical region. And the elliptical Zernike polynomials which are orthogonal over this elliptical region were generated from Zernike polynomials by coordinate transformation. Then the reduced difference front was expanded into the obtained elliptical Zernike polynomials. The shear matrix that relates the elliptical Zernike coefficients of difference front and the Zernike coefficients of the original wavefront were derived from a double integral. The difference front inydirection withinΣywas analyzed following the same way.

However, the obtained elliptical Zernike polynomials are orthogonal over the full elliptical region, but not orthogonal over the discrete points at which the difference front are measured. Moreover, the applications of this method are limited to the situation that the wavefront under test is circular because the overlap region of the two interference beams cannot be approximated by an ellipse when the wavefront under test is generated from a non-circle aperture such as annular aperture.

For the proposed method, the numerical orthogonal transformation is implemented directly on the discrete points of the difference front data set, and the obtained numerical polynomials are orthogonal over these discrete points rather than the full overlap region. In addition, the orthogonality of the obtained numerical polynomials is not influenced by the shape of the region. Thus, the proposed method can be applied on apertures of arbitrary shape.

The fact that the outcomes of the lower-order terms are influenced by the remaining high-order terms was pointed out and illustrated by an example shown in Ref.16. However, the authors did not explain the effect of orthogonal transformation on the reduction of this influence. This effect will be made clear with a cross-coupling formula derived in the following section.

3. Impact of remaining high-order terms

Here, we assume the number J stands for infinity. Therefore, any practical wavefront can be represented completely by Eq. (1). Now, returning to Eq. (10), under the assumption that J stands for infinity, the column number of the matrixZxis extended to infinity. If the matrixZxand the vectorAxare splited into two blocks, a formula can be obtained from Eq. (10) as

ΔWx=ZxAx=(ZxfZxr)(AxfAxr)=ZxfAxf+ZxrAxr.

Here, ZxfandZxrare the two blocks containing the firstKand the remaining columns of the matrixZx,respectively. AxfandAxrare the firstKand the remaining elements of the vectorAx.

3.1 Least-square fitting of difference fronts

Since the difference front cannot be fitted with infinite terms of Zernike polynomials, we assume it is fitted by the firstKterms, and then the corresponding coefficients can be obtained by means of least-square fitting,

A^xf=ZxfΔWx.
whereA^xfis the estimation ofAxf,andZxf=(ZxfTZxf)1ZxfTis the generalized inverse of matrixZxf. Substituting Eq. (21) into Eq. (22), we obtain
A^xf=Zxf(ZxfAxf+ZxrAxr)=Axf+CxAxr.
whereCx=ZxfZxr. A similar expression can be obtained for the fitting of the difference front withinΣy where the shearing is in theydirection,
A^yf=Ayf+CyAyr.
whereA^yfis the estimation ofAyfwhich is the firstKZernike coefficients of the difference front, Ayris the Zernike coefficients of the remaining high-order terms, and Cy=ZyfZyr, whereZyf=(ZyfTZyf)1ZyfTis the generalized inverse of matrixZyf, and Zyfand Zyrare the firstKand the remaining columns of the matrixZy,respectively, andZyis N×Jmatrix representing each of JZernike polynomials over theNdata points within the regionΣy.

3.2 Splitting of the shear matrixes

To facilitate the derivation, Eq. (3) is written in another form as

(AxfAxr)=(Nxf1Nxf2Nxr1Nxr2)(afar)=(Nxf1Nxf20Nxr2)(afar).

Here, the shear matrixNxwith infinite rows and columns is splited into four blocks by a horizontal line under theKthrow and a vertical line behind the(K1)thcolumns. So, the block Nxf1is K×(K1)matrix, and the dimensions of other blocks can also be deduced easily. In addition, Nxr1=0is becauseNxis an upper triangular matrix. Vectorsafandarare arrays of the firstK1and the remaining elements of vectora, respectively.

The shear matrixNycan also be splited into four blocks in the same way, and a similar expression as Eq. (24a) for the analysis of the difference front inyshear direction can be obtained as

(AyfAyr)=(Nyf1Nyf2Nyr1Nyr2)(afar)=(Nyf1Nyf20Nyr2)(afar).

From Eq. (24a) and Eq. (24b) the following formula can be obtained easily,

Axf=Nxf1af+Nxf2ar
Axr=Nxr2ar.
Ayf=Nyf1af+Nyf2ar.
Ayr=Nyr2ar.

3.3 Reconstruction with finite-dimensional shear matrixes

As discussed in Sec.1, the wavefront cannot be reconstructed completely due to the fact that the dimensions of the shear matrix are finite. As Eq. (19), the estimation of the firstKZernike coefficients of the original wavefront except piston can be obtained by shear matrixNxf1, Nyf1and the estimation coefficientsA^xfand A^yfas

a^f=Nf1(A^xfA^yf)withNf1=(Nxf1Nyf1).
whereNf1=(Nf1TNf1)1Nf1Tis generalized inverse of matrixNf1.Substituting Eqs. (23a) and (23b) into Eq. (26), we obtain

a^f=Nf1(Axf+CxAxrAyf+CyAyr).

Substituting Eqs. (25a) - (25d) to Eq. (27), we have

a^f=Nf1(Nxf1af+Nxf2ar+CxNxr2arNyf1af+Nyf2ar+CyNyr2ar)=Nf1{(Nxf1Nyf1)af+(Nxf2Nyf2)ar+(CxNxr2CyNyr2)ar}.=af+TZar
where

TZ=Nf1Nf2+Nf1Ncr2,Nf2=(Nxf2Nyf2)andNcr2=(CxNxr2CyNyr2).

TZis referred to as cross-coupling matrix, and it represents the impact of the remaining high-order terms on the outcomes of the lower-order terms when the wavefront is reconstructed by use of Eq. (19). Note that this cross-coupling matrix is different from the cross-coupling matrix derived by Herrmann [12] and the cross-talk matrix derived by Dai [18].

3.4 Same analysis of numerical orthogonal transformation method

Although Eq. (28) is deduced from the situation that the Zernike polynomials are used as a basis to expand the difference fronts, it can also be used in the case that the numerical orthonormal polynomials are used as a basis to expand the difference fronts. To analyze the proposed method, by imitating Eq. (28) we obtain

a^f=af+THar.
where

TH=Hf1Hf2+Hf1Hcr2,Hf2=(Hxf2Hyf2)andHcr2=(DxHxr2DyHyr2).

Similar toCx=ZxfZxrandCy=ZyfZyr, we haveDx=FxfFxrandDy=FyfFyr. Note that, due to the orthonormality ofFxandFy, we have

Dx=FxfFxr=(FxfTFxf)1FxfTFxr=0.
Dy=FyfFyr=(FyfTFyf)1FyfTFyr=0.

Substituting Eqs. (30). (a) and (b) to the expression ofHcr2, the result ofHcr2=0 is obtained, and then substituting this result to the expression ofTH, we obtainTH=Hf1Hf2.

Obviously, bothTZandTHare zero matrixes whenK=J, under this condition we have a^f=af, that is the wavefront can be reconstructed without error regardless of the orthogonality of the basis functions that are used to expand the difference front. However, as discussed in Sec.1, this condition cannot be met in practical situation. Consequently, it is inevitable that the estimations of lower-order coefficients a^fare affected by the high-order coefficients aras long asKJ.This problem is inherent deficiency of the modal methods which cannot be prevented by the proposed method. However, it can be alleviated by the proposed method, because the level of the cross coupling of THis far below than that ofTZas demonstrated below.

4. Numerical simulation

4.1 Simulation condition

To confirm the proposed method and the theory analysis discussed above, numerical simulations were implemented. A digitized wavefront W(xn,yn) filtered by a circle pupil was generated over a 256×256 square grid by the first 20 fringe Zernike polynomials, that was J=20.The corresponding coefficients were generated randomly, but the last five elements were multiplied by101.This attenuation followed the assumption that all the terms that gave significant contributions to the wavefront were contained in the first 15 terms, and the contributions of the last five terms were very weak compared with those of the first 15 terms. Noise-free difference fronts data in two perpendicular shear directions was calculated with the same shear ratio ofs=0.1. The test wavefront and the difference fronts in two directions are shown in Figs. 2(a) to 2(c). The Zernike coefficients of the test wavefront are shown in Fig. 2(d). For comparison, the wavefront were reconstructed from the two difference fronts by Rimmer-Wyant method as Eq. (19) and by the proposed method as Eq. (20), respectively.

 figure: Fig. 2

Fig. 2 Simulation condition, (a) simulated wavefront under testW(xn,yn), (b) difference front ΔWx(xn,yn) when the shearing was in thexdirection, (c) difference front ΔWy(xn,yn) when the shearing was in theydirection and (d) random Zernike coefficients of the wavefront under test.

Download Full Size | PDF

4.2 Reconstruction without remaining high-order terms

First, we consider the reconstruction without the remaining high-order terms, that is, with K=J=20. In this case, all of the Zernike coefficients of the original wavefront are represented by one or more elements of the shear matrixNandH. The dimensions of both the two shear matrixesNandHwere40×19.The results of the evaluation of the Zernike coefficients of the original wavefront are shown in Table 1 . WhenK=20, the evaluated coefficients of both Rimmer-Wyant method and the proposed method are identical to the input coefficients. These results make it clear that both the two methods are capable to reconstruct the wavefront without error under the condition that all the terms of original wavefront are included in the analysis, that is there is no remaining high-order terms. Unfortunately, this condition can only be met in simulation because the existence of remaining high-order terms is inevitable in the analysis of a practical wavefront in general case, as discussed in Sec. 1.

Tables Icon

Table 1. Input and evaluated Zernike coefficients of the original wavefront by Rimmer-Wyant method (M1) and the proposed method (M2) when the reconstruction was performed without and with the affections of the remaining high-order terms

4.3 Reconstruction under the impact of remaining high-order terms

To examine the impact of the remaining high-order terms on the outcomes of the lower-order terms, the wavefront was reconstructed under the condition thatK=15, that is, the last five terms were the remaining high-order terms. The dimensions of both the two shear matrixes N andHwere shrunk to30×14. The results of the evaluation of the Zernike coefficients of the original wavefront are also shown in Table 1. The value of “Percentage Error” column was calculated by the formula ofrj=(aja^j)/aj×100, where ajis the input coefficient, and a^jis the corresponding evaluated value by the two methods. The calculation results of rjare also diagramed in Fig. 3(d) for visualization. It can be clearly seen from the “Percentage Error” column of Table.1 that all of the absolute values of the coefficients error that were retrieved by the proposed method are smaller than those by Rimmer-Wyant method. And the proposed method retrieved the input coefficients with small differences of less than 1% in most case, while these differences are higher than30%in some case by Rimmer-Wyant method.

 figure: Fig. 3

Fig. 3 Original wavefront and reconstructed results of the two methods: (a) a part of test wavefront Wf, (b) the reconstruction result of Rimmer-Wyant method, (c) the reconstruction result of the proposed method, (d) the percentage error of the retrieved Zernike coefficients by Rimmer-Wyant method and the proposed method, (e) the difference between the reconstruction result W1 of Rimmer-Wyant method and Wf, and(f) the difference between the reconstruction result W2 of the proposed method and Wf .

Download Full Size | PDF

To facilitate the analysis, the wavefront under test is divided into two parts. One part is denoted by Wf which represents the contributions of the first 15 terms, while the other part denoted byWrdescribes the contributions of the remaining 5 terms. In this simulation, we evaluate the reconstruction accuracy ofWf.The wavefront was reconstructed by use of the evaluated Zernike coefficients shown in “K=15”column of Table 1. The notations of W1andW2 were assigned to represent the reconstruction results of Rimmer-Wyant method and the proposed method, respectively. The contour plots of these wavefronts and the reconstruction errors of the two methods are shown in Fig. 3. The reconstruction error of the proposed method is obviously smaller than that of Rimmer-Wyant method, as can be clearly seen from Figs. 3(e) and 3(f).

4.4 Comparison of RMS and PV value

The root mean square (RMS) and peak-to-valley (PV) value were used to characterize the reconstruction accuracy. The RMS and PV value of the original wavefront and the reconstruction error of the two methods are shown in Table 2 . The RMS reconstruction error is6.25%for Rimmer-Wyant method and 0.81%for the proposed method. In other words, the remaining error of the proposed method is about1/8of that of Rimmer-Wyant method. Moreover, we know from Table 2 that the PV reconstruction error of the proposed method is about 1/10of that of Rimmer-Wyant method. In brief, the reconstruction accuracy of the proposed method is superior to that of Rimmer-Wyant method, which can also be confirmed from Figs. 3(a) to 3(c). Note that the reconstruction accuracy will change with some parameters, such as the number of the Zernike polynomialsKused in the reconstruction and the shear ratios, but the fact that the proposed method is more accurate will not change.

Tables Icon

Table 2. RMS and PV values of the original wavefront and of the reconstruction error

4.5 Evaluation of the cross-coupling matrix

To explain the reason why the reconstruction accuracy of the proposed method is superior to that of Rimmer-Wyant method under the impact of remaining high-order terms, the cross-coupling matrixes TZandTHof the two methods were evaluated. The two cross-coupling matrixes TZand TH are shown in Figs. 4(a) and 4(b). As discussed in Sec.1, the cross-coupling matrix manifests the impact of the remaining high-order terms on the outcomes of the lower-order terms. For example, as shown in Fig. 4(a), the high-order coefficient a16 has affections on the estimation of all the lower-order coefficients, especiallya14anda15, when Rimmer-Wyant method is used. When the proposed method is used, the coefficienta16just affects ona4anda9with more slight level, as shown in Fig. 4(b).

 figure: Fig. 4

Fig. 4 Cross-coupling matrix of the two methods, (a) the coupling-matrixTZof Rimmer-Wyant method, (b) the cross-coupling matrixTHof the proposed method

Download Full Size | PDF

On the other hand, the cross-coupling matrix also manifests the sensitivity of the outcomes of the lower-order terms to the remaining high-order terms. For example, as shown in Fig. 4(a), the evaluation of the coefficienta13is sensitive to almost all of the five remaining terms, especially toa16anda19, when Rimmer-Wyant method is used. When the proposed method is used, the evaluation ofa13is not sensitive to any one of the five remaining terms, as can be seen from Fig. 4(b). Thus the calculation error ofa13of the proposed method is far below than that of Rimmer-Wyant method which can also be confirmed from Table 1.

Anyway, it can be clearly seen from Fig. 4 that the level of cross coupling ofTHis far below than that of TZ.Therefore the sensitivity of the outcomes of lower-order terms to the remaining high-order terms can be decreased by the proposed numerical orthogonal transformation method which leads to a smaller remaining error and a more accurate reconstruction result than Rimmer-Wyant method.

4.6. Comparison of the computation time

To compare the computation time of the proposed method and Rimmer-Wyant method, several simulations were implemented in different sample sizes. The computation time of the proposed method was divided into the following two parts: one part is the time to do the numerical orthogonal transformation and the other part is the time to reconstruct the wavefront with the new shear matrixes and numerical orthogonal polynomials. The total reconstruction time is the summation of the two parts. It can be clearly seen from Table 3 that the proposed method is faster than Rimmer-Wyant method. The reason is that the time to obtain the orthogonal coefficients of difference fronts is shorter than the time to obtain the Zernike coefficients of difference fronts as follows: the orthogonal coefficients of the difference front in x direction can be obtained directly byA^x=FxTΔWx/N, while the Zernike coefficients are obtained by means of least-squares fitting as A^x=(ZxTZx)1ZxTΔWxwhich needs more computation time. The time difference is longer than the time to operate the numerical orthogonal transformation. Note that our calculation was performed on a personal computer equipped with a 2.80GHZ Pentium-4 processor and the software was MATLAB (Version 7.8.0).

Tables Icon

Table 3. Comparison of Computation Time of the Two Methods for Different Sample Sizes

4.7. Simulation with a general wavefront

Although all of the test wavefronts used above were devised in a special case, the obtained conclusions can be promoted reasonably to general situation. To support this viewpoint, another simulation was implemented. In this simulation, the test wavefront was constructed by use of the first 36 fringe Zernike polynomials and the corresponding coefficients were decided from the experiment results of Ref. 28. The coefficients of tilt and defocus were set to be zero. The difference fronts data in the two directions were calculated with the same shear ratio s=0.1. The wavefront was reconstructed by use of the first 31 fringe Zernike polynomials and the last 5 terms played the parts of remaining high-order terms. We note that the coefficients of the last 5 terms were comparable to that of the first 31 terms in this case. The RMS reconstruction error was 11.29% for Rimmer-Wyant method and 2.98% for the proposed method.

5. Conclusion

The number of terms of Zernike polynomials of a practical wavefront is infinite in general case, but the dimensions of the shear matrix which relate the Zernike coefficients of the original wavefront and difference fronts cannot be extended to infinity. As a result, the outcome of the lower-order terms affected by the remaining high-order terms is inevitable when the modal method is used to reconstruct a practical wavefront from its corresponding difference fronts. Nevertheless, this effect can be decreased by use of orthogonal transformation which results in a smaller reconstruction error. This is just the primary idea of the proposed method which indeed leads to a more accurate result than Rimmer-Wyant method. In theory, the prescription described in this paper can be applied to reconstruct a wavefront on an aperture of arbitrary shape from its difference fronts, and it can also be applied when other polynomials are used as basis functions to expand the wavefront under test.

Acknowledgments

This work was supported by the Grant from the National Natural Science foundation of China under no. 60938003.

References and links

1. D. Malacara, Optical Shop Testing, 3rd ed, (CRC Press, Taylor& Francis, 2007).

2. A. Dubra, C. Paterson, and C. Dainty, “Study of the tear topography dynamics using a lateral shearing interferometer,” Opt. Express 12(25), 6278–6288 (2004). [CrossRef]   [PubMed]  

3. Y. Zhu, K. Sugisaki, M. Okada, K. Otaki, Z. Liu, J. Kawakami, M. Ishii, J. Saito, K. Murakami, M. Hasegawa, C. Ouchi, S. Kato, T. Hasegawa, A. Suzuki, H. Yokota, and M. Niibe, “Wavefront measurement interferometry at the operational wavelength of extreme-ultraviolet lithography,” Appl. Opt. 46(27), 6783–6792 (2007). [CrossRef]   [PubMed]  

4. M. P. Rimmer, “Method for evaluating lateral shearing interferograms,” Appl. Opt. 13(3), 623–629 (1974). [CrossRef]   [PubMed]  

5. D. L. Fried, “Least-square fitting a wave-front distortion estimate to an array of phase-difference measurements,” J. Opt. Soc. Am. 67(3), 370–375 (1977). [CrossRef]  

6. R. H. Hudgin, “Wave-front reconstruction for compensated imaging,” J. Opt. Soc. Am. 67(3), 375–378 (1977). [CrossRef]  

7. B. R. Hunt, “Matrix formulation of the reconstruction of phase values from phase differences,” J. Opt. Soc. Am. 69(3), 393–399 (1979). [CrossRef]  

8. J. Herrmann, “Least-squares wave front errors with minimum norm,” J. Opt. Soc. Am. 70(1), 28–35 (1980). [CrossRef]  

9. X. Liu, Y. Gao, and M. Chang, “A partial differential equation algorithm for wavefront reconstruction in lateral shearing interferometry,” J. Opt. A, Pure Appl. Opt. 11(4), 045702 (2009). [CrossRef]  

10. S. Okuda, T. Nomura, K. Kamiya, H. Miyashiro, K. Yoshikawa, and H. Tashiro, “High-precision analysis of a lateral shearing interferogram by use of the integration method and polynomials,” Appl. Opt. 39(28), 5179–5186 (2000). [CrossRef]   [PubMed]  

11. P. Liang, J. Ding, Z. Jin, C. S. Guo, and H. T. Wang, “Two-dimensional wave-front reconstruction from lateral shearing interferograms,” Opt. Express 14(2), 625–634 (2006). [CrossRef]   [PubMed]  

12. J. Herrmann, “Cross coupling and aliasing in modal wavefront estimation,” J. Opt. Soc. Am. 71(8), 989–992 (1981). [CrossRef]  

13. K. R. Freischlad and C. L. Koliopoulos, “Modal estimation of a wave front from difference measurements using the discrete Fourier transform,” J. Opt. Soc. Am. A 3(11), 1852–1861 (1986). [CrossRef]  

14. M. P. Rimmer and J. C. Wyant, “Evaluation of large aberrations using a lateral-shear interferometer having variable shear,” Appl. Opt. 14(1), 142–150 (1975). [PubMed]  

15. W. Shen, M. W. Chang, and D. S. Wan, “Zernike polynomial fitting of lateral shearing interferometry,” Opt. Eng. 36(3), 905–913 (1997). [CrossRef]  

16. G. Harbers, P. J. Kunst, and G. W. R. Leibbrandt, “Analysis of lateral shearing interferograms by use of Zernike polynomials,” Appl. Opt. 35(31), 6162–6172 (1996). [CrossRef]   [PubMed]  

17. H. van Brug, “Zernike polynomials as a basis for wave-front fitting in lateral shearing interferometry,” Appl. Opt. 36(13), 2788–2790 (1997). [CrossRef]   [PubMed]  

18. G.- Dai, “Modal wavefront reconstruction with Zernike polynomials and Karhunen-Loève functions,” J. Opt. Soc. Am. A 13(6), 1218–1225 (1996). [CrossRef]  

19. W. H. Southwell, “Wave-front estimation from wave-front slope measurements,” J. Opt. Soc. Am. 70(8), 998–1006 (1980). [CrossRef]  

20. J. C. Wyant and K. Creath, Basic Wavefront Aberration Theory for Optical Metrology, Vol. XI of Applied Optics and Optical Engineering Series (Academic, 1992), 28.

21. V. N. Mahajan, “Zernike annular polynomials for imaging systems with annular pupils,” J. Opt. Soc. Am. 71(1), 75–85 (1981). [CrossRef]  

22. R. Upton and B. Ellerbroek, “Gram-Schmidt orthogonalization of the Zernike polynomials on apertures of arbitrary shape,” Opt. Lett. 29(24), 2840–2842 (2004). [CrossRef]   [PubMed]  

23. V. N. Mahajan and G. M. Dai, “Orthonormal polynomials for hexagonal pupils,” Opt. Lett. 31(16), 2462–2464 (2006). [CrossRef]   [PubMed]  

24. G. M. Dai and V. N. Mahajan, “Nonrecursive determination of orthonormal polynomials with matrix formulation,” Opt. Lett. 32(1), 74–76 (2007). [CrossRef]   [PubMed]  

25. V. N. Mahajan and G. M. Dai, “Orthonormal polynomials in wavefront analysis: analytical solution,” J. Opt. Soc. Am. A 24(9), 2994–3016 (2007). [CrossRef]   [PubMed]  

26. G. M. Dai and V. N. Mahajan, “Orthonormal polynomials in wavefront analysis: error analysis,” Appl. Opt. 47(19), 3433–3445 (2008). [CrossRef]   [PubMed]  

27. M. Hasegawa, C. Ouchi, T. Hasegawa, S. Kato, A. Ohkubo, A. Suzuki, K. Sugisaki, M. Okada, K. Otaki, K. Murakami, J. Saito, M. Niibe, and M. Takeda, “Recent progress of EUV wavefront metrology in EUVA,” Proc. SPIE 5533, 27–36 (2004). [CrossRef]  

28. Y. Zhu, S. Odate, A. Sugaya, K. Otaki, K. Sugisaki, C. Koike, T. Koike, and K. Uchikawa, “Method for designing phase-calculation algorithms for two-dimensional grating phase-shifting interferometry,” Appl. Opt. 50(18), 2815–2822 (2011). [CrossRef]   [PubMed]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (4)

Fig. 1
Fig. 1 Schematic diagrams of lateral shearing interferograms. (a) Lateral shearing interferogaram when the shearing is in the x direction. (b) Lateral shearing interferogram when the shearing is in the y direction.
Fig. 2
Fig. 2 Simulation condition, (a) simulated wavefront under test W( x n , y n ) , (b) difference front Δ W x ( x n , y n ) when the shearing was in thexdirection, (c) difference front Δ W y ( x n , y n ) when the shearing was in theydirection and (d) random Zernike coefficients of the wavefront under test.
Fig. 3
Fig. 3 Original wavefront and reconstructed results of the two methods: (a) a part of test wavefront Wf, (b) the reconstruction result of Rimmer-Wyant method, (c) the reconstruction result of the proposed method, (d) the percentage error of the retrieved Zernike coefficients by Rimmer-Wyant method and the proposed method, (e) the difference between the reconstruction result W1 of Rimmer-Wyant method and Wf, and(f) the difference between the reconstruction result W2 of the proposed method and Wf .
Fig. 4
Fig. 4 Cross-coupling matrix of the two methods, (a) the coupling-matrix T Z of Rimmer-Wyant method, (b) the cross-coupling matrix T H of the proposed method

Tables (3)

Tables Icon

Table 1 Input and evaluated Zernike coefficients of the original wavefront by Rimmer-Wyant method (M1) and the proposed method (M2) when the reconstruction was performed without and with the affections of the remaining high-order terms

Tables Icon

Table 2 RMS and PV values of the original wavefront and of the reconstruction error

Tables Icon

Table 3 Comparison of Computation Time of the Two Methods for Different Sample Sizes

Equations (38)

Equations on this page are rendered with MathJax. Learn more.

W(x,y)= j=2 J a j Z j (x,y) .
Δ W x (x,y;s)=W(x+s,y)W(xs,y)= j=1 J A x,j Z j (x,y) .
A x = N x a .
F x,i ( x n , y n )= j=1 J M x,ij Z j ( x n , y n ) .
F x = Z x M x T .
F x T F x = F x T Z x M x T .
Z x T F x = Z x T Z x M x T .
M x Z x T F x =NI .
Q x T Q x = ( Z x T Z x ) /N .
Δ W x = Z x A x = F x B x .
A x = M x T B x .
B x = ( M x T ) 1 N x a= H x a .
A y = N y a .
B y = ( M y T ) 1 N y a= H y a .
( A x A y )=( N x N y )a .
A=Na .
( B x B y )=( H x H y )a .
B=Ha .
a ^ = N A ^ .
a ^ = H B ^ .
Δ W x = Z x A x =( Z xf Z xr )( A xf A xr )= Z xf A xf + Z xr A xr .
A ^ xf = Z xf Δ W x .
A ^ xf = Z xf ( Z xf A xf + Z xr A xr )= A xf + C x A xr .
A ^ yf = A yf + C y A yr .
( A xf A xr )=( N xf1 N xf2 N xr1 N xr2 )( a f a r )=( N xf1 N xf2 0 N xr2 )( a f a r ) .
( A yf A yr )=( N yf1 N yf2 N yr1 N yr2 )( a f a r )=( N yf1 N yf2 0 N yr2 )( a f a r ) .
A xf = N xf1 a f + N xf2 a r
A xr = N xr2 a r .
A yf = N yf1 a f + N yf2 a r .
A yr = N yr2 a r .
a ^ f = N f1 ( A ^ xf A ^ yf )with N f1 =( N xf1 N yf1 ) .
a ^ f = N f1 ( A xf + C x A xr A yf + C y A yr ) .
a ^ f = N f1 ( N xf1 a f + N xf2 a r + C x N xr2 a r N yf1 a f + N yf2 a r + C y N yr2 a r ) = N f1 { ( N xf1 N yf1 ) a f +( N xf2 N yf2 ) a r +( C x N xr2 C y N yr2 ) a r } . = a f + T Z a r
T Z = N f1 N f2 + N f1 N cr2 , N f2 =( N xf2 N yf2 )and N cr2 =( C x N xr2 C y N yr2 ) .
a ^ f = a f + T H a r .
T H = H f1 H f2 + H f1 H cr2 , H f2 =( H xf2 H yf2 )and H cr2 =( D x H xr2 D y H yr2 ) .
D x = F xf F xr = ( F xf T F xf ) 1 F xf T F xr =0 .
D y = F yf F yr = ( F yf T F yf ) 1 F yf T F yr =0 .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.