Registration of fluorescein angiography and optical coherence tomography images of curved retina via scanning laser ophthalmoscopy photographs

Ramin Almasi; Abbas Vafaei; Abbas Vafaei; Zeinab Ghasemi; Mohammad Reza Ommani; Ali Reza Dehghani; Ali Reza Dehghani; Hossein Rabbani; Hossein Rabbani

doi:10.1364/BOE.395784

1. Introduction

Nowadays, many researches have been done in medical image processing to present systems that can detect and analyze various diseases automatically. Minimizing expenditures, increasing detection speed, reducing subjective mistakes and enhancing accuracy are the chief goals of these systems. Particularly in ophthalmology, a lot of researches have been conducted to propose and apply systems for diagnosis and analysis of several retinal diseases including Diabetic Retinopathy (DR) and Age-related Macular Degeneration (AMD) [1].

Image registration is the procedure of aligning two or more images into a unique common coordinate system. Applying registration on multimodal images leads to integration of information, thus benefitting from complementary nature of images. There are various ocular imaging modalities such as Fluorescein Angiography (FA) and Optical Coherence Tomography (OCT), each of which shows some features of retina better than the others. FA is an invasive imaging technique that clearly illustrates retina vasculature through injection of fluorescein dye. FA is known as the gold standard ocular imaging for presenting neurovascular structure and assessing vasculature diseases such as DR and Diabetic Macular Edema (DME) [2]. On the other hand, OCT is non-invasive, prevalent ocular imaging that works based on the Michelson interferometry rule using near-infrared light instead of sound in ultrasound and presents a cross-sectional view of retinal structure. OCT can be used to identify macular edema, macular cysts, vitreomacular traction, sub-retinal fluid, pigment epithelial detachment, choroidal neovascularization and measuring retinal thickness [3].

As mentioned before, FA is used to diagnose DR as a common retinal disease with hyperfluorescent spots as hallmarks of Microaneurysm (MA) [4]. The greater number of MA spots indicates the DR progression. Albeit FA depicts neurovascular structure of retina exactly, it utilizes intravenous injection that is harmful for human body and requires an expert photographer with the patient remaining strictly still. Unlike FA, OCT is non-invasive with retinal in-depth data. Therefore, clinicians usually use information of both imaging modalities for better diagnosis and evaluation of retinal diseases. Great number of papers have evaluated MA visual properties via intuitive comparison of FA and OCT images simultaneously [4–8] whereas information of both images can be added together using automatic and precise retinal image registration. Furthermore, due to reduction in subject-dependent errors, this can be more helpful for clinical purposes.

So far, many retinal image registration techniques have been proposed which align images of various modalities into a unique coordinate system. From a technical point of view, these techniques can be classified into two categories: segmentation-based methods and non-segmentation-based ones [9]. In the former, the retinal image is partitioned into meaningful segments such as vessels and optic disc making the analysis of the image simpler [10]. In the latter, the full image contents are processed using image processing techniques. A comprehensive review, analysis, and classification of the retinal vessel segmentation algorithms and methodologies is provided by [11], also a survey of state-of-the-art techniques for extracting retinal vessels is presented in [12]. In [13], a fast and accurate registration algorithm is proposed based on Salient Feature Regions (SFR) followed by local rigid transformation. The alignment process is then augmented with a global second order polynomial transformation. This method is only applicable to mono-modal fundus images. In [14], a registration algorithm for poor quality multimodal retinal images using Harris corner detector and Partially Intensity Invariant Feature Descriptor (PIIFD) is presented. A multimodal registration of Spectral Domain OCT (SD-OCT) volumes and fundus photographs is suggested in [15] which uses Features from Accelerated Segment Test (FAST) feature detector and Histogram of Oriented Gradients (HOG) descriptor. In this method, the affine transformation is computed using RANdom SAmple Consensus (RANSAC) [16]. A multi-resolution difference of Gaussian pyramid with Saddle detector (D-Saddle) is proposed in [17], to detect feature points on the low-quality region that consists of vessels with varying contrasts and sizes. In [18] a two-step framework is proposed to register multimodal retinal images. In the first step, HOG descriptor matching is applied to the mean phase image for transformation computation and in the second step, the deformation registration method based on Modality Independent Neighborhood Descriptor (MIND) is used to improve the registration accuracy. In [19], reliable feature point sets are extracted based on SURF detector and PIIFD descriptor from model and target retinal images. Then Mixture Feature and Structure Preservation (MFSP) is used to map feature point sets for finding exact point correspondences. An Adaptive RANSAC (A-RANSAC) method is proposed in [20] for multimodal retinal image registration. In this method, the threshold value is chosen so that the Root Mean Square Error (RMSE) and the number of removed matches are optimized simultaneously. It should be noted that all of the aforementioned methods and techniques are included in none segmentation-based category. The following methods are from the segmentation-based one.

In [21] a registration of OCT projection images with color fundus photographs is proposed based on a hierarchical local feature matching that utilizes the search of local maximization of the similarity function. Reference [22] registers pairs of FA frames for the sake of segmenting fluorescein leakage in patients with DME where two-step registration method performs global and local registration in turn. In global registration step, after vessel extraction using exploratory dijkstra forest algorithm, Speeded Up Robust Features (SURF) features are detected, extracted, and matched, then M-estimator Sample Consensus (MSAC) [23] algorithm is applied for transformation computation. Thereafter in local registration step, intensity multi-resolution registration is performed on local patches to obtain optimal results. In [24], a joint vessel segmentation and deformable registration model is proposed based on Convolutional Neural Network (CNN). In vessel segmentation, a style loss guides the model to generate segmentation maps and helps transform images of different modalities into consistent representations. In deformable registration, a content loss helps finding dense correspondence for multi-modal images. An image registration algorithm is proposed in [25] to trace changes in the retina structure across modalities using vessel segmentation and automatic landmark detection. The segmentation of the vessels is done using a U-Net and the detection of the vessel junctions is achieved with Mask R-CNN. In [26] a new method is presented to register SD-OCT and FA images. For this purpose SLO images captured by Heidelberg Spectrally HRA2/OCT device are used as intermediate images. To the best of our knowledge, only [26] registers OCT images with FA photographs via corresponding SLO images.

As a result of differences in multimodal retinal images, when common feature-based registration methods are applied in RANSAC or MSAC, the outlier matches are not removed completely. State-of-the-art mismatch removal techniques are proposed in [27–30]. An efficient approach is designed in [27] based on maintaining the local neighborhood structures of the potential true matches. The problem is formulated into a mathematical model, and a closed-form solution with linearithmic time and linear space complexities is derived. In [28], the mismatch removal is converted to a two-class classification problem, and learning a general classifier to determine the correctness of an arbitrary putative match. Method in [29] adaptively clusters the putative matches into several motion consistent clusters together with an outlier/mismatch cluster. The classic Density Based Spatial Clustering method of Applications with Noise (DBSCAN) is customized in the context of feature matching to implement spatial clustering. An iterative clustering strategy is designed to promote the matching performance in case of severely degraded data. In [30], a feature guided Gaussian Mixture Model (GMM) is proposed for the non-rigid registration of retinal images. The problem is formulated as an estimation of a feature guided mixture of densities: a GMM is fitted to one point set in which the centers of the Gaussian densities characterized by spatial positions associated with local appearance descriptors are constrained to coincide with the other point set.

Due to differences in contrast, resolution and brightness of the multimodal retinal images, prevalent feature detection, extraction and matching schemes do not result in perfect matches. Also, the images resulted from vessel extraction of image pairs are not exactly the same and the accuracy of retinal image registration methods that use vessel segmentation is largely dependent on the accuracy of vessel segmentation. In addition, the relationships between retinal image pairs are usually modeled by affine transformation, which cannot generate accurate alignments due to the non-planar retina surface.

In this paper, a novel method is proposed for registering FA and SD-OCT images using SLO photographs. For doing so, after applying some preprocessing techniques on FA and SLO images, the vessels are extracted in both images and the gray level images are converted to new black and white vessel-map ones. Next, a new feature detector-descriptor method is applied on these vessel-map photographs to remove outlier matches completely. For this goal, first, SURF features are detected and extracted from both images. Likewise, FAST-HOG framework is applied on both images to detect and extract new feature set. Applying only one of these feature detector-descriptors separately in RANSAC or MSAC, does not result in perfect matches. So, SURF feature descriptors from two images are matched using approximate nearest neighbors [31]. The same procedure is iterated with HOG feature descriptors. Since this matching method may include some erroneous matches, the MSAC method is applied separately on matched SURF and HOG features to refine them, but because of vessel segmentation errors and different modalities, some outliers still remained. After that the two vectors of matched featuresF resulted from the previous steps are concatenated to enhance matched features. After applying MSAC again on concatenated features and obtaining perfect matches, global rigid transformation is calculated and applied to the vessel-map and the original images. Combining these features and applying the MSAC for three times, guarantee complete elimination of outlier matches. In addition, to consider non-planar surface of retina, the transformed images are globally aligned again applying a Gaussian model for the surface of retina which increases the accuracy of the previous step. This reduces the amount of deformation effects resulted from the subsequent local registration step. High image deformations in multimodal image registration, could affect the primary image, especially in parts of image that contain no vessels. Eventually displacement field between transformed and reference original images are estimated using method presented in [32] to register two images perfectly. Our dataset contains 36 image pairs of 21 subjects with diabetic retinopathy. Using accurate registration of FA and SD-OCT images, enables us to exactly detect location and morphological appearance of diabetic retinopathy symptoms such as MAs and leakages in OCT B-scans.

The proposed methods are explained in section 2. section 3 presents experimental results and finally, this paper is concluded in section 4.

2. Methods

Because of the different nature of two modalities, direct registration of OCT images and FA photographs is not possible; one presenting a depth view of retina and the other showing the surface of it. On the other hand, Heidelberg Spectralis HRA2/OCT device provides the capability of simultaneous SLO/OCT imaging by means of a single light source that guarantees exact correspondence between SLO image of retina surface and cross sectional B-scans of OCT. In this paper precise automatic registration of FA images with OCT photographs is proposed using registration of FA images with SLO photographs. Comprehensive block diagram of the proposed four-step method is demonstrated in Fig. 1. As shown, overall process is divided into four main sections: data acquisition, preprocessing, global registration, and local registration. In the following, the steps are explained in detail.

Fig. 1. Block diagram of overall process of registration.

Methods	Mean ± SD (pixel)	maximum (pixel)
RMS error after global registration and before Gaussian model	2.34 ± 0.62	3.99
RMS error after applying Quadratic model to global registration	2.06 ± 0.54	3.52
RMS error after applying Gaussian model to global registration	1.04 ± 0.36	2.2
RMS error after local registration step	0.23 ± 0.26	0.77

Methods	Mean ± SD (pixel)	maximum (pixel)
Proposed Method in [45]	3.67 ± 2.13	12.24
Proposed Method in [15]	1.85 ± 1.29	6.21
Manual Method in [15]	1.02 ± 0.85	4.23
Proposed Method in [17]	2.32 ± 3.98	4.99
Proposed Method in [19]	3.16 ± 0.12	3.74
Proposed Method in [20]	2.1	Not reported
Our Method in this Paper	0.23 ± 0.26	0.77

Methods	Success rate (%)
SURF	83.33
FAST + HOG	77.77
Proposed Method in [45]	81.81
Proposed Method in [15]	97.72
Proposed Method in [19]	92.44
Proposed Method in [20]	95
Our Method in this Paper	97.23

Methods	running time (s)
Proposed Method in [45]	28.45
Proposed Method in [15]	2.34
Manual Method in [15]	75.71
Our Method in this Paper	26.1

Abstract

1. Introduction

2. Methods

2.1 Data acquisition

2.2 Preprocessing

2.3 Global registration

2.3.1 Control points detection

2.3.1.1 FAST corner detector

2.3.1.2 SURF (Fast-Hessian) blob detector

2.3.2 Feature extraction

2.3.2.1 SURF feature descriptor

2.3.2.2 HOG feature descriptor

2.3.3 Feature matching

2.3.4 Transformation computation

2.3.5 Applying the Gaussian model for retina surface

2.4 Local registration

2.4.1 Pre-computation of demons set

2.4.2 Iterative process

3. Results

3.1 RMS error

3.2 Success rate

3.3 Speed

4. Discussions and conclusions

Funding

Acknowledgments

Disclosures

References

Cited By

Figures (16)

Tables (4)

Equations (12)

Biomedical Optics Express