DOI: 10.1002/mp.17949 ISSN: 0094-2405

Use of a deep learning neural network to generate bone suppressed images for markerless lung tumor tracking

Jason Luce, Mandeep Kaur, Joseph Dingillo, Andrew Keeler, Mathias Lehmann, Daniel Morf, Liangjia Zhu, Hyejoo Kang, Ha Nguyen, Michal Walczak, Matthew M. Harkenrider, John C. Roeske

Abstract

Background

Markerless tumor tracking (MTT) is being considered for real‐time motion management of lung tumors. However, bony structures in conventional x‐ray images can obfuscate the tumor, increasing tracking difficulty. Bone suppression using dual energy subtraction (DES) can improve tumor visibility but requires additional hardware or software that is not currently available with commercial on‐board imaging (OBI) systems.

Purpose

This study compares DES images to synthetic DES (sDES) images generated by a U‐net neural network, examining both image quality and tracking results.

Methods

High (120 kV) and low (60 kV) energy image pairs were obtained over 180‐degree rotation using fast‐kV switching for a motion phantom and 20 lung cancer patients. DES images were generated offline using weighted logarithmic subtraction. A U‐net was then trained to transform 120 kV images into sDES images. Images from the phantom (2694 image pairs) and 20 patients (4499 image pairs), were divided into training, validation, and test sets consisting of 70%, 15%, and 15% of the images, and used for network training and evaluation. The similarity between sDES images and ground truth DES images were evaluated using histogram comparison, structural similarity index measure (SSIM), peak signal‐to‐noise ratio (PSNR), and the 2D correlation coefficient (2DCC). Separately, a template‐tracking algorithm was used to predict tumor location on patient and phantom sDES images. Since there was no ground truth location for the patient images, the predicted locations of the tumor in the HE and sDES images were compared against the predicted locations in the DES images. For the phantom images, tracking success rate (TSR) was defined as the percentage of images in which the predicted and ground truth tumor location differed by <2 mm, missing frames (MF) was defined as the percentage of images in which the tracking algorithm failed, and the mean absolute error (MAE) was also calculated from the differences between predicted and ground truth locations of the tumor.

Results

Histogram count comparisons showed good agreement between the pixel distribution of sDES and DES images. Average SSIM, PSNR, and 2DCC scores for sDES images were 0.80 ± 0.05, 28.9 ± 3.4, and 0.97 ± 0.03 for phantom images, and 0.85 ± 0.04, 26.2 ± 3.5, and 0.97 ± 0.03 for patient images. For the patient images, the median tracking difference was 0.5 mm for HE versus DES images, and 0.3 mm for sDES versus DES images (p < 0.01). Separately, the TSR, MF, and MAE tracking metrics for the DES and sDES phantom images were found to be statistically equivalent, with scores of 93.5%, 0.22%, and 0.95 mm versus 93.5%, 0.22%, and 1.03 mm, respectively.

Conclusion

SDES images were found to be equivalent to DES images for use in MTT. The image similarity metrics comparing sDES and DES images showed good agreement, and MTT with DES and sDES images resulted in similar tracking metrics. These results indicate a trained U‐net can be used to generate sDES images suitable for MTT using a single x‐ray exposure without the need for additional hardware or software.