TY - JOUR
T1 - Benchmarking of Deformable Image Registration for Multiple Anatomic Sites Using Digital Data Sets With Ground-Truth Deformation Vector Fields
AU - Shi, Liting
AU - Chen, Quan
AU - Barley, Susan
AU - Cui, Yunfeng
AU - Shang, Lu
AU - Qiu, Jianfeng
AU - Rong, Yi
N1 - Publisher Copyright:
© 2021 American Society for Radiation Oncology
PY - 2021/9/1
Y1 - 2021/9/1
N2 - Purpose: This study aimed to evaluate the accuracy of deformable image registration (DIR) algorithms using data sets with different levels of ground-truth deformation vector fields (DVFs) and to investigate the correlation between DVF errors and contour-based metrics. Methods and Materials: Nine pairs of digital data sets were generated through contour-controlled deformations based on 3 anonymized patients’ CTs (head and neck, thorax/abdomen, and pelvis) with low, medium, and high deformation intensity for each site using the ImSimQA software. Image pairs and their associated contours were imported to MIM-Maestro, Raystation, and Velocity systems, followed by DIR and contour propagation. The system-generated DVF and propagated contours were compared with the ground-truth data. The correlation between DVF errors and contour-based metrics was evaluated using the Pearson correlation coefficient (r), while their correlation with volumes were calculated using Spearman correlation coefficient (rho). Results: The DVF errors increased with increasing deformation intensity. All DIR algorithms performed well for esophagus, trachea, left femoral, right femoral, and urethral (mean and maximum DVF errors <2.50 mm and <4.27 mm, respectively; Dice similarity coefficient: 0.93-0.99). Brain, liver, left lung, and bladder showed large DVF errors for all 3 systems (dmax: 2.8-91.90 mm). The minimum and maximum DVF errors, conformity index, and Dice similarity coefficient were correlated with volumes (|rho|: 0.41-0.64), especially for very large or small structures (|rho|: 0.64-0.80). Only mean distance to agreement of Raystation and Velocity correlated with some indices of DVF errors (r: 0.70-0.78). Conclusions: Most contour-based metrics had no correlation with DVF errors. For adaptive radiation therapy, well-performed contour propagation does not directly indicate accurate dose deformation and summation/accumulation within each contour (determined by DVF accuracy). Tolerance values for DVF errors should vary as the acceptable accuracy for overall adaptive radiation therapy depends on anatomic site, deformation intensity, organ size, and so forth. This study provides benchmark tables for evaluating DIR accuracy in various clinical scenarios.
AB - Purpose: This study aimed to evaluate the accuracy of deformable image registration (DIR) algorithms using data sets with different levels of ground-truth deformation vector fields (DVFs) and to investigate the correlation between DVF errors and contour-based metrics. Methods and Materials: Nine pairs of digital data sets were generated through contour-controlled deformations based on 3 anonymized patients’ CTs (head and neck, thorax/abdomen, and pelvis) with low, medium, and high deformation intensity for each site using the ImSimQA software. Image pairs and their associated contours were imported to MIM-Maestro, Raystation, and Velocity systems, followed by DIR and contour propagation. The system-generated DVF and propagated contours were compared with the ground-truth data. The correlation between DVF errors and contour-based metrics was evaluated using the Pearson correlation coefficient (r), while their correlation with volumes were calculated using Spearman correlation coefficient (rho). Results: The DVF errors increased with increasing deformation intensity. All DIR algorithms performed well for esophagus, trachea, left femoral, right femoral, and urethral (mean and maximum DVF errors <2.50 mm and <4.27 mm, respectively; Dice similarity coefficient: 0.93-0.99). Brain, liver, left lung, and bladder showed large DVF errors for all 3 systems (dmax: 2.8-91.90 mm). The minimum and maximum DVF errors, conformity index, and Dice similarity coefficient were correlated with volumes (|rho|: 0.41-0.64), especially for very large or small structures (|rho|: 0.64-0.80). Only mean distance to agreement of Raystation and Velocity correlated with some indices of DVF errors (r: 0.70-0.78). Conclusions: Most contour-based metrics had no correlation with DVF errors. For adaptive radiation therapy, well-performed contour propagation does not directly indicate accurate dose deformation and summation/accumulation within each contour (determined by DVF accuracy). Tolerance values for DVF errors should vary as the acceptable accuracy for overall adaptive radiation therapy depends on anatomic site, deformation intensity, organ size, and so forth. This study provides benchmark tables for evaluating DIR accuracy in various clinical scenarios.
UR - http://www.scopus.com/inward/record.url?scp=85106339173&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106339173&partnerID=8YFLogxK
U2 - 10.1016/j.prro.2021.02.012
DO - 10.1016/j.prro.2021.02.012
M3 - Article
C2 - 33722783
AN - SCOPUS:85106339173
SN - 1879-8500
VL - 11
SP - 404
EP - 414
JO - Practical Radiation Oncology
JF - Practical Radiation Oncology
IS - 5
ER -