Abstract
BACKGROUND AND PURPOSE: We evaluated modifications to our contrast-enhanced MR imaging grading system for symptomatic patients with suspected nasopharyngeal carcinoma, aimed at improving discrimination of early-stage cancer and benign hyperplasia. We evaluated a second non-contrast-enhanced MR imaging grading system for asymptomatic patients from nasopharyngeal carcinoma plasma screening programs.
MATERIALS AND METHODS: Dedicated nasopharyngeal MR imaging before (plain scan system) and after intravenous contrast administration (current and modified systems) was reviewed in patients from a nasopharyngeal carcinoma–endemic region, comprising 383 patients with suspected disease without nasopharyngeal carcinoma and 383 patients with nasopharyngeal carcinoma. The modified and plain scan systems refined primary tumor criteria, added a nodal assessment, and expanded the system from 4 to 5 grades. The overall combined sensitivity and specificity of the 3 systems were compared using the extended McNemar test (a χ2 value > 5.99 indicates significance).
RESULTS: The current, modified, and plain scan MR imaging systems yielded sensitivities of 99.74%, 97.91%, and 97.65%, respectively, and specificities of 63.45%, 89.56% and 86.42%, respectively. The modified system yielded significantly better performance than the current ( = 122) and plain scan ( = 6.1) systems. The percentages of patients with nasopharyngeal carcinoma in grades 1–2, grade 3, and grades 4–5 for the modified and plain scan MR imaging systems were 0.42% and 0.44%; 6.31% and 6.96%; and 90.36% and 87.79%, respectively. No additional cancers were detected after contrast administration in cases of a plain scan graded 1–2.
CONCLUSIONS: We propose a modified MR imaging grading system that improves diagnostic performance for nasopharyngeal carcinoma detection. Contrast was not valuable for low MR imaging grades, and the plain scan shows potential for use in screening programs.
ABBREVIATION:
- NPC
- nasopharyngeal carcinoma
The early diagnosis of nasopharyngeal carcinoma (NPC) is essential to future improvements in patient survival and reduction of the long-term adverse effects of aggressive treatment regimens.1 Recently, we found that a high percentage of patients with early-stage NPC could be identified by population screening in Hong Kong using persistently elevated plasma Epstein-Barr virus DNA (71% versus 20% compared with historical data).2 However, subsequent investigations must be able to detect these early cancers.
Cancers of the nasopharynx are notorious for being submucosal, located deep in the pharyngeal recess, or masked by adenoidal hyperplasia in the nasopharyngeal roof. Accordingly, endoscopic tumor detection is challenging at this site, not only for asymptomatic patients in the screening setting but also for symptomatic patients in the clinical setting. Several studies investigating patients with suspected NPC from endemic regions,3,4 as well as nonendemic regions,5 have shown that MR imaging can detect NPCs hidden from endoscopic view. In an MR imaging study of predominantly symptomatic patients referred to the outpatient clinic with suspected NPC, we found that MR imaging detects the 10% of tumors that are hidden from endoscopic view.3 This figure rises to 17% in asymptomatic patients screened for NPC using persistently elevated plasma Epstein-Barr virus DNA.4 In both studies, MR imaging detected tumors up to 3 years before they became visible endoscopically.4,6 MR imaging, therefore, has a complementary role in early NPC detection.
We have shown that the standardized MR imaging grading system used at our institution has a high sensitivity for NPC detection,3,4,7 which includes early-stage tumors hidden on the endoscopic examination that tend to form a focal mass in the roof or pharyngeal recess.3,4 However, some early-stage cancers confined within the nasopharynx are diffuse rather than focal. In these diffuse early-stage NPCs, the wall thickening is usually greater on one side of the nasopharynx, so discrimination of NPC from benign hyperplasia of the walls and adenoid relies heavily on finding asymmetry on bilateral nasopharyngeal MR imaging evaluations.3,4,8,9 Unfortunately, on scrutiny of the images for small early-stage tumors, benign hyperplasia is often slightly asymmetric. This characteristic is challenging in screened subjects with benign lesions and could produce false-positive findings leading to unnecessary alarm and resource usage.4 Moreover, in a study of a large group of stage T1 NPCs confined to the nasopharynx, we observed that a small number of these early-stage tumors were symmetrically diffuse and overlapped in appearance with benign hyperplasia.8 These early-stage tumors could potentially reduce the high sensitivity of MR imaging.
In the first part of the study, we modified the current contrast-enhanced MR imaging grading system, which can be used worldwide as a complementary investigation in symptomatic patients with suspected NPC when endoscopic examination findings are negative or indeterminate. The aim was to improve the specificity of MR imaging and maintain a high sensitivity, by adding focal expansion in asymmetric diffuse thickening, absent or external distortion of contrast-enhancing adenoidal septa, and the presence of metastatic nodes to indicate NPC. In the second part of the study, we evaluated a new plain scan MR imaging grading system involving a short protocol without intravenous contrast for use in screening programs in endemic regions such as east and southeast parts of Asia, where the incidence in middle-aged men is up to 35 per 100,000.10 Most patients with persistently elevated plasma or serology markers for NPC referred for investigation do not have NPC. In these patients, it would be advantageous to limit the use of gadolinium MR imaging contrast agents. Furthermore, a short plain scan has greater potential for use in screening programs in NPC endemic regions to prioritize or even select patients for endoscopic examination. Finally, observers with different levels of experience in head and neck MR imaging tested the modified and plain scan MR imaging systems.
MATERIALS AND METHODS
Patients
This retrospective evaluation of MR imaging grading systems in patients who underwent MR imaging for the staging of known NPC or investigation of suspected NPC was approved by the institutional review board (the Chinese University of Hong Kong), which waived the requirement for written informed consent. Nasopharyngeal MR imaging data obtained from 766 patients before and after intravenous contrast were reviewed. Patients did not have a history of head and neck cancer before they underwent the MR imaging for suspected or biopsy-proved NPC. The first group comprised 383 patients without NPC (304 men and 79 women; mean age, 52 years; age range, 18–83 years) who had undergone an endoscopic examination and MR imaging for suspected NPC (based on clinical symptoms, clinical signs, or elevated blood markers for NPC) between 2005 and 2016, but had not been diagnosed with head and neck cancer (minimum follow-up of 2 years). Details are shown in On-line Fig 1. These included 358 patients recruited for 2 previous prospective NPC-detection studies.3,4 The second group comprised a similar sample size of 383 patients referred for suspected NPC (based on clinical symptoms, clinical signs, or elevated blood markers for NPC) who had biopsy-proved undifferentiated NPC (282 men and 101 women; mean age, 53 years; age range, 19–92 years). These patients were randomly selected from consecutive patients with NPC scanned between 2005 and 2016 according to the expected T-stage11 distribution in a screened population: stage T1 (nasopharynx), 246 patients (64.2%); T1 (nasal cavity/oropharynx), 13 (3.4%); T2, 33 (8.6%); T3, 80 (20.9%); and T4, 11 (2.9%). These include 134 patients with T1 cancer reported previously.8
MR Imaging Acquisition
Targeted nasopharyngeal MR imaging was performed using a 1.5T or 3T whole-body MR imaging system (Philips Healthcare, Best, the Netherlands). During a scan duration of approximately 15–20 minutes, axial fat-suppressed T2-weighted, coronal T2-weighted, and axial T1-weighted images, and axial and coronal T1-weighted images were obtained after a bolus injection of 0.1 mmol per kilogram of body weight of gadoterate meglumine (Dotarem; Guerbet, Aulnay-sous-Bois, France).
MR Imaging Grading
The current MR imaging grading system3 is shown in the On-line Table. The modified and plain scan MR imaging grading systems are shown in Table 1 (which differ only in grading the adenoid because contrast-enhancing septa cannot be assessed on the plain scan) and are illustrated in On-line Figs 2 and 3, respectively. The diagnosis of a metastatic node was based on recognized imaging criteria shown in Table 1.12,13 The MR imaging data were graded by readers blinded to the diagnosis of NPC or no NPC. Initially, non-contrast-enhanced images (T2- and T1-weighted images) were graded according to the proposed plain scan MR imaging grading system. Subsequently, the full scan, including contrast-enhanced images (T2- and T1-weighted images pre- and post-contrast enhancement) was graded according to our current and proposed modified MR imaging grading systems.
Results were from the MR imaging data assessed by a radiologist with >20 years of experience in MR imaging of NPC (observer 1). The grading systems were tested by 2 further observers, a researcher with 5 years of experience in MR imaging of NPC (observer 2) and a general radiologist with a 1-year postqualification in radiology (after training on 50 patients not included the study analysis comprising 25 without and 25 with NPC with a distribution of T-stages similar to that in the study) (observer 3).
Statistical Analysis
The performance of MR imaging for NPC detection and classification of lesions as benign and malignant was assessed by calculating the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy based on groupings of grades. The concordance rates between MR imaging grading systems were analyzed according to Hawass.14 The McNemar test was used to test the sensitivity or specificity individually. A calculated χ2 value exceeding the critical value of = 3.84 indicated a significant difference. Moreover, the extended McNemar test was used to test the overall significance of the sensitivity and specificity with a fixed error rate; a calculated χ2 value exceeding the critical value of = 5.99 indicated a significant difference. The Youden index was applied if one system exhibited significant differences in sensitivity while the other exhibited significant differences in specificity. The highest Youden index denoted the system with the best performance.
The κ and weighted κ analyses were calculated to determine the interobserver agreement among 3 observers using 2 scales (grades 1–3 versus grades 4–5) and 3 scales (grades 1–2 versus grade 3 versus grades 4–5), respectively. κ values of ≤0.20, 0.21–0.40, 0.41–0.60, 0.61–0.80, and 0.81–1.00 indicated slight, fair, moderate, substantial, and almost perfect agreement, respectively.15 All statistical analyses were 2-sided, and a P value < .05 was a statistically significant difference. The analyses were performed using SPSS software (Version 25.0; IBM, Armonk, New York) and SAS software (Version 9.4; SAS Institute, Cary North Carolina).
RESULTS
The results for each grade using the current, modified, and plain scan MR imaging systems are shown in Table 2. Metastatic nodes were present in 62.6% (154/246) of patients with stage T1 primary tumors confined to the nasopharynx. All 6 cases with stage T1 NPC detected by nodal assessment alone had abnormal nodes in the upper internal jugular chain, whereas 7/9 patients with benign hyperplasia had abnormal nodes in the retropharyngeal group only. The percentage of patients with NPC in grades 1–2, 3, and 4–5 were 0.42% (1/240), 6.31% (7/111), and 90.36% (375/415), respectively, for the modified grading system, and 0.44% (1/225), 6.96% (8/115), 87.79% (374/426), respectively, for the plain scan MR imaging grading system.
The diagnostic performance using the current, modified, and plain scan MR imaging grading systems for grouped grades are presented in Table 3, and the statistical comparisons of the systems are shown in Table 4. The modified grading system yielded statistically significant better overall performance than the current and the plain scan grading systems. The modified grading system yielded statistically significant better overall performance using the indeterminate grade 3 to indicate benign versus malignant and without nodal assessment versus with nodal assessment.
Observer Results
The sensitivity, specificity, and accuracy using grades 4 and 5 to indicate malignancy (Table 1) for the plain scan grading system were 96.61%, 91.91%, and 94.26%, respectively, for observer 2, and 95.04%, 96.87%, and 95.95%, respectively, for observer 3; and for the modified grading system, they were 97.65%, 92.69%, and 95.17%, respectively, for observer 2 and 97.13%, 97.13%, and 97.13%, respectively, for observer 3. The respective interclass correlation coefficients (κ and weighted κ) for the modified systems, which were determined using 2 (grades 1–3 versus grades 4–5) and 3 scales (grades 1–2 versus grade 3 versus grades 4–5), were 0.87 and 0.81 between observers 1 and 2 and 0.88 and 0.76 between observers 1 and 3. The respective interclass correlation coefficients for the plain scan systems using 2 and 3 scales were 0.87 and 0.83 between observers 1 and 2 and 0.84 and 0.81 between observers 1 and 3.
DISCUSSION
We modified our MR imaging grading system for NPC detection to meet the challenge of using MR imaging to investigate patients with suspected NPC from both the clinical setting (symptomatic) and the screening setting (asymptomatic). We analyzed tumors with a T-stage distribution similar to that expected in a screening population—that is, a high percentage of early-stage primary tumors. Tumors with spread outside the nasopharynx were retained in the analysis to ensure that the grading system reflected the full range of T-stage tumors that may be encountered in both of these settings and to allow us to evaluate the new plain scan MR imaging grading system that would be advantageous in screening programs.
Current versus Modified Grading Systems for Detection of the Primary Tumor
The grading system was modified primarily to improve the specificity for primary tumor detection. The current grading system had a low specificity for NPC detection (63.5%) because of false-positive results from cases of benign hyperplasia with asymmetry (grade 3). Using grade 4 only in the current grading system (focal mass or extension beyond the nasopharynx) to indicate NPC greatly improved the specificity to 97.9%, but this was at the expense of sensitivity, which decreased from 99.7% to 87.5%.
The modified grading system yielded a statistically significant better overall performance and better balance between specificity and sensitivity (91.9% and 96.3%, respectively), producing high positive and negative predictive values (90.4% and 97.7%, respectively). This result was achieved mainly by the subdivision of asymmetric diffuse wall thickening into asymmetry without focal expansion, which is less indicative of NPC (new indeterminate grade 3), and into asymmetry with focal expansion, which indicates a higher risk of NPC (new grade 4). Moreover, a focal mass or extension beyond the nasopharynx was elevated to grade 5, thus expanding the 4-grade system to 5 grades. The refined criteria also incorporated absent contrast-enhancing septa8 and external distortion of contrast-enhancing septa in the adenoid into the new grades 4–5 to indicate suspicious or probable NPC.
Modified Grading System for NPC Detection with and without Metastatic Node Assessment
The incidence of regional spread to retropharyngeal and upper internal jugular chain nodes covered on the short MR imaging protocol was high for stage T1 tumors confined to the nasopharynx (62.6%) and is consistent with the reported incidence for early-stage NPC (71.1%).16 The inclusion of nodal assessment increased the sensitivity of the modified grading system by detecting 6 additional cases of NPC that would have been missed using the primary tumor criteria alone. However, nodal assessment also reduced the specificity because of an overlap in the size of enlarged reactive nodes and metastatic nodes. Most interesting, false-positive results for malignancy occurred in the retropharyngeal group despite using the 6-mm13 rather than 5-mm17 threshold for metastatic node diagnosis. The decrease in specificity outweighed the increase in sensitivity, but the difference in overall performance was small, so we believe that nodal assessment still has a role because it detects primary cancers that would otherwise be missed by MR imaging.
Modified versus Plain Scan Grading Systems for NPC Detection
As expected, the modified full-protocol grading system, which includes T2- and T1-weighted images before and after contrast enhancement, yielded better overall NPC diagnostic performance than the plain scan grading system. The plain scan grading system, nevertheless, performed well, with high sensitivity and specificity (97.7% and 86.4%, respectively) and high negative and positive predictive values (97.3% and 87.8%, respectively). Contrast-enhanced images detected only 1 additional primary cancer and, surprisingly, no additional spread beyond the nasopharynx; this finding is encouraging because it is an important MR imaging indicator of malignancy. Contrast-enhanced images did not detect any additional metastatic nodes. In a screening setting, avoidance of intravenous contrast would enable more rapid, less expensive scans and eliminate the need for intravenous MR imaging contrast agents in healthy patients with false-positive blood tests.
We suggest that for plain scan MR imaging graded 1 or 2, the risk of NPC is low (0.44%) and the patient can be reassured. Intravenous contrast is of limited benefit and can be withheld, especially in the absence of abnormal findings on endoscopy. On the other hand, for plain scan MR imaging graded 4 or 5, the risk of NPC is high (87.80%) and intravenous contrast is suggested irrespective of the endoscopy findings. For plain scan MR imaging graded 3, the risk of NPC is intermediate (6.96%) and the role of intravenous contrast is unclear, though results suggest that contrast is more beneficial when the MR imaging is graded by a less experienced assessor or to increase the confidence that a symmetric enlarged adenoid is due to benign hyperplasia (striped appearance with alternating septal and lymphoid columns).
Observer Results
This study aimed to produce a simple MR imaging grading system based on conventional sequences for a quick assessment by radiologists with differing levels of experience. Therefore, we did not include subtle abnormalities of malignancy such as loss of the white line sign8 or functional sequences such as diffusion-weighted imaging,18 which require quantification and use of thresholds that are not easily implemented across centers. Using our proposed modified and plain scan grading systems yielded substantial agreement between the most and the least experienced observers when assessing grades 1–2 versus grade 3 versus grades 4–5 on contrast MR imaging. All other agreement among the 3 observers was almost perfect. Therefore, we believe that these new MR imaging grading systems can be used by specialists and generalists. Most interesting, the number of false-positive cases increased with experience, which lowered the overall accuracy.
Limitations
The use of MR imaging in NPC detection is still in its infancy, so our results are from a single-center setting and include cases reported previously. Therefore, to ensure generalizability, the data should be validated at outside institutions, and we hope our proposed systems will be tested and developed by researchers in this field. Moreover, future studies should explore the potential use of only a non-contrast-enhanced MR imaging protocol and the plain scan grading system in screening settings. We envisage future refinements of the MR imaging grading systems, and our proposal regarding the use of intravenous contrast may evolve as advances in the circulating Epstein-Barr virus DNA evaluations reduce the number of healthy subjects referred for further investigation after a false-positive blood test.19 Finally, the grading systems are proposed for use only in patients with suspected NPC who have a dedicated nasopharyngeal MR imaging and not for use in a general population.
CONCLUSIONS
We refined our current MR imaging grading system for NPC detection, with the intent to meet the challenge of applying MR imaging not only in the clinical setting but also in a screening setting involving greater numbers of patients with early-stage primary cancers that must be discriminated from benign hyperplasia. We added focal expansion in diffuse asymmetric wall thickening, absent or external distortion of contrast-enhancing adenoidal septa, and the presence of metastatic nodes as indicators of malignancy. The modified MR imaging grading system improved the overall diagnostic performance of MR imaging when compared with the current MR imaging grading system and produced sensitivity and specificity rates of 96.3% and 91.9%, respectively. The modified MR imaging grading system was used by specialists and generalists to classify patients into low (grades 1, 2), indeterminate (grade 3), and high-risk (grades 4, 5) categories for NPC. Patients with a low-risk MR imaging grade can be spared an injection of intravenous contrast. We hope the grading system will help disseminate knowledge to other centers and act as the foundation for building a consensus on future guidelines for a Nasopharynx Imaging Reporting and Data System. Finally, the plain scan alone exhibited potential usefulness in future screening programs.
Footnotes
This work was supported partially by Kadoorie Charitable Foundation and the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. SEG_CUHK02, 14107216 and T12-404/11).
Disclosures: Rossa W.K. Chiu—RELATED: Grant: Research Grants Council of Hong Kong Special Administrative Region Government Theme-Based Research Grant (T12-401/16-W)*; UNRELATED: Consultancy: GRAIL; Grants/Grants Pending: contract research agreement from GRAIL*; Patents (Planned, Pending or Issued): both pending and issued patents on cancer diagnostics*; Royalties: from patents on cancer diagnostics*; Stock/Stock Options: GRAIL, Take2 Healthcare; Other: founder of Take2 Healthcare. K.C. Allen Chan—RELATED: Grant: GRAIL, Cirina, Comments: This work is partially supported by GRAIL/Cirina*; Consulting Fee or Honorarium: GRAIL, Comments: I am a consultant to GRAIL; UNRELATED: Board Membership: Take2 Healthcare, DRA, Comments: I am a director of Take2 Healthcare and DRA; Consultancy: GRAIL, Comments: I am a consultant of GRAIL; Grants/Grants Pending: GRAIL/Cirina, Comments: My institution received funding support from GRAIL/Cirina*; Patents (Planned, Pending or Issued): patents on molecular diagnostics; Royalties: GRAIL, Sequenom, Illumine, Take2, Xcelom; Stock/Stock Options: GRAIL, DRA, Take2 Healthcare, Comments: I hold equities. Wai Kei Jacky Lam—UNRELATED: Patents (Planned, Pending or Issued): enhancement of cancer screening using cell-free viral nucleic acids (pending); Stock/Stock Options: GRAIL. Edwin P. Hui—UNRELATED: Consultancy: Merck Sharp & Dohme; Payment for Lectures Including Service on Speakers Bureaus: Merck Serono, Merck Sharp & Dohme. Brigette Ma—UNRELATED: Grants/Grants Pending: Novartis, Boerhinger Ingelheim, Comments: research grant; Payment for Lectures Including Service on Speakers Bureaus: Merck Sharp & Dohme, Bristol-Myers Squibb.* Anthony Chan—UNRELATED: Consultancy: Merck Serono, Merck Sharp & Dohme, Cullinan Management Inc; Employment: The Chinese University of Hong Kong; Grants/grants pending: Boehringer Ingelheim, Bristol Myers Squibb, Amgen, Pfizer, Eli Lilly, Merck Serono, Merck Sharp & Dohme*. Yuk Ming Dennis Lo—RELATED: Grant: Kadoorie Charitable Foundation, GRAIL Inc, Li Ka Shing Foundation, Hong Kong Research Grants Council, Comments: Hong Kong Research Grants Council Theme-Based Research Grant (T12-401/16-W)*; Consulting fees or honorarium: GRAIL Inc, Decheng Capital*; Other: Take2 Holdings Limited, GRAIL Inc, Comments: Take2 Holdings Limited - Co-founder, Shareholder, Chairman of the Board, GRAIL Inc. - Scientific Co-founder, shareholder, member of Scientific Advisory Board; UNRELATED: Board membership: DRA Limited, Comments: Chairman of the Board; Patents (Planned, Pending or Issued): Illumina, Sequenom, Xcelom Limited, DRA Limited, Comments: Licensed patents or patent applications in the area of non-invasive prenatal testing; Royalties: Illumina, Sequenom, Xcelom Limited, DRA Limited, Comments: Royalties for licensed patents or patent applications in the area of non-invasive prenatal testing; Stock/stock options: DRA Limited; Other: KingMed Future, Comments: Service agreement in the area of non-invasive prenatal testing. *Money paid to the institution.
Indicates open access to non-subscribers at www.ajnr.org
References
- Received September 23, 2019.
- Accepted after revision December 14, 2019.
- © 2020 by American Journal of Neuroradiology