preoperative diagnosis of thyroid nodules using the bethesda system for reporting thyroid...

14
Preoperative diagnosis of thyroid nodules using the Bethesda System for Reporting Thyroid Cytopathology: a comprehensive review and meta-analysis Expert Rev. Endocrinol. Metab. Early online, 1–14 (2014) Brandon S Sheffield 1 , Hamid Masoudi 1 , Blair Walker 1 and Sam M Wiseman* 2 1 Department of Pathology and Laboratory Medicine, St. Paul’s Hospital and University of British Columbia, Vancouver, British Columbia, Canada 2 Department of Surgery, St. Paul’s Hospital and University of British Columbia, Vancouver, British Columbia, Canada *Author for correspondence: Tel.: +1 604 806 9108 Fax: +1 604 806 9957 [email protected] Fine-needle aspiration biopsy (FNAB) is the test of choice for the evaluation of nodules, arriving at a cancer diagnosis, and guiding surgical management. This review and meta-analysis aims to objectively evaluate the Bethesda System for Reporting Thyroid Cytopathology (BSRTC) based upon literature reports of histopathological outcomes following cytopathological diagnoses. Thirteen studies were reviewed and the risk of malignancy (ROM) for each of the BSRTC diagnostic categories were calculated as: Non-diagnostic 11–26%, Benign 4–9%, AUS/FLUS 19–38%, FN/SFN 27–40%, SFM 50–79%, and Malignant 98–100%. In typical clinical utilization, the sensitivity and specificity of thyroid FNAB diagnosis using the BSRTC were 96% and 46%, respectively. The BSRTC represents an important advance in standardizing thyroid FNAB cytopathological reporting. Close attention should be paid to the observation that the AUS-FLUS and FN-SFN DCs have overlapping ROMs, and the potential clinical implications of this finding on patient management. KEYWORDS: Bethesda • BSRTC • cytology • fine-needle biopsy • needle biopsy • thyroid cancer • thyroid neoplasm The thyroid nodule, a palpably discrete swell- ing in an otherwise normal thyroid gland, varies in incidence from 1 to 10% in iodine- sufficient populations [1]. This incidence rises to 67% when non-palpable lesions that are detected by high-resolution ultrasound are included [2]. Cancer is the primary concern for any patient who presents with a thyroid nod- ule, either palpable or incidentally detected by imaging, and as few as 3.9% [2] of these lesions may actually represent cancer in indi- viduals who lack any other clinical risk factors. The rising incidence of thyroid cancer that has been observed in western society over the last several decades [3] suggests that an increasing number of thyroid nodules have been under- going clinical evaluation. Papillary thyroid carcinoma accounts for 85% of all thyroid cancer cases and generally has an excellent prognosis with a 20-year cause-specific mortality rate that is less than 1% [4]. The mainstay of treatment for thyroid cancer is total or near-total resection of the thyroid gland, postoperative adjuvant radioac- tive iodine therapy and life-long thyroid hor- mone replacement with thyroid-stimulating hormone suppression [5]. By contrast, many benign nodules may be followed clinically. Given these two very different treatment algo- rithms for thyroid nodules, their accurate pre- operative cancer risk-stratification is of critical clinical importance. While history and physical examination, laboratory testing, and imaging are all impor- tant parts of the evaluation of an individual who presents with a thyroid nodule, it is fine- needle aspiration biopsy (FNAB) that is the single most useful test to either confirm or informahealthcare.com 10.1586/17446651.2014.887435 Ó 2014 Informa UK Ltd ISSN 1744-6651 1 Review Expert Review of Endocrinology & Metabolism Downloaded from informahealthcare.com by 139.173.54.11 on 02/17/14 For personal use only.

Upload: ubc

Post on 01-Dec-2023

0 views

Category:

Documents


0 download

TRANSCRIPT

Preoperative diagnosis ofthyroid nodules using theBethesda System for ReportingThyroid Cytopathology: acomprehensive review andmeta-analysisExpert Rev. Endocrinol. Metab. Early online, 1–14 (2014)

Brandon S Sheffield1,Hamid Masoudi1,Blair Walker1 andSam M Wiseman*2

1Department of Pathology and

Laboratory Medicine, St. Paul’s Hospital

and University of British Columbia,

Vancouver, British Columbia, Canada2Department of Surgery, St. Paul’s

Hospital and University of British

Columbia, Vancouver, British Columbia,

Canada

*Author for correspondence:

Tel.: +1 604 806 9108

Fax: +1 604 806 9957

[email protected]

Fine-needle aspiration biopsy (FNAB) is the test of choice for the evaluation of nodules, arrivingat a cancer diagnosis, and guiding surgical management. This review and meta-analysis aims toobjectively evaluate the Bethesda System for Reporting Thyroid Cytopathology (BSRTC) basedupon literature reports of histopathological outcomes following cytopathological diagnoses.Thirteen studies were reviewed and the risk of malignancy (ROM) for each of the BSRTCdiagnostic categories were calculated as: Non-diagnostic 11–26%, Benign 4–9%, AUS/FLUS19–38%, FN/SFN 27–40%, SFM 50–79%, and Malignant 98–100%. In typical clinical utilization,the sensitivity and specificity of thyroid FNAB diagnosis using the BSRTC were 96% and 46%,respectively. The BSRTC represents an important advance in standardizing thyroid FNABcytopathological reporting. Close attention should be paid to the observation that the AUS-FLUSand FN-SFN DCs have overlapping ROMs, and the potential clinical implications of this finding onpatient management.

KEYWORDS: Bethesda • BSRTC • cytology • fine-needle biopsy • needle biopsy • thyroid cancer • thyroid neoplasm

The thyroid nodule, a palpably discrete swell-ing in an otherwise normal thyroid gland,varies in incidence from 1 to 10% in iodine-sufficient populations [1]. This incidence risesto 67% when non-palpable lesions that aredetected by high-resolution ultrasound areincluded [2]. Cancer is the primary concern forany patient who presents with a thyroid nod-ule, either palpable or incidentally detected byimaging, and as few as 3.9% [2] of theselesions may actually represent cancer in indi-viduals who lack any other clinical risk factors.The rising incidence of thyroid cancer that hasbeen observed in western society over the lastseveral decades [3] suggests that an increasingnumber of thyroid nodules have been under-going clinical evaluation.

Papillary thyroid carcinoma accounts for85% of all thyroid cancer cases and generally

has an excellent prognosis with a 20-yearcause-specific mortality rate that is less than1% [4]. The mainstay of treatment for thyroidcancer is total or near-total resection of thethyroid gland, postoperative adjuvant radioac-tive iodine therapy and life-long thyroid hor-mone replacement with thyroid-stimulatinghormone suppression [5]. By contrast, manybenign nodules may be followed clinically.Given these two very different treatment algo-rithms for thyroid nodules, their accurate pre-operative cancer risk-stratification is of criticalclinical importance.

While history and physical examination,laboratory testing, and imaging are all impor-tant parts of the evaluation of an individualwho presents with a thyroid nodule, it is fine-needle aspiration biopsy (FNAB) that is thesingle most useful test to either confirm or

informahealthcare.com 10.1586/17446651.2014.887435 � 2014 Informa UK Ltd ISSN 1744-6651 1

Review

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

exclude a cancer diagnosis. The proportion of cases diagnosedwith cancer in thyroid surgical specimens prior to the wide-spread adoption of FNAB was reported to be 14% [6], andafter its introduction this proportion more than doubled androse to approximately 40% [7]. Over the past decade, the pro-portion of thyroid surgical specimens that are given a cancerdiagnosis has further risen to approximately 56% [8]. Althoughthe technique for FNAB has remained largely unchanged, inno-vations such as the widespread adoption of ultrasound guid-ance, liquid-based cytology preparations and improvedtechnical and interpretive skills, may in part explain this risingFNAB accuracy.

FNAB has significantly influenced the management of thy-roid nodules and will continue to be the test of choice forguiding the treatment of thyroid nodules in the foreseeablefuture. Despite its widespread utilization, FNAB is an imperfecttest, with established false-positive and false-negative rates thatare approximately 5% for each [9], and may lead to nodule over-treatment or undertreatment, respectively [10]. These apparentdeficiencies in the test characteristics of FNAB may be largelyattributable to specimens that are given an ‘indeterminate’ diag-nosis, or a cytopathological diagnosis that is associated with acancer risk, but is not definitively benign or malignant.

Terminology is a modifiable factor with regards to thereporting of thyroid cytopathology, and miscommunicationbetween the pathologist and the clinician may occur, especiallyfor indeterminate diagnoses. Surveys have shown that patholo-gists report thyroid cytopathology with very heterogeneous ter-minology [11,12], and that clinicians act variably based uponthese reports [11]. Apart from its impact on patient care, incon-sistent cytopathological nomenclature blurs the relationshipbetween FNAB diagnosis and outcome, hampers the analysis ofthe test itself as well as understanding of the pathophysiologyof the processes for which it has been utilized.

To address the inconsistencies and the limitations of the termi-nology employed for thyroid cytopathology, the National CancerInstitute hosted the 2007 ‘National Cancer Institute ThyroidFine Needle Aspiration State of the Science Conference’ that ledto the development and publication of The Bethesda System forReporting Thyroid Cytopathology (BSRTC) [13]. The BSRTCis composed of a set of 6 diagnostic categories (DCs) thatare each accompanied with descriptive information andphoto-micrographic examples for pathologists, as well as a risk ofmalignancy (ROM) and recommended course of action for clini-cians [14]. The 6 DCs are: non-diagnostic (including cyst fluid,acellular, or excessively bloody specimens), benign (includingfollicular nodules and thyroiditis), atypia of undetermined signif-icance or follicular lesion of undetermined significance(AUS-FLUS), follicular neoplasm or suspicious for a follicularneoplasm (FN-SFN), suspicious for malignancy (SFM) andmalignant (including papillary carcinoma, medullary carcinoma,undifferentiated thyroid cancer, and metastatic disease). For acomplete description, with definitions and criteria for each DCplease refer to BSRTC [13]. The cancer risk and recommendedmanagement associated with each of the DCs are as follows:

non-diagnostic (1–4% cancer risk, recommend repeat FNABwith ultrasound guidance), benign (0–3% cancer risk, recom-mend clinical follow-up), AUS-FLUS (5–15% cancer risk, rec-ommend repeat FNAB), FN-SFN (15–30% cancer risk,recommend surgical lobectomy), SFM (60–75% cancer risk, rec-ommend surgical lobectomy or near-total thyroidectomy), andmalignant (97–99% cancer risk, recommend near-total thyroid-ectomy) [13,14].

The objective of developing the BSRTC was to create a uni-form thyroid cytopathology reporting system that could facili-tate: effective communication between pathologists andclinicians, cytologic-histologic correlation of cases, research andallow for comparison of data between institutions. Since theBSRTC was first published in 2007, it has become adopted bymany centers and its clinical utility has been extensivelyscrutinized [15–18]. Controversies have arisen over the recom-mended management of certain DCs, such as the appropriatesurgical management of AUS-FLUS [19], or the need for rou-tine second-opinion on all indeterminate diagnoses (AUS-FLUS, FN-SFN, SFM) [20,21]. Some authors have even sug-gested revising the 6-DC system into a 4-DC system [22], orsubdividing the AUS-FLUS DC into two separate DCs [23].Overall, the BSRTC has been well received, with the majorityof controversy focused upon the AUS-FLUS DC [24,25].

This review and meta-analysis aims to evaluate institutionaldata published in the medical literature regarding the rate ofmalignancy for each of the 6 BSRTC DCs that have beenfollowed-up with a thyroid operation and a definitive histo-pathologic diagnosis. Though similar reviews have been under-taken in the past [15,16], our study represents a contemporaryand comprehensive review of the current published literaturethat has reported on the BSRTC.

Materials & methodsA review of the Medline database was carried out throughPubMed and Ovid in order to identify relevant studies formeta-analysis. Studies were identified based upon a comparisonof the cytological diagnosis with the histological diagnosis afterthyroid surgery, and only studies using BSRTC terminologyfor reporting the cytological diagnosis were included. Addi-tional inclusion criteria were English language publication, andpublication availability (search performed online 31 December2012). Thirteen studies meeting the above inclusion criteriawere identified and are summarized in TABLE 1 [8,26–37]. Publica-tions focusing only upon specific BSRTC DCs were excludedfrom the analysis [38]. Publications meeting the inclusion crite-ria, but presenting data in a manner such that extraction formeta-analysis was not possible, were also excluded [18,39].A recent report by Wong and Baloch [15] reviewed the experi-ence of six institutions with the BSRTC and all six of theirsource publications were included in the current review.Another recent review by Bongiovanni et al. [16] analyzed datareported by seven institutions using the BSRTC, and all butone of these studies were included in the current review.A single report was excluded because they utilized a system

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

‘similar to BSRTC’ and because the data were presented in amanner that it could not readily be assigned into BSRTCDCs [40].

The following elements were extracted from the includedreports: total number of FNAB cases, distribution of cytologi-cal diagnoses into the 6 DCs, total number of cases with

histological follow-up and their distribution across 6 DCs,the number of cases in each DC having malignant histologyat follow-up and the rate of surgical intervention. Statisticalanalysis was carried out, and tables and figures were gener-ated using Microsoft Excel (Microsoft Corporation, Red-mond, WA, USA). The mean ROM scores and 95% CIs

Table 1. Demographics of studies selected for review.

Study(year)

Institution Method Total FNABs Total FNABswith follow-uphistology

Ref.

1

(2012)

Charleston Area Medical

Center

Randomly selected 100 cases with

available histology performed blind,

retrospective cytology diagnosis using

BSRTC

99 99 [26]

2

(2012)

Ball Memorial Hospital Retrospective review of consecutive

FNAB cases

1382 221 [27]

3

(2012)

Cleveland Clinic Prospective series of consecutive FNAB

cases

1000 451 [28]

4

(2011)

Vanderbuilt University

Medical Center

Retrospective review of consecutive

surgical cases comparing outcomes

pre-BSRTC with those after

implementation of the BSRTC

282 282 [29]

5

(2011)

University of Alabama at

Birmingham

Retrospective review of consecutive

FNAB cases comparing outcomes pre-

BSRTC with those after implementation

of the BSRTC

562 110 [30]

6

(2010)

University of Virginia

Health System

Retrospective review of consecutive

FNAB cases with retrospective BSRTC

diagnosis based on previous descriptive

terminology

3080 882 [31]

7

(2010)

Yale-New Haven Hospital Prospective series of consecutive FNAB

cases

2468 378 [32]

8

(2011)

Istanbul University Retrospective review of consecutive

FNAB cases spanning implementation

of BSRTC; retrospective reclassification

of pre-BSRTC indeterminate diagnoses

442 442 [33]

9

(2010)

Baptist Hospital,

Homestead Hospital

Retrospective review of consecutive

FNAB cases with retrospective

reclassification according to BSRTC

7089 1331 [34]

10

(2011)

Konkuk Medical Center Prospective series of consecutive FNAB

cases, studying impact of ancillary

molecular testing on malignancy rate

865 193 [35]

11

(2012)

Multi-institutional

collaboration

Retrospective review of consecutive

cases from five institutions comparing

BSRTC to 5-tiered Italian system

3724 1361 [36]

12

(2007)

Long-Island Jewish

Medical Center

University of Texas

Medical Branch

Retrospective review of consecutive

FNAB cases and reclassification using

BSRTC

4703 1052 [37]

13

(2007)

Brigham and Women’s

Hospital

Prospective series of consecutive FNAB

cases

3589 1242 [8]

BSRTC: Bethesda System for Reporting Thyroid Cytopathology; FNAB: Fine-needle aspiration biopsy.

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

were calculated (using a sample size of 13) based upon aver-ages of individually reported ROMs in the source material.Weighted means were additionally calculated, however, thisstatistic was not amenable to presentation with 95% CIs, andtherefore the average of individual ROMs was the preferredstatistic for evaluation.

Test characteristics, including sensitivity (SN) (true positives/[true positives + false negatives]), specificity (SP) (true nega-tives/[false positives + true negatives]), positive predictive value(PPV) (true positives/total positives), negative predictive value(NPV) (true negatives/total negatives) and both positive andnegative likelihood ratios (LR+ and LR-) (SN/[1 - SP] and[1 - SN]/SP), were computed for multiple cases interpretingdifferent combinations of DCs as being positive or negativetest results (95% CIs calculated using x ± [1.96 � sqrt(x � (1 - x)/n)] for SN, SP, PPV, NPV and exp [log(LR) ± (1.96 � standard error)] for LR+ and LR-). Post-testprobabilities were calculated for illustrative cases (post-testprobability = [p/(1 - p) � LR]/[1 + (p/[1 - p] � LR)], where

p = pre-test probability). Test characteristics were computedwith the aid of CatMaker software available through the Centrefor Evidence Based Medicine [41].

ResultsAll studies reported consecutive series of cytological diagnoseswith the exceptions of Richmond et al. [26], who evaluated arandom selection of 100 cases with available histological fol-low-up, and Broome and Solorzano [29], who reported on aconsecutive series of surgical cases. The remaining 11 studiesreported results from consecutive series of FNABs with histo-pathological follow-up and their distribution across the6 BSRTC DCs, is summarized in TABLE 2.

The broadest variation in DC utilization was within thebenign DC, with a frequency that ranged from 39.00% asreported by Wu et al. to 77.76% as reported by Crowe et al.The DC with the second-most common variation in propor-tion was the AUS-FLUS DC, with rates that ranged from0.80%, as reported by Bohacek et al., to 27.21% as reported

Table 2. Incidence of each Bethesda System for Reporting Thyroid Cytopathology diagnostic categoryin studies evaluated.

Study† TotalFNAB

Non-diagnostic I(%)

Benign II(%)

AUS-FLUS III(%)

FN-SFN IV(%)

Suspicious V(%)

Malignant VI(%)

2 1382 278

20.12%

539

39.00%

376

27.21%

116

8.39%

36

2.60%

37

2.68%

3 1000 56

5.60%

671

67.10%

8

0.80%

172

17.20%

24

2.40%

69

6.90%

5 562 16

2.85%

437

77.76%

71

12.63%

21

3.74%

2

0.36%

15

2.67%

6 3080 574

18.64%

1817

58.99%

104

3.38%

298

9.68%

71

2.31%

216

7.01%

7 2468 230

9.32%

1799

72.89%

89

3.61%

166

6.73%

39

1.58%

145

5.88%

8 442 36

8.14%

223

50.45%

25

5.66%

35

7.92%

42

9.50%

81

18.33%

9 7089 1671

23.57%

3829

54.01%

548

7.73%

606

8.55%

134

1.89%

301

4.25%

10 865 16

1.85%

504

58.27%

141

16.30%

10

1.16%

54

6.24%

140

16.18%

11 3724 110

2.95%

2064

55.42%

248

6.66%

886

23.79%

224

6.02%

192

5.16%

12 4703 488

10.38%

3036

64.55%

152

3.23%

544

11.57%

124

2.64%

359

7.63%

13 3589 269

7.50%

2361

65.78%

144

4.01%

328

9.14%

314

8.75%

173

4.82%

Total

Mean

(95% CI)

28,904 3744

10.08%

(9.83–10.33%)

17,280

60.39%

(60.23–60.54%)

1906

8.29%

(7.93–8.65%)

3182

9.80%

(9.58–10.03%)

1064

4.03%

(3.86–4.19%)

1728

7.41%

(7.16–7.66%)†Studies 1 and 4 are excluded as they do not report consecutive case series of FNAB.AUS: Atypia of undetermined significance; FLUS: Follicular lesion of undetermined significance; FN: Follicular neoplasm; FNAB: Fine-needle aspiration biopsy;SFN: Suspicious for a follicular neoplasm.

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

by Wu et al. The malignant and SFM DCs were diagnosedmost consistently. Not surprisingly, the majority of FNABswere diagnosed in the benign DC, 60.39 ± 0.15% (mean± 95% CI). The malignant DC comprised 7.41 ± 0.25% oftotal FNABs, and 22.12% of diagnoses were from one of theindeterminate DCs (AUS-FLUS, FN-SFN or SFM). FN-SFNwas most commonly diagnosed (9.80 ± 0.23%) and SFM wasleast commonly diagnosed, accounting for fewer than 5% oftotal FNAB diagnoses (TABLE 2).

Of all the histopatholigcal results available for review fromthe source publications, 27.05% of the cases had an initialbenign FNAB diagnosis, 16.82% of the cases had malignantcytology, 48.16% of cases had indeterminate cytology and7.97% of cases were non-diagnostic FNABs.

TABLE 3 summarizes the ROM for each of the 6 BSRTC DCsas they were presented in the 13 publications reviewed. Asexpected, the highest ROM was associated with a malignantcytopathological diagnosis (97.74–99.53% [95% CI]), and thelowest ROM was associated with a benign cytopathologicaldiagnosis (4.47–8.57%). The remaining 4 DCs had ROMsthat were between these two extremes. These observations arepresented in FIGURE 1, showing individual study means, overallmean ± 95% CI and the BSRTC expected ROM.

Considering thyroid FNAB that utilizes the BSRTC termi-nology as a diagnostic test, and applying follow-up histopatho-logical diagnosis as the gold standard, the test characteristicswere calculated. A positive or negative test result depends uponhow the indeterminate cases are grouped. For this reason, testcharacteristics were calculated with multiple interpretations ofboth positive and negative results. Two illustrative cases arepresented in FIGURE 2.

If only malignant FNABs are considered as being positivetest results, and only benign FNABs are considered negativetest results, and all other results (ND, AUS-FLUS, FN-SFN,SFM) are excluded, then the SN, SP, PPV and NPV ofBSRTC would be 92, 99, 99 and 94%, respectively. Con-versely, if malignant, SFM, FN-SFN, AUS-FLUS* DCs areconsidered to be positive test results, with only benign FNABsinterpreted as negative results, and all non-diagnostic cases areexcluded, the SN, SP, PPV, NPV are calculated as 96, 46,54 and 95%, respectively (FIGURE 2).

While the first case may represent the ideal scenario, the sec-ond case may more accurately reflect current clinical reality. Tovalidate the decision to exclude non-diagnostic FNABs fromthis calculation, the non-diagnostic population is interpretedsequentially as either positive or negative, showing that the testcharacteristics remain relatively stable with the SN and SPranging from 93 to 96% and 52 to 41%, respectively. Thus,the test characteristics are minimally impacted by the groupingof the non-diagnostic DC.

DiscussionBSRTC utilization

The 13 studies included in this review well illustrate how numer-ous institutions have adopted the BSRTC with variable results.These reports comprise data from Europe, Asia and North Amer-ica, and from both the academic and community hospital set-tings. It is clear that the utilization of the BSRTC DCs variessignificantly from one center to another (TABLE 2). Notable exam-ples include an extremely high rate of AUS-FLUS (27%)reported by Wu et al. [27], compared with the extremely low rateof this DC reported by Bohacek et al. [28] (0.8%). Non-diagnostic rates reported by Wu et al. [27], Jo et al. [31] andRenshaw [34] were 20, 19 and 24%, respectively, and were all sig-nificantly higher than the 10% overall group mean. These obser-vations may suggest that the BSRTC, and that the AUS-FLUSDC in particular, may not be consistently utilized by all institu-tions. This variation in DC incidence may have several possibleexplanations. It is possible that the incidence of the 6 DCs variesgeographically in different thyroid nodule patient populations.There may also be a referral bias, wherein academic institutionsmay report on a higher proportion of diagnostically challengingcases. Finally, it is possible that different pathologists, at differentcenters, have different experience and thresholds for making aspecific BSRTC diagnosis. Both Wu et al. [27] and Layfield et al.[42] have reported that there is a wide variation in individualpathologist utilization of the AUS-FLUS DC, and they havefound that individuals who lack specific cytopathology trainingmay be more likely to utilize this DC.

While the published description of the BSRTC does notspecify a target incidence for any of the specific DCs, the chap-ter dedicated to AUS-FLUS suggests that pathologists shouldaim for an incidence that is approximately 7% of allFNABs [43]. Our review found a mean incidence of 8.3% forthe AUS-FLUS DC. This observation is similar to the BSRTCrecommendation, and the mean does fall within this rangewhen outliers are excluded from the analysis (6.4% whenexcluding Wu et al. [27]). Apart from the 7% reporting recom-mendation for the AUS-FLUS DC, there is a paucity of evi-dence regarding the appropriate distribution of other BSRTCDCs. Krane et al. [44] have suggested the utilization of an AUS:malignant ratio as a benchmark to ensure laboratories achievean appropriate distribution of BSRTC DCs. Currently, thereexists no recommendation for, or system to facilitate, internalvalidation for laboratories newly adopting the BSRTC.

Cancer risk

Comparing the ROMs calculated in the current study withROMs reported for the BSRTC (FIGURE 1), our review found anincreased cancer risk for non-diagnostic DC (11–26% in currentreview compared with 1–4% reported by the BSRTC) and AUS-FLUS DC (19–38% in current review compared with the5–15% reported by the BSRTC). Our observed cancer risk isalso slightly higher for the benign and FN-SFN DCs. Both themalignant and SFM DCs have calculated ROMs that are consis-tent with those reported by the BSRTC (TABLE 1).

*The test characteristics for this specific scenario (positivetest = Malignant or SFM or FN-SFN or AUS-FLUS) are presented

here because these diagnoses are most likely to undergo surgery, basedon data collected in this study.

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

Table

3.Riskofmalignancy

instudiesevaluatedwithsu

mmary

statistics.

Study

TotalFN

AB

withfollow-up

histology

Non-diagnostic

ITotalcases

Malignant

histology

(%)

BenignII

Totalcases

Malignant

histology

(%)

AUS/FLU

SIII

Totalcases

Malignant

histology

(%)

FN/SFN

/HCN

IVTotalcases

Malignant

histology

(%)

Susp

iciousV

Totalcases

Malignant

histology

(%)

MalignantVI

Totalcases

Malignant

histology

(%)

199

6 1 16.67%

54

6 11.11%

10

4 40.00%

25

10

40.00%

3 1 33.33%

1 1 100.00%

2221

21

3 14.29%

63

6 9.52%

51

11

21.57%

49

13

26.53%

18

12

66.67%

19

19

100.00%

3451

19

5 26.32%

173

12

6.94%

8 1 12.50%

160

33

20.63%

24

14

58.33%

67

65

97.01%

4282

11

0 0.00%

96

9 9.38%

82

16

19.51%

33

12

36.36%

29

15

51.72%

31

30

96.77%

5110

2 1 50.00%

35

3 8.57%

44

8 18.18%

16

3 18.75%

0 0 0.00%

13

13

100.00%

6882

135

12

8.89%

307

20

6.51%

53

9 16.98%

177

45

25.42%

56

39

69.64%

154

151

98.05%

7378

25

8 32.00%

82

8 9.76%

27

13

48.15%

102

35

34.31%

30

26

86.67%

112

112

100.00%

8442

36

9 25.00%

223

22

9.87%

25

9 36.00%

35

23

65.71%

42

34

80.95%

81

77

95.06%

91331

235

47

20.00%

361

6 1.66%

204

50

24.51%

287

80

27.87%

75

73

97.33%

169

169

100.00%

10

193

0 0 0.00%

8 0 0.00%

33

24

72.73%

7 3 42.86%

33

33

100.00%

112

112

100.00%

11

1361

28

8 28.57%

158

4 2.53%

132

19

14.39%

698

224

32.09%

183

137

74.86%

162

161

99.38%

AUS:Atypia

ofundeterm

ined

significance;FLUS:Follicularlesionofundeterm

ined

significance;FN

:Follicularneo

plasm

;FN

AB:Fine-needle

aspirationbiopsy;HCN:Hurthle

cellneo

plasm

;SFN

:Su

spiciousfora

follicularneo

plasm

.

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

A potential explanation for the elevated ROMs we observedin the current study is the bias introduced when only consider-ing cases that have undergone a thyroid operation. This is bestdemonstrated in the benign DC that we found had a 7% can-cer risk (TABLE 3), a higher cancer risk than was suggested by theBSRTC (0–3% cancer risk). Clinical follow-up is often recom-mended for benign thyroid nodules, leading to the inferencethat the 2176 cytologically benign cases that underwent anoperation must have had an alternative surgical indication.Unfortunately, these specific surgical indications were not elab-orated upon in the source publications, but may have included:symptomatic goiters, cosmesis, malignant imaging characteris-tics, thyroid cancer predisposing inherited conditions such asMEN2, surgery performed for a different or higher risk thyroidlesion or hyperthyroidism.

In order to determine the true cancer risk, we would need toapply the gold standard, or thyroid histopathology following sur-gery, to all thyroid nodules undergoing FNAB, which for benignpreoperative diagnoses, is not clinically feasible. In the currentstudy, histopathology after surgery was the only diagnostic goldstandard for definitive diagnosis that we utilized. Other possiblediagnostic gold standards could include histopathology afterautopsy, and possibly long-term clinical follow-up of benignlesions. Future studies may choose to utilize these or other crite-ria in order to better evaluate the true risk of cancer for DCs thatdo not necessarily require an operation. While this ‘surgicalfollow-up’ bias is most apparent for the non-diagnostic andbenign DCs, it should be absent in the malignant and SFMDCs, both of which require surgical treatment. The ROMs forthese categories were 98–100% and 50–79%, respectively, andthese observations are consistent with the reported BSRTCROMs. It is unknown what proportion of the elevated ROM,that is present in the AUS-FLUS category, may be attributed tothis bias.

Another potential study bias is the inclusion of papillarymicrocarcinomas (tumors less than 1 cm in greatest dimension)in the study populations. These were occasionally mentionedin the studies evaluated, and where the incidence of papillarymicrocarcinoma was stated separately from incidence of papil-lary carcinoma, both were interpreted as cancer in the currentreview. The majority of studies did not specifically state howmicrocarcinomas were interpreted when calculating their malig-nancy rates. According to a review of international autopsydata, subclinical papillary microcarcinomas may be present inup to 39% of individuals [45].

Notably, the ROM for the AUS-FLUS DC was calculated tobe 19–38%, which is higher than the 5–15% risk reported by theBSRTC. The ROM in this DC also overlaps with the ROM forthe BSRTC FN-SFN DC that was calculated to range from 27 to40%. These observations are best appreciated in FIGURE 1, wherethe individual study means and the overall means for each DCare presented. FIGURE 3 presents the ROM statistic for the AUS-FLUS DC as compared with the ROM statistic for FN-SFNDC. Two studies (Richmond et al. [27] [study 1] and Croweet al. [30] [study 5]), reported very similar ROM statistics forT

able

3.Riskofmalignancy

instudiesevaluatedwithsu

mmary

statistics(cont.).

Study

TotalFN

AB

withfollow-up

histology

Non-diagnostic

ITotalcases

Malignant

histology

(%)

BenignII

Totalcases

Malignant

histology

(%)

AUS/FLU

SIII

Totalcases

Malignant

histology

(%)

FN/SFN

/HCN

IVTotalcases

Malignant

histology

(%)

Susp

iciousV

Totalcases

Malignant

histology

(%)

MalignantVI

Totalcases

Malignant

histology

(%)

12

1052

46

5 10.87%

247

18

7.29%

52

10

19.23%

326

105

32.21%

105

68

64.76%

276

272

98.55%

13

1242

77

8 10.39%

369

6 1.63%

84

20

23.81%

268

74

27.61%

288

173

60.07%

156

152

97.44%

Total

8044

641

107

16.69%

2176

120

5.51%

805

194

24.10%

2183

660

30.23%

886

625

70.54%

1353

1334

98.60%

Mean

(95%

CI)

18.69%

(11.16–26.22%

)

6.52%

(4.47–8.57%

)

28.27%

(19.02–37.53%

)

33.10%

(26.56–39.65%)

64.95%

(50.41–79.49%)

98.64%

(97.74–99.53%)

AUS:Atypia

ofundeterm

ined

significance;FLUS:

Follicularlesionofundetermined

significance;FN

:Follicularneo

plasm

;FN

AB:Fine-needle

aspirationbiopsy;HCN:Hurthle

cellneo

plasm

;SFN

:Suspiciousfora

follicularneo

plasm

.

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

0%

012345678910111213

10%

20%

30%

40%

50%

RO

M (

%)

Study

60%

70%

80%

90%

100%

AB D

C

FE

Figure

1.Fo

rest

plots

ofrisk

ofmalignancy

forthesixBethesd

aSystem

forReportingThyroid

Cytopathologydiagnostic

categories.

Vertical

linesshow

ROM

as

predictedbyBethesdaSystem

forReportingThyroid

Cytopathology.

Diamondsshow

individualstudies’reportedROM.Largecirclesshow

meanofindividually

reportedstudies

ROM

±95%

CI(n

=13).(A

)DCInon-diagnostic.(B)DCIIben

ign.(C)DCIIIatypia

ofundeterm

inedsignifican

ceorfollicularlesionofundeterm

ined

significance.(D

)DCIV

follicularneoplasm

orsuspiciousforfollicularneoplasm

.(E)DCVsuspiciousformalignancy.(F)DCVImalignant.

DCs:Diagnosticcategories;ROM:Riskofmalignan

cy.

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

these DCs. Furthermore, Theoharis et al. [32] (study 7) andKim et al. [35] (study 10) reported higher ROMs for the AUS-FLUS DC than for the FN-SFN DC. These observations raisedoubts regarding the validity of 5–15% ROM for the AUS-FLUS DC presented by the BSRTC. Additionally, the recom-mended clinical management for nodules in the AUS-FLUSDC is repeat FNAB, whereas for FN-SFN the recommendedmanagement is a diagnostic thyroid lobectomy. The major dif-ference in the clinical management of these DCs is interesting,since we found that they both have an almost equivalent cancerrisk. This discrepancy seems to have already been recognized bytreating clinicians. In the current review, all the included studieswere evaluated for the rate of surgical intervention for the AUS-FLUS DC. This statistic was available for 8 of 13 studiesreviewed and is presented in TABLE 4. These findings clearly sug-gest that a significant proportion of nodules diagnosed as AUS-FLUS undergo a diagnostic operation. Thus, not only is thecancer risk similar in the AUS-FLUS DC to other DCs, but itsclinical management may also be similar.

Test characteristics

As already discussed, measuring the test characteristics of thy-roid FNABs is challenging, as there is no clearly accepted

definition for a ‘positive’ or ‘negative’ result. Lewis et al. [12]

reported a wide variation in thyroid FNAB test characteristicspublished in the medical literature. The explanation for hetero-geneity in observed results was that for each calculation theindeterminate cytopathology results were treated variably, aseither being positive or negative results, or being excluded fromthe calculation entirely. In this review, we calculated test char-acteristics for thyroid FNABs with every possible combinationof interpretations as to what could be considered a positive ora negative test result, and all our calculations used final histopa-thology as the gold standard for diagnosis. Two of these com-binations are selected and shown in FIGURE 2 with Case Aillustrating an ‘ideal’ scenario, and Case B depicting the more‘real-world’ scenario. Overall, the cytopathological diagnosis ofthyroid nodules is generally quite reliable if the diagnosis iseither benign or malignant, and fortunately the majority (67%)of all FNABs are within these 2 DCs (data from this review).The remaining third of all thyroid nodules that are given anindeterminate diagnosis present the treating clinician with amanagement dilemma. Given that AUS-FLUS DC cases arelikely to undergo surgical intervention, perhaps it is more clini-cally appropriate that a ‘positive test’, or test result that suggeststhe need for surgical intervention, includes the malignant,

Positive = malignant; negative = benign

Histology 95% CIMalignant Benign Total Sensitivity 91.75% 90.84% 92.65%

FN

A

+ 1334 19 1353 Specificity 99.08% 98.77% 99.40%

- 120 2056 2176Pre-testprobability 41.20%†

Total 1454 2075 3529 PPV 98.60% 98.21% 98.98%NPV 94.49% 93.73% 95.24%

LR + 100.1973Post test + 98.60%‡

LR - 0.083294Post test - 5.51%‡

Positive = malignant, SFM, FN-SFN, AUS-FLUS; negative = benign

Histology 95% CIMalignant Benign Total Sensitivity 95.91% 95.46% 96.36%

FN

A

+ 2813 2414 5227 Specificity 46.00% 44.86% 47.13%

- 120 2056 2176Pre-testprobability 39.62%†

Total 2933 4470 7403 PPV 53.82% 52.68% 54.95%NPV 94.49% 93.97% 95.01%

LR + 1.775939Post test + 53.82%‡

LR - 0.088952Post test - 5.51%‡

A

B

Figure 2. Test characteristics of thyroid FNAB. (A) Positive FNAB interpreted as malignant (Bethesda VI). (B) Positive FNAB interpretedas malignant or SFM or FN-SFN or AUS-FLUS (Bethesda III, IV, V, VI).†Based on the population used to calculate the test characteristics.‡Based on the pre-test probability presented in each case. Post-test probabilities can be calculated using different pre-testprobabilities using an appropriate nomogram.AUS: Atypia of undetermined significance; FLUS: Follicular lesion of undetermined significance; FNAB: Fine-needle aspiration biopsy;SFM: Suspicious for malignancy.

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

SFM, FN-SFN and AUS-FLUS DCs, since these DCs usuallyall lead to an operative intervention. The test characteristics ofFNABs that utilize the BSRTC under these conditions are pre-sented in FIGURE 2B.

If a FNAB was carried out (using test characteristics calcu-lated in FIGURE 2B) on three illustrative patients, one with a low,another with an intermediate and another with a high pre-testthyroid cancer risk, the post-test cancer risks are summarizedin FIGURE 4. In the patient with a low pre-test cancer risk (5%cancer risk), a positive FNAB result is unable to definitivelydiagnose cancer, while a negative FNAB is sufficient to excludecancer. In the intermediate-risk patient (40% pre-test cancer

risk, the approximate incidence of cancerin the follow-up histology specimens inthis study) even though a benign FNABdiagnosis may be able to exclude cancer(6% post-test cancer risk), a positive resultcannot definitively diagnose malignancy(54% post-test cancer risk). In the theoret-ical high-risk patient (80% pre-test cancerrisk, chosen for illustrative purposes),FNAB is unable to definitively diagnose orexclude cancer (88%, 26% post-test +/-cancer risk). As seen here, the test charac-teristics of FNAB break down when thetest is applied to the illustrative high-riskpatient, thus bringing the clinical utility ofthe BSRTC in this specific setting intoquestion. Based upon the calculationsshown in FIGURES 2B & 4, we can concludethat the only truly diagnostic use of preop-erative FNAB is to rule out cancer in alow-risk patient. For all other scenarios,the test leaves significant room for error. Itshould also be noted that in other clinical

scenarios, such as for the high-risk patient, FNAB may still beuseful in establishing less common diagnoses such as lymphomaor thyroid cancer in lymph node metastases, which have differentclinical management algorithms.

Future directions

Despite recent advances in medicine, thyroid nodules continueto present a diagnostic and management dilemma for patholo-gists, endocrinologists, surgeons and other clinicians who areinvolved in their treatment. Operative decision-making isheavily influenced by the FNAB diagnosis, which in itself isbased upon morphologic criteria. As demonstrated in thisreview, thyroid cytopathology has limitations and is unable todefinitively diagnose approximately a third of thyroid nodules.Future endeavors may include the further refinement of termi-nology and DCs, with updated cancer risk estimation as well asclinical management recommendations. More recently, toaddress the limitations of thyroid cytopathology much attentionhas also been paid to non-morphological elements as adjunctsto traditional cytomorphology. These non-morphological ele-ments include immunocytochemistry (ICC) and molecularscreening methodologies.

ICC is a derivation of the well-established technique ofimmunohistochemistry that is commonly applied to surgicalspecimens. ICC is an antibody-based assay that allows the cyto-pathologist to determine whether a given antigen or molecularmarker is present, its location within the cellular compartmentand its level of expression. A review of studies investigating theuse of immunocytochemical markers in thyroid FNABs haspreviously been reported by our group [46].

Molecular analysis is yet another diagnostic avenue that hasbeen explored in order to reduce the number of indeterminate

ROM (AUS-FLUS + FN-SFN )

0

1

2

3

4

5

6

7

8

9

10

11

12

13

0.00% 20.00% 40.00% 60.00% 80.00% 100.00%

ROM

Study

Figure 3. Risk of malignancy for both atypia of undetermined significancefollicular lesion of undetermined significance (squares) and follicular neoplasmor suspicious for a follicular neoplasm (triangles). Grand means are shown as study0 with error bars representing 95% CIs.AUS: Atypia of undetermined significance; FLUS: Follicular lesion of undeterminedsignificance; FN-SFN: Follicular neoplasm or suspicious for a follicular neoplasm;ROM: Risk of malignancy.

Table 4. Rate of surgical intervention for atypiaof undetermined significance-follicular lesion ofundetermined significance diagnosis†.

Study Percent treated surgically

3 100%

5 62%

6 52%

7 30.3%

9 37%

11 53.2%

12 40.6%

13 58.3%

Mean (95% CI) 54.18% (39.28–69.07%)†Includes only studies who explicitly stated the rate of surgical follow-up or thosepublishing data in a manner that facilitated extraction.

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

thyroid FNABs that undergo surgical intervention. This gener-ally involves the analysis of FNAB specimens using moleculartechniques such as the PCR. Variations of this technique canidentify mutations in oncogenes that are highly specific for thy-roid cancer such as BRAF. Nikiforov et al. [47] reported onhow indeterminate BSRTC DC cases could be reassigned tohigher- or lower-risk groups by evaluating a panel of genesusing a PCR-based test. While molecular changes such asBRAF mutations have proven highly specific for thyroid can-cer, targeted molecular assays have generally had limited successin accurately identifying benign thyroid lesions. Anothermolecular-based strategy that has been developed to boost thediagnostic accuracy of FNABs is nucleic acid array-based test-ing. These techniques usually evaluate the RNA content of aFNAB specimen from a cytologically indeterminate thyroidlesion. Using bioinformatic classification analyses, which com-pares the tested material to known benign and malignant refer-ence standards, these assays are able to differentiate benign andmalignant RNA-expression profiles. Alexander et al. [48]

recently reported the results of a gene-expression classifier thatwas used to evaluate cases with indeterminate BSRTC diagno-ses. For thyroid lesions with BSRTC diagnoses that were AUS-FLUS, FN-SFN and SFM, the assay had NPVs that were 95,94 and 85%, respectively. The ability to exclude cancer incytologically indeterminate specimens could potentially signifi-cantly reduce the number of diagnostic thyroid operations.

Unfortunately, neither ICC nor molecular analysis hasbecome routinely adopted into the clinical management ofthyroid nodules. Undoubtedly, the adoption of ICC andmolecular testing into the algorithm of thyroid nodule man-agement, as a means to overcome the limitations of cytopa-thology shows promise, but will importantly also need totake economic considerations into account as these tests maylead to increased healthcare-related costs. A recent report byour group [49] evaluated the cost–effectiveness of a thyroidcancer molecular diagnostic test used adjunctively withFNAB, showed a slight benefit in both cost and quality oflife, though these conclusions are drawn based upon theassumption that the molecular test had excellent performancecharacteristics (SN, SP >95%).

ConclusionThe BSRTC represents an important advance in the establish-ment of a standardized system for reporting thyroid cytopathol-ogy. It has already, and will continue to have a positive impacton patient care as well as research in the field of thyroid cancerdiagnosis. This review validates several elements of the BSRTC,including the cancer rate in malignant, SFM, FN-SFN, benignand non-diagnostic DCs, suggesting that these diagnoses arereproducible at different institutions, in different patient popu-lations and at different geographic locations. The currentreview also suggests that the risk of cancer is significantlyhigher than has been previously suggested for the AUS-FLUSDC, and its recommended BSRTC clinical managementshould be reconsidered. Most importantly, by computing the

test characteristics of FNABs, we have highlighted the need forcaution in making clinical decisions based solely upon thecytopathologic diagnosis.

Expert commentaryThyroid nodules are common and FNAB represents the singlemost important test for their diagnosis. The BSRTC representsan important advancement in the field because it provides astandardized system for the reporting of thyroid FNABs. How-ever, currently one-third of all FNABs are diagnosed as indeter-minate, with no clearly established clinical managementguidelines. Molecular analysis of these indeterminate aspirateshas become of increasing importance, and we are beginning tosee its early adoption in the clinic. The role of molecular diag-nostics is currently being filled at most centers by clinical judg-ment, a tried and tested, but not always an objective orscientific technique. In this review, the BSRTC has been vali-dated and the test characteristics of this system are reported tofacilitate its clinical utilization. The BSRTC will continue toprove useful in both the clinical management and study ofthyroid nodules.

Five-year viewThe BSRTC has already been adopted into routine clinicalpractice by many institutions worldwide. As individual centersreport and validate their experiences with the BSRTC, addi-tional institutions will continue to adopt the system, and fur-ther refinements may make it more widely accepted andadopted by other centers. It was the aim of this review to eval-uate the true cancer risk associated with each BSRTC DC sothat the medical team involved in thyroid nodule managementmay incorporate this information into their treatment plan. Asutilization of the BSRTC becomes more widespread, and avail-able outcome data for analysis increases, further refinement ofthe cancer risk and recommended clinical actions associatedwith each cytopathological DC will be possible. Expanded

Pre-FNAB cancer risk FNAB Post-FNAB cancer risk

5%9%

<1%

40%54%

6%

80%88%

26%

+-

+-

+-

Figure 4. Post-test cancer risk for three pre-test cancer risks†.†Based on likelihood ratios calculated in FIGURE 2B (FNAB isinterpreted as positive if: malignant, suspicious for malignancy,follicular neoplasm or suspicious for a follicular neoplasm, atypiaof undetermined significance-follicular lesion of undeterminedsignificance and interpreted as negative if benign).FNAB: Fine-needle aspiration biopsy.

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

utilization of the BSRTC will also facilitate both intra- andinter-laboratory quality assurance endeavors, adding to thereproducibility of the system and allow further improvement inthe accuracy of FNAB.

Even though FNAB has remained the cornerstone of pre-operative thyroid nodule diagnostic evaluation, in the futurein addition to cytomorphological data, for indeterminatecases adjunctive ICC and/or molecular analysis will assistthe cytopathologist in establishing a definitive diagnosis.The current revised American Thyroid Association Guide-lines specifically state that molecular markers may beemployed in cases of indeterminate FNAB cytology [50]. Asthese novel techniques undergo further refinement and vali-dation, and with decreasing costs per test, their clinical util-ity will increase and they will emerge with a more definitiverole in the management algorithm of thyroid nodules. Theavailability of such specialized tests may also have an impact

on the performance and interpretation of thyroid FNABs atsmaller volume centers, and potentially lead to the migra-tion of diagnostic thyroid cytology to reference centers forcytopathological evaluation and ancillary testing. Thus, theongoing evolution and refinement of thyroid nodule cytopa-thology, of which the BSRTC represents an important earlystep, will undoubtedly lead to improved management andoutcomes of individuals diagnosed with both benign andmalignant disease.

Financial & competing interests disclosure

The authors have no relevant affiliations or financial involvement with

any organization or entity with a financial interest in or financial conflict

with the subject matter or materials discussed in the manuscript. This

includes employment, consultancies, honoraria, stock ownership or options,

expert testimony, grants or patents received or pending or royalties.

No writing assistance was utilized in the production of this manuscript.

Key issues

• Fine-needle aspiration biopsy is currently the best diagnostic test for the preoperative evaluation of a thyroid nodule.

• Clear communication of cytopathological results from the pathologist to the treating clinician(s) is of critical importance for optimal

patient management.

• The Bethesda System for Reporting Thyroid Cytopathology (BSRTC) provides a well-defined framework for communicating

cytopathological diagnoses from the pathologist to the clinician.

• The BSRTC consists of 6 diagnostic groups, each having an associated specific cancer risk, and estimates of these risks based upon

clinical reports have been calculated in this review.

• The atypia of undetermined significance or follicular lesion of undetermined significance diagnostic group confers a cancer risk that is

significantly higher than the cancer risk suggested by the BSRTC, and this finding must be considered when managing cases in this group.

• In typical clinical utilization, the BSRTC is highly sensitive (96%), but lacks specificity (46%) for thyroid cancer diagnosis.

• Decisions regarding the surgical management of cases with indeterminate cytopathology should incorporate the specific cytopathologic

diagnosis, as well as other relevant clinical parameters.

• Ancillary tests for accurately diagnosing indeterminate cases are being actively investigated, and are currently available at select

institutions, over time these modalities will become increasingly incorporated into the clinical management algorithm of thyroid nodules.

References

Papers of special note have been highlighted as:

• of interest

•• of considerable interest

1. Maitra A. The endocrine system. In:

Kumar V, Abbas AK, Fausto N, Aster JC,

editors. Pathologic basis of disease (1107-

1126). Elsevier; Philadelphia, PA, USA: 2010

• An excellent, concise textbook-chapter

review of thyroid pathophysiology.

2. Tan GH, Gharib H. Thyroid

incidentalomas: management approaches to

nonpalpable nodules discovered incidentally

on thyroid imaging. Ann Intern Med 1997;

126(1):226-331

3. Davies L, Welch HG. Increasing incidence

of thyroid cancer in the United States

1973-2002. JAMA 2006;295(18):2164-7

4. Randolph GW, Thompson GB,

Branovan DI, Tuttle M. Treatment of

thyroid cancer: 2007 - a basic review. Int J

Radiat Oncol Biol Phys 2007;69(2):92-7

5. Sosa JA, Udelsman R. Total thyroidectomy

for differentiated thyroid cancer. J Surg

Oncol 2006;94(8):701-7

6. Hamberger B, Gharib H, Melton LJ,

et al. Fine-needle aspiration biopsy of

thyroid nodules. Impact on thyroid

practice and cost of care. Am J Med 1982;

73(3):381-4

7. Hadi M, Gharib H, Goellner JR,

Heerden JA. Has fine-needle aspiration

biopsy changed thyroid practice? Endocr

Pract 1997;3(1):9-13

8. Yassa L, Cibas ES, Benson CB, et al.

Long-term assessment of a multidisciplinary

approach to thyroid nodule diagnostic

evaluation. Cancer 2007;111(1):508-16

• One of 13 articles included for meta-

analysis. This report is the basis for many

of the cancer risks published in

association with the Bethesda System for

Reporting Thyroid Cytopathology

(BSRTC).

9. Gharib H, Papini E, Valcavi R, et al.

American association of clinical

endocrinologists and associazione medici

endocrinology medical guidelines for clinical

practice or the diagnosis and management

of thyroid nodules: AACE/AME task force

on thyroid nodules. Endocr Pract 2006;

12(1):63-102

• Contains clinical management guidelines,

also see Reference 51.

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

10. Kitano M, Sugitani I, Toda K, et al.

Cytopathological review of patients that

underwent thyroidectomies based on the

diagnosis of papillary thyroid carcinoma by

fine needle aspiration cytology but were later

found to have benign tumors by

histopathology. Surg Today 2013;43(6):632-7

11. Redman R, Yoder BJ, Massoll NA.

Perceptions of diagnostic terminology and

cytopathologic reporting of fine-needle

aspiration biopsies of thyroid nodules:

a survey of clinicians and pathologists.

Thyroid 2006;16(10):1003-8

•• This is an interesting report, which

highlights the role for the BSRTC. This

study consists of a survey of pathologists

and clinicians, asking about diagnostic

terminology used, as well as asking

clinicians how they perceive and react to

specific terminology.

12. Lewis CM, Chang KP, Pitman M, et al.

Thyroid fine-needle aspiration biopsy:

variability in reporting. Thyroid 2009;19(7):

717-23

•• This report examines published test

characteristics of fine-needle aspiration

biopsy (FNAB). Heterogeneity in the

methods used in literature reporting of

thyroid FNAB test characteristics is nicely

demonstrated.

13. Ali SZ, Cibas ES. editors. The Bethesda

system for reporting thyroid cytopathology.

Springer; NY: USA; 2010

•• An essential reference describing the

BSRTC.

14. Baloch ZW, Alexander EK, Gharib H,

Raab SS. Overview of diagnostic

terminology and reporting. In: Ali SZ,

Cibas ES, editors. The Bethesda system for

reporting thyroid cytopathology. Springer;

NY, USA; 2010

15. Wong LQ, Baloch ZW. Analysis of the

Bethesda system for reporting thyroid

cytopathology and similar precursor thyroid

cytopathology reporting schemes. Adv Anat

Pathol 2012;19(5):313-19

• Review and meta-analysis examining data

sets from both before and after adoption

of the BSRTC, showing calculated test

characteristics for each individual report.

A nice comparison with the current

study.

16. Bongiovanni M, Spitale A, Faquin WC,

et al. The Bethesda system for reporting

thyroid cytopathology: a meta-analysis. Acta

Cytol 2012;56(1):333-9

• This meta-analysis of the BSRTC has

essentially the same results to the current

study, though different statistical

methodologies and inclusion criteria are

employed to arrive at similar conclusions.

17. Chen JC, Pace SC, Chen BA, et al. Yield of

repeat fine-needle aspiration biopsy and rate

of malignancy in patients with atypia or

follicular lesion of undetermined

significance: the impact of the Bethesda

system for reporting thyroid cytopathology.

Surgery 2012;152(1):1037-44

18. Rabaglia JL, Kabbani W, Wallace L, et al.

Effect of the Bethesda system for reporting

thyroid cytopathology on thyroidectomy

rates and malignancy risk in cytologically

indeterminate lesions. Surgery 2010;148(1):

1267-33

19. Teixeira GV, Chikota H, Teixeira T, et al.

Incidence of malignancy in thyroid nodules

determined to be follicular lesions of

undetermined significance on fine-needle

aspiration. World J Surg 2011;36(1):69-74

20. Park JH, Kim HK, Kang SW, et al. Second

opinion in thyroid fine-needle aspiration

biopsy by the Bethesda system. Endocr J

2012;59(3):205-12

21. Davidov T, Trooskin SZ, Shanker BA,

et al. Routine second-opinion cytopathology

review of thyroid fine needle aspiration

biopsies reduces diagnostic thyroidectomy.

Surgery 2010;148(1):1294-301

22. Walts AE, Bose S, Fan X, et al.

A simplified Bethesda system for reporting

thyroid cytopathology using only four

categories improves intra- and inter-observer

diagnostic agreement and provides

non-overlapping estimates of malignancy

risks. Diagn Cytopathol 2012;40(1):62-8

23. Horne MJ, Chhieng DC, Theoharis C,

et al. Thyroid follicular lesion of

undetermined significance: evaluation of the

risk of malignancy using the two-tier

sub-classification. Diagn Cytopathol 2012;

40(1):410-15

24. Bose S, Walts AE. Thyroid fine needle

aspirate: a post-Bethesda update. Adv Anat

Pathol 2012;19(3):160-9

25. Bongiovanni M, Krane JF, Cibas ES,

Faquin WC. The atypical thyroid

fine-needle aspiration: past, present, and

future. Cancer 2012;120(1):73-86

• A good review of the adoption of BSRTC

with focus on the atypia of undetermined

significance or follicular lesion of

undetermined significance category.

26. Richmond BK, O’Brien BA, Mangano W,

et al. The impact of implementation of the

Bethesda System for Reporting Thyroid

Cytopathology on the surgical treatment of

thyroid nodules. Am Surg 2012;78(6):

706-10

27. Wu HH, Rose C, Elsheikh TM. The

Bethesda system for reporting thyroid

cytopathology: an experience of 1,382 cases

in a community practice setting with the

implication for risk of neoplasm and risk of

malignancy. Diagn Cytopathol 2012;40(5):

399-403

28. Bohacek L, Milas M, Mitchell J, et al.

Diagnostic accuracy of surgeon-performed

ultrasound-guided fine-needle aspiration of

thyroid nodules. Ann Surg Oncol 2012;

19(1):45-51

29. Broome JT, Solorzano CC. The impact of

atypia/follicular lesion of undetermined

significance on the rate of malignancy in

thyroid fine-needle aspiration: Evaluation of

the Bethesda system for reporting thyroid

cytopathology. Surgery 2011;150(6):

1234-41

30. Crowe A, Linder A, Hameed O, et al. The

impact of implementation of the Bethesda

System for Reporting Thyroid Cytopathology

on the quality of reporting, “risk” ofmalignancy, surgical rate, and rate of frozen

sections requested for thyroid lesions. Cancer

Cytopathol 2011;119(1):315-21

31. Jo VY, Stelow EB, Dustin SM, Hanley KZ.

Malignancy risk for fine-needle aspiration of

thyroid lesions according to the Bethesda

System for Reporting Thyroid

Cytopathology. Am J Clin Pathol 2010;

134(1):450-6

32. Theoharis CGA, Schofield KM,

Hammers L, et al. The Bethesda thyroid

fine-needle aspiration classification system:

year 1 at an academic institution. Thyroid

2009;199(11):1215-23

33. Ozluk Y, Pehlivan E, Gulluoglu MG, et al.

The use of the Bethesda terminology in

thyroid fine-needle aspiration results in a

lower rate of surgery for non-malignant

nodules: a report from a reference center in

Turkey. Int J Surg Pathol 2011;19(6):

761-71

34. Renshaw AA. Should “atypical follicularcells” in thyroid fine-needle aspirates be

subclassified? Cancer Cytopathol 2010;

118(1):186-9

35. Kim SK, Hwant TS, Yoo YB, et al. Surgical

results of thyroid nodules according to a

management guideline based on the BRAF

(V600E) mutation status. J Clin Endocrinol

Metab 2011;96(1):658-64

36. Bongiovanni M, Crippa S, Baloch Z, et al.

Comparison of 5-tiered and 6-tiered

diagnostic systems for the reporting of

Preoperative diagnosis of thyroid nodules using the BSRTC Review

informahealthcare.com doi: 10.1586/17446651.2014.887435

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.

thyroid cytopathology. Cancer Cytopathol

2012;120(1):117-25

37. Yang J, Schnadig V, Logrono R,

Wasserman PG. Fine-needle aspiration of

thyroid nodules: a study of 4703 patients

with histologic clinical correlations. Cancer

Cytopathol 2007;111(1):306-15

38. VanderLaan PA, Marqusee E, Krane JF.

Features associated with locoregional spread

of papillary carcinoma correlate with

diagnostic category in the Bethesda system

for reporting thyroid cytopathology. Cancer

Cytopathol 2012;120(1):245-53

39. Marchevsky AM, Walts AE, Bose S, et al.

Evidence-based evaluation of the risks of

malignancy predicted by thyroid fine-needle

aspiration biopsies. Diagn Cytopathol 2010;

38(1):252-9

40. Nayar R, Ivanovic M. The indeterminate

thyroid fine-needle aspiration. Cancer

Cytopathol 2009;117(1):195-202

41. Badenoch D, Sackett D, Straus S, et al.

CATmaker �. The Centre for Evidence

Based Medicine. 2004. Available form

www.cebm.net/

•• The Centre for Evidence Based Medicine

is based out of Oxford University, the

website (above) provides excellent

reference material for physicians and

researchers. Specifically the CATmaker

program facilitates calculation of test

characteristics such as sensitivity

specificity, positive predictive value,

negative predictive value and both

positive and negative likelihood ratios.

42. Layfield LJ, Morton MJ, Cramer HM,

Hirschowitz S. Implications of the proposed

thyroid fine-needle aspiration category of

“follicular lesion of undetermined

significance”: a five-year multi-institutional

analysis. Diagn Cytopathol 2009;37(10):

710-14

• Interesting examination of variability in

reporting of the atypia of undetermined

significance or follicular lesion of

undetermined significance diagnostic

category.

43. Krane JF, Nayar R, Renshaw A. Atypia of

undetermined significance/follicular lesion of

undetermined significance. In: Ali SZ,

Cibas ES, editors. The Bethesda system for

reporting thyroid cytopathology. Springer;

NY, USA: 2010

44. Krane JF, VanderLaan PA, Faquin WC,

Renshaw AA. The atypia of undetermined

significance/follicular lesion of undetermined

significanc:Malignant ratio. Cancer

Cytopathol 2011;120(1):111-16

45. Kovacs GL, Gonda G, Vadasz G, et al.

Epidemiology of thyroid microcarcinoma

founding autopsy series conducted in areas

of different iodine intake. Thyroid 2005;

15(2):152-7

46. Griffith OL, Chiu CG, Gown AM, et al.

Biomarker panel diagnosis of thyroid cancer:

a critical review. Expert Rev Anticancer

Ther 2008;8(9):1399-413

47. Nikiforov YE, Ohori P, Hodak SP, et al.

Impact of mutational testing on the

diagnosis and management of patients with

cytologically indeterminate thyroid nodules:

a prospective analysis of 1056 FNA samples.

J Clin Endocrinol Metab 2011;96(11):

3390-7

• Interesting demonstration of PCR-based

molecular assay for thyroid cancer.

48. Alexander EK, Kennedy GC, Baloch ZW,

et al. Preoperative diagnosis of benign

thyroid nodules with indeterminate

cytology. N Eng J Med 2012;367(8):705-15

•• Demonstration of array-based molecular

testing. Results indicate that a microarray

with classifier can effectively identify

benign RNA expression signature in

thyroid FNAB specimens.

49. Najafzadeh M, Marra CA, Lynd LD,

Wiseman SM. Cost-effectiveness of using a

molecular diagnostic test to improve

preoperative diagnosis of thyroid cancer.

Value Health 2012;15(8):1005-13

•• Analysis of the cost and benefit of a novel

molecular assay. This paper shows

potential for a test used in adjunct with

FNAB to have a wide-scale impact on

patient care. Highlights both the cost of

the test as well as the test characteristics

that are rate limiting.

50. Cooper DS, Doherty GM, Haugen BR,

et al. Revised American Thyroid Association

management guidelines for patients with

thyroid nodules and differentiated thyroid

cancer. Thyroid 2009;19(11):1167-214

Review Sheffield, Masoudi, Walker & Wiseman

doi: 10.1586/17446651.2014.887435 Expert Rev. Endocrinol. Metab.

Exp

ert R

evie

w o

f E

ndoc

rino

logy

& M

etab

olis

m D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y 13

9.17

3.54

.11

on 0

2/17

/14

For

pers

onal

use

onl

y.