|
August 2005
Cover Story
Like the perennial debate over which team is better—the New York
Yankees or Boston Red Sox, the debate about FISH versus IHC for HER2 measurement
appears to be everlasting. Of course, in the case of HER2 detection, the
stakes are much higher.
As the anti-HER2 antibody trastuzumab becomes used in more clinical situations,
clinicians increasingly want to identify women with breast cancer who
are more likely to benefit from this expensive and often effective therapy.
Like baseball fans who cite voluminous statistics, those who advocate
measuring HER2 protein overexpression by immunohistochemistry, or IHC,
and those who favor HER2 gene amplification by fluorescence in situ hybridization,
or FISH, cite numerous arguments and extensive data to support the advantages
of their preferred method. A prototypical exchange might go like this:
FISH advocate: FISH directly visualizes the underlying genetic
abnormality.
IHC advocate: Whats important for carcinogenesis is HER2 protein
overexpression. It appears that in occasional cases overexpression can
occur without gene amplification.
FISH advocate: IHC gives false- positive results for some 3+
samples and has a gray zone of 2+ results.
IHC advocate: There is no compelling evidence that IHC 3+/FISH-negative
cases have poorer prognosis. Nor is there compelling evidence that IHC
2+/FISH-positive cases are more likely to benefit from trastuzumab. And
the few IHC 2+ cases can be reflexed to FISH.
FISH advocate: Interpreting IHC is too subjective. Surveys show
that IHC results vary when done by inexperienced laboratories.
IHC advocate: Interobserver variability with IHC among experienced
laboratories is very good. And FISH results, too, can vary when done by
inexperienced laboratories.
FISH advocate: More laboratories are using FISH, both as the
initial test and for confirmation of IHC results. FISH is especially helpful
for needle core biopsies, where IHC has artifacts.
IHC advocate: IHC is faster, less labor-intensive, and less
expensive. That’s why most laboratories use IHC as their initial screening
test.
The situation can indeed be confusing. What’s most important, says Kenneth
Bloom, MD, medical director at Clarient Inc. in Irvine, Calif., and clinical
professor of pathology at the University of Southern California, is that
IHC and FISH "are both great tests." Dr. Bloom does favor IHC. "I don’t
want to sound negative about FISH," he says. "I use it every day and it’s
a great test. But a test that is done under 100x power in a dark room
with no landmarks on the slide to me is not the epitome of a screening
test." In his experience, a large majority of laboratories use IHC as
the initial screen and then reflex to FISH in some cases, while about
30 percent of laboratories use FISH as their first test.
To D. Craig Allred, MD, professor of pathology in the Breast Center at
Baylor College of Medicine, Houston, it makes more sense financially to
do IHC for all primary screening. IHC done accurately provides a definitive
answer on 80 percent or more of cases in a rapid, cost-effective manner.
Several expert guidelines recommend referring IHC 2+ cases to FISH, and
reporting the one-third or so of samples that are amplified as positive.
"That’s probably a good algorithm," Dr. Allred says. "Doing FISH up front
would be no more accurate and would cost a lot more and take a lot more
time."
FDA-approved kits have made IHC easier, Dr. Allred notes. "If you follow
the manufacturer’s directions, you should get accurate results most of
the time."
Patrick Roche, PhD, senior director for reagent development at Ventana
Medical Systems Inc., favors protein testing as the first-line assay,
based on his past experience at the Mayo Clinic. "Intermediate protein
expression [IHC 2+] requires evaluation of gene status," he says. Thomas
Grogan, MD, founder and chief medical officer at Ventana and professor
of pathology at the University of Arizona Medical Center, agrees with
Dr. Allred: "Follow the recommended controls religiously," he advises.
He believes that instrumentation and automation can improve test standardization.
"Instrumentation allows people to get the same results from laboratory
to laboratory."
Ann Thor, MD, chair of the Department of Pathology at the University
of Oklahoma Health Science Center, also sees most laboratories doing IHC
for initial screening, with FISH for IHC 2+ cases. "There may be a few
institutions doing FISH for all cases," she says, "but given the cost
and reimbursement issues, probably not a lot. And there is no evidence
that you should use FISH on all cases."
A recent shift has been more reporting of nonadjusted data for the Abbott/Vysis
FISH kit. "Some institutions now give the centromeric score, the HER2
gene copy number, and the adjusted score [ratio]," Dr. Thor says. "There
has been some concern that adjusted score may miss some women who may
benefit from trastuzumab."
In the research setting, a developing trend in cooperative group clinical
trials is to measure "molecular partners" of the HER2 receptor. "Many
trials have moved beyond trastuzumab and HER2 testing to look at combining
traditional agents with newer reagents that target HER2 and a binding
partner," Dr. Thor says.
One pathologist who is using much more FISH is Atul Bhan, MD, director
of the immunopathology unit at Massachusetts General Hospital and professor
of pathology at Harvard Medical School. "We do FISH on all core biopsies,"
says Dr. Bhan. For excisional biopsies, use of FISH is based on whether
the specimen is IHC 2+. "We now have a tendency to do FISH on all cases
except those that are 3+," Dr. Bhan says. "As I see it, oncologists would
be happy if FISH was done at some time on all cases."
Raymond R. Tubbs, DO, chair of the Department of Clinical Pathology at
the Cleveland Clinic Foundation, thinks that the dominance of IHC for
initial screening "is likely to change in the near future as bright-field
assays become more mainstream and experience is gained with these assays."
Dr. Tubbs and colleagues have been developing such an assay, which has
the potential to show gene copy number and protein expression on the same
slide. "Most pathologists do not want to take on the additional instrumentation
and training of personnel required for FISH," he observes. "Having an
assay that fits in with how we use a microscope and with our workload
makes a lot of sense." In his view, bright-field measurement of HER2 genes
will eventually become "the method of choice."
To compare the performance of IHC and FISH, Dr. Bloom has done an extensive
analysis of the available data. He focuses on the FDA’s review when Genentech
submitted an amended claim to have patients qualified for trastuzumab
therapy by FISH. (IHC was the method used in clinical trials and the method
included in the original trastuzumab approval, though both FDA-approved
IHC assays differ from the predicate IHC assay.) The FDA’s review found
that an IHC 3+ score and FISH positivity were virtually identical in relative
risk reduction with chemotherapy plus trastuzumab, both for time to progression
and for overall survival. "So independently either FISH or IHC helped
you predict who would best respond to trastuzumab therapy," Dr. Bloom
says.
The FDA also looked at two discrepant categories, IHC 2+/FISH-positive
and IHC 3+/FISH-negative. FISH positivity did not confer any clinical
advantage for IHC 2+ cases, Dr. Bloom says. "That would argue that reflexing
IHC 2+ cases to FISH wouldn’t get you much of a benefit," he says. On
the other hand, IHC 3+ cases did benefit from addition of trastuzumab
in time to progression (though not in overall survival) even when they
were FISH-negative. Dr. Bloom cautions that each category contained small
numbers of patients, 32 and 43 respectively, out of 422 total patients,
so the data were not strong enough for statistically firm statements.
But they clearly don’t support the notion that FISH is more accurate.
So where did the idea come from that FISH is superior? Investigators
comparing the two methods often combine data from IHC 2+ and 3+, Dr. Bloom
finds. "Since IHC 2+ appears to have no significant benefit, that artificially
makes FISH look better." One trial of single-agent, first-line trastuzumab
therapy for metastatic breast cancer analyzed outcomes for IHC 2+ and
3+ separately. Response rates were 35 percent for IHC 3+ and 0 for IHC
2+, indicating that the IHC score clearly discriminated responders. For
FISH, response rates were 34 percent for FISH-positive specimens and seven
percent for those that were FISH-negative (Vogel CL, et al. J
Clin Oncol. 2002;20:719-726).
A study by the Cancer and Leukemia Group B demonstrated the equivalence
among IHC and FISH, as well as polymerase chain reaction, in determining
HER2 status for clinical purposes. Among patients with node-positive breast
cancer, all three methods demonstrated a similar prognostic relationship
for disease-free survival and overall survival. All methods were also
equivalent in predicting therapeutic response to high-dose doxorubicin
adjuvant therapy in patients with HER2-positive tumors; none of the methods—or
combinations of methods—was statistically superior (Dressler LG,
et al. J
Clin Oncol. 2005;23:4287-4297).
Based on his Mayo Clinic experience, Dr. Roche says that high-volume
laboratories experienced at doing both assays can achieve greater than
92 percent correlation between IHC 3+ and FISH amplification. Some people
believe that a few percent of IHC 3+/FISH-negative cases are true positives,
overexpressing HER2 protein by some mechanism other than gene amplification.
Further information about the reliability of IHC comes from the National
Surgical Adjuvant Breast Project B-31 trial, for which preliminary results
were announced this year at the meeting of the American Society of Clinical
Oncology. In this trial, trastuzumab is being evaluated in the adjuvant
setting for women with primary breast cancer and no evidence of metastatic
disease. Women who are HER2-positive are randomized to get standard chemotherapy
plus or minus trastuzumab. Patients initially became eligible for enrollment
after HER2 testing in their local hospitals. After the first 104 cases,
Dr. Bloom says, the central laboratory retested the blocks. "What they
found alarmed them big time," he says. About 20 percent of cases called
IHC 3+ by laboratories other than reference laboratories turned out to
be negative, with the miscall rate rising to 50 percent for laboratories
using a method other than Herceptest. "Their initial response was that
IHC is worthless," Dr. Bloom says.
But the NSABP set up a program to certify laboratories that do more than
100 cases per month who could show that their results correlated to FISH.
Many laboratories met this criterion and their results were accepted as
qualifying patients for enrollment. Review of the next 240 cases found
that the error rate for IHC 3+ from these certified laboratories was two
percent. "Basically this showed that once you know how to do the IHC assay,
you can do it very reliably and reproducibly," Dr. Bloom says. This finding
also shows the value of a certification mechanism.
When FISH-positive specimens from hospital laboratories were repeated
in the central laboratory, the discrepancy rate was 4.5 percent. "So the
argument that FISH is easier to do or more robust did not hold in a real-world
setting, although the concordance was still excellent," Dr. Bloom says.
Dr. Thor draws a similar conclusion about FISH. "Some data from large
cooperative groups such as NSABP suggest that small laboratories that
don’t perform FISH a lot may not perform as well as laboratories that
have been certified," she says, which suggests the value of certification
for FISH, too. "Experience and setting up proper controls are equally
important [to do FISH accurately] as with IHC.
"The most important factor in doing either test reproducibly is to follow
the guidelines with the kits as closely as possible," Dr. Thor continues.
She especially advises against shifting to more-sensitive secondary and
tertiary reagents, which can lead to calling normal levels of expression
as overexpression. "CAP guidelines are very useful in that regard," she
says. The CAP is now doing HER2 testing Surveys. "When we get those data,
they will be important information for pathologists. They will show us
how we’re doing."
At the University of Oklahoma Health Science Center, only two pathologists
read out IHC HER2 samples. "I do think it is important that whoever reads
HER2 needs to be very familiar with the criteria of various scores and
understand nuances of reporting," Dr.Thor says. "Reading HER2 is more
demanding than other IHC analytes, such as cytokeratin." It is equally
important to be familiar with scores and reporting when doing HER2 by
FISH. For both assays, positivity should be scored only on the invasive
part of the tumor, not on the in situ component. Two-thirds of insituductal
carcinomas are positive for HER2, whereas only one-third of invasive tissues
are.
Dr. Bloom predicts that FISH results will become more problematic as
more small laboratories bring up the assay. "Smaller laboratories are
getting pressured to set up FISH on their own," he says. Community oncologists
perceive FISH as being easy, he finds, "like connecting the dots." In
actuality, it is like any other staining test, except that DNA is more
stable than protein. He sees autofluorescence of native proteins as a
confounder. "Depending on your training level and experience, you can
easily overcall that autofluorescence as a positive gene signal."
Dr. Allred acknowledges the "groundswell for FISH."
"Certainly there is some push from oncologists to do FISH because of
the misconception that it is more accurate. If you get a chance to educate
them, you can often change their minds," he says. Dr. Bhan has experienced
"a drive for FISH from clinicians." To standardize results, he and a colleague,
Elena Brachtel, MD, do all the HER2 IHC at Massachusetts General Hospital.
By protocol, they do FISH on excisional biopsies only for IHC 2+ staining.
"But it does get changed by demands of clinicians," he says. "Our protocol
is getting a bit modified because of reports of a small number of cases
that don’t show overexpression by IHC but that may show gene amplification.
So the trend at our institution now is to do FISH on all cases at some
time during treatment."
Dr. Thor says some people simply do not believe the literature that suggests
that IHC is sensitive and specific. "I have done personal comparisons
of hundreds of cases and I have never found an IHC zero or 1+ sample that
had HER2 gene amplification." Use of FISH can be influenced by an oncologist
who is eager to use trastuzumab. "But," she repeats, "I don’t know any
data that says using FISH is justified for IHC 0 or 1+ cases."
A relative advantage of FISH is that it is not subject to edge, crush,
or retraction artifacts which can be prominent on core needle biopsies,
while IHC is subject to these artifacts. In edge artifact, the most common
confounder, tissue close to the edges of the specimen stains stronger
than in the center because antibody pools at the sides. "Some people see
3+ along the edge and 1+ in the middle and misinterpret the specimen as
3+ or 2+," Dr. Bloom says. "So they overcall HER2 results." The trend
over the last couple of years to do more testing on core biopsies has
exacerbated this problem. "We need to be very aware and seasoned in HER2
IHC interpretation," Dr. Bloom cautions. He suggests reflexing IHC 3+
scores on core biopsies to FISH in small-volume laboratories or repeating
the IHC on a lumpectomy specimen if it becomes available.
An issue specific to FISH is how to report results. Ideally, any FISH
assay would have a probe for chromosome 17 centromere and a probe for
the HER2 gene, according to Dr. Tubbs. That would enable both gene counting
and calculation of a ratio. "One option to having a CEP 17 probe is to
establish cutoffs that account for polysomic states," he says. That’s
what he is doing for the bright-field assay that is in development. "A
lot of work that we have done with Dr. Tubbs suggests that you don’t really
need a CEP 17 probe," says Ventana’s Dr. Roche. "Cases with low-level
amplification most often result from duplication of chromosome 17, not
from HER2 gene amplification. And they are usually associated with low-level
protein expression." Dr. Bhan at MGH says he doesn’t know of a single
outcome study that settles this question. Thus, he includes raw figures
in his report. Above the line is the ratio result, with values greater
than two being called amplified, while below the line are the two other
figures—number of HER2 genes visualized and number of copies of
chromosome 17. Clinicians can make their own interpretation if they wish.
Dr. Bloom believes that one way to improve IHC scoring on HER2 is to
use image analysis. (He discloses that he is working for a company that
has a vested interest in this technology.) "What we really try to do by
IHC is visually relate the intensity of the HER2 stain to the number of
HER2 receptors on the surface of the cell," he explains. Unfortunately,
while the human eye is good at reading patterns of staining, it is not
so good at detecting subtle changes in staining intensity. A comparison
showed that pathologists using image analysis achieve greater interobserver
agreement and better concordance with HER2 gene status (Bloom K, Harrington
D. Am
J Clin Pathol. 2004;121:620-630).
Dr. Grogan advocates automated performance of IHC or FISH as a way to
increase accuracy. "The advantages of automation are standardization and
transportability of results," he says. "The score should look the same
at every site. Then you can focus in on interpretation." Ventana has an
automated PMA-approved IHC test for HER2. Interlaboratory concordance
was very high with the automated IHC assay, according to Dr. Grogan.
As a potential solution to the either/or debate over IHC and FISH, Dr.
Tubbs predicts that the bright-field assay that he and several colleagues
are developing, which uses enzyme metallography to visualize HER2 genes,
"will very soon make it possible on one slide to use H&E to detect margins
and other tissue features, to do all the things done with IHC, such as
progesterone and estrogen receptors, ki-67 and HER2 protein expression,
and to do bright-field analysis for HER2 gene copy number." And all this
will be possible using a standard microscope. Introducing deposition of
metallic silver to localize HER2 genes "takes bright-field detection to
another level of sensitivity, making visualization of individual gene
copies possible," Dr. Tubbs says.
He and his colleagues recently reported two abstracts on the performance
of the assay, called EnzMet GenePro, one at the 2004 San Antonio Breast
Cancer Symposium and the other at the 2005 U.S. and Canadian Academy of
Pathology meeting. A paper reporting validation of the assay compared
with FISH and its reproducibility among observers is in press in the American
Journal of Surgical Pathology.
"We’re very enthusiastic about chromogenic gene detection," Dr. Grogan
says. "We’re now working on the launch of the chromogenic assay for detection
of the HER2 gene."
Assays that predict prognosis and response to therapy for breast cancer
will become more complicated in the future. "We still don’t know how trastuzumab
works," Dr. Bloom points out. "Our basic screening test is intended to
define who is eligible for trastuzumab therapy, not who will respond."
Predicting response is the next goal. Accumulating data point to key pathways
and downstream molecules that are altered by HER2 overexpression. "Assessing
downstream molecules will most likely be done by IHC," Dr. Bloom believes.
"Not all patients even with high levels of HER2 expression respond to
trastuzumab," Dr. Thor says. At most, HER2-positive tumors show a 50 percent
response rate to trastuzumab plus conventional chemo therapeutic agents.
"Other biological tests are being developed to see who will respond to
trastuzumab," she adds. Dr. Thor explains that HER2 (also called erbB2)
belongs to the receptor tyrosine kinase, or RTK, type 1 gene superfamily.
It can induce signaling independent of a ligand, particularly when in
a highly expressed state. It can also bind to other members of its family—epidermal
growth factor receptor (EGFR, erbB1), erbB3, and erbB4—whose activity
and binding affinity are modulated by ligands. FDA-approved drugs exist
against some of these other targets, including drugs that target more
than one member, such as gefitinib. Trials are underway to determine whether
patients get additional benefit using combined therapeutics.
Dr. Thor notes that EGFR is an especially important family member that
indirectly interacts with estrogen via its upregulation of critical growth
factor ligands that bind to EGFR. "Interactions between hormonal promotion
of tumor growth and RTK-associated promotion of tumor growth is an active
area of investigation," she says. Clinically, patients who have estrogen
receptor-positive cancers that are resistant to tamoxifen often have abnormalities
of EGFR and HER2/erbB2. "So this strong biological signaling link has
potential practical significance," she says.
"Probably four to five years from now," she continues, "we will be using
a whole different testing system." Like Dr. Bloom, she sees future tests
looking at downstream intermediates in the signaling pathway, rather than
the receptors themselves, to see whether the pathways are active. Or measurement
of downstream intermediates may be combined with one agreed-upon test
for HER2.
While agreeing that multi-analyte signatures will be the key to future
breast cancer evaluation, Dr. Tubbs foresees them not only in the form
of protein assays, but as genomic or gene expression signatures, including
expression arrays, array-based comparative genomic hybridization, or both.
Such molecular analyses will be in a "low-complexity" form, he predicts—a
few genes at a time, rather than hundreds or thousands. And such low-complexity
signatures may be readily adapted to conventional IHC and in situ hybridization
formats, but with a twist—slide-based cluster analysis.
He identifies another trend with breast cancer and other tumors: finding
a way to integrate the information we have now, along with low-complexity
genetic signatures, into a weighted pattern that correlates more precisely
with malignant potential, prognosis, and response to therapy. "We in the
pathology community need to think about how to do hierarchical clustering
of those markers," he notes. Dr. Tubbs points to the work of pathologist
David Huntsman, MD, of the University of British Columbia, on hierarchical
clustering of IHC biomarkers as an example of such work (Makretsov NA,
et al. Clin
Cancer Res. 2004;10:6143-6151). "I think that integrated hierarchical
cluster analysis is the future of biomarkers, rather than single analyte
interpretation in an isolated fashion," Dr. Tubbs says.
Summarizing the IHC/FISH debate, Dr. Allred says, "If done correctly,
each assay gives comparable results." And each has an area where it gives
low-level abnormal results, he adds. "Even in good laboratories, those
areas are relatively difficult to interpret. At the end of the day," he
concludes, "FISH is a surrogate for the true target, overexpression of
the protein. So to try to make the case that FISH is a more accurate test
doesn’t make sense."
Dr. Bloom agrees that IHC and FISH are very good tests and that both
require training and experience. For oncologists, he uses this analogy:
"If you asked a radiologist which is better, MRI or CT, the radiologist
would say they are complementary." Somehow, he says, "we need to alleviate
the fight between IHC and FISH in pathology.
"We’ve done a huge disservice in letting oncologists preach to us which
test we should use," Dr. Bloom contends. "As laboratory professionals
it is incumbent on us to step up and say we understand the tests we are
using and that we will choose the best methodology based on the submitted
sample." When he talks to oncologists, he finds most will accept that.
He has seen laboratorians who initially could not convince oncologists
that HER2 testing by IHC is reliable who then did both tests on the next
50 cases and showed a correlation so tight that the oncologists were convinced
they didn’t need to perform primary FISH testing or reflex all IHC-negative
cases. Unfortunately, Dr. Bloom adds, "Our passiveness has allowed oncologists
and others to dictate testing methods. Pathologists reflexing all IHC-negative
cases to FISH are shirking their responsibility as the laboratory expert.
I don’t think that serves the patients best."
William Check is a medical writer in Wilmette, Ill. |