MUTYH-associated polyposis (MAP), caused by biallelic mutations in MUTYH (formerly known as MYH), is characterized by a greatly increased lifetime risk of colorectal cancer (43% to almost 100% in the absence of timely surveillance). Although typically associated with ten to a few hundred colonic adenomatous polyps that are evident at a mean age of about 50 years, colonic cancer develops in some individuals with biallelic MUTYH mutations in the absence of polyposis (Wang et al. 2004). Duodenal adenomas are found in 17%-25% of individuals with MAP; the lifetime risk of duodenal cancer is about 4%. Also noted are a modestly increased risk for rather late-onset malignancies of the ovary, bladder, and skin, and some evidence for an increased risk for breast and endometrial cancer. More recently, thyroid abnormalities (multinodular goiter, single nodules, and papillary thyroid cancer) have been reported in some studies. Some affected individuals develop sebaceous gland tumors.
Biallelic mutations in MUTYH have been found to account for approximately 10% of polyposis patients, but <1% of all colorectal cancer (Halford et al. 2003; Wang, Baudhuin et al. 2004). The largest population study to date indicates that approximately 0.2% of all colorectal cancer is caused by biallelic mutations in MYH (Webb et al. 2006). It was demonstrated in the same study that monoallelic MUTYH mutations are not associated with an increased risk of colorectal cancer. The MAP phenotype is similar AFAP, with extra-colonic manifestations consisting of duodenal polyps but not intra-abdominal desmoids although occasionally patients may have up to several thousand polyps. Among Caucasians approximately 80% of mutations in MUTYH causing MAP are Y165C or G382D (Sieber, Lipton et al. 2003).
The E466X mutation is a common founder mutation among Pakistani populations. The family tree above shows 2 Pakistani brothers who were affected with colorectal cancer and polyps in their 30s due to this founder mutation. Y90X is a founder mutation in Indian populations (Sieber, Lipton et al. 2003).
Around 25–30% of polyposis cases with more than 20 polyps and without evidence of a dominant inheritance pattern, in whom genetic analysis has not identified an APC mutation, are due to bi-allelic mutations in the base excision repair (BER) gene, MUTYH. Polyps can be exclusively adenomatous or mixed adenomatous/hyperplastic. Since the mode of inheritance is autosomal recessive, lack of vertical transmission of the polyposis phenotype in the family should raise the possibility of MUTYH-associated polyposis (MAP). Siblings are at 25% risk of carrying bi-allelic deleterious mutations. Children of a bi-allelic carrier are at high risk if the other parent also carries at least one mutant allele. Large, systematic studies of MUTYH mutation frequency in colorectal cancer cases and controls suggest penetrance in bi-allelic carriers is very high, and probably >90%.
The heterozygote carrier frequency in the UK is around 2% and around 1:10 000 homozygous or compound heterozygotes for two MUTYH mutations. The proportion of polyposis syndromes due to MUTYH in clinical practice is less clear because studies have so far focused on selected research case series of multiple polyps that have been screened negative for APC mutations. In one study 4% of multiple polyp cases (3–100) and 8% of APC mutation negative polyposis cases carry MUTYH mutations.
Damaged DNA is repaired by several mechanisms, one of which involves a family of enzymes involved in base-excision repair (BER). The MUTYH gene (also known as MYH) encodes a DNA glycosylase involved in the repair of the oxidative lesion 8-oxoguanine, a by-product of cellular metabolism and oxidative damage of DNA.
The products of three BER repair genes, OGG1, MTH1 and MYH work together to prevent 8-oxo-G induced mutagenesis. Mutations in MUTYH cause an autosomal recessive colorectal cancer and polyposis syndrome MYH-associated polyposis (MAP; OMIM 608456) (Al-Tassan et al. 2002). Somatic mutations in the APC gene in polyps from individuals affected with MAP are almost invariably G to T transversions (Sieber et al. 2003), and it was by understanding the underlying DNA repair mechanism of this mutation, base-excision repair, that MUTYH was identified as a candidate-predisposition
gene. G to T transversion mutations were also identified in KRAS in codon 12 (Lipton et al. 2003). The adenoma to carcinoma pathway in MAP does not involve BRAF V600E, SMAD4 or TGFBIIR mutations, or microsatellite instability, and the cancers are near-diploid (Lipton, Halford et al. 2003). Thus, tumours with germline MUTYH mutations tend to follow a distinct pathway.
Colorectal surveillance & screening
Treatment of manifestations: Suspicious polyps identified on colonoscopy should be removed until polypectomy alone cannot manage the large size and density of the polyps, at which point either subtotal colectomy or proctocolectomy is performed. Duodenal polyps showing dysplasia or villous changes should be excised during endoscopy. Abnormal findings on thyroid ultrasound examination should be evaluated by a thyroid specialist to determine what combination of monitoring, surgery, and/or fine needle aspiration (FNA) is appropriate. Surveillance: Individuals with biallelic MUTYH germline mutations: Evaluation of relatives at risk: Offer molecular genetic testing for the familial mutations to all siblings of an individual with genetically confirmed MAP in order to reduce morbidity and mortality through early diagnosis and treatment.
Counselling: MAP is inherited in an autosomal recessive manner. At conception, each sib of an affected individual has a 25% chance of being affected, a 50% chance of being a carrier with a small increased risk of CRC, and a 25% chance of being unaffected and not a carrier. Carrier testing for at-risk family members and prenatal diagnosis for pregnancies at increased risk are possible if the disease-causing mutations in the family have been identified.
UK Recommendations: Large bowel surveillance colonoscopy every 2–3 years is recommended from age 25 years for patients who are biallelic MUTYH carriers (or homozygous carriers of other BER gene defects). Colonoscopy is the preferred modality because of the likelihood of polyps requiring polypectomy.
Experience is limited because the role of MUTYH and other BER genes has only relatively recently been demonstrated. Hence, available evidence comes from pooled descriptive experience and opinion. However, there is a substantial colorectal cancer risk for those who are bi-allelic carriers. Although indirect evidence suggests colonoscopic surveillance and polypectomy may be effective in colorectal cancer control, this has yet to be definitively determined. Indeed, we are not aware in the literature to date of any control subjects with bi-allelic MUTYH mutations who have reached the age of 55 years without developing colorectal cancer or polyposis . Hence, the risk may be sufficiently high to merit at least considering prophylactic colectomy and ileorectal anastomosis or even proctocolectomy and ileo-anal pouch if dense rectal polyposis is a feature. The patient should be counselled about the limited evidence available to guide decisions on either surveillance or pre-emptive surgical strategies
The hyperplastic polyp and serrated adenoma pathway
The first series of mixed hyperplastic-adenomatous polyps were described in 1990 (Longacre and Fenoglio-Preiser 1990), and have been an increasingly recognised phenomenon. Most hyperplastic polyps have no malignant potential, although there is now some have malignant potential, especially those with serrated architecture (sessile serrated adenomas – SSAs), large hyperplastic polyps, mixed polyps and polyps on the right side of the colon (Torlakovic et al. 2003).
Classification of hyperplastic polyps
A new understanding of the clinical relevance of hyperplastic polyps has emerged over the past decade (Young and Jass 2010). The simple hyperplastic polyp has itself been subclassified into a goblet cell variant and a microvesicular variant, the latter appear to be the precursors for serrated adenomas and thus colorectal cancer. Serrated adenomas usually have serrations low in the crypts which help differentiate them from hyperplastic polyps.
However, serrated polyps also include a broader spectrum of polyp subtypes ranging from these small common lesions to the recently described sessile serrated adenoma (SS
A), which is often large and proximal with abundant mucin secretion, exaggerated serration, and atypical architecture. Rarer serrated polyp subtypes with unequivocal dysplasia include traditional serrated adenoma (SA), which combines the dysplastic features of an adenoma with the architectural features of a hyperplastic polyp and the mixed polyp (MP) in which separate hyperplastic and dysplastic elements are combined within a single polyp (see figure). SSA, SA, and MP are described as “advanced serrated polyps” and comprise ∼5% of all serrated polyps retrieved in colonscopy patients. Importantly, these advanced serrated lesions show frequent BRAF mutation and widespread DNA methylation.
Endoscopic appearances of serrated adenomas
Serrated and hyperplastic polyps present endoscopic features that could help to differentiate them from adenomatous polyps. HPs appear flat and pale and are often covered by a thin film of mucus. They exhibit Kudo type 2 pit pattern typically. As they are not highly vascular they will appear pale compared to surrounding mucosa using narrow-band imaging (NBI). In addition, chromoendoscopy may be helpful in the endoscopic characterisation of these lesions.
Inherited Colorectal Cancer Syndrome?
An inherited hyperplastic polyposis syndrome (HPS) has also been increasingly recognised (Cohen et al. 1981; Sumner et al. 1981). It is now more commonly known as serrated polyposis syndrome. There is no sex predominance and the mean age at diagnosis is around 55 years. HPS has largely been considered a genetic disease, but the pattern of inheritance remains unknown: both autosomal recessive and autosomal dominant patterns have been suggested. Environmental factors could be partially responsible for the phenotypic differences and model the unknown pattern of inheritance. Smoking, being overweight and some drugs have been postulated as potential risk factors of HPs.
In HPS, multiple serrated polyps develop in the colorectum, and approximately 50% of cases present with at least one CRC (Ferrandez et al. 2004; Young and Jass 2006). Boparai et al (2011) have recently described an increased risk of CRC [relative risk (RR) = 5.4] and HPS (RR = 39) in first-degree relatives of probands diagnosed with HPS compared to the general population. Estimates for CRC risk associated with serrated polyposis may range from 7% to 50% and vary with phenotype.
Classification of the Syndrome
In the WHO criteria, Burt and Jass defined HPS as at least five HPs proximal to the sigmoid colon, two of which are > 1 cm diameter, or more than 30 HPs at any site in the large bowel (Burt 2000). Rashid et al, however, used a different classification system, in which HPS was defined as any person with more than 20 HPs, and separate classes were used for patients with large (>1 cm diameter) or multiple (5-10) HPs (Rashid et al. 2000). These differing classification systems reflect a syndrome which may be both genetically and phenotypically heterogeneous, but one which is becoming increasingly recognised.
The serrated pathway to colorectal cancer
Some evidence suggests that some but not all of these tumours develop along a ‘serrated pathway’ separate from the classical adenoma-carcinoma sequence (Sawyer et al. 2002; Spring, Zhao et al. 2006). This serrated pathway involves one group who accumulate BRAF V600E mutations and another separate pathway which involves KRAS mutations(Carvajal-Carmona et al. 2007). In addition the tumours often have methylation of the MLH1 promoter with subsequent microsatellite instability, and other genes such as P16, MGMT, or IGFBP7 may also be epigenetically inactivated. The CIMP phenotype identified by increased levels of methylation in the CpG island marker MINT31(Jass 2005).
HPS (sometimes known as the ‘serrated pathway syndrome’ or ‘serrated pathway syndrome’ (SPS) and sometimes ‘Jass Syndrome’) may, in fact, be a heterogeneous group of conditions leading to sporadic and inherited cases of colorectal neoplasia. There are two alternative clinical criteria for the diagnosis of HPS families (Burt 2000; Rashid, Houlihan et al. 2000). This syndrome is usually associated with somatic mutations in either BRAF or KRAS, but not both together (Carvajal-Carmona, Howarth et al. 2007), providing further evidence of molecular as well as phenotypic heterogeneity. BRAF mutations are associated with low-grade microsatellite instability due to methylation in CpG islands (CIMP)(Young, Jenkins et al. 2007). This may result in loss of expression of DNA repair genes MLH1 and MGMT (O(6)-methylguanine-DNA methyltransferase) in dysplastic mixed polyps from HPS patients, possibly as a result of promoter methylation (Oh et al. 2005).
Linkage analysis in a large family affected with hyperplastic polyposis syndrome demonstrated a maximum parametric LOD score of 2.71 on the short arm of chromosome 8 (8p.21; Monahan et al 2007). Another group have identified genetic linkage to chromosome 2q32.2-q33.3 with a LOD score of 2.07 (Roberts et al 2011 Fam Cancer).
As of this time however, there is no known causative germline mutation responsible for this condition, therefore genetic testing for predisposition is not possible. Because the natural history of HPS is poorly understood, colonoscopic screening guidelines have not been developed. Empirically we recommend 5 yearly colonoscopic screening from the age of the earliest known affected relative, or from 45 years of age.
Although inherited susceptibility is responsible for 30% of all CRC (Lichtenstein, Holm et al. 2000), high-penetrance mutations in APC, the mismatch repair (MMR) genes, MUTYH, SMAD4, BMPR1A and STK11 account for <5% of cases (Aaltonen et al. 2007). The nature of the residual inherited susceptibilityto CRC is at present undefined, but a model in which high-riskalleles account for all of the excess inherited risk seems improbable.It is likely that the remaining CRC inherited risk is largely accounted forby common, low penetrance alleles. These alleles may either predispose directly to colorectal tumourigenesis or may have an additive effect on predisposition. Candidate alleles studied include variants on known tumour suppressor genes, oncogenes, DNA repair genes, folate metabolising genes, and others.
The APC I1307K variant is present in about 6% of Ashkenazi Jews,but is much rarer in those of other ethnic groups. I1307K createsan A8 tract (eight consecutive adenine residues) which appears to be somatically unstable, leadingto frameshift mutations (Laken et al. 1997). The tumour risk associated with I1307K has been controversial, but most recent reports suggest that it has a relatively small effect (perhaps only 1.5-fold risk of colorectal cancer), suggesting that the A8 tract is only modestly hypermutable (Gryfe et al. 1999).
A number of other low-penetrance alleles have been found with varying degrees of evidence and importance (table 1.1). The ability to identify these genes and to understand their interactions with other relevant environmental and genetic factors remains important however. It will help to stratify an individual patient’s risk for entry into surveillance programs and to reveal causative factors, allowing more effective prevention strategies.
To date a number of genome-wide association studies have been performed in breast (Easton et al. 2007; Stacey et al. 2007; Stacey et al. 2008), lung(Amos et al. 2008), prostate (Gudmundsson et al. 2007; Gudmundsson et al. 2007; Eeles et al. 2008; Gudmundsson et al. 2008), melanoma (Gudbjartsson et al. 2008) as well as colorectal cancer (Broderick et al. 2007; Tomlinson, Webb et al. 2007; Jaeger, Webb et al. 2008; Tomlinson et al. 2008). Most of these studies have been published over the last 2 years. The odds ratios for the loci identified range from 1.1 to 1.75, the majority having an odds ratio <1.5 (Easton and Eeles 2008). There has been a certain amount of replication between these studies, particularly for the locus 8q24 which has been associated with risk of breast, prostate and colorectal cancer in separate studies. However results so far suggest that these loci account for a small proportion of the overall risk.
It is difficult to speculate on the true function of these risk alleles. There appears to be very little epistasis between the 28 loci identified in these 5 cancer types. None of these loci are involved in DNA repair, frequently a cause of susceptibility to higher penetrance loci. This may underlie why so many case control studies have failed to yield significant results consistently, as the underlying hypothesis may have been inaccurate. One might speculate that many of the associations may be driven through their effects on gene expression, particularly as many lie in gene-poor regions.
Most GWAS have not been empowered to detect the effects of polymorphisms with minor allele frequencies (MAFs) <0.05; such variants are therefore sometimes included in the rare variant class. More often, rare variants are considered to be subpolymorphic (MAF <0.01), with very rare or ‘private’ variants having MAF <0.001. Clearly much of the distinction between ‘common disease-common variant’ and ‘rare variant’ models is arbitrary. Nevertheless it is probably worth arbitrarily defining them in order to illustrate important differences between common and rare variants models, in terms of gene discovery and possible clinical relevance. For example, the significance of rare variants is such that they are likely to have more biological impact than common variants, having arisen more recently in evolutionary terms (Bodmer and Bonilla 2008).
Rare variants will not be detectable by population association studies based on the use of linked polymorphic markers, even with very large case/control cohort studies. This is because of low allelic frequency and individually small contributions to the overall inherited susceptibility of a disease. These variants are less common than those studied in association studies (i.e. minor allele frequency (MAF) <0.05) but not as rare as obvious mutations (MAF >0.01), although such mutations may also be identified. Finding rare variants requires nomination of candidate genes likely to have a role in disease aetiology, which are then directly screened for sequence variants which may affect protein function. This is known as the ‘common-disease/rare-variant’ hypothesis (Pritchard 2001).
So far there have been few rare variants identified in colorectal cancer, partially because candidate genes are not easily identified, and because there have only been a few studies performed. In one such study variants in APC I1307K and E1317Q, in AXIN1, CTNNB1, and the mismatch repair genes hMLH1 and hMSH2 were more common in 124 multiple adenoma cases than in controls (Fearnhead et al. 2004). Studies of other candidate genes have produced results of low or no significance however (Dallosso et al. 2008; Zogopoulos et al. 2008).
Labelling APC I1307K a rare variant may not be accurate, as the frequency of the polymorphism in the Ashkenazi population where it is present is 6%, thus potentially suitable for large association studies. This distinction underlines the arbitrary nature of how such polymorphisms are labelled as rare or common variants.
Although the population attributable risk (PAR) of rare variants may be relatively high, the relative influence of these common variants is low, with reported odds ratios below 2 and peaking at approximately 1.2 (Easton and Eeles 2008). Most rare variants have odds ratios a little higher than 2 but not above 5, with a mean of 3.7 in observations thus far (Bodmer and Bonilla 2008). Their individual contributions are small, and they do not give rise to familial concentrations of cases. As techniques improve to interrogate genetic sequence in an inexpensive, high-throughput and efficient manner this method of identifying variants is likely to generate a higher yield of significant results in the near future.
A candidate gene approach demonstrated rare novel low penetrance breast cancer predisposition loci in three genes, PALB2, BRIP1, and RAD51C. (Seal et al 2006; Rahman et al 2007; Meindl et al 2010). This discovery was assisted by the identification of breast cancer cases in Fanconi Anaemia pedigrees. In general however, it is not a simple task to prioritize candidates for rare variant studies. In the short term, it is likely that discovery efforts will be focused largely on sequencing candidate genes. Nevertheless, it is becoming feasible to sequence entire genomes to discover variants, due to decreased costs and increased efficiency of such methods. In a proof of principle study, complete exomic sequencing of a patient with familial pancreatic cancer identified a germline truncating mutation in PALB2 which appeared responsible for this individual’s predisposition to the disease (Jones et al 2009), although mutations in this gene are thought to be rare events in familial pancreatic cancer (Tischkowitz et al 2010).
The above mentioned rare variant loci for breast cancer in PALB2, BRIP1, and RAD51C were present in 10, 8 and 2 cases and 0, 1 and 0 controls respectively. Due to lack of power rare variants are difficult to validate by frequency alone in an association-type study. If we assume that a single variant or a set of related variants (for example, in the same gene) occurs at a general population frequency of 0.01–0.001, as many as 1000 unselected cases or controls will be required to detect with probability of about 0.7 more than one variant in a discovery screen (Bodmer & Tomlinson 2010).
Nevertheless, in principle the more common a variant is in the population the less its biological impact, thus allowing it to be passed on through generations without affecting reproductive ability. Rare variants are likely to reveal more about the pathophysiology of the disease process than common variants, as they are likely to have functional significance, as opposed to common variants which are probably in linkage disequilibrium with the causative mutations.
However it is more problematic to design useful studies of rare variants, as random variation identified cannot be readily assumed to be of functional significance, for example over 1500 variants of uncertain significance (VUSs) have been identified in BRCA1 using a sequencing based approach in breast cancer cases. The difficulty with rare variant discovery, particularly with whole exomic sequence analysis, will be to sort out the candidate functional variation from an almost overwhelming background of functionally irrelevant variation. The choice of targets will, in general, require some a priori assessment of functional effects. In silico biometric approaches have been developed with increasing predictive ability, although in vitro demonstration of effects are generally preferable in order to determine functional effects, for example simple effects on expression or protein truncation.
Studying a cohort of affected cases and subsequently examining a control set for variants identified can cause ascertainment bias. Thus it would be preferable to search for them in affected individuals and controls with equal rigour, and to use a statistical framework to determine whether variants are truly more common in the affected. These studies are likely to require extremely large and/or enriched data sets in order to identify and verify significant rare variants. Nevertheless it is becoming increasingly cost and time effective to perform even whole genome sequencing to determine genetic predisposition to both common and rare disease.
A copy number polymorphism (CNP) in MTUS1 was found to be associated with breast cancer predisposition (Frank et al. 2007), but not colorectal cancer (Monahan et al 2008). Recently, multiple studies have discovered an abundance of germline copy number variation (CNV) of DNA segments ranging from small to large chromosomal segments (e.g. Down syndrome results from trisomy 21), probably encompassing over 12% of the human genome (Redon et al. 2006). These include deletions, insertions, duplications and complex multi-site variants. The extent and role of these copy number polymorphisms (CNPs) is increasingly understood with the development of new techniques which allow us to identify such variation (Lupski 2007).
Many new CNPs have been identified from studies using whole genome SNP chips (Redon et al. 2006). However, the extent of linkage disequilibrium between SNPs and CNPs is unclear. The biological impact of these types of variation, for example on gene expression, is strikingly different. Expression profiles from SNPs and CNPs had little overlap (Stranger et al. 2007). Multiplex ligation-probe amplification (MLPA) has revealed complex whole exon duplications and deletions in APC which lead to the classic FAP phenotype (Schouten et al. 2002; McCart et al. 2006; Pagenstecher et al. 2007). High penetrance conditions such as FAP are rare whatever the type of mutation may be, e.g. point mutations or exon CNV. In theory, complex disease might be more susceptible to subtle, lower penetrance forms of variation which alter whole gene copy number without disabling gene function. In addition, the impact of individual CNPs may be even subtler, with disease phenotype being caused by combinations of low penetrance alleles.
Identification of significant CNPs is thus far hampered by the cost of performing such studies and the lack of techniques available. Genome wide association studies using SNPs are better at identifying deletion copy number variation that duplication (Locke et al. 2006). The new generation arrays (e.g. the Affymetrix 5.0 and 6.0, and Illumina 1 M) are being designed to offer the potential to simultaneously interrogate SNPs and CNPs in a single experiment. However, it may be that more comprehensive genome wide CNP maps are first required with the level of detail for CNPs that the Hapmap project provided for SNPs, before such genome wide CNP arrays are truly useful.
Much as SNPs can be either common or rare variants, so can CNPs. Using a comparative genomic hybridisation (aCGH) platform, a large study concluded that these CNVs are well tagged on existing SNP platforms and probably contribute little to disease predisposition (Craddock et al 2010). However this study was limited by the selection of CNVs and did not examine the impact of rare CNVs. While genome-wide association using common CNPs may be a potentially useful method to elucidate predisposition caused by such CNPs, this technique is not useful for such rare variants. The true role of these variants are as of yet of undetermined importance in human disease.
When a Mendelian cancer predisposition gene is first identified, much of the evidence of it’s linkage to the phenotype derives from the finding of several different variants in that gene that
Conversely the finding of a statistical association of low penetrance alleles with disease in association studies does not necessarily prove that the underlying variant has biological consequence such as causing low-penetrance predisposition. The likely disease-causing locus (with which the polymorphism is in linkage disequilibrium) has rarely been identified. IGF1 microsatellite and the TSER TYMS polymorphisms may be in linkage disequilibrium with a sequence variant which alters gene expression Monahan et al 2009). In a number of recent genome-wide and candidate gene association studies performed, the downstream effect of such variation on RNA and protein function is largely unknown. Nevertheless identification of a germline mutation in linkage disequilibrium with predisposition alleles has remained elusive and it is felt that allele-specific expression may be an important aetiological factor in colorectal cancer predisposition, particularly as many observed significant variants are not close to any known coding regions (Houlston et al. 2008; Valle et al. 2008). A SNP in SMAD7 whilst strongly associated with colorectal cancer risk was not found to alter expression of the gene despite lying in the 3’UTR region of the gene (Broderick et al. 2007). This study may have been limited by the effects of tissue-specific expression as it was performed on lymphoblastoid cell lines derived from cases. In contrast colorectal cancer associated locus 8q24 lies in a gene desert but contains regulatory elements of MYC, and this region preferentially binds TCF4 the primary target of the canonical Wnt signalling pathway (Tuupanen et al 2009; Pomerantz et al 2009).
Whilst association studies may not easily reveal germline mutations, quantitative and qualitative gene expression studies may be a useful direction for future studies.
Understanding proteomics may be used to yield information as to epistasis between genes as protein-protein interactions are amongst the most important determinants of interaction between genes. However, in variants identified to date there appears to be very little epistasis (Houlston et al. 2008). There have been some significant advances in the understanding of diseases such as Crohn’s disease (Parkes et al. 2007) and Coeliac disease (van Heel et al. 2007) due to the results of non-hypothesis driven association studies. A number of low-penetrance loci have been linked to specific biological pathways with likely biological relevance in these conditions. Five of the 10 SNPs identified by GWAS of colorectal cancer are in close LD with genes of the TGF/BMP signalling pathway including SMAD7, BMP2 and BMP4. In the next few years research is likely to reveal further advances in our understanding of the role of both common and rare low penetrance alleles in colorectal cancer by analysing the associated effects on expression and protein function, and by the identification of disease causing mutations.
Recently published data analysis from the CAPP2 study demonstrates significant modification of colorectal cancer risk in Lynch Syndrome patients by aspirin (Burn et al 2011). Thus even high penetrant syndromes may be modifiable by the environment. A priori, environmental agents are even more likely to modify lower penetrance genetic risk factors. An association of smoking-related cancers with polymorphisms at the cancer susceptibility locus 8q24 (identified by genome-wide association) has been suggested (Park et al. 2008). When the odds ratios for predisposition alleles are well below 1.5 there is a possibility of interaction (or bias) through an unmeasured environmental factor, as in the context of lung cancer risk and association with 15q which contains the nicotinic acetylcholine receptor (Chanock and Hunter 2008). Furthermore, the role of gene-environment interactions remains poorly defined and a reductionist approach to understanding the aetiology of colorectal neoplasia means that few such studies exist. Naturally common low penetrance susceptibility alleles will individually contribute little to overall risk, and it is likely that environmental ‘modification’ by smoking, exercise, body habitus, diet, etc. will provide a more complete explanation of what drives normal colonic crypts along the pathway to cancer. Indeed the odds ratios for environmental risk factors are comparable to many low penetrance alleles.
It is likely that combining data from genetic and environmental studies will provide clinicians with an increasingly powerful tool to understand and individual patient’s risk and tailor an appropriate management plan, whether this be colonoscopic screening, genetic testing, or lifestyle modification. It has been proposed that this data may be used in future in association studies in a two-step process whereby patients are first screened for epidemiological risk factors before entering the genotyping analysis (Murcray et al. 2009).
In 1997, the ColoRectal tumour Gene Identification(CoRGI) Study Consortium was formed to ascertain and collect biologicalsamples and data from families segregating colorectal cancer, in order to identify novel predisposition genes. This study led by Prof Ian Tomlinson has largely been undertaken in this laboratory by colleagues. Families and individuals are being collected with the following entry criteria;
Families were collected from centres throughout England, Scotland and Ireland.
CORGI 1 – Linkage Analysis: A genome wide linkage analysis has been performed on 69 families with a history of bowel cancer and/or polyps using the GeneChip Mapping 10K Xba 142 arrays containing 10 204SNP markers (Kemp et al. 2006). Families in this study had at least 2 individuals (except parent/child) affected. A maximum non-parametriclinkage statistic of 3.40 (P=0.0003) was identified at chromosomal region 3q21–q24. The Galway family is the largest pedigree with over 29 informative meioses, and a decision was taken for it to be studied separately (Chapters 3 and 4).
CORGI 1b A second similar set of 34 families has been collected. Linkage analysis was performed by colleagues which confirmed linkage at 3q22 (Papaemmanuil, Carvajal-Carmona et al. 2008).
CORGI 1c Approximately 100 families where siblings are affected are being collected for sib-pair analysis.
CORGI 2 – Genome Wide Association (GWA): CORGI 2 is a GWA study using an Illumina SNP platform on cases with the same entry criteria as CORGI 1 but without a family history. Colleagues initially genotyped 550,163 tag SNPs in 940 individuals with familial colorectal neoplasia and 965 controls using the Illumina Infinium platform. (Tomlinson, Webb et al. 2007). In CORGI 2b Approximately 42000 candidate SNPs with most significant association in CORGI 2 are being re-tested in a group of ~ 3000 colorectal cancer patients. Several loci which contain SNPs associated with colorectal cancer susceptibility (at 8q23, 10p14, 11q24, 15q13.3 and 18q21) have been recently identified by colleagues in this cohort (Broderick, Carvajal-Carmona et al. 2007; Tomlinson, Webb et al. 2007; Jaeger, Webb et al. 2008; Tenesa et al. 2008; Tomlinson, Webb et al. 2008). However no mutations have yet been identified at these loci with proven functional relevance.
CORGI 3 – Candidate gene screening: Genes in the CORGI 2 patient cohort are being screened for sequence abnormalities in functionally important genes such as those involved in DNA repair, the Wnt pathway, or other genes involved in the aetiology of colorectal neoplasia. Colleagues are also screening the patients included in CORGI 1 and CORGI 2 for gene mutations the loci identified by linkage or association respectively. Candidate genes EPHB1 and MBD4 have been screened for mutations at 3q21-24 in the CORGI 1 family set but none were found (Kemp, Carvajal-Carmona et al. 2006).
Because of the evidence from adenoma-to-carcinoma sequence model (Morson 1968; Fearon and Vogelstein 1990) the National Polyp Study (Winawer et al. 1993) and other prospective studies (Dove-Edwin et al. 2005; Dove-Edwin et al. 2006) we know that if polyps are removed during colonoscopy, cancer may be prevented. Thus colorectal cancer is one of the most preventable of all cancers, and some early evidence is emerging that colonoscopic screening may reduce colorectal cancer related mortality (Baxter et al. 2009). However, national colonoscopic screening programs are expensive, stretching the capacity of already busy services and therefore do not reach the whole population they target. In addition to lifestyle modification advice to reduce environmental risk factors, it may be possible to identify two groups of patients with inherited risk by understanding the underlying molecular aetiology.
(Copyright, Dr Kevin Monahan)