Research

rs4646903 — CYP1A1 *2A (MspI, T3801C)

Regulatory variant in the 3'-flanking region of CYP1A1 that increases gene inducibility, producing more carcinogen-activating enzyme in oral and airway tissues upon tobacco smoke or dietary PAH exposure

Moderate Risk Factor Share

Details

Gene
CYP1A1
Chromosome
15
Risk allele
G
Consequence
Regulatory
Inheritance
Codominant
Clinical
Risk Factor
Evidence
Moderate
Chip coverage
v3 v4 v5

Population Frequency

AA
64%
AG
32%
GG
4%

Ancestry Frequencies

east_asian
42%
latino
34%
south_asian
33%
african
24%
european
11%

See your personal result for CYP1A1

Upload your DNA data to find out which genotype you carry and what it means for you.

Upload your DNA data

Works with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.

CYP1A1 *2A — The Oral Tissue Induction Amplifier

Cytochrome P450 1A1 (CYP1A1) is the cell's primary weapon against polycyclic aromatic hydrocarbons (PAHs)11 polycyclic aromatic hydrocarbons (PAHs)
Flat, multi-ringed carbon compounds formed by incomplete combustion — found in cigarette smoke, grilled meat, chewing tobacco, and betel quid
— but also the enzyme that converts those compounds into reactive, DNA-damaging epoxides. Whether CYP1A1 is protective or dangerous depends on whether Phase II enzymes (like glutathione S-transferases) are present to quickly neutralize what CYP1A1 generates. The rs4646903 variant (*2A, MspI) sits in the 3'-flanking non-coding region of the gene and modulates how much CYP1A1 protein gets made when carcinogens arrive — making it especially relevant for oral tissues directly exposed to tobacco and dietary PAHs.

The Mechanism

Unlike rs1048943 (Ile462Val), which increases the catalytic speed of individual CYP1A1 enzyme molecules, rs4646903 acts at the level of gene inducibility22 inducibility
How strongly the gene responds when its transcription is switched on by the aryl hydrocarbon receptor (AHR) upon carcinogen exposure
. The T-to-C transition (T3801C in traditional numbering, reported as A→G on the genomic plus strand) in the 3'-flanking region is thought to increase mRNA stability or alter post-transcriptional regulation, resulting in higher CYP1A1 protein levels following AHR activation by PAHs or dioxins. In practical terms, when oral epithelial cells encounter tobacco smoke constituents or dietary charred compounds, carriers of the C (G on plus strand) allele produce more CYP1A1 enzyme — and therefore more reactive PAH epoxides — than carriers of the reference T (A) allele.

This variant often occurs on the same chromosome as the rs1048943 Ile462Val variant, forming the so-called *2B haplotype33 *2B haplotype
The combination of *2A (MspI, rs4646903) + *2C (Ile462Val, rs1048943) on the same chromosomal copy — both more enzyme quantity and higher enzyme activity, particularly common in East Asian and Latino populations
. When both variants co-occur, the effect is additive: more enzyme produced, and each molecule is more catalytically active.

The *2A variant has no effect on protein structure (it is non-coding), so its clinical relevance is entirely exposure-dependent. Without a PAH trigger — no tobacco, minimal charred food — there is little CYP1A1 to be induced, and the variant's effect is minimal.

The Evidence

Oral and oropharyngeal cancer. A HuGE-GSEC meta-analysis and pooled analysis44 HuGE-GSEC meta-analysis and pooled analysis
Varela-Lema L et al. Meta-analysis and pooled analysis of GSTM1 and CYP1A1 polymorphisms and oral and pharyngeal cancers: a HuGE-GSEC review. Genetics in Medicine, 2008
combining 30 publications (7,783 subjects) found that the MspI homozygous variant genotype (CC/m2m2) was associated with significantly increased oral and oropharyngeal cancer risk: meta-analytic OR 1.9 (95% CI 1.4–2.7), pooled adjusted OR 2.0 (95% CI 1.3–3.1). Notably, the association was significant even among never-smokers (adjusted OR 1.8, 95% CI 1.1–2.9), pointing to dietary PAH exposure as a contributing factor independent of tobacco.

A dedicated meta-analysis of oral squamous cell carcinoma55 meta-analysis of oral squamous cell carcinoma
Xie S et al. CYP1A1 MspI polymorphism and the risk of oral squamous cell carcinoma: Evidence from a meta-analysis. Molecular and Clinical Oncology, 2016
analysed 10 studies (1,505 cases, 1,967 controls) and found CC vs TT: OR 2.52 (95% CI 1.60–3.96). The association was driven by Asian populations; results in Caucasian and mixed-race groups were not significant.

A separate Asian-focused meta-analysis66 separate Asian-focused meta-analysis
Xu JL et al. Association of CYP1A1 MspI polymorphism with oral cancer risk in Asian populations. J Cell Mol Med, 2016
with 12 studies (1,925 cases, 2,335 controls) confirmed the risk concentrated in homozygous carriers (m2/m2): all three genetic models were statistically significant.

Head and neck cancer overall. A 2022 Scientific Reports meta-analysis77 2022 Scientific Reports meta-analysis
Mohammadi H et al. Association between the CYP1A1 MspI polymorphism and risk of head and neck cancer: a meta-analysis. Scientific Reports, 2022
found consistent elevation across five genetic models: allelic OR 1.28 (95% CI 1.09–1.51), homozygous OR 1.68 (95% CI 1.16–2.45), dominant OR 1.66 (95% CI 1.27–2.16).

Laryngeal cancer. A meta-analysis of Asian populations88 meta-analysis of Asian populations
Zeng W et al. CYP1A1 rs1048943 and rs4646903 polymorphisms associated with laryngeal cancer susceptibility among Asian populations. J Cell Mol Med, 2016
(10 studies, 748 cases, 1,558 controls) found GG vs AA: OR 1.53 (95% CI 1.31–2.21); G allele carriers vs AA: OR 1.33 (95% CI 1.04–1.71).

Gene-environment synergy with GSTM1. The interaction is most pronounced when the *2A variant is combined with GSTM1 null (absent Phase II detoxification). A Northeast Indian head and neck cancer study99 Northeast Indian head and neck cancer study
Choudhury JH et al. Tobacco carcinogen-metabolizing genes CYP1A1, GSTM1, and GSTT1 polymorphisms and their interaction with tobacco exposure influence the risk of head and neck cancer. Tumour Biol, 2015
found that carriers of CYP1A1 TC/CC + GSTM1 null genotypes had a 3.52-fold increased risk overall (P<0.001), rising to 6.42-fold in smokers. This Phase I/Phase II imbalance — more reactive intermediates generated, fewer cleared — is the central biological mechanism.

Practical Actions

The *2A variant's clinical significance is almost entirely dependent on PAH exposure. Carriers who avoid tobacco and minimize dietary PAH exposure from charred foods substantially reduce their effective risk. For those who cannot or will not eliminate these exposures entirely, supporting Phase II detoxification capacity (cruciferous vegetables, sulforaphane) may partially compensate for the imbalance. Oral cancer screening awareness is warranted for homozygous carriers with significant tobacco or betel nut exposure history.

Interactions

CYP1A1 rs1048943 (Ile462Val/*2C): These two variants frequently co-occur on the *2B haplotype. When both are present, the combination produces more CYP1A1 enzyme (rs4646903 effect) and each molecule has higher catalytic activity (rs1048943 effect). This double hit amplifies PAH activation capacity more than either variant alone. Compound action proposed: CYP1A1 AG or GG (rs4646903) + CYP1A1 CT or CC (rs1048943) — combined recommendation: maximize PAH avoidance and cruciferous vegetable intake; consider discussing oral cancer screening with a dentist or physician if there is any tobacco or betel nut use history.

GSTM1 null (rs71748309, structural deletion): The gene-environment synergy between CYP1A1 *2A and GSTM1 null is the most clinically important interaction. Both AG and GG carriers of rs4646903 who also carry GSTM1 null show dramatically higher oral and head-and-neck cancer risk, particularly in tobacco users (OR 6.42 in smokers). Compound action proposed: rs4646903 AG/GG + GSTM1 null — avoidance of all tobacco and regular cruciferous vegetable intake as highest-priority combined recommendation.

Genotype Interpretations

What each possible genotype means for this variant:

AA “Standard CYP1A1 Inducibility” Normal

Reference genotype — baseline CYP1A1 inducibility in oral tissues

The AA genotype (TT in papers) represents the wild-type CYP1A1 *1 allele at the MspI locus. CYP1A1 is still induced by tobacco smoke and dietary PAHs in this genotype — but at baseline levels rather than the elevated levels associated with the G allele variants.

Meta-analyses consistently use this genotype as the reference group against which variant genotypes are compared. The HuGE-GSEC review (Varela-Lema et al. 2008) and OSCC meta-analysis (Xie et al. 2016) both report odds ratios relative to TT (AA in plus-strand notation).

Standard recommendations for cancer prevention apply to everyone, including this genotype — but no genotype-specific interventions are indicated.

AG “Increased CYP1A1 Inducibility” Intermediate Caution

One G allele — moderately elevated CYP1A1 induction upon PAH exposure

Heterozygous AG (TC in papers) carriers produce an intermediate level of CYP1A1 induction. In the dominant genetic model used by the 2022 HNC meta-analysis (Liu et al.), AG+GG combined showed OR 1.66 (95% CI 1.27-2.16) for head and neck cancer — suggesting that even one G allele confers elevated risk in the overall population context.

The association is specifically meaningful in the context of PAH exposure. In never-smokers with low dietary PAH intake, the G allele has minimal substrate to act on and the risk increment is modest. The clinical significance rises substantially with tobacco use, particularly cigarette smoking, betel nut chewing, or hookah use.

If this variant co-occurs with GSTM1 null (no glutathione S-transferase M1 to neutralize reactive intermediates), the risk for head and neck cancer in tobacco users rises markedly.

GG “High CYP1A1 Inducibility” High Risk Warning

Two G alleles — substantially elevated CYP1A1 induction upon PAH exposure

With both CYP1A1 copies carrying the *2A variant, PAH-induced CYP1A1 expression is maximal. The HuGE-GSEC meta-analysis (Varela-Lema et al. 2008) found the CC homozygote associated with oral and pharyngeal cancer OR 1.9 (CI 1.4-2.7) in the meta-analysis, and pooled adjusted OR 2.0 (CI 1.3-3.1). The dedicated OSCC meta-analysis (Xie et al. 2016) found CC vs TT OR 2.52 (CI 1.60-3.96). For laryngeal cancer, GG vs AA OR was 1.53 (CI 1.31-2.21) in Asian populations.

Notably, the HuGE-GSEC pooled analysis found that the CC genotype was significantly associated even in never-smokers (adjusted OR 1.8, CI 1.1-2.9), implicating dietary PAH exposure from charred or smoked foods as a relevant independent risk factor. This is one of relatively few genetic risk findings where dietary carcinogen exposure alone is implicated.

When GG co-occurs with GSTM1 null genotype, the imbalance between Phase I activation (maximum) and Phase II detoxification (absent) is at its most extreme. Northeast Indian studies found combined CYP1A1 TC/CC + GSTM1 null raised head and neck cancer risk 6.42-fold in smokers.

If this variant also co-occurs with rs1048943 CC (Ile462Val *2C), forming the *2B haplotype, the effect is amplified further: more enzyme is made (rs4646903 effect) and each molecule has enhanced catalytic activity (rs1048943 effect).

Key References

PMID: 18496222

HuGE-GSEC meta-analysis of 30 studies (7,783 subjects) — CYP1A1 MspI homozygous variant (CC/m2m2) associated with oral and oropharyngeal cancer OR 1.9 (CI 1.4-2.7); pooled adjusted OR 2.0 (CI 1.3-3.1)

PMID: 27073686

Meta-analysis of 10 studies (1,505 cases, 1,967 controls) — CYP1A1 MspI CC vs TT associated with oral squamous cell carcinoma OR 2.52 (CI 1.60-3.96); significant in Asian populations

PMID: 35087125

2022 Scientific Reports meta-analysis of HNC: MspI allelic OR 1.28 (CI 1.09-1.51), homozygous OR 1.68 (CI 1.16-2.45), dominant OR 1.66 (CI 1.27-2.16)

PMID: 26578427

Zeng W et al. 2016 — Meta-analysis of laryngeal cancer — rs4646903 G/G vs A/A OR 1.53 (CI 1.31-2.21) in Asian populations; 10 studies, 748 cases, 1,558 controls

PMID: 27323067

Xu et al. 2016 — Meta-analysis of 12 Asian studies (1,925 cases, 2,335 controls); m1/m1 vs m2/m2 OR 0.46 (CI 0.30-0.70), confirming m2/m2 as the risk genotype for oral cancer

PMID: 25724184

Northeast India HNC study — CYP1A1 TC/CC + GSTM1 null genotypes combined raised HNC risk 3.52-fold (P<0.001), rising to 6.42-fold in smokers