rs8034191 — AGPHD1/CHRNA3 15q25.1 intergenic
Intronic variant near CHRNA3 at 15q25.1 that acts as an eQTL for nicotinic receptor genes and is independently associated with heavy smoking and lung cancer risk
Details
- Gene
- AGPHD1/CHRNA3
- Chromosome
- 15
- Risk allele
- C
- Consequence
- Intronic
- Inheritance
- Additive
- Clinical
- Risk Factor
- Evidence
- Strong
- Chip coverage
- v3 v4 v5
Population Frequency
Ancestry Frequencies
Related SNPs
Category
Brain & Mental HealthSee your personal result for AGPHD1/CHRNA3
Upload your DNA data to find out which genotype you carry and what it means for you.
Upload your DNA dataWorks with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.
The 15q25.1 Regulatory Variant: A Second Lever on Nicotine Dependence and Lung Cancer Risk
On chromosome 15, a busy stretch of DNA called 15q25.1 houses a cluster of nicotinic acetylcholine receptor genes — CHRNA5, CHRNA3, and CHRNB4 — plus several neighboring genes including AGPHD1 (also known as HYKK, hydroxylysine kinase). The rs8034191 variant sits in the third intron of AGPHD1, just upstream of the CHRNA3 gene that codes for the alpha-3 nicotinic receptor subunit. This locus has been among the most replicated findings in lung cancer and nicotine dependence genomics11 This locus has been among the most replicated findings in lung cancer and nicotine dependence genomics
Multiple GWAS identified chromosome 15q25 variants at p-values below 10⁻¹⁸ for lung cancer, and rs8034191 is one of two key tagging variants at this region — the other being rs1051730 in CHRNA3, which is already covered in GeneOps.
Although rs8034191 and rs1051730 are in high linkage disequilibrium (r²=0.91 in Europeans), they are not identical and are partially independent in non-European populations. Understanding rs8034191 adds meaningful information for people of African or Asian ancestry, where the LD structure differs, and helps characterize the regulatory — rather than coding — mechanisms at this locus.
The Mechanism
Unlike rs1051730, which is a synonymous coding change in CHRNA3, rs8034191 is a non-coding intronic variant. Its functional impact operates through gene regulation: rs8034191 acts as a lung cis- and trans-eQTL22 rs8034191 acts as a lung cis- and trans-eQTL
Expression quantitative trait loci alter how much mRNA a gene produces in specific tissues, influencing the expression of multiple nearby genes including CHRNA3, CHRNA5, HYKK, IREB2, and PSMA4 in lung tissue. The C allele is associated with altered expression of these genes independently of smoking status, meaning the regulatory effect is present regardless of tobacco exposure.
The variant also influences [DNA methylation patterns | Epigenetic modification of cytosine bases that silences or activates gene regions] at the IREB2, CHRNA3, and PSMA4 promoters, independently of smoking. IREB2 encodes iron-regulatory protein 2 and has been implicated in COPD pathogenesis. The combined eQTL and methylation effects suggest rs8034191 reshapes the entire transcriptional landscape of this genomic neighborhood.
Approximately 30% of the variant's effect on COPD development is mediated through smoking behavior (pack-years), while the remaining effect is direct — acting through the gene expression changes described above. This distinguishes rs8034191 from a purely behavioral risk factor.
The Evidence
A 14-study meta-analysis33 14-study meta-analysis
Covering 14,075 cases and 12,873 controls from published case-control studies found that the C allele confers OR=1.23 (95% CI: 1.08–1.40; P=0.002) for lung cancer under the allelic model. The effect is strongest in Caucasians (OR=1.22) and African Americans (OR=1.39) and absent in East Asians, where the C allele is rare (frequency ~3%).
A separate 13-study meta-analysis44 13-study meta-analysis
Testing three genetic models — dominant, additive, and recessive confirmed significant associations across all three models: dominant OR=1.34 (95% CI: 1.29–1.41), additive OR=1.61 (95% CI: 1.50–1.73), and recessive OR=1.41 (95% CI: 1.32–1.50). The recessive model finding — indicating that two copies of the C allele confer a disproportionate effect — is a distinctive feature of rs8034191 compared to rs1051730.
For COPD, a genome-wide association study of COPD55 genome-wide association study of COPD
Bergen et al., PLOS Genetics 2009, including subjects from multiple cohorts found rs8034191 significantly associated with COPD (OR=1.29, 95% CI: 1.18–1.41), with the population attributable risk of the C allele estimated at 12.2% for COPD in the general population and 14.3% in current smokers. An independent mediation analysis found that 30% of the COPD effect is mediated by smoking behavior66 30% of the COPD effect is mediated by smoking behavior
Measured as pack-years in path analysis models, meaning 70% operates through direct biological mechanisms.
For smoking behavior, a Canadian cohort study77 a Canadian cohort study
526 women in Northeastern Ontario found that women carrying the CC genotype had a 2.8-fold increased odds of being heavy smokers (>20 cigarettes/day) after adjusting for age, confirming the variant's strong influence on smoking quantity independent of other factors. The variant is also associated with nicotine and opioid dependence severity, demonstrating a pleiotropic role across addictive substances88 demonstrating a pleiotropic role across addictive substances
The same nicotinic receptor locus influences dependence on multiple drugs via overlapping neural circuits.
Practical Implications
If you carry one or two copies of the C allele, your baseline risk for heavy nicotine dependence and lung disease is elevated through both behavioral and biological pathways. The behavioral pathway: C allele carriers experience altered nicotinic receptor activity, making them more vulnerable to progressing from occasional to heavy smoking. The biological pathway: even independent of smoking, the eQTL effects on CHRNA3 and IREB2 expression appear to influence lung tissue susceptibility.
For smokers, the priority is cessation — the evidence consistently shows that quitting reduces lung cancer and COPD risk substantially regardless of genotype. For CC carriers who have never smoked, the direct biological effect (30% of the COPD risk) is still present but substantially lower in absolute terms than for smokers. Lung function monitoring is warranted for CC smokers, particularly spirometry to detect early obstructive patterns.
The variant is particularly relevant for people of African ancestry, where the LD structure between rs8034191 and rs1051730 is weaker, and the two variants provide partially independent information about risk.
Interactions
rs8034191 is in strong linkage disequilibrium with rs1051730 in CHRNA3 (r²=0.91 in Europeans), meaning most Europeans with the C allele also carry the rs1051730-A allele. In non-European populations, particularly African Americans, this correlation is weaker, making rs8034191 and rs1051730 partially independent risk indicators. The locus also contains rs16969968 in CHRNA5, which encodes the functionally impactful Asp398Asn amino acid change — rs16969968 is the strongest single functional variant at this locus, while rs8034191 captures primarily regulatory effects.
The three variants can be considered a risk haplotype: individuals who carry risk alleles at all three (rs16969968-A, rs1051730-A, rs8034191-C) are at substantially higher cumulative risk for nicotine dependence, lung cancer, and COPD than those carrying any single risk allele alone, though no published compound-genotype analyses exist to precisely quantify the triple-carrier odds ratio.
Genotype Interpretations
What each possible genotype means for this variant:
No elevated genetic risk from this 15q25.1 variant
You carry two copies of the reference T allele at rs8034191, meaning this particular regulatory variant does not add to your genetic risk for heavy smoking or lung cancer. About 40% of Europeans share this genotype. The 15q25.1 locus contains additional coding and regulatory variants in CHRNA3 and CHRNA5 that can independently affect risk and may be assessed separately.
Moderately elevated risk for heavy smoking and lung cancer via 15q25.1 regulation
The single C allele shifts the regulatory environment of your 15q25.1 locus. As an eQTL, it changes how much CHRNA3, CHRNA5, HYKK, and IREB2 are expressed in lung tissue. Unlike rs16969968 which alters the receptor protein directly, rs8034191 influences the quantity of nicotinic receptor subunits available, with downstream effects on nicotine response and lung tissue biology.
The 30% of the effect that operates independent of smoking is important: even if you never smoke, IREB2 and CHRNA3 expression changes from this variant may influence lung function over time. Spirometry tracking is a reasonable precaution for TC carriers who have significant smoking history.
Substantially elevated risk for heavy smoking, COPD, and lung cancer
The homozygous CC genotype concentrates the eQTL effects of both alleles. In lung tissue, this shifts CHRNA3, CHRNA5, HYKK, IREB2, and PSMA4 expression simultaneously and also changes DNA methylation patterns at several of these genes independently of smoking. IREB2 dysregulation has been mechanistically linked to emphysema through effects on cellular iron homeostasis and oxidative stress responses.
The recessive model showing OR=1.41 for lung cancer suggests a disproportionate effect in CC carriers compared to TC carriers — the two-allele dose is not simply twice the single-allele effect but may be amplified by the dual eQTL impact. This is consistent with the CC genotype producing a qualitatively different transcriptional state in lung epithelium.
For CC carriers of African or Latino ancestry, this variant may represent a more independent risk signal than for Europeans, since LD with rs1051730 is weaker in these populations. Your CC genotype at rs8034191 provides meaningful risk information beyond what rs16969968 and rs1051730 alone would indicate.
Key References
14-study meta-analysis (14,075 cases/12,873 controls) confirming C allele confers OR=1.23 for lung cancer; effect in Caucasians (OR=1.22) and African Americans (OR=1.39), not East Asians
13-study meta-analysis confirming rs8034191 association with lung cancer across three genetic models (dominant OR=1.34, additive OR=1.61)
Quantitative comparison of rs8034191 and rs1051730 at 15q25 — both similarly associated with lung cancer risk, partially mediated by smoking behavior
rs8034191 CC genotype associated with heavy smoking in women (age-adjusted OR=2.8)
Genome-wide association study of COPD identifying rs8034191 as susceptibility locus; C allele population attributable risk 12.2% for COPD
Erlich et al. 2010 — CHRNA5/CHRNA3 region (15q25.1) variants associated with nicotine and opioid dependence severity; supports the 15q25.1 locus as a pleiotropic dependence risk region; rs8034191 is a tagging variant for this locus