rs804279 — GATA4 GATA4/NEIL2 PCOS Susceptibility
Intergenic variant at the GATA4/NEIL2 locus on chromosome 8p23.1 associated with polycystic ovary syndrome susceptibility; the T allele tags regulatory variation linked to elevated androstenedione and PCOS risk (OR ~1.14), with the strongest effect in NIH-diagnosed PCOS
Details
- Gene
- GATA4
- Chromosome
- 8
- Risk allele
- T
- Clinical
- Risk Factor
- Evidence
- Moderate
Population Frequency
Category
Fertility & Ovarian FunctionSee your personal result for GATA4
Upload your DNA data to find out which genotype you carry and what it means for you.
Upload your DNA dataWorks with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.
GATA4/NEIL2 — A Chromosomal Neighbourhood Tied to PCOS Androgen Excess
On the short arm of chromosome 8, at position 8p23.1, a compact genomic
neighbourhood hosts two functionally distinct genes: GATA411 GATA4
GATA-binding
protein 4, a zinc-finger transcription factor essential for cardiac and
gonadal development; expressed in ovarian granulosa and theca cells where it
regulates steroidogenic gene expression
and NEIL222 NEIL2
nei-like DNA glycosylase 2, a base excision repair enzyme that
removes oxidised purines from DNA.
Between and around them sits rs804279, an intergenic variant that large-scale
genome-wide association studies have linked to polycystic ovary syndrome (PCOS)
susceptibility in women of European ancestry.
The Mechanism
The rs804279 variant lies approximately 6,400 bp upstream of GATA4 and 3,300 bp
from NEIL2 (within the NEIL2 RefSeqGene interval, NG_053043.1:g.1718). As an
intergenic regulatory variant, it does not change a protein sequence; instead, it
likely modulates the expression or chromatin accessibility of one or both flanking
genes in hormone-responsive tissues. GATA4 is a transcription factor expressed in
ovarian theca cells, where it drives expression of steroidogenic enzymes including
CYP17A1, the key enzyme in androgen biosynthesis. The locus also harbours
FDFT133 FDFT1
farnesyl-diphosphate farnesyltransferase 1, the first committed enzyme
in cholesterol biosynthesis that provides the substrate for all steroid hormones,
which sits approximately 29 kb downstream — giving this chromosomal neighbourhood
three converging mechanisms for steroid hormone biology.
The T allele is the globally common allele (~72% frequency) but is the risk allele for PCOS at this locus. This contrasts with the typical GWAS pattern where the minor allele carries risk, and implies that the protective A allele (reference, ~28% globally) represents a recent or population-specific buffering variant — or that regulatory activity at this locus is required for normal PCOS-associated androgen elevation, with the A allele conferring partial protection.
The Evidence
The key PCOS association at rs804279 was established in a
large European meta-analysis44 large European meta-analysis
Day FR et al. Large-scale genome-wide meta-analysis
of polycystic ovary syndrome suggests shared genetic architecture for different
diagnosis criteria. PLoS Genet 14:e1007813, 2018
combining 10,074 PCOS cases and 103,164 controls. Among 14 genome-wide significant
PCOS loci, rs804279 was notable for one specific reason: it was the only locus
showing significant heterogeneity across diagnostic criteria (heterogeneity
p=2.6×10⁻⁵). The effect was largest in women diagnosed by NIH criteria
(hyperandrogenism plus oligo-anovulation) and smallest in self-reported cases —
consistent with a variant that specifically tags the androgen-excess component of
PCOS rather than the full syndrome as broadly defined.
Computational functional analysis has categorised rs804279 among
variants with predicted functional relevance55 variants with predicted functional relevance
Prabhu BN et al. Conceptualization
of functional SNPs of PCOS genes: an in silico approach. J Endocrinol Invest
44:1783–1793, 2021 in the GATA4/NEIL2
region, supporting its role as a regulatory rather than neutral variant.
The evidence level is moderate: the locus has genome-wide significance in the Day 2018 meta-analysis, but functional characterisation of what rs804279 specifically does to GATA4 or NEIL2 expression remains incomplete. The diagnostic-criteria heterogeneity means the association is most reliable when PCOS is defined by the NIH standard (requiring both hyperandrogenism and cycle abnormality) rather than by self-report or broader Rotterdam criteria alone.
Practical Actions
Because this locus appears most strongly associated with the hyperandrogenic PCOS subtype, monitoring should focus on androgen biomarkers: free testosterone, DHEAS, and androstenedione are the most directly relevant, with AMH and LH:FSH ratio providing additional characterisation of the reproductive phenotype.
Inositol supplementation (myo-inositol) has the strongest evidence for improving ovarian responsiveness in PCOS with FSH resistance, acting downstream of the gonadotropin signalling cascade that GATA4 expression influences. Women with the TT genotype who have clinical signs (irregular cycles, hirsutism, acne, difficulty conceiving) have clear grounds for proactive evaluation.
Interactions
The GATA4/NEIL2 locus acts independently of the DENND1A (rs7852296) and LHCGR (rs13405728) PCOS susceptibility loci established in the same European meta-analyses. Women carrying risk alleles at multiple PCOS loci accumulate additive PCOS susceptibility. The GATA4 locus appears to operate primarily through the hyperandrogenic axis (linked to androstenedione and testosterone excess) while DENND1A operates through FSH receptor trafficking — distinct but complementary mechanisms both converging on follicular arrest and androgen excess. A compound interaction between homozygous TT at rs804279 and the DENND1A risk haplotype (rs7852296-AA or homozygous risk) would represent convergent failure of both steroidogenic regulation and FSH receptor recycling; this combination warrants investigation as a compound action (no published combined effect size yet).
Genotype Interpretations
What each possible genotype means for this variant:
Protective GATA4/NEIL2 genotype — reduced PCOS susceptibility from this locus
You carry two copies of the A allele at rs804279, the reference allele and the less common genotype globally (approximately 6–8% of people of European ancestry). This genotype is associated with reduced PCOS susceptibility from this locus compared to those carrying the T allele. The A allele confers relative protection against the androgen-excess PCOS subtype at chromosome 8p23.1. Overall PCOS risk depends on many additional genetic and environmental factors beyond this single locus.
One copy of the GATA4/NEIL2 risk allele — modestly elevated PCOS susceptibility
The GATA4/NEIL2 locus on chromosome 8p23.1 was identified as a PCOS susceptibility locus in the Day et al. 2018 meta-analysis (10,074 cases / 103,164 controls). Among 14 confirmed PCOS susceptibility loci, rs804279 uniquely showed significant heterogeneity across diagnostic criteria (Het P=2.6×10⁻⁵), with the strongest effect in NIH-diagnosed PCOS (requiring documented hyperandrogenism). This means the locus specifically tags the androgen-excess component of PCOS, not the metabolic or self-reported spectrum. GATA4 is expressed in ovarian cells and regulates steroidogenic enzyme transcription; the nearby FDFT1 gene encodes the first enzyme in cholesterol synthesis. Carriers of one T allele at rs804279 show intermediate susceptibility from this locus; clinical significance depends heavily on whether hyperandrogenic symptoms are present.
Homozygous for the GATA4/NEIL2 risk allele — elevated PCOS susceptibility from this locus
The Day et al. 2018 PCOS meta-analysis identified rs804279 as one of 14 genome-wide significant PCOS susceptibility loci in European women. The locus is uniquely diagnostic-criterion-dependent: the effect is largest in NIH-diagnosed PCOS (hyperandrogenism + oligo-anovulation), weakest in self-reported PCOS. This heterogeneity (Het P=2.6×10⁻⁵) suggests rs804279 tags the androgen-excess component of PCOS specifically. GATA4, located 6.4 kb from rs804279, is a transcription factor expressed in ovarian theca cells that drives CYP17A1 expression — the rate-limiting enzyme for androgen biosynthesis. FDFT1, the first committed step in cholesterol synthesis (the precursor pool for all steroid hormones), is located ~29 kb from the variant. Homozygous TT carriers show the highest regulatory effect from this locus on the GATA4/steroidogenic axis.
The T allele is the global majority allele (~72%), meaning this risk genotype is common. PCOS affects approximately 8–13% of women of reproductive age; the OR of ~1.14 per T allele means population-attributable risk from this locus is real but modest — the locus contributes to a polygenic risk architecture rather than acting as a deterministic switch. Combined with other PCOS loci (DENND1A, LHCGR, THADA, FSHB), TT at rs804279 contributes meaningfully to cumulative genetic PCOS risk.