Research

rs804279 — GATA4 GATA4/NEIL2 PCOS Susceptibility

Intergenic variant at the GATA4/NEIL2 locus on chromosome 8p23.1 associated with polycystic ovary syndrome susceptibility; the T allele tags regulatory variation linked to elevated androstenedione and PCOS risk (OR ~1.14), with the strongest effect in NIH-diagnosed PCOS

Moderate Risk Factor Share

Details

Gene
GATA4
Chromosome
8
Risk allele
T
Clinical
Risk Factor
Evidence
Moderate

Population Frequency

AA
8%
AT
40%
TT
52%

See your personal result for GATA4

Upload your DNA data to find out which genotype you carry and what it means for you.

Upload your DNA data

Works with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.

GATA4/NEIL2 — A Chromosomal Neighbourhood Tied to PCOS Androgen Excess

On the short arm of chromosome 8, at position 8p23.1, a compact genomic neighbourhood hosts two functionally distinct genes: GATA411 GATA4
GATA-binding protein 4, a zinc-finger transcription factor essential for cardiac and gonadal development; expressed in ovarian granulosa and theca cells where it regulates steroidogenic gene expression
and NEIL222 NEIL2
nei-like DNA glycosylase 2, a base excision repair enzyme that removes oxidised purines from DNA
. Between and around them sits rs804279, an intergenic variant that large-scale genome-wide association studies have linked to polycystic ovary syndrome (PCOS) susceptibility in women of European ancestry.

The Mechanism

The rs804279 variant lies approximately 6,400 bp upstream of GATA4 and 3,300 bp from NEIL2 (within the NEIL2 RefSeqGene interval, NG_053043.1:g.1718). As an intergenic regulatory variant, it does not change a protein sequence; instead, it likely modulates the expression or chromatin accessibility of one or both flanking genes in hormone-responsive tissues. GATA4 is a transcription factor expressed in ovarian theca cells, where it drives expression of steroidogenic enzymes including CYP17A1, the key enzyme in androgen biosynthesis. The locus also harbours FDFT133 FDFT1
farnesyl-diphosphate farnesyltransferase 1, the first committed enzyme in cholesterol biosynthesis that provides the substrate for all steroid hormones
, which sits approximately 29 kb downstream — giving this chromosomal neighbourhood three converging mechanisms for steroid hormone biology.

The T allele is the globally common allele (~72% frequency) but is the risk allele for PCOS at this locus. This contrasts with the typical GWAS pattern where the minor allele carries risk, and implies that the protective A allele (reference, ~28% globally) represents a recent or population-specific buffering variant — or that regulatory activity at this locus is required for normal PCOS-associated androgen elevation, with the A allele conferring partial protection.

The Evidence

The key PCOS association at rs804279 was established in a large European meta-analysis44 large European meta-analysis
Day FR et al. Large-scale genome-wide meta-analysis of polycystic ovary syndrome suggests shared genetic architecture for different diagnosis criteria. PLoS Genet 14:e1007813, 2018
combining 10,074 PCOS cases and 103,164 controls. Among 14 genome-wide significant PCOS loci, rs804279 was notable for one specific reason: it was the only locus showing significant heterogeneity across diagnostic criteria (heterogeneity p=2.6×10⁻⁵). The effect was largest in women diagnosed by NIH criteria (hyperandrogenism plus oligo-anovulation) and smallest in self-reported cases — consistent with a variant that specifically tags the androgen-excess component of PCOS rather than the full syndrome as broadly defined.

Computational functional analysis has categorised rs804279 among variants with predicted functional relevance55 variants with predicted functional relevance
Prabhu BN et al. Conceptualization of functional SNPs of PCOS genes: an in silico approach. J Endocrinol Invest 44:1783–1793, 2021
in the GATA4/NEIL2 region, supporting its role as a regulatory rather than neutral variant.

The evidence level is moderate: the locus has genome-wide significance in the Day 2018 meta-analysis, but functional characterisation of what rs804279 specifically does to GATA4 or NEIL2 expression remains incomplete. The diagnostic-criteria heterogeneity means the association is most reliable when PCOS is defined by the NIH standard (requiring both hyperandrogenism and cycle abnormality) rather than by self-report or broader Rotterdam criteria alone.

Practical Actions

Because this locus appears most strongly associated with the hyperandrogenic PCOS subtype, monitoring should focus on androgen biomarkers: free testosterone, DHEAS, and androstenedione are the most directly relevant, with AMH and LH:FSH ratio providing additional characterisation of the reproductive phenotype.

Inositol supplementation (myo-inositol) has the strongest evidence for improving ovarian responsiveness in PCOS with FSH resistance, acting downstream of the gonadotropin signalling cascade that GATA4 expression influences. Women with the TT genotype who have clinical signs (irregular cycles, hirsutism, acne, difficulty conceiving) have clear grounds for proactive evaluation.

Interactions

The GATA4/NEIL2 locus acts independently of the DENND1A (rs7852296) and LHCGR (rs13405728) PCOS susceptibility loci established in the same European meta-analyses. Women carrying risk alleles at multiple PCOS loci accumulate additive PCOS susceptibility. The GATA4 locus appears to operate primarily through the hyperandrogenic axis (linked to androstenedione and testosterone excess) while DENND1A operates through FSH receptor trafficking — distinct but complementary mechanisms both converging on follicular arrest and androgen excess. A compound interaction between homozygous TT at rs804279 and the DENND1A risk haplotype (rs7852296-AA or homozygous risk) would represent convergent failure of both steroidogenic regulation and FSH receptor recycling; this combination warrants investigation as a compound action (no published combined effect size yet).

Genotype Interpretations

What each possible genotype means for this variant:

AA “Protective Genotype” Normal

Protective GATA4/NEIL2 genotype — reduced PCOS susceptibility from this locus

You carry two copies of the A allele at rs804279, the reference allele and the less common genotype globally (approximately 6–8% of people of European ancestry). This genotype is associated with reduced PCOS susceptibility from this locus compared to those carrying the T allele. The A allele confers relative protection against the androgen-excess PCOS subtype at chromosome 8p23.1. Overall PCOS risk depends on many additional genetic and environmental factors beyond this single locus.

AT “One Risk Allele” Intermediate Caution

One copy of the GATA4/NEIL2 risk allele — modestly elevated PCOS susceptibility

The GATA4/NEIL2 locus on chromosome 8p23.1 was identified as a PCOS susceptibility locus in the Day et al. 2018 meta-analysis (10,074 cases / 103,164 controls). Among 14 confirmed PCOS susceptibility loci, rs804279 uniquely showed significant heterogeneity across diagnostic criteria (Het P=2.6×10⁻⁵), with the strongest effect in NIH-diagnosed PCOS (requiring documented hyperandrogenism). This means the locus specifically tags the androgen-excess component of PCOS, not the metabolic or self-reported spectrum. GATA4 is expressed in ovarian cells and regulates steroidogenic enzyme transcription; the nearby FDFT1 gene encodes the first enzyme in cholesterol synthesis. Carriers of one T allele at rs804279 show intermediate susceptibility from this locus; clinical significance depends heavily on whether hyperandrogenic symptoms are present.

TT “Homozygous Risk” High Risk Warning

Homozygous for the GATA4/NEIL2 risk allele — elevated PCOS susceptibility from this locus

The Day et al. 2018 PCOS meta-analysis identified rs804279 as one of 14 genome-wide significant PCOS susceptibility loci in European women. The locus is uniquely diagnostic-criterion-dependent: the effect is largest in NIH-diagnosed PCOS (hyperandrogenism + oligo-anovulation), weakest in self-reported PCOS. This heterogeneity (Het P=2.6×10⁻⁵) suggests rs804279 tags the androgen-excess component of PCOS specifically. GATA4, located 6.4 kb from rs804279, is a transcription factor expressed in ovarian theca cells that drives CYP17A1 expression — the rate-limiting enzyme for androgen biosynthesis. FDFT1, the first committed step in cholesterol synthesis (the precursor pool for all steroid hormones), is located ~29 kb from the variant. Homozygous TT carriers show the highest regulatory effect from this locus on the GATA4/steroidogenic axis.

The T allele is the global majority allele (~72%), meaning this risk genotype is common. PCOS affects approximately 8–13% of women of reproductive age; the OR of ~1.14 per T allele means population-attributable risk from this locus is real but modest — the locus contributes to a polygenic risk architecture rather than acting as a deterministic switch. Combined with other PCOS loci (DENND1A, LHCGR, THADA, FSHB), TT at rs804279 contributes meaningfully to cumulative genetic PCOS risk.