Research

rs5029937 — TNFAIP3

Intronic risk variant within TNFAIP3 intron 2 that independently increases susceptibility to rheumatoid arthritis and SLE through a distinct LD block from the nearby intergenic 6q23 signals, completing the three-signal risk model at the TNFAIP3 locus

Strong Risk Factor Share

Details

Gene
TNFAIP3
Chromosome
6
Risk allele
T
Clinical
Risk Factor
Evidence
Strong

Population Frequency

GG
74%
GT
24%
TT
2%

See your personal result for TNFAIP3

Upload your DNA data to find out which genotype you carry and what it means for you.

Upload your DNA data

Works with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.

TNFAIP3 Intron 2 — The Third Signal in the 6q23 Risk Cluster

The TNFAIP3 gene on chromosome 6q23 encodes A20, the primary negative regulator of NF-κB inflammatory signaling11 A20, the primary negative regulator of NF-κB inflammatory signaling
A20 is a ubiquitin-editing enzyme that terminates immune activation by deubiquitinating key signaling proteins; TNFAIP3 stands for TNF Alpha Induced Protein 3
. The 6q23 locus is one of the most replicated autoimmune susceptibility regions in the human genome, carrying three independently inherited risk signals that each contribute to RA and SLE susceptibility through distinct mechanisms. rs5029937 is the third and most proximally located of these signals — it sits within intron 2 of TNFAIP3 itself, in a separate linkage disequilibrium block from the intergenic variants rs6920220 and rs10499194 that lie approximately 150 kb upstream. The T allele, rare in Europeans (~3%) but substantially more common in African populations (~27%), marks a haplotype associated with elevated autoimmune disease risk, and individuals who carry it alongside the risk alleles at the upstream intergenic SNPs face the steepest combined risk at this locus.

The Mechanism

rs5029937 resides in a 71 kb LD block that is structurally distinct from the intergenic LD block containing rs6920220 and rs10499194. Its location within intron 2 of TNFAIP3 places it in a region that could influence pre-mRNA splicing, intronic regulatory elements, or enhancer activity within the gene body. The precise functional mechanism has not been resolved — like many intronic GWAS signals, rs5029937 may be a proxy for an unidentified causal variant within the same haplotype rather than directly functional itself. What is established is that it is statistically independent of the two upstream intergenic signals: conditional logistic regression22 conditional logistic regression
After conditioning on both rs6920220 and rs13207033 in multiple independent datasets, rs5029937 retained significant association
with P=0.02, and combining WTCCC data showed P values reaching 4.71×10⁻¹¹ — well into genome-wide significance territory.

The downstream functional consequence is presumed to impair A20-mediated NF-κB termination, consistent with the two upstream signals that reduce A20 transcription (rs6920220) and enzymatic activity (rs2230926 F127C). Whether rs5029937 impairs A20 expression, splicing efficiency, or interacts with intronic regulatory sequences that modulate tissue-specific expression remains under investigation.

The variant shows a notable association pattern by autoantibody status33 autoantibody status
rs5029937 was strongly associated with RF-positive and anti-CCP-positive RA cases but not with anti-CCP-negative RA in the original discovery cohort
, mirroring the biology of seropositive disease where NF-κB-driven autoantibody production plays a larger role.

The Evidence

The variant was identified as one of three independent signals at 6q23 in an analysis that applied conditional logistic regression to map the locus structure. The key three-signal paper by Orozco et al. (2009) established that when rs5029937 T allele carriers also carry the rs6920220 A allele and lack the rs10499194 protective T allele, the combined odds ratio reaches 1.8644 combined odds ratio reaches 1.86
4.4% of RA patients versus 2.3% of controls carried this highest-risk haplotype combination at 6q23
. This represents one of the clearest examples of multiple independent risk signals at a single locus acting additively to substantially amplify RA susceptibility beyond any individual signal's contribution.

The RA association was originally identified in a meta-analysis and imputation study at the TNFAIP3 locus55 meta-analysis and imputation study at the TNFAIP3 locus
Bates JS et al. 2009 — identified a 109 kb risk haplotype spanning the TNFAIP3 region, with rs5029937 among the contributing markers
using European populations.

Beyond RA, rs5029937 shows broader autoimmune pleiotropism consistent with A20's role across the immune system. A meta-analysis of 18,501 SLE patients and 30,435 controls66 meta-analysis of 18,501 SLE patients and 30,435 controls
Zhang MY et al. 2016 — pooled ORs with 95% CI across 23 studies testing TNFAIP3 polymorphisms including rs5029937, rs2230926, rs5029939, and rs3757173
found significant SLE association (P<0.001) in both European and Asian populations. A Korean case-control study77 Korean case-control study
133 SLE patients vs 422 healthy controls; Kim SK et al. Rheumatology 2014
found rs5029937 T allele conferred OR 2.13 (95% CI 1.25–3.65, P=0.02) for SLE, while interestingly not showing significant RA association in the same cohort — echoing the population-specific and disease-specific LD patterns seen across the 6q23 locus.

A population-level RA study from Iran found the T allele associated with RA with OR 2.61 for the allele comparison (95% CI 1.38–4.92, P=0.004), and the combined TT+GT genotype versus GG showed OR 3.46 (95% CI 1.49–8.08). While this was a small cohort (50 cases/50 controls), the effect direction is consistent with European discovery data.

The T allele is very rare in Europeans (~3%) and essentially absent in some East Asian populations (dbSNP ALFA data: 0% in the small East Asian sample), while reaching ~27% frequency in African populations and ~7% in Latino populations. This population stratification has important implications: the locus likely contributes more substantially to autoimmune disease burden in African ancestry populations than European GWAS samples suggest, and population-specific LD patterns may shift the disease-relevant haplotype tagging across populations.

The VITAL randomized controlled trial88 VITAL randomized controlled trial
25,871 participants randomized to vitamin D3 2000 IU/day, omega-3 1g/day, or placebo, followed for 5 years
is the most directly actionable intervention evidence for NF-κB pathway modulation: vitamin D reduced incident autoimmune disease by 22% (HR 0.78, P=0.05) and omega-3 reduced by 15%, with benefits persisting two years after supplementation ended. These interventions act on the NF-κB pathway that TNFAIP3/A20 regulates.

Interactions

rs5029937 is the third leg of the three-signal 6q23 locus model. The complete risk hierarchy at this locus requires integrating all three signals: rs6920220 (intergenic, reduces A20 transcription), rs10499194/rs13207033 (intergenic, protective haplotype when T allele present), and rs5029937 (intronic, risk when T allele present). Carriers of both risk signals (rs6920220 A allele and rs5029937 T allele) while lacking the protective allele at rs10499194 face the maximum 6q23 combined OR of 1.86.

The TNFAIP3 missense variant rs2230926 (F127C) impairs A20 catalytic activity through a completely distinct mechanism. Carriers of rs5029937 T allele who also carry rs2230926 G allele may face compounded A20 dysfunction — the intronic variant potentially reducing expression or splicing efficiency while the missense variant reduces enzymatic function.

PTPN22 R620W (rs2476601) intersects with TNFAIP3 in seropositive autoimmune disease risk: PTPN22 lowers T-cell activation thresholds while TNFAIP3 variants prolong NF-κB inflammatory signaling. The combination of rs5029937 T allele and PTPN22 risk allele may be particularly consequential for seropositive RA risk given both variants' preferential association with RF/anti-CCP-positive disease.

Genotype Interpretations

What each possible genotype means for this variant:

GG “No Intronic Risk Allele” Normal

Common genotype — no intronic TNFAIP3 risk contribution from this variant

You carry two copies of the common G allele at rs5029937. This is by far the most prevalent genotype — approximately 94% of Europeans, 91% of East Asians, and 54% of people of African descent share this genotype. The intronic T risk allele at this TNFAIP3 locus is absent, so this signal does not contribute to elevated autoimmune disease risk. Your overall TNFAIP3 locus risk may still be shaped by other independent signals at the locus, which are assessed separately.

GT “One Intronic Risk Allele” Intermediate Caution

One T allele — modest independent increase in RA and SLE susceptibility at this locus

The T allele at rs5029937 is located in intron 2 of TNFAIP3, within a 71 kb LD block structurally separate from the intergenic block harboring rs6920220 and rs10499194/rs13207033. Conditional regression analysis confirmed it remains significantly associated with RA (P=0.02) after accounting for both other signals — its risk is not simply a statistical shadow of those variants. The autoantibody-specific association (RF-positive, anti-CCP-positive RA but not seronegative RA) suggests this variant's effect is particularly relevant to the antibody-mediated arm of autoimmune disease.

In African populations where the T allele is substantially more common (~27%), this variant likely contributes meaningfully to autoimmune disease population risk in ways not fully captured by European-ancestry GWAS studies. The SLE association (OR 2.13 in Korean cohort, meta-analysis P<0.001) extends this variant's relevance beyond RA.

TT “Homozygous Intronic Risk Allele” High Risk Warning

Two T alleles — highest individual genotype risk at this intronic TNFAIP3 signal

TT homozygosity at rs5029937 means both copies of the intronic TNFAIP3 haplotype carry the risk-associated T allele. The independent contribution of this signal means the elevated risk is additive with the two upstream intergenic signals. The autoantibody specificity of this variant (strong association with RF-positive and anti-CCP-positive RA, no significant association with seronegative disease) means the primary concern is seropositive autoimmune disease.

SLE risk is also elevated: meta-analysis across 18,501 SLE patients and 30,435 controls found significant rs5029937 association with SLE (P<0.001) in European and Asian populations. The Korean study showing OR 2.13 for the T allele in SLE (heterozygous) suggests TT homozygotes face substantially higher SLE risk in susceptible populations.