rs2066847 — NOD2 3020insC (1007fs)
The most important Crohn's disease risk variant worldwide — a frameshift insertion in NOD2 that abolishes bacterial muramyl dipeptide sensing; homozygotes face 20–40x elevated Crohn's risk
Details
- Gene
- NOD2
- Chromosome
- 16
- Risk allele
- I
- Clinical
- Risk Factor
- Evidence
- Established
Population Frequency
Category
IBD & Mucosal ImmunitySee your personal result for NOD2
Upload your DNA data to find out which genotype you carry and what it means for you.
Upload your DNA dataWorks with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.
NOD2 3020insC — The Crohn's Disease Master Switch
Every bacterium carries muramyl dipeptide (MDP) in its cell wall — a molecular signature
that the NOD2 protein uses as an early warning alarm inside intestinal cells. When bacteria
breach the gut epithelium, NOD2 binds MDP11 NOD2 binds MDP
NOD2 directly binds MDP via its leucine-rich
repeat domain, triggering downstream NF-κB activation and antimicrobial
defense and triggers a tightly controlled
inflammatory response. The 3020insC variant (also written c.3019dup or L1007fsX1008) inserts
a single cytosine into a run of C's in exon 11, shifting the reading frame and producing a
truncated protein lacking the last 33 amino acids22 truncated protein lacking the last 33 amino acids
the truncation removes the terminal
10th leucine-rich repeat, the domain that directly contacts
MDP. The result is a protein that can no longer
sense bacteria — leaving the gut immune system functionally blind to microbial invasion.
The 3020insC frameshift was independently discovered in 200133 independently discovered in 2001
Ogura et al. and Hugot et al.
simultaneously identified NOD2 variants, including 3020insC, as the first specific genetic
risk factors for Crohn's disease by two groups
simultaneously and remains the single most important Crohn's disease susceptibility variant
in people of European descent. It is virtually absent in Asian populations, which helps
explain why Crohn's disease has different genetic architecture in East Asian compared to
European patients.
The Mechanism
The frameshift shifts the translation reading frame starting at codon 1007, replacing leucine
with proline and introducing a stop codon two residues later (Leu1007ProfsX1008). The resulting
truncated protein lacks the 10th leucine-rich repeat44 10th leucine-rich repeat
the NOD2 LRR domain forms a
horseshoe-shaped structure; the 10th repeat is critical for MDP
binding, which is required for MDP
recognition and NF-κB activation. This is a complete loss-of-function mutation55 loss-of-function mutation
unlike
R702W and G908R, which cause partial impairment, the frameshift abolishes MDP sensing
entirely at physiological concentrations.
Ordinarily, NOD2 in ileal Paneth cells coordinates two protective functions: it activates
NF-κB to initiate bacterial defense, and it downregulates excessive Toll-like receptor
(TLR) signaling66 downregulates excessive Toll-like receptor
(TLR) signaling
NOD2-mediated MDP recognition normally dampens TLR4 responses to prevent
inflammatory overactivation from commensal bacteria.
The 3020insC mutation eliminates both functions simultaneously. Without proper MDP sensing,
Paneth cells produce fewer cryptdins (defensins)77 cryptdins (defensins)
antimicrobial peptides secreted by Paneth
cells to control the bacterial load in the terminal ileum,
TLR responses are dysregulated, and the gut becomes susceptible to inappropriate
inflammatory responses against commensal bacteria — the hallmark of Crohn's disease.
The Evidence
The original 2001 discoveries were rapidly confirmed worldwide. A meta-analysis of 75
case-control studies88 meta-analysis of 75
case-control studies
18,727 Crohn's disease cases and 17,102 controls across multiple
populations established the effect sizes with
precision: heterozygous carriers (DI genotype) have an odds ratio of 3.8 (95% CI 3.4–4.3)
for Crohn's disease, and homozygous or compound heterozygous carriers face an OR of
approximately 34 — a risk elevation unmatched by virtually any other common complex disease
variant.
The genotype-phenotype correlation is equally striking. Among 1,066 Crohn's disease patients,
rs2066847 allele frequency was 15.6% in patients with aggressive disease versus 8.2% in
patients with mild disease99 Among 1,066 Crohn's disease patients,
rs2066847 allele frequency was 15.6% in patients with aggressive disease versus 8.2% in
patients with mild disease
p.Leu1007fsX1008 is a strong predictor of aggressive vs
mild disease course. A study of all 54 homozygous
carriers showed a uniformly aggressive phenotype1010 uniformly aggressive phenotype
all homozygotes who were active smokers
developed ileal stenosis requiring surgery:
87% had ileal stenosis, 68.5% had fistulas, and 72.2% required Crohn's-related surgery despite
immunosuppressive therapy in 87% of cases. Among homozygotes who were active smokers, every
single patient required surgery.
Critically, the mutation's association is highly specific to Crohn's disease1111 highly specific to Crohn's disease
no
significant association with ulcerative colitis, other IBD forms, or most other
inflammatory conditions. The variant is also
essentially absent in Asian populations, consistent with the known epidemiological differences
in Crohn's disease incidence and genetics between European and non-European populations.
Practical Implications
Most 3020insC heterozygous carriers (DI genotype) never develop Crohn's disease — penetrance is incomplete and requires additional genetic and environmental triggers. However, if symptoms of intestinal inflammation develop, the genotype argues for prompt, thorough evaluation with a gastroenterologist. Fecal calprotectin is a sensitive, non-invasive marker of intestinal inflammation that can be used for routine monitoring in carriers.
For homozygous or compound heterozygous carriers (those who also carry R702W [rs2066844] or G908R [rs2066845] on the other chromosome), the clinical picture is fundamentally different: if Crohn's disease develops, it is expected to be ileal, stricturing, and medically refractory. A "top-down" approach — early initiation of biologic therapy rather than step-up from aminosalicylates — is supported by the literature for these patients. Smoking cessation is particularly urgent, as the smoking-NOD2 interaction is specific to the 1007fs mutation and results in universally aggressive outcomes in homozygotes.
Interactions
The 3020insC variant is the most severe of the three major NOD2 mutations. Compound
heterozygotes — individuals carrying 3020insC on one chromosome and either R702W (rs2066844)
or G908R (rs2066845) on the other — face risk equivalent to homozygotes for 3020insC, with
an OR of ~34 and 98% specificity for complicated stricturing disease1212 OR of ~34 and 98% specificity for complicated stricturing disease
compound heterozygotes
and homozygotes have equivalent risk; both dramatically exceed simple
heterozygotes.
NOD2 also recruits ATG16L1 (rs2241880)1313 ATG16L1 (rs2241880)
ATG16L1 is essential for autophagy of intracellular
bacteria; NOD2 directly recruits the ATG16L1 complex to bacterial entry
sites to bacterial entry sites in the gut
epithelium. When both NOD2 function is abolished (3020insC) and ATG16L1 autophagy is impaired
(T300A variant), intracellular bacterial clearance fails through two convergent mechanisms,
amplifying Crohn's risk. Similarly, IL23R variants (rs11209026) that increase Th17
inflammatory responses compound NOD2 loss-of-function by driving the adaptive immune
arm of intestinal inflammation.
Genotype Interpretations
What each possible genotype means for this variant:
Normal bacterial sensing with population-average Crohn's disease risk
You have two copies of the reference (insertion-absent) allele, indicating normal NOD2 protein production and intact muramyl dipeptide sensing. Your ileal Paneth cells can appropriately recognize bacterial peptidoglycans, activate NF-κB responses, and produce antimicrobial defensins to maintain gut homeostasis. This is the most common genotype — approximately 97% of people of European descent and over 99% of East Asian and African populations carry this genotype.
Two copies of the 3020insC frameshift — very high Crohn's disease risk with aggressive expected phenotype
Both copies of your NOD2 gene carry the frameshift mutation, leaving your intestinal epithelium with essentially no functional MDP-sensing capacity. Without NOD2 signaling, Paneth cells cannot appropriately regulate the ileal bacterial environment, TLR responses are dysregulated against commensal bacteria, and the anti-inflammatory cytokine IL-10 response to MDP is abolished. This creates the most permissive genetic environment for Crohn's disease of any common variant combination. The disease phenotype in homozygotes is not merely more likely — it is qualitatively different: uniformly ileal, uniformly stricturing, and resistant to conventional step-up therapy. Gastroenterology involvement should be established proactively, before disease onset if possible, and biologic therapy should be considered early if disease develops rather than as a last resort.
One copy of the Crohn's disease frameshift variant — elevated but moderate risk
One of your two NOD2 gene copies carries the frameshift, so approximately half your intestinal NOD2 protein is non-functional. Your remaining intact copy provides substantial residual bacterial sensing, which is why most heterozygous carriers never develop IBD despite their elevated genetic risk. The variant confers incomplete penetrance requiring additional genetic hits (other NOD2 variants, ATG16L1, IL23R) and environmental triggers (specific gut microbiome composition, early life infections, smoking) for disease to manifest. Monitoring gut health proactively and promptly investigating any new gastrointestinal symptoms is the most actionable step for heterozygous carriers.