Research

rs2066847 — NOD2 3020insC (1007fs)

The most important Crohn's disease risk variant worldwide — a frameshift insertion in NOD2 that abolishes bacterial muramyl dipeptide sensing; homozygotes face 20–40x elevated Crohn's risk

Established Risk Factor Share

Details

Gene
NOD2
Chromosome
16
Risk allele
I
Clinical
Risk Factor
Evidence
Established

Population Frequency

DD
0%
DI
2%
II
98%

See your personal result for NOD2

Upload your DNA data to find out which genotype you carry and what it means for you.

Upload your DNA data

Works with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.

NOD2 3020insC — The Crohn's Disease Master Switch

Every bacterium carries muramyl dipeptide (MDP) in its cell wall — a molecular signature that the NOD2 protein uses as an early warning alarm inside intestinal cells. When bacteria breach the gut epithelium, NOD2 binds MDP11 NOD2 binds MDP
NOD2 directly binds MDP via its leucine-rich repeat domain, triggering downstream NF-κB activation and antimicrobial defense
and triggers a tightly controlled inflammatory response. The 3020insC variant (also written c.3019dup or L1007fsX1008) inserts a single cytosine into a run of C's in exon 11, shifting the reading frame and producing a truncated protein lacking the last 33 amino acids22 truncated protein lacking the last 33 amino acids
the truncation removes the terminal 10th leucine-rich repeat, the domain that directly contacts MDP
. The result is a protein that can no longer sense bacteria — leaving the gut immune system functionally blind to microbial invasion.

The 3020insC frameshift was independently discovered in 200133 independently discovered in 2001
Ogura et al. and Hugot et al. simultaneously identified NOD2 variants, including 3020insC, as the first specific genetic risk factors for Crohn's disease
by two groups simultaneously and remains the single most important Crohn's disease susceptibility variant in people of European descent. It is virtually absent in Asian populations, which helps explain why Crohn's disease has different genetic architecture in East Asian compared to European patients.

The Mechanism

The frameshift shifts the translation reading frame starting at codon 1007, replacing leucine with proline and introducing a stop codon two residues later (Leu1007ProfsX1008). The resulting truncated protein lacks the 10th leucine-rich repeat44 10th leucine-rich repeat
the NOD2 LRR domain forms a horseshoe-shaped structure; the 10th repeat is critical for MDP binding
, which is required for MDP recognition and NF-κB activation. This is a complete loss-of-function mutation55 loss-of-function mutation
unlike R702W and G908R, which cause partial impairment, the frameshift abolishes MDP sensing entirely at physiological concentrations
.

Ordinarily, NOD2 in ileal Paneth cells coordinates two protective functions: it activates NF-κB to initiate bacterial defense, and it downregulates excessive Toll-like receptor (TLR) signaling66 downregulates excessive Toll-like receptor (TLR) signaling
NOD2-mediated MDP recognition normally dampens TLR4 responses to prevent inflammatory overactivation from commensal bacteria
. The 3020insC mutation eliminates both functions simultaneously. Without proper MDP sensing, Paneth cells produce fewer cryptdins (defensins)77 cryptdins (defensins)
antimicrobial peptides secreted by Paneth cells to control the bacterial load in the terminal ileum
, TLR responses are dysregulated, and the gut becomes susceptible to inappropriate inflammatory responses against commensal bacteria — the hallmark of Crohn's disease.

The Evidence

The original 2001 discoveries were rapidly confirmed worldwide. A meta-analysis of 75 case-control studies88 meta-analysis of 75 case-control studies
18,727 Crohn's disease cases and 17,102 controls across multiple populations
established the effect sizes with precision: heterozygous carriers (DI genotype) have an odds ratio of 3.8 (95% CI 3.4–4.3) for Crohn's disease, and homozygous or compound heterozygous carriers face an OR of approximately 34 — a risk elevation unmatched by virtually any other common complex disease variant.

The genotype-phenotype correlation is equally striking. Among 1,066 Crohn's disease patients, rs2066847 allele frequency was 15.6% in patients with aggressive disease versus 8.2% in patients with mild disease99 Among 1,066 Crohn's disease patients, rs2066847 allele frequency was 15.6% in patients with aggressive disease versus 8.2% in patients with mild disease
p.Leu1007fsX1008 is a strong predictor of aggressive vs mild disease course
. A study of all 54 homozygous carriers showed a uniformly aggressive phenotype1010 uniformly aggressive phenotype
all homozygotes who were active smokers developed ileal stenosis requiring surgery
: 87% had ileal stenosis, 68.5% had fistulas, and 72.2% required Crohn's-related surgery despite immunosuppressive therapy in 87% of cases. Among homozygotes who were active smokers, every single patient required surgery.

Critically, the mutation's association is highly specific to Crohn's disease1111 highly specific to Crohn's disease
no significant association with ulcerative colitis, other IBD forms, or most other inflammatory conditions
. The variant is also essentially absent in Asian populations, consistent with the known epidemiological differences in Crohn's disease incidence and genetics between European and non-European populations.

Practical Implications

Most 3020insC heterozygous carriers (DI genotype) never develop Crohn's disease — penetrance is incomplete and requires additional genetic and environmental triggers. However, if symptoms of intestinal inflammation develop, the genotype argues for prompt, thorough evaluation with a gastroenterologist. Fecal calprotectin is a sensitive, non-invasive marker of intestinal inflammation that can be used for routine monitoring in carriers.

For homozygous or compound heterozygous carriers (those who also carry R702W [rs2066844] or G908R [rs2066845] on the other chromosome), the clinical picture is fundamentally different: if Crohn's disease develops, it is expected to be ileal, stricturing, and medically refractory. A "top-down" approach — early initiation of biologic therapy rather than step-up from aminosalicylates — is supported by the literature for these patients. Smoking cessation is particularly urgent, as the smoking-NOD2 interaction is specific to the 1007fs mutation and results in universally aggressive outcomes in homozygotes.

Interactions

The 3020insC variant is the most severe of the three major NOD2 mutations. Compound heterozygotes — individuals carrying 3020insC on one chromosome and either R702W (rs2066844) or G908R (rs2066845) on the other — face risk equivalent to homozygotes for 3020insC, with an OR of ~34 and 98% specificity for complicated stricturing disease1212 OR of ~34 and 98% specificity for complicated stricturing disease
compound heterozygotes and homozygotes have equivalent risk; both dramatically exceed simple heterozygotes
.

NOD2 also recruits ATG16L1 (rs2241880)1313 ATG16L1 (rs2241880)
ATG16L1 is essential for autophagy of intracellular bacteria; NOD2 directly recruits the ATG16L1 complex to bacterial entry sites
to bacterial entry sites in the gut epithelium. When both NOD2 function is abolished (3020insC) and ATG16L1 autophagy is impaired (T300A variant), intracellular bacterial clearance fails through two convergent mechanisms, amplifying Crohn's risk. Similarly, IL23R variants (rs11209026) that increase Th17 inflammatory responses compound NOD2 loss-of-function by driving the adaptive immune arm of intestinal inflammation.

Genotype Interpretations

What each possible genotype means for this variant:

II “Normal NOD2 Function” Normal

Normal bacterial sensing with population-average Crohn's disease risk

You have two copies of the reference (insertion-absent) allele, indicating normal NOD2 protein production and intact muramyl dipeptide sensing. Your ileal Paneth cells can appropriately recognize bacterial peptidoglycans, activate NF-κB responses, and produce antimicrobial defensins to maintain gut homeostasis. This is the most common genotype — approximately 97% of people of European descent and over 99% of East Asian and African populations carry this genotype.

DD “Homozygous Variant” High Risk Critical

Two copies of the 3020insC frameshift — very high Crohn's disease risk with aggressive expected phenotype

Both copies of your NOD2 gene carry the frameshift mutation, leaving your intestinal epithelium with essentially no functional MDP-sensing capacity. Without NOD2 signaling, Paneth cells cannot appropriately regulate the ileal bacterial environment, TLR responses are dysregulated against commensal bacteria, and the anti-inflammatory cytokine IL-10 response to MDP is abolished. This creates the most permissive genetic environment for Crohn's disease of any common variant combination. The disease phenotype in homozygotes is not merely more likely — it is qualitatively different: uniformly ileal, uniformly stricturing, and resistant to conventional step-up therapy. Gastroenterology involvement should be established proactively, before disease onset if possible, and biologic therapy should be considered early if disease develops rather than as a last resort.

DI “Heterozygous Carrier” Carrier Caution

One copy of the Crohn's disease frameshift variant — elevated but moderate risk

One of your two NOD2 gene copies carries the frameshift, so approximately half your intestinal NOD2 protein is non-functional. Your remaining intact copy provides substantial residual bacterial sensing, which is why most heterozygous carriers never develop IBD despite their elevated genetic risk. The variant confers incomplete penetrance requiring additional genetic hits (other NOD2 variants, ATG16L1, IL23R) and environmental triggers (specific gut microbiome composition, early life infections, smoking) for disease to manifest. Monitoring gut health proactively and promptly investigating any new gastrointestinal symptoms is the most actionable step for heterozygous carriers.