NCBI Summary
This gene encodes a C-type lectin that functions in cell adhesion and pathogen recognition. This receptor recognizes a wide range of evolutionarily divergent pathogens with a large impact on public health, including leprosy and tuberculosis mycobacteria, the Ebola, hepatitis C, HIV-1 and Dengue viruses, and the SARS-CoV acute respiratory syndrome coronavirus. The protein is organized into four distinct domains: a C-terminal carbohydrate recognition domain, a flexible tandem-repeat neck domain, a transmembrane region and an N-terminal cytoplasmic domain involved in internalization. This gene is closely related in terms of both sequence and function to a neighboring gene, CLEC4M (Gene ID: 10332), also known as L-SIGN. The two genes differ in viral recognition and expression patterns, with this gene showing high expression on the surface of dendritic cells. Polymorphisms in the neck region are associated with protection from HIV-1 infection, while single nucleotide polymorphisms in the promoter of this gene are associated with differing resistance and susceptibility to and severity of infectious disease, including rs4804803, which is associated with SARS severity. [provided by RefSeq, May 2020].
Protein
Protein (NP_066978)
CLEC4L - DC-SIGN
CD209 antigen (C-type lectin domain family 4 member L) (Dendritic cell-specific ICAM-3-grabbing non-integrin 1) (DC-SIGN) (DC-SIGN1) (CD antigen CD209)
CD209
CD209 molecule
Curated
C-type lectin
CLEC4L
C-type - Type II transmembrane receptors
a/b mixed / C-type lectin-like
0.614
Protein sequence and protein families (fasta) (404 amino acids) Download
MSDSKEPRLQQLGLLEEEQLRGLGFRQTRGYKSLAGCLGHGPLVLQLLSFTLLAGLLVQVSKVPSSISQEQSRQDAIYQNLTQLKAAVGELSEKSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELTRLKAAVGELPEKSKLQEIYQELTWLKAAVGELPEKSKMQEIYQELTRLKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKSKQQEIYQELTQLKAAVERLCHPCPWEWTFFQGNCYFMSNSQRNWHDSITACKEVGAQLVVIKSAEEQNFLQLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPSFKQYWNRGEPNNVGEEDCAEFSGNGWNDDKCNLAKFWICKKSAASCSRDEEQFLSPAPATPNPPPA
Mol* PDB structure viewerUniLectin3D
Structural models
Model Confidence:
  •    Very high (pLDDT > 90)
  •    Confident (90 > pLDDT > 70)
  •    Low (70 > pLDDT > 50)
  •    Very low (pLDDT < 50)

  AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Oligomerization and Known Interactions
Homotetramer. Interacts with C1QBP; the interaction is indicative for a C1q:C1QBP:CD209 signaling complex. Interacts (via C-type lectin domain) with CEACAM1 (via Lewis X moieties); this interaction is regulated by the glycosylation pattern of CEACAM1 on cell types and regulates contact between dendritic cells and neutrophils (PubMed:16246332)

(Microbial infection) Interacts with HIV-1 and HIV-2 gp120 (PubMed:11799126, PubMed:12502850, PubMed:1518869)

(Microbial infection) Interacts with ebolavirus envelope glycoproteins (PubMed:12502850, PubMed:12504546)

(Microbial infection) Interacts with cytomegalovirus gB protein (PubMed:12433371, PubMed:22496863)

(Microbial infection) Interacts with HCV E2 protein (PubMed:15371595, PubMed:16816373)

(Microbial infection) Interacts with dengue virus major envelope protein E

(Microbial infection) Interacts with measles hemagglutinin

(Microbial infection) Interacts with herpes simplex virus 1 surface proteins

(Microbial infection) Interacts with Influenzavirus A hemagglutinin

(Microbial infection) Interacts with SARS-CoV spike glycoprotein

(Microbial infection) Interacts with Japanese encephalitis virus E protein

(Microbial infection) Interacts with Lassa virus Glycoprotein

(Microbial infection) Interacts with marburg virus glycoprotein

(Microbial infection) Interacts with Respiratory syncytial virus glycoprotein G

(Microbial infection) Interacts with Rift valley fever virus and uukuniemi virus envelope glycoprotein

(Microbial infection) Interacts with west-nile virus envelope glycoprotein

(Microbial infection) Interacts with whole M.bovis cells in a Ca(2+)-dependent and independent manner; in vitro experiments suggest it interacts with CH60.1 (groL1), DnaK, GADPH (gap) and LrpG (PubMed:21203928)
Annotation
Ligand
Glycan ligands from structural data
aMan16(aMan13)aMan16Man
Man(a1-6)[Man(a1-3)]Man(a1-6)Man
aMan12aMan13aMan
Man(a1-2)Man(a1-3)Man
bGal14(aFuc13)bGlcNAc13bGal14Glc
Gal(b1-4)[Fuc(a1-3)]GlcNAc(b1-3)Gal(b1-4)Glc
bGlcNAc12aMan13(bGlcNAc12aMan16)Man
GlcNAc(b1-2)Man(a1-3)[GlcNAc(b1-2)Man(a1-6)]Man
aMan12aMan
Man(a1-2)Man
Di-mannoside mimic, synthetic analog of trimannoside
Man
Expression
Functionality temporarily unavailable.
References
NCBI References (10 PubMed Identifiers)
  • Lectins enhance SARS-CoV-2 infection and influence neutralizing antibodies. [34464958]
  • SARS-CoV-2 Impairs Dendritic Cells and Regulates DC-SIGN Gene Expression in Tissues. [34502134]
  • CD209L/L-SIGN and CD209/DC-SIGN act as receptors for SARS-CoV-2. [32607506]
  • COVID-19, Renin-Angiotensin System and Endothelial Dysfunction. [32660065]
  • DC-SIGN, DC-SIGNR and LSECtin: C-type lectins for infection. [24156700]
  • CD209 (DC-SIGN) -336A>G promoter polymorphism and severe acute respiratory syndrome in Hong Kong Chinese. [20359516]
  • DC-SIGN. a related gene, DC-SIGNR. and CD23 form a cluster on 19p13. [10975799]
  • DC-SIGN, a dendritic cell-specific HIV-1-binding protein that enhances trans-infection of T cells. [10721995]
  • Identification of DC-SIGN, a novel dendritic cell-specific ICAM-3 receptor that supports primary immune responses. [10721994]
  • Sequence and expression of a membrane-associated C-type lectin that exhibits CD4-independent binding of human immunodeficiency virus envelope glycoprotein gp120. [1518869]
UniProt Main References (39 PubMed Identifiers)
  • A dendritic cell-specific intercellular adhesion molecule 3-grabbing nonintegrin (DC-SIGN)-related protein is highly expressed on human liver sinusoidal endothelial cells and promotes HIV-1 infection. [11257134]
  • Extensive repertoire of membrane-bound and soluble dendritic cell-specific ICAM-3-grabbing nonintegrin 1 (DC-SIGN1) and DC-SIGN2 isoforms. Inter-individual variation in expression of DC-SIGN transcripts. [11337487]
  • Complete sequencing and characterization of 21,243 full-length human cDNAs. [14702039]
  • The DNA sequence and biology of human chromosome 19. [15057824]
  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). [15489334]
  • DC-SIGN-ICAM-2 interaction mediates dendritic cell trafficking. [11017109]
  • A novel mechanism of carbohydrate recognition by the C-type lectins DC-SIGN and DC-SIGNR. Subunit organization and binding to multivalent ligands. [11384997]
  • Human cytomegalovirus binding to DC-SIGN is required for dendritic cell infection and target cell trans-infection. [12433371]
  • Identification of different binding sites in the dendritic cell-specific receptor DC-SIGN for intercellular adhesion molecule 3 and HIV-1. [11799126]
  • The dendritic cell-specific adhesion receptor DC-SIGN internalizes antigen for presentation to T cells. [11859097]
  • Show more
All isoforms of this gene containing a lectin domain
NP_001138365.1, NP_001138366.1, NP_001138368.1, NP_066978.1, NP_001138371.1, NP_001138367.1, NP_001138369.1
RNA
RNA (Transcript ID: NM_021155.4)
CD209 molecule, transcript variant 1
m7G-5')ppp(5'-ACACUGGGGGAGAGUGGGGUGACAUGAGUGACUCCAAGGAACCAAGACUGCAGCAGCUGGGCCUCCUGGAGGAGGAACAGCUGAGAGGCCUUGGAUUCCGACAGACUCGAGGAUACAAGAGCUUAGCAGGGUGUCUUGGCCAUGGUCCCCUGGUGCUGCAACUCCUCUCCUUCACGCUCUUGGCUGGGCUCCUUGUCCAAGUGUCCAAGGUCCCCAGCUCCAUAAGUCAGGAACAAUCCAGGCAAGACGCGAUCUACCAGAACCUGACCCAGCUUAAAGCUGCAGUGGGUGAGCUCUCAGAGAAAUCCAAGCUGCAGGAGAUCUACCAGGAGCUGACCCAGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGCUGCAGGAGAUCUACCAGGAGCUGACCCGGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGCUGCAGGAGAUCUACCAGGAGCUGACCUGGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGAUGCAGGAGAUCUACCAGGAGCUGACUCGGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGCAGCAGGAGAUCUACCAGGAGCUGACCCGGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGCAGCAGGAGAUCUACCAGGAGCUGACCCGGCUGAAGGCUGCAGUGGGUGAGCUUCCAGAGAAAUCUAAGCAGCAGGAGAUCUACCAGGAGCUGACCCAGCUGAAGGCUGCAGUGGAACGCCUGUGCCACCCCUGUCCCUGGGAAUGGACAUUCUUCCAAGGAAACUGUUACUUCAUGUCUAACUCCCAGCGGAACUGGCACGACUCCAUCACCGCCUGCAAAGAAGUGGGGGCCCAGCUCGUCGUAAUCAAAAGUGCUGAGGAGCAGAACUUCCUACAGCUGCAGUCUUCCAGAAGUAACCGCUUCACCUGGAUGGGACUUUCAGAUCUAAAUCAGGAAGGCACGUGGCAAUGGGUGGACGGCUCACCUCUGUUGCCCAGCUUCAAGCAGUAUUGGAACAGAGGAGAGCCCAACAACGUUGGGGAGGAAGACUGCGCGGAAUUUAGUGGCAAUGGCUGGAACGACGACAAAUGUAAUCUUGCCAAAUUCUGGAUCUGCAAAAAGUCCGCAGCCUCCUGCUCCAGGGAUGAAGAACAGUUUCUUUCUCCAGCCCCUGCCACCCCAAACCCCCCUCCUGCGUAGCAGAACUUCACCCCCUUUUAAGCUACAGUUCCUUCUCUCCAUCCUUCGACCUUCACAAAAUCUCUGGGACUGUUCUUUGUCAGAUUCUUCCUCCUUUAGAAGGCUGGGUCCCAUUCUGUCCUUCUUGUCAUGCCUCCAAUUUCCCCUGGUGUAGAGCUUGUUUUUCUGGCCCAUCCUUGGAGCUUUAUGAGUGAGCUGGUGUGGGAUGCCUUUGGGGGUGGACUUGUGUUCCAAGAAUCCACUCUCUCUUCCUUUUGGAGAUUAGGAUAUUUGGGUUGCCAUGUGUAGCUGCUAUGUCCCCUGGGGCGUUAUCUUAUACAUGCAAACCUACCAUCUGUUCAACUUCCACCUACCACCUCCUGCACCCCUUUGAUCGGGGACUUACUGGUUGCAAGAGCUCAUUUUGCAGGCUGGAAGCACCAGGGAAUUAAUUCCCCCAGUCAACCAAUGGCACCCAGAGAGGGCAUGGAGGCUCCACGCAACCCCUUCCACCCCCACAUCUUCCUUUGUCUUAUACAUGGCUUCCAUUUGGCUGUUUCUAAGUUGUAUUCUUUAUUUUAUUAUUAUUAUUACUAUUUUUCGAGAUGGAGUUUCACUCUUGUCGCUCAGGCUGGAGUGCCAUGGCGCGAUCUUGGCUCACUGCAACCUCUGCCUCCCGGGUUCAAGUGAUUCUCCUGCCUCAGCCUCACGAGUAGCUGGAAUUACAGGCAGGCGCCACCAGACCCGGCUAAUUUUUUGUAUUUUUAGUACAGAUGGGGUUUCUCCGUGUUGGUCAGGCUGGUCUUGAACUCCCGACCUCAGAUGAUCUGCCCGCCUCGGCCUCCCAAAAUUGCUGGGAUUACAGGUGUGAGCCACCGCGCCUGGCCUAUUAUUUUUUGUAAGAAUAAAACAGGUUUAUUGGGAUUUGGGACUCUGAACAGUUCUGUCUCUACUACCUGAUCUCCUCCUACCACGACUUUGGGAUCUAGAGGAGCUUUGGCUCCGGCUGUGACGGCUCCGGCCGUUCUCACUGCGGCUGCACCGGCCCCCGCUGCGGUCACUAUUUCUUCCUCUGCUAGGUGAAUUGUGCCUCUCCUGGCUCUUUGACAUGUGCUAGUGAGAUUUCUUCCUUUUCCUUUCGGAUUCCCCAUUUCUUUUGUAGGAAUGGUCUGGACUAGGGUUCUCCUUCCCCGCAGCCUGUAGUAUUCAUCGUGGUGGCCCACCCUCUCUCUCCCCUUGGAGCUCUUGCCAAAGGAGGAGACAAGCAGAGGUCUCUAUUGGAUUUCUCAACACCUGAAGAAAGUUGCAGUGUUUUCCUCUUGGACAUUGUUGUAUUUCAAAUAAACCACAAAUCAUCAUUUUCCACCGAGCCACUGGGCAGAAUUCACACUGAAGCUGUCGUCCUGCGUACAUACCAUCGUCCGUUAAACAGAGAAAGAGCUGCUUGGCAUUCUUCUUCCGACUGGUACUGAACAUAUAUACUUGCCCCUCAGGUGAGGUUCCAAGUUGCAACUGACCUUGAACUGAAUCACUCUCCCCACGUUAUUUUUUAAUUACUAUUUUUUUUUAAAGAUGGGGUCUUGCUCUGUCGCCAGGCUGGAGUGCAGUGGCGCGAUCUAGGCUCACUGCAACUUCCGCCUCCCGGGUUCAAGCGAUUCUCCUGCCUCAGCCUCCCGAGUAGCUGGGACUCCACUAAAAGUACAAAAAUUAGCUGGGCGUGCACCACUGCGCCCAGCUAAUUCUUGUAUUUUUGGUAGAGACGGGGUUUCAACAUGUUGACCAGGAUGGUCUCGAUCUCUUGACCUCGUGAUUCGCCCGCCGCGUCCUCCCAAAGUGCUGGGAUUACAGGCCUGAGCCACCGCGCCCAGUCUCUCCCCACGUUCUUGAACUCGGGCAGCACAUCCUCACAGAAAUCUAGGAACUGUUGGUAGGUUUCUUCCUCGCUGUACUCCAGGCUUGCUUCGGAGUCAUAGUCAUCCCUCCUGCACUGCUCCUUUCCAAACACUGUAAACAUGCUUUUAAUAAGAAGGGUAGGACUGGAUGUUGGGAAAUCAUGUGAACAUCUAUCUCCAAAUCUGCAAGCUCCUGUUUUACUGUAGAAGGGACAAUUAACUCCAUCCUUCUCCAUGACUCUGAAAUCCAAGGGGGGGUUCCGGGUUUUGCCAUGUGGCGCCAUUUUCCAACUCAUUUUCAGCCUGAUCCAGCAUCUUCUGGACAGCUUCCGGUUUUUGUUUCUUCUGUCGUUUCUGUUCCUCCUCCUCUCUCUCUUUCCUCUGCUGUUCUUCCCAUUGUUCCUUUAACUUUCGCUCUUGUUCUUGCCGUUUUCUAGCCACCUCUUCCUUUUCCUUCUUUAUUCUGAAUUCUUCUUGUGCCUUCUGCUCUCUCAGCAACCACUCCUCAUGUAAUCUUUGCCUCUCUCUUCCCCAUAGCUUUUCUAGUUGUUGUUUUUCAAUAAAAGUGUCCUCCUCUUUCUGUGAGAGUCCUGAGUCCCUCAGUGGAGCAAGUUCCUGCUGGCGUUUCUUUCGUUUCUCCUUCUUCAGGGCGGCCCUGUACUUUUUGUGGCUUGGUUUCUCUGGAAAUGUCACCUUUUCGGGCGCAGCCAUCUUGCCGGCACCGCCCCGCCCCUCUAGUUGUAUCCUUUAUAAUAAACUGGUAAACAUUGUAACCGCAGAUUCAGCCCAAUCUGGUUCAACUUUGUGUAAUAAAAUGGCGAGUUGUUUUUCAGUUGUCGUGGACCCCCAGGUUGCAAGUUACAUACCCUGGGCAUGUCCAGAUGAACGAAGCGUGCAAAUCCACGUGGAACCUAAGUGCUCAGACCGAGGAACAGGGACUGAGUUAAGAAGUGGACACCACGUGGCAUGAUCCUUGAUCCAAUCAGAUUGAGCCCUGGCGUGAUCCAGUCAGAUCAAGCCUCCUGAAUCCCCUCAUUACAAGAUCCAAUCAUAUCAUGCCUCACUACCCUCUGUAUAUAAAAUCUGCCCCAGCCUCCAACUUGGAGAGACAGAUUUGGGCCAGACUCCUGUGUCCUUGCUUGGCUGCCUUGCAAUAAAUUUUUCUCUCUACAAAA-3'- Poly-A tail
  • Coding region
DNA
DNA (Gene ID: 30835)
CD209 molecule
strand -
DC-SIGN, hDC-SIGN, CDSIGN, DC-SIGN1, CLEC4L
NCBI CDS gene sequence (1215 bp)
5'-ATGAGTGACTCCAAGGAACCAAGACTGCAGCAGCTGGGCCTCCTGGAGGAGGAACAGCTGAGAGGCCTTGGATTCCGACAGACTCGAGGATACAAGAGCTTAGCAGGGTGTCTTGGCCATGGTCCCCTGGTGCTGCAACTCCTCTCCTTCACGCTCTTGGCTGGGCTCCTTGTCCAAGTGTCCAAGGTCCCCAGCTCCATAAGTCAGGAACAATCCAGGCAAGACGCGATCTACCAGAACCTGACCCAGCTTAAAGCTGCAGTGGGTGAGCTCTCAGAGAAATCCAAGCTGCAGGAGATCTACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCTGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCTGCAGGAGATCTACCAGGAGCTGACCTGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGATGCAGGAGATCTACCAGGAGCTGACTCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGAACGCCTGTGCCACCCCTGTCCCTGGGAATGGACATTCTTCCAAGGAAACTGTTACTTCATGTCTAACTCCCAGCGGAACTGGCACGACTCCATCACCGCCTGCAAAGAAGTGGGGGCCCAGCTCGTCGTAATCAAAAGTGCTGAGGAGCAGAACTTCCTACAGCTGCAGTCTTCCAGAAGTAACCGCTTCACCTGGATGGGACTTTCAGATCTAAATCAGGAAGGCACGTGGCAATGGGTGGACGGCTCACCTCTGTTGCCCAGCTTCAAGCAGTATTGGAACAGAGGAGAGCCCAACAACGTTGGGGAGGAAGACTGCGCGGAATTTAGTGGCAATGGCTGGAACGACGACAAATGTAATCTTGCCAAATTCTGGATCTGCAAAAAGTCCGCAGCCTCCTGCTCCAGGGATGAAGAACAGTTTCTTTCTCCAGCCCCTGCCACCCCAAACCCCCCTCCTGCGTAG-3'
NCBI CDS gene sequence with introns (location: 7743039.. 7747511) (4473 bp)Download
NCBI CDS gene sequence with introns, 5'UTR and 3'UTR (location: 7739993.. 7747534) (7542 bp)Download
NCBI gene sequence (location: [7739993.. 7747534 + 1000]) (8542 bp)Download
Cite How to cite