NCBI Summary
This gene is one of several genes encoding pulmonary-surfactant associated proteins (SFTPA) located on chromosome 10. Mutations in this gene and a highly similar gene located nearby, which affect the highly conserved carbohydrate recognition domain, are associated with idiopathic pulmonary fibrosis. The current version of the assembly displays only a single centromeric SFTPA gene pair rather than the two gene pairs shown in the previous assembly which were thought to have resulted from a duplication. [provided by RefSeq, Sep 2009].
Protein
Protein (NP_001092138)
SP-A2 / surfactant-associated protein A2
Pulmonary surfactant-associated protein A2 (PSP-A) (PSPA) (SP-A) (SP-A2) (35 kDa pulmonary surfactant-associated protein) (Alveolar proteinosis protein) (Collectin-5)
SFTPA2
surfactant protein A2
Undefined
Very low evidence
C-type lectin
SFTPA2
C-type - Collectins
a/b mixed / C-type lectin-like
Uncharacterised
0.353
Undefined
Protein sequence and protein families (fasta) (248 amino acids) Download
MWLCPLALTLILMAASGAACEVKDVCVGSPGIPGTPGSHGLPGRDGRDGVKGDPGPPGPMGPPGETPCPPGNNGLPGAPGVPGERGEKGEAGERGPPGLPAHLDEELQATLHDFRHQILQTRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKKYNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLYSRLTICEF
No structure currently available in the PDB RCSB Databank.
Structural models
Model Confidence:
  •    Very high (pLDDT > 90)
  •    Confident (90 > pLDDT > 70)
  •    Low (70 > pLDDT > 50)
  •    Very low (pLDDT < 50)

  AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

SWISS-MODEL structural models
Modeller structural model (Homology modelling pipeline), Error: [0.86, 1.13] ÅDownload
The location of the lectin domain structural model is: 115-248
We infer [0.86, 1.13] Å as the interval of error of this structural model.
Template 1: 4WRE chain: A, P08427, NP_001257574.1, sequence identity: 70.1%, coverage: 100.0%, location in sequence: 108-248, (88-228 in PDB).
Template 2: 6BBD chain: A, Q9N1X4, NP_999275.1, sequence identity: 40.3%, coverage: 100.0%, location in sequence: 225-378, (205-358 in PDB).
Template 3: 6LFJ chain: A, Q9D8Q7, NP_081494.1, sequence identity: 25.4%, coverage: 92.5%, location in sequence: 42-174, (75-207 in PDB).
Show the alignment used for the construction of the structural model, Download.
Show the plot of DOPE energy score, Download.
Ligand
Glycan ligands from structural data
No crystal structures of complexes with glycan ligand.
References
NCBI References (10 PubMed Identifiers)
  • Human Surfactant Protein SP-A1 and SP-A2 Variants Differentially Affect the Alveolar Microenvironment, Surfactant Structure, Regulation and Function of the Alveolar Macrophage, and Animal and Human Survival Under Various Conditions. [34484180]
  • Identification of a Missense Mutation in the Surfactant Protein A2 Gene in a Chinese Family with Interstitial Lung Disease. [33181027]
  • Functional assessment and phenotypic heterogeneity of SFTPA1 and SFTPA2 mutations in interstitial lung diseases and lung cancer. [32855221]
  • Differences in the alveolar macrophage toponome in humanized SP-A1 and SP-A2 transgenic mice. [33141765]
  • Identification and functional characterization of a novel surfactant protein A2 mutation (p.N207Y) in a Chinese family with idiopathic pulmonary fibrosis. [32602668]
  • A scan of chromosome 10 identifies a novel locus showing strong association with late-onset Alzheimer disease. [16385451]
  • Pulmonary Fibrosis, Familial [20301408]
  • Characterization of a second human pulmonary surfactant-associated protein SP-A gene. [1372511]
  • Studies of the structure of lung surfactant protein SP-A. [2610270]
  • Isolation and characterization of cDNA clones for the 35-kDa pulmonary surfactant-associated protein. [3755136]
UniProt Main References (5 PubMed Identifiers)
  • Human SP-A1 (SFTPA1) variant-specific 3' UTRs and poly(A) tail differentially affect the in vitro translation of a reporter gene. [20693318]
  • The DNA sequence and comparative analysis of human chromosome 10. [15164054]
  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). [15489334]
  • Genetics of the hydrophilic surfactant proteins A and D. [9813381]
  • Genetic defects in surfactant protein A2 are associated with pulmonary fibrosis and lung cancer. [19100526]
All isoforms of this gene containing a lectin domain
NP_001092138.1, NP_001307742.1, XP_005270189.1, XP_011538426.1, XP_011538427.1, XP_016872097.1, NP_001307743.1, XP_005270185.1
RNA
RNA (Transcript ID: NM_001098668.4)
surfactant protein A2, transcript variant 1
m7G-5')ppp(5'-AACUUGGAGGCAGAGACCCAAGCAGCUGGAGGCUCUGUGUGUGGGUCGCUGAUUUCUUGGAGCCUGAAAAGAAGGAGCAGCGACUGGACCCAGAGCCAUGUGGCUGUGCCCUCUGGCCCUCACCCUCAUCUUGAUGGCAGCCUCUGGUGCUGCGUGCGAAGUGAAGGACGUUUGUGUUGGAAGCCCUGGUAUCCCCGGCACUCCUGGAUCCCACGGCCUGCCAGGCAGGGACGGGAGAGAUGGUGUCAAAGGAGACCCUGGCCCUCCAGGCCCCAUGGGUCCGCCUGGAGAAACACCAUGUCCUCCUGGGAAUAAUGGGCUGCCUGGAGCCCCUGGUGUCCCUGGAGAGCGUGGAGAGAAGGGGGAGGCUGGCGAGAGAGGCCCUCCAGGGCUUCCAGCUCAUCUAGAUGAGGAGCUCCAAGCCACACUCCACGACUUCAGACAUCAAAUCCUGCAGACAAGGGGAGCCCUCAGUCUGCAGGGCUCCAUAAUGACAGUAGGAGAGAAGGUCUUCUCCAGCAAUGGGCAGUCCAUCACUUUUGAUGCCAUUCAGGAGGCAUGUGCCAGAGCAGGCGGCCGCAUUGCUGUCCCAAGGAAUCCAGAGGAAAAUGAGGCCAUUGCAAGCUUCGUGAAGAAGUACAACACAUAUGCCUAUGUAGGCCUGACUGAGGGUCCCAGCCCUGGAGACUUCCGCUACUCAGAUGGGACCCCUGUAAACUACACCAACUGGUACCGAGGGGAGCCUGCAGGUCGGGGAAAAGAGCAGUGUGUGGAGAUGUACACAGAUGGGCAGUGGAAUGACAGGAACUGCCUGUACUCCCGACUGACCAUCUGUGAGUUCUGAGAGGCAUUUAGGCCAUGGGACAGGGAGGAUCCUGUCUGGCCUUCAGUUUCCAUCCCCAGGAUCCACUUGGUCUGUGAGAUGCUAGAACUCCCUUUCAACAGAAUUCACUUGUGGCUAUUAGAGCUGGAGGCACCCUUAGCCACUUCAUUCCCCUGAUGGGCCCUGACUCUUCCCCAUAAUCACUGACCAGCCUUGACACUCCCCUUGCAAACCAUCCCAGCACUGCACCCCAGGCAGCCACUCCUAGCCUUGGCCUUUGGCAUGAGAUGGAGGCCUCCUUAUUCCCCAUCUGGUCCAGUUCCUUCACUUACAGAUGGCAGCAGUGAGGCCUUGGGGUAGAAGGAUCCUCCAAAGUCACACAGAGUGCCUGCCUCCUGGUCCCCUCAGCUCUGCCUCUGCAGCCCACUGCCUGCCCAGUGCCAUCAGGAUGAGCAGUACCGGCCAAGCAUAAUGACAGAGAGAGGCAGAUUUCAGGGAAGCCCUGACUGUGUGGAGCUAAGGACACAGUGGAGAUUCUCUGGCACUCUGAGGUCUCUGUGGCAGGCCUGGUCAGGCUCUCCAGGUGGUCAGAGGGCCCAGUGGUGCCCCAGCACGGUGGUGCCCAAGCCAACCCUGUGACUGACAUGUACGAUUCACUCCUUUGAGUCUUUGGAUGCCAACUCAGCCCCCUGACCUGGAGGCAGCCGGCCAAGGCCUCUAGGGAAGAGCCCCCCACUGCAGACAUGACCCGAGUAACUUUCUGCUGAUGAACAAAUCUGCACCCCACUUCAGACCUCGGUGGGCAUUCACACCACCCCCCAUGCCACCGGCUCCACUUUCCCCUUUUAUUAAUACAUUCACCCAGAUAAUCAUUAAAAUUAACAUGUGCCAGGUCUUAGGAUGUGUCUUGGGGUGGGCACAGUACCCGGUGACUCUUGGGGAUAUUUAUUUAUUUUCCCUGAGCCUAUAUCUUCAUCUGUGAAAUGGGGAUAAAAAUACUUGUUGCUGUCACAAUUAUUACCAUCUCUCCAGCUAGCAAAAUUACUACCAGAGCCGUUACUACACACAAAGGCUAUUGACCGAGCACAUACCAUGUGCCACACACCUUGACAAAAUCUUUUAAUACAGUUUAUUAUGUACUAUUCAAUCUUUACACAAUGUCACGGGACCAGUAUUGUUUACCCAAUUUUUUAUAAGGACACUGAAGCUUAGAGGAGUGAAAUGUUUUGAGUGUUAUUUCAGAGAGCAAAUGGCAAAGACUGGAUCCAAACCCAUCUUCCUGGACCUGAAGUUCAUGCUCCCAGCCACCCCACCCCUGAGCUGAAUAAAGAUGAUUUAAGCAUAAUAAAUCGUUAGUGUGUUCACAUGAGUUUCCAUA-3'- Poly-A tail
  • Coding region
;
DNA
DNA (Gene ID: 729238)
surfactant protein A2
strand -
SP-A2, COLEC5
NCBI CDS gene sequence (747 bp)
5'-ATGTGGCTGTGCCCTCTGGCCCTCACCCTCATCTTGATGGCAGCCTCTGGTGCTGCGTGCGAAGTGAAGGACGTTTGTGTTGGAAGCCCTGGTATCCCCGGCACTCCTGGATCCCACGGCCTGCCAGGCAGGGACGGGAGAGATGGTGTCAAAGGAGACCCTGGCCCTCCAGGCCCCATGGGTCCGCCTGGAGAAACACCATGTCCTCCTGGGAATAATGGGCTGCCTGGAGCCCCTGGTGTCCCTGGAGAGCGTGGAGAGAAGGGGGAGGCTGGCGAGAGAGGCCCTCCAGGGCTTCCAGCTCATCTAGATGAGGAGCTCCAAGCCACACTCCACGACTTCAGACATCAAATCCTGCAGACAAGGGGAGCCCTCAGTCTGCAGGGCTCCATAATGACAGTAGGAGAGAAGGTCTTCTCCAGCAATGGGCAGTCCATCACTTTTGATGCCATTCAGGAGGCATGTGCCAGAGCAGGCGGCCGCATTGCTGTCCCAAGGAATCCAGAGGAAAATGAGGCCATTGCAAGCTTCGTGAAGAAGTACAACACATATGCCTATGTAGGCCTGACTGAGGGTCCCAGCCCTGGAGACTTCCGCTACTCAGATGGGACCCCTGTAAACTACACCAACTGGTACCGAGGGGAGCCTGCAGGTCGGGGAAAAGAGCAGTGTGTGGAGATGTACACAGATGGGCAGTGGAATGACAGGAACTGCCTGTACTCCCGACTGACCATCTGTGAGTTCTGA-3'
NCBI CDS gene sequence with introns (location: 79557209.. 79559483) (2275 bp)Download
NCBI CDS gene sequence with introns, 5'UTR and 3'UTR (location: 79555852.. 79560407) (4556 bp)Download
NCBI gene sequence (location: [79555852.. 79560407 + 1000]) (5556 bp)Download
Cite How to cite