NCBI Summary
This gene encodes a lung surfactant protein that is a member of a subfamily of C-type lectins called collectins. The encoded protein binds specific carbohydrate moieties found on lipids and on the surface of microorganisms. This protein plays an essential role in surfactant homeostasis and in the defense against respiratory pathogens. Mutations in this gene are associated with idiopathic pulmonary fibrosis. Alternate splicing results in multiple transcript variants. [provided by RefSeq, May 2010].
Protein
Protein (NP_005402)
SP-A1 - surfactant-associated protein A1
Pulmonary surfactant-associated protein A1 (PSP-A) (PSPA) (SP-A) (SP-A1) (35 kDa pulmonary surfactant-associated protein) (Alveolar proteinosis protein) (Collectin-4)
SFTPA1
surfactant protein A1
Undefined
Curated
C-type lectin
SFTPA1
C-type - Collectins
a/b mixed / C-type lectin-like
Man, Fuc, Gal / Glycosphingolipids
0.353
Protein sequence and protein families (fasta) (248 amino acids) Download
MWLCPLALNLILMAASGAVCEVKDVCVGSPGIPGTPGSHGLPGRDGRDGLKGDPGPPGPMGPPGEMPCPPGNDGLPGAPGIPGECGEKGEPGERGPPGLPAHLDEELQATLHDFRHQILQTRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKKYNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLYSRLTICEF
No structure currently available in the PDB RCSB Databank.
Structural models
Model Confidence:
  •    Very high (pLDDT > 90)
  •    Confident (90 > pLDDT > 70)
  •    Low (70 > pLDDT > 50)
  •    Very low (pLDDT < 50)

  AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

SWISS-MODEL structural models
Modeller structural model (Homology modelling pipeline), Error: [0.86, 1.13] ÅDownload
The location of the lectin domain structural model is: 115-248
We infer [0.86, 1.13] Å as the interval of error of this structural model.
Template 1: 4WRE chain: A, P08427, NP_001257574.1, sequence identity: 70.1%, coverage: 100.0%, location in sequence: 108-248, (88-228 in PDB).
Template 2: 6BBD chain: A, Q9N1X4, NP_999275.1, sequence identity: 40.3%, coverage: 100.0%, location in sequence: 225-378, (205-358 in PDB).
Template 3: 6LFJ chain: A, Q9D8Q7, NP_081494.1, sequence identity: 25.4%, coverage: 92.5%, location in sequence: 42-174, (75-207 in PDB).
Show the alignment used for the construction of the structural model, Download.
Show the plot of DOPE energy score, Download.
Ligand
Glycan ligands from structural data
No crystal structures of complexes with glycan ligand.
References
NCBI References (10 PubMed Identifiers)
  • Human Surfactant Protein SP-A1 and SP-A2 Variants Differentially Affect the Alveolar Microenvironment, Surfactant Structure, Regulation and Function of the Alveolar Macrophage, and Animal and Human Survival Under Various Conditions. [34484180]
  • Functional assessment and phenotypic heterogeneity of SFTPA1 and SFTPA2 mutations in interstitial lung diseases and lung cancer. [32855221]
  • Differences in the alveolar macrophage toponome in humanized SP-A1 and SP-A2 transgenic mice. [33141765]
  • Specificity of lung surfactant protein SP-A for both the carbohydrate and the lipid moieties of certain neutral glycolipids. [1577827]
  • Characterization of a second human pulmonary surfactant-associated protein SP-A gene. [1372511]
  • An immunohistochemical study of bronchial cells producing surfactant protein A in the developing human fetal lung. [1935736]
  • Structural comparison of recombinant pulmonary surfactant protein SP-A derived from two human coding sequences: implications for the chain composition of natural human SP-A. [1986781]
  • The coding sequence for the 32,000-dalton pulmonary surfactant-associated protein A is located on chromosome 10 and identifies two separate restriction-fragment-length polymorphisms. [2884868]
  • Isolation and characterization of cDNA clones for the 35-kDa pulmonary surfactant-associated protein. [3755136]
  • Isolation and characterization of the human pulmonary surfactant apoprotein gene. [2995821]
UniProt Main References (12 PubMed Identifiers)
  • Human SP-A1 (SFTPA1) variant-specific 3' UTRs and poly(A) tail differentially affect the in vitro translation of a reporter gene. [20693318]
  • Complete sequencing and characterization of 21,243 full-length human cDNAs. [14702039]
  • The DNA sequence and comparative analysis of human chromosome 10. [15164054]
  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). [15489334]
  • Studies of the structure of lung surfactant protein SP-A. [2610270]
  • Genetics of the hydrophilic surfactant proteins A and D. [9813381]
  • Association between the surfactant protein A (SP-A) gene locus and respiratory-distress syndrome in the Finnish population. [10762543]
  • The Mycobacterium tuberculosis cell-surface glycoprotein apa as a potential adhesin to colonize target cells via the innate immune system pulmonary C-type lectin surfactant protein A. [17158455]
  • Identification of four novel DC-SIGN ligands on Mycobacterium bovis BCG. [21203928]
  • Surfactant protein A (SP-A)-mediated clearance of Staphylococcus aureus involves binding of SP-A to the staphylococcal adhesin eap and the macrophage receptors SP-A receptor 210 and scavenger receptor class A. [21123169]
  • Show more
All isoforms of this gene containing a lectin domain
NP_001158118.1, NP_001158117.1, NP_001158116.1, NP_001158119.1, NP_005402.3, XP_005270119.1, NP_001087239.2, XP_006718016.1
RNA
RNA (Transcript ID: NM_005411.5)
surfactant protein A1, transcript variant 1
m7G-5')ppp(5'-GACUUGGAGGCAGAGACCCAAGCAGCUGGAGGCUCUGUGUGUGGGUCGCUGAUUUCUUGGAGCCUGAAAAGAAAGUAACACAGCAGGGAUGAGGACAGAUGGUGUGAGUCAGUGAGAGCAGCGACUGGACCCAGAGCCAUGUGGCUGUGCCCUCUGGCCCUCAACCUCAUCUUGAUGGCAGCCUCUGGUGCUGUGUGCGAAGUGAAGGACGUUUGUGUUGGAAGCCCUGGUAUCCCCGGCACUCCUGGAUCCCACGGCCUGCCAGGCAGGGACGGGAGAGAUGGUCUCAAAGGAGACCCUGGCCCUCCAGGCCCCAUGGGUCCACCUGGAGAAAUGCCAUGUCCUCCUGGAAAUGAUGGGCUGCCUGGAGCCCCUGGUAUCCCUGGAGAGUGUGGAGAGAAGGGGGAGCCUGGCGAGAGGGGCCCUCCAGGGCUUCCAGCUCAUCUAGAUGAGGAGCUCCAAGCCACACUCCACGACUUUAGACAUCAAAUCCUGCAGACAAGGGGAGCCCUCAGUCUGCAGGGCUCCAUAAUGACAGUAGGAGAGAAGGUCUUCUCCAGCAAUGGGCAGUCCAUCACUUUUGAUGCCAUUCAGGAGGCAUGUGCCAGAGCAGGCGGCCGCAUUGCUGUCCCAAGGAAUCCAGAGGAAAAUGAGGCCAUUGCAAGCUUCGUGAAGAAGUACAACACAUAUGCCUAUGUAGGCCUGACUGAGGGUCCCAGCCCUGGAGACUUCCGCUACUCAGACGGGACCCCUGUAAACUACACCAACUGGUACCGAGGGGAGCCCGCAGGUCGGGGAAAAGAGCAGUGUGUGGAGAUGUACACAGAUGGGCAGUGGAAUGACAGGAACUGCCUGUACUCCCGACUGACCAUCUGUGAGUUCUGAGAGGCAUUUAGGCCAUGGGACAGGGAGGACGCUCUCUGGCCUUCGGCCUCCAUCCUGAGGCUCCACUUGGUCUGUGAGAUGCUAGAACUCCCUUUCAACAGAAUUCACUUGUGGCUAUUGGGACUGGAGGCACCCUUAGCCACUUCAUUCCUCUGAUGGGCCCUGACUCUUCCCCAUAAUCACUGACCAGCCUUGACACUCCCCUUGCAAACUCUCCCAGCACUGCACCCCAGGCAGCCACUCUUAGCCUUGGCCUUCGACAUGAGAUGGAGCCCUCCUUAUUCCCCAUCUGGUCCAGUUCCUUCACUUACAGAUGGCAGCAGUGAGGUCUUGGGGUAGAAGGACCCUCCAAAGUCACACAAAGUGCCUGCCUCCUGGUCCCCUCAGCUCUCUCUCUGCAACCCAGUGCCAUCAGGAUGAGCAAUCCUGGCCAAGCAUAAUGACAGAGAGAGGCAGACUUCGGGGAAGCCCUGACUGUGCAGAGCUAAGGACACAGUGGAGAUUCUCUGGCACUCUGAGGUCUCUGUGGCAGGCCUGGUCAGGCUCUCCAUGAGGUUAGAAGGCCAGGUAGUGUUCCAGCAGGGUGGUGGCCAAGCCAACCCCAUGAUUGAUGUGUACGAUUCACUCCUUUGAGUCUUUGAAUGGCAACUCAGCCCCCUGACCUGAAGACAGCCAGCCUAGGCCUCUAGGGUGACCUAGAGCCGCCUUCAGAUGUGACCCGAGUAACUUUCAACUGAUGAACAAAUCUGCACCCUACUUCAGAUUUCAGUGGGCAUUCACACCACCCCCCACACCACUGGCUCUGCUUUCUCCUUUCAUUAAUCCAUUCACCCAGAUAUUUCAUUAAAAUUAUCACGUGCCAGGUCUUAGGAUAUGUCGUGGGGUGGGCAAGGUAAUCAGUGACAGUUGAAGAUUUUUUUUUCCCAGAGCUUAUGUCUUCAUCUGUGAAAUGGGAAUAAGAUACUUGUUGCUGUCACAGUUAUUACCAUCCCCCCAGCUACCAAAAUUACUACCAGAACUGUUACUAUACACAGAGGCUAUUGACUGAGCACCUAUCAUUUGCCAAGAACCUUGACAAGCACUUCUAAUACAGCAUAUUAUGUACUAUUCAAUCUUUACACAAUGUCACGGGACCAGUAUUGUUUCCUCAUUUUUUAUAAGGACACUGAAGCUUGGAGGAGUUAAAUGUUUUGAGUAUUAUUCCAGAGAGCAAGUGGCAGAGGCUGGAUCCAAACCCAUCUUCCUGGACCUGAAGCUUAUGCUUCCAGCCACCCCACUCCUGAGCUGAAUAAAGAUGAUUUAAGCUUAAUAAAUCGUGAAUGUGUUCACA-3'- Poly-A tail
  • Coding region
;
DNA
DNA (Gene ID: 653509)
surfactant protein A1
strand +
SP-A, SP-A1, COLEC4
NCBI CDS gene sequence (747 bp)
5'-ATGTGGCTGTGCCCTCTGGCCCTCAACCTCATCTTGATGGCAGCCTCTGGTGCTGTGTGCGAAGTGAAGGACGTTTGTGTTGGAAGCCCTGGTATCCCCGGCACTCCTGGATCCCACGGCCTGCCAGGCAGGGACGGGAGAGATGGTCTCAAAGGAGACCCTGGCCCTCCAGGCCCCATGGGTCCACCTGGAGAAATGCCATGTCCTCCTGGAAATGATGGGCTGCCTGGAGCCCCTGGTATCCCTGGAGAGTGTGGAGAGAAGGGGGAGCCTGGCGAGAGGGGCCCTCCAGGGCTTCCAGCTCATCTAGATGAGGAGCTCCAAGCCACACTCCACGACTTTAGACATCAAATCCTGCAGACAAGGGGAGCCCTCAGTCTGCAGGGCTCCATAATGACAGTAGGAGAGAAGGTCTTCTCCAGCAATGGGCAGTCCATCACTTTTGATGCCATTCAGGAGGCATGTGCCAGAGCAGGCGGCCGCATTGCTGTCCCAAGGAATCCAGAGGAAAATGAGGCCATTGCAAGCTTCGTGAAGAAGTACAACACATATGCCTATGTAGGCCTGACTGAGGGTCCCAGCCCTGGAGACTTCCGCTACTCAGACGGGACCCCTGTAAACTACACCAACTGGTACCGAGGGGAGCCCGCAGGTCGGGGAAAAGAGCAGTGTGTGGAGATGTACACAGATGGGCAGTGGAATGACAGGAACTGCCTGTACTCCCGACTGACCATCTGTGAGTTCTGA-3'
NCBI CDS gene sequence with introns (location: 79611826.. 79614113) (2288 bp)Download
NCBI CDS gene sequence with introns, 5'UTR and 3'UTR (location: 79610939.. 79615443) (4505 bp)Download
NCBI gene sequence (location: [79610939 - 1000].. 79615443) (5505 bp)Download
Cite How to cite