NCBI Summary
This locus represents naturally occurring transcripts that splice the 5' exons of the POC1B (POC1 centriolar protein homolog B) gene on chromosome 12 to the GALNT4 (UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 4) gene, which is located within a POC1B intron. Alternative splicing results in two transcript variants, one of which encodes a fusion isoform that shares sequence identity with the products of each individual gene. [provided by RefSeq, Dec 2010].
Protein
Protein (NP_001186710)
CBM13-ppGalNAc-T4
Polypeptide N-acetylgalactosaminyltransferase 4 (EC 2.4.1.41) (Polypeptide GalNAc transferase 4) (GalNAc-T4) (pp-GaNTase 4) (Protein-UDP acetylgalactosaminyltransferase 4) (UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 4)
POC1B-GALNT4
POC1B-GALNT4 readthrough
Undefined
Curated
Ricin-like
R-Type Lectins
b-trefoil
Undefined
Undefined
Protein sequence and protein families (fasta) (575 amino acids) Download
MASATEDPVLERYFKGHKAAITSLDLSPNGKQLGAGRARELGSRRLSDLQKNTEDLSRPLYKKPPADSRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDRVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCDALDKNQIWSFEK
Mol* PDB structure viewerUniLectin3D
Structural models
Model Confidence:
  •    Very high (pLDDT > 90)
  •    Confident (90 > pLDDT > 70)
  •    Low (70 > pLDDT > 50)
  •    Very low (pLDDT < 50)

  AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.


Ligand
Glycan ligands from structural data
GalNAc, Glycopeptide
GalNAc
References
NCBI References (1 PubMed Identifiers)
  • Expression of conjoined genes: another mechanism for gene regulation in eukaryotes. [20967262]
UniProt Main References (6 PubMed Identifiers)
  • Cloning of a human UDP-N-acetyl-alpha-D-Galactosamine:polypeptide N-acetylgalactosaminyltransferase that complements other GalNAc-transferases in complete O-glycosylation of the MUC1 tandem repeat. [9804815]
  • Complete sequencing and characterization of 21,243 full-length human cDNAs. [14702039]
  • The finished DNA sequence of human chromosome 12. [16541075]
  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). [15489334]
  • The lectin domain of UDP-N-acetyl-D-galactosamine: polypeptide N-acetylgalactosaminyltransferase-T4 directs its glycopeptide specificities. [10984485]
  • The interdomain flexible linker of the polypeptide GalNAc transferases dictates their long-range glycosylation preferences. [29208955]
All isoforms of this gene containing a lectin domain
NP_001186711.1, NP_001186710.1
RNA
RNA (Transcript ID: NM_001199781.2)
POC1B-GALNT4 readthrough, transcript variant 1
m7G-5')ppp(5'-GACUGGGCCGCCGUUGUUGUAGUGACCGGGAGGCGCUGCCGCUCUGGGCAGACGGUUCCGGGAGCCGCACGGUCCCCUCUCCUUCCCCAUCCUCUCCCCUCCCCUCUCCGGGUUCCCCCACCCACAGGAGCCUUGGGCCGACCACUCCCCCGAUGGCCUCAGCCACGGAGGACCCCGUUCUGGAGCGUUAUUUCAAAGGCCACAAAGCUGCGAUCACCUCCUUGGACCUCAGCCCCAACGGCAAGCAACUUGGAGCCGGCCGUGCCAGGGAGCUGGGGUCAAGAAGGCUCUCAGACCUCCAGAAAAAUACGGAGGAUUUGUCUCGACCGCUUUAUAAGAAGCCCCCUGCAGAUUCCCGUGCACUUGGGGAGUGGGGGAAAGCCAGCAAACUCCAGCUCAACGAGGAUGAACUGAAGCAGCAAGAAGAACUCAUUGAGAGAUACGCCAUCAAUAUUUACCUCAGUGACAGGAUUUCCCUGCAUCGACACAUAGAGGAUAAAAGAAUGUAUGAGUGUAAGUCCCAGAAGUUCAACUAUAGGACACUUCCUACCACCUCUGUUAUCAUUGCUUUCUAUAACGAAGCCUGGUCGACUUUGCUCCGUACCAUUCACAGUGUUUUAGAAACUUCUCCUGCAGUUCUUUUGAAAGAGAUCAUCUUGGUGGAUGACUUGAGUGACAGAGUUUAUUUGAAGACACAACUUGAAACUUACAUCAGCAAUCUUGAUAGAGUACGCUUGAUUAGGACCAAUAAGCGAGAGGGGCUGGUUAGGGCCCGUCUGAUUGGGGCCACUUUCGCCACUGGGGACGUCCUCACUUUCCUGGAUUGUCACUGUGAGUGUAAUUCCGGUUGGCUGGAACCGCUUUUGGAAAGGAUUGGGAGAGAUGAAACAGCAGUUGUGUGUCCUGUUAUAGACACAAUUGAUUGGAAUACUUUUGAAUUCUAUAUGCAGAUAGGGGAGCCCAUGAUUGGUGGGUUUGACUGGCGUUUAACAUUUCAGUGGCAUUCUGUCCCCAAACAGGAAAGGGACAGGCGGAUAUCAAGAAUUGACCCCAUCAGAUCACCUACCAUGGCUGGAGGACUGUUUGCUGUCAGCAAGAAAUAUUUUCAGUACCUUGGAACGUAUGACACAGGAAUGGAAGUGUGGGGAGGUGAAAACCUUGAGCUGUCUUUUAGGGUGUGGCAGUGUGGUGGCAAAUUGGAGAUCCACCCGUGUUCCCACGUGGGCCAUGUGUUCCCCAAGCGGGCACCAUAUGCUCGCCCCAAUUUCCUACAGAAUACUGCUCGGGCAGCAGAAGUUUGGAUGGAUGAAUACAAAGAGCACUUCUACAAUAGAAACCCUCCAGCAAGAAAAGAAGCUUAUGGUGAUAUUUCUGAAAGAAAAUUACUACGAGAGCGGUUGAGAUGCAAGAGCUUUGACUGGUAUUUGAAAAACGUUUUUCCUAAUUUACAUGUUCCAGAGGAUAGACCAGGCUGGCAUGGGGCUAUUCGCAGUAGAGGGAUCUCGUCUGAAUGUUUAGAUUAUAAUUCUCCUGACAACAACCCCACAGGUGCUAACCUUUCACUGUUUGGAUGCCAUGGUCAAGGAGGCAAUCAAUUCUUUGAAUAUACUUCAAACAAAGAAAUAAGGUUUAAUUCUGUGACAGAGUUAUGUGCAGAGGUACCUGAGCAAAAAAAUUAUGUGGGAAUGCAAAAUUGUCCCAAAGAUGGGUUCCCUGUACCAGCAAACAUUAUUUGGCAUUUUAAAGAAGAUGGAACUAUUUUUCACCCACACUCAGGACUGUGUCUUAGUGCUUAUCGGACACCGGAGGGCCGACCUGAUGUACAAAUGAGAACUUGUGAUGCUCUAGAUAAAAAUCAAAUUUGGAGUUUUGAGAAAUAGAGCACAACAGCACUUUCGUCAUGAGCUGACAGUAGUGUCAAGAAAGUCAAAGAGCCUUAAGAGCCUCAGUGAAGAUUGUAUUUUAUUUUAUCAAAAGCCACCUAGCAGUCAUCUGUGGAGCACUGGAAAGCUGGGGUUCAUUUUGGUAUAUCACACUGAAACUGGGUACCCAGAGUGCUGCUGUUUAAUAUUUCACAAUGCCUUACUUAUUGGUUGUUUUAUAUAAGAGUUUUGUCAAUAUGGUCUCUUCUUAAAAGAAGUUGACUAUGAAUUGAAACACACAAAACAUUUAAGUGCCAGACUUAAUAUUAAAGAAUGUAAAGGUCCAAGUAAAAUGAGGUAUGAUUUAUGUUGAUGUGUAAGUUCACCGCACAUCCCACUUUUUAACAAAACUCAUGAAUGUGCAGUUUGAGCCAUUGCUAUUUUGAUUACAUAGAAUUUGUAUUUCUUUUUUAGCCAGCACAUUAAAUUUUAGAUUUUAUUUUUUAAUCUAAUUUUUUUCUAAUCAAAAAGAAAAUUGAGCUUAAGGCAAAAGGCCUGGUUUUAGAGAUAUGUGUAAUUGGAAGAGGGCAUUUGUUUGAGUGUGAGUUUGGAGGCCUUUUUAACAUGCAGACAUACCCAUAUUUAAAUGAAAUGGGGAGAUAUUUACAUUCCGUACUUUGUAAACUUGAGCUAUUGGACUUCACUGAUGUAUAUAUUAAUACCUCAGAUUCCUCUGAUUUUGUAAGCUGUCUUCUCUGUGAACGUGUUUGUGUGUGUAGGGCAUUUUCUGAUUGCACUUCCUUAAGUUAUGAAUGUACUAGAAAGGGACUCAUCCAGAAUACUAUGCCUCCCUUUGUUAAUGCUUAAUCAUUUAAAGUAAACACAAUUGAAGCCUCUCUGAAGUUAAACCCAACUAUGUUUAUUAAAAUGUGUGAAACUGAAAGUGGGCUAGGUUCUACCAAGGCUGUGGAACUCUCCUACGAGUUCUGCUGAUCAGGAAAUUUAAGAAUUUAUCUUAAAAAUGCAAGGAAAAAAGACUGCCUUGGCAAUUGUGAAUGGUGCUUUCAAUCUCCUAGCACCGAGCCUGGCACUUAGGCAGCUUUCAGUAAGUGGGUGAAUGAAUGACUGAAUGAAUGAAUGAAUGGCUCAGCUGAGGAAUGUAACUUUGGUCAAGUUAUUAUGAUGUGUUUGGGCUUAGUUUUCUCAUUGGUAAAAUGUGGGUGCUGGAUUGGAUCUUAAAGAUCCCUUCCAGCUCUGAAAUGCUGAUUGUACAGUAUAUUCUUCCCAGAUUGACUCACUGUGCAAUCUUUACAAUACUUUUUAUCUUUUCACUUUUGACAUAGGUAAUGUUGUUGAGCAGUUGAGCAAUGUUCAGUCCAGUUGUGAAGCUGGAGAAGAGAAAUGGGUUUUAAAAAUUAAGUGAGGGGAGGCCGGGUGCGGUGGCUCACGCAUGUAAUCCCAGCACUUUGGGAGGCCAAGGCAGGUGGAUCACGAGGUCAGGAGAUCCAGACCAUCCUGGCUAACAUGGUGAAACCUCGUCUCUACUAAAAAUACAAAAAAUUAGCCAGCUGUGGUGGCGGGCGCCUGUAGUCCCAGCUACUCAGGAGGCUGAGGCAGGAGAAUGGCGUGAACCUGUGAGGCAGAGCUUACAGUGAGCCGAGAUCGUGCCACUGCACUCCAGCCUGGGCGACAGAGCAAGACUCUGUCUCAAAAAAAAAAAAUAAUAAUAAAAUAAGUGAGCUGAACUCACCUGAAGUGGUUUACUUCUGUGGGUUAAGAAGUUCUAGUCAGUGUUCAUAGUCGUUUCGUUUUGAUAAUUGUUGAACCAAUUUUGUUUUUAAAACCUUUAGACUCUGAAAGUAAUAUUUUGACUAAGAAUGUAAAUAUUUCCAAACUAAAUUACUCGGGAAGUAAACGCUUUUUUUAAAAGUAUUUUUACUGGUUUUAUACCAAUAUUAUAUGCAGAAAUCACAGGAUGAAUUUAGAAUUAAAUCUCAAUUAGUUCACUUUGGCCUAGAUUUAUGAAAAAUGCAUGCCUCGUAAAGAGUCCACUGUAUUCACGAGUAAAGUUGCUUUUAGUGUUCACUUGAUGACUUGGAGAGUAGGAAUUUUGCAAAAUCUGAAUUUAAGGAAAUUCUUUAGGAUAACCAUUUCAAAAAAUAAAAUUGCUAUGCAAUCUUGAAUAUUUUCUCUUUUGCCUCGUAAAAUGAAAAUGCAUUCACAGUUUCUGUAAAUUAUUUAGCAGCCUUAAAGUUUAUCAAAAAAUUGUCCAGAUUCCACGUGCAGCAUGCUUGGCCCUGCAUUUAAUUUAAGAAGGAUUAAUAAUAAUGCUCUGAAUUUUUCGAAAGGGAUUCUCCUAAACCCACCCACUUCUCUUGCCCAGGCUGCUUUUUAAAAAUAUUUUUUUAUUUUUUACUUAUUUUUAAAUUUUCUCUUUUUAUUUAUUUUUGGUUUUCUUGUUAGCCACCUGUUAUAUGGGAGAACGAAAAUUGUUAUAUUUUGAAAGUACUUAUUACAUUAUUUUUAUUUUAGUAUCUUGAUGCUCCUGUCAAAAGGGAAAUGAGGCUUUUAAAAAUAAAGUACCUUAAUUCUUUAUUGACUUUUUGCCCUAAAUUGCUAGGUGUGACCCAGCAAUCUUUUAGGAAGAGAUUUUACAGUGGUGCUUUAUUUAUAUCAAUAAUCCAGUAUAGUUAGGCUGUUCAUUCCUCAUAAUAGAGUACAUAACAGAAAAGUGGGACUUUCACAUUUUCAUAUUUAGGCACGUUCCAAUUUAAUUCCAAAAAUACUCUGUAAUUCUACAUCUAAAAAAACCGAUUCCCUAAUUCGAAUUUAUUGGUACCAAAGCUCUCUUUGGCUAUAGACAAUUAAGAGUUGACCUUUUAAGUUAAUGUAUAUGCUUAAAAACAGUUUUAGGAAAAUAUUUGGUAGACAAAGAGUUUCAACUUUAAAUGUUCACUAUGUCAUUUAGUGUCCAACUUUACGGAUAGGUUGACUAUCUAAAUAGGCAUUUUUAGUCAUUAAAAAAAAUCUAGUCACCAGGAGGAUCCCUAUAACUCAAAAUAACUUGUUUGUAAAAGAAAAUUUGUUUACUUACCCAUUAGUAAGUUCCUGCAUAUUCAUUAUAAGAUGGCAAAUCAAACUUUUCUAGGAUGAAGACAGCUUAUUUUUAAGUUGUAUAGUCUUAGUUGGUUUAGGGUCUCAAUUUUAAUUAAUAAAAUACUUGGUUUUUAUUUGCUUGUCCUUUUGAAUUCCUGUUUUAAUAAUUUUAAAAUGAGCACAAAGAACGUUGAAGUUCAGAUUAAUCUCUUCUGAAUGAUGUUUUUUUCCUCUGUGAUGAGUUGUUUCUGACUUUUUUCCUUUUGUAUUUGUAAUGUUGAUUAAGAUGUAAAAUAAAAAGUGUGCCUGAUUAUUUUUGCAAA-3'- Poly-A tail
  • Coding region
;
DNA
POC1B-GALNT4 readthrough
strand -
NCBI CDS gene sequence (1728 bp)
5'-ATGGCCTCAGCCACGGAGGACCCCGTTCTGGAGCGTTATTTCAAAGGCCACAAAGCTGCGATCACCTCCTTGGACCTCAGCCCCAACGGCAAGCAACTTGGAGCCGGCCGTGCCAGGGAGCTGGGGTCAAGAAGGCTCTCAGACCTCCAGAAAAATACGGAGGATTTGTCTCGACCGCTTTATAAGAAGCCCCCTGCAGATTCCCGTGCACTTGGGGAGTGGGGGAAAGCCAGCAAACTCCAGCTCAACGAGGATGAACTGAAGCAGCAAGAAGAACTCATTGAGAGATACGCCATCAATATTTACCTCAGTGACAGGATTTCCCTGCATCGACACATAGAGGATAAAAGAATGTATGAGTGTAAGTCCCAGAAGTTCAACTATAGGACACTTCCTACCACCTCTGTTATCATTGCTTTCTATAACGAAGCCTGGTCGACTTTGCTCCGTACCATTCACAGTGTTTTAGAAACTTCTCCTGCAGTTCTTTTGAAAGAGATCATCTTGGTGGATGACTTGAGTGACAGAGTTTATTTGAAGACACAACTTGAAACTTACATCAGCAATCTTGATAGAGTACGCTTGATTAGGACCAATAAGCGAGAGGGGCTGGTTAGGGCCCGTCTGATTGGGGCCACTTTCGCCACTGGGGACGTCCTCACTTTCCTGGATTGTCACTGTGAGTGTAATTCCGGTTGGCTGGAACCGCTTTTGGAAAGGATTGGGAGAGATGAAACAGCAGTTGTGTGTCCTGTTATAGACACAATTGATTGGAATACTTTTGAATTCTATATGCAGATAGGGGAGCCCATGATTGGTGGGTTTGACTGGCGTTTAACATTTCAGTGGCATTCTGTCCCCAAACAGGAAAGGGACAGGCGGATATCAAGAATTGACCCCATCAGATCACCTACCATGGCTGGAGGACTGTTTGCTGTCAGCAAGAAATATTTTCAGTACCTTGGAACGTATGACACAGGAATGGAAGTGTGGGGAGGTGAAAACCTTGAGCTGTCTTTTAGGGTGTGGCAGTGTGGTGGCAAATTGGAGATCCACCCGTGTTCCCACGTGGGCCATGTGTTCCCCAAGCGGGCACCATATGCTCGCCCCAATTTCCTACAGAATACTGCTCGGGCAGCAGAAGTTTGGATGGATGAATACAAAGAGCACTTCTACAATAGAAACCCTCCAGCAAGAAAAGAAGCTTATGGTGATATTTCTGAAAGAAAATTACTACGAGAGCGGTTGAGATGCAAGAGCTTTGACTGGTATTTGAAAAACGTTTTTCCTAATTTACATGTTCCAGAGGATAGACCAGGCTGGCATGGGGCTATTCGCAGTAGAGGGATCTCGTCTGAATGTTTAGATTATAATTCTCCTGACAACAACCCCACAGGTGCTAACCTTTCACTGTTTGGATGCCATGGTCAAGGAGGCAATCAATTCTTTGAATATACTTCAAACAAAGAAATAAGGTTTAATTCTGTGACAGAGTTATGTGCAGAGGTACCTGAGCAAAAAAATTATGTGGGAATGCAAAATTGTCCCAAAGATGGGTTCCCTGTACCAGCAAACATTATTTGGCATTTTAAAGAAGATGGAACTATTTTTCACCCACACTCAGGACTGTGTCTTAGTGCTTATCGGACACCGGAGGGCCGACCTGATGTACAAATGAGAACTTGTGATGCTCTAGATAAAAATCAAATTTGGAGTTTTGAGAAATAG-3'
NCBI CDS gene sequence with introns (location: 89522813.. 89525895) (3083 bp)Download
NCBI CDS gene sequence with introns, 5'UTR and 3'UTR (location: 89519412.. 89526047) (6636 bp)Download
NCBI gene sequence (location: [89519412.. 89526047 + 1000]) (7636 bp)Download
How to cite: Schnider B., M'Rad Y., el Ahmadie J., de Brevern AG., Imberty A., Lisacek F., HumanLectome, an update of UniLectin for the annotation and prediction of human lectins, Nucleic Acids Reasearch doi.org/10.1093/nar/gkad905