Detail Information for IndEnz0002004584
IED ID IndEnz0002004584
Enzyme Type ID protease004584
Protein Name Mucin-5AC
MUC-5AC
Gastric mucin
Major airway glycoprotein
Mucin-5 subtype AC, tracheobronchial
Tracheobronchial mucin
TBM
Gene Name MUC5AC MUC5
Organism Homo sapiens (Human)
Taxonomic Lineage cellular organisms Eukaryota Opisthokonta Metazoa Eumetazoa Bilateria Deuterostomia Chordata Craniata Vertebrata Gnathostomata (jawed vertebrates) Teleostomi Euteleostomi Sarcopterygii Dipnotetrapodomorpha Tetrapoda Amniota Mammalia Theria Eutheria Boreoeutheria Euarchontoglires Primates Haplorrhini Simiiformes Catarrhini Hominoidea (apes) Hominidae (great apes) Homininae Homo Homo sapiens (Human)
Enzyme Sequence MSVGRRKLALLWALALALACTRHTGHAQDGSSESSYKHHPALSPIARGPSGVPLRGATVFPSLRTIPVVRASNPAHNGRVCSTWGSFHYKTFDGDVFRFPGLCNYVFSEHCGAAYEDFNIQLRRSQESAAPTLSRVLMKVDGVVIQLTKGSVLVNGHPVLLPFSQSGVLIQQSSSYTKVEARLGLVLMWNHDDSLLLELDTKYANKTCGLCGDFNGMPVVSELLSHNTKLTPMEFGNLQKMDDPTDQCQDPVPEPPRNCSTGFGICEELLHGQLFSGCVALVDVGSYLEACRQDLCFCEDTDLLSCVCHTLAEYSRQCTHAGGLPQDWRGPDFCPQKCPNNMQYHECRSPCADTCSNQEHSRACEDHCVAGCFCPEGTVLDDIGQTGCVPVSKCACVYNGAAYAPGATYSTDCTNCTCSGGRWSCQEVPCPGTCSVLGGAHFSTFDGKQYTVHGDCSYVLTKPCDSSAFTVLAELRRCGLTDSETCLKSVTLSLDGAQTVVVIKASGEVFLNQIYTQLPISAANVTIFRPSTFFIIAQTSLGLQLNLQLVPTMQLFMQLAPKLRGQTCGLCGNFNSIQADDFRTLSGVVEATAAAFFNTFKTQAACPNIRNSFEDPCSLSVENEKYAQHWCSQLTDADGPFGRCHAAVKPGTYYSNCMFDTCNCERSEDCLCAALSSYVHACAAKGVQLGGWRDGVCTKPMTTCPKSMTYHYHVSTCQPTCRSLSEGDITCSVGFIPVDGCICPKGTFLDDTGKCVQASNCPCYHRGSMIPNGESVHDSGAICTCTHGKLSCIGGQAPAPVCAAPMVFFDCRNATPGDTGAGCQKSCHTLDMTCYSPQCVPGCVCPDGLVADGEGGCITAEDCPCVHNEASYRAGQTIRVGCNTCTCDSRMWRCTDDPCLATCAVYGDGHYLTFDGQSYSFNGDCEYTLVQNHCGGKDSTQDSFRVVTENVPCGTTGTTCSKAIKIFLGGFELKLSHGKVEVIGTDESQEVPYTIRQMGIYLVVDTDIGLVLLWDKKTSIFINLSPEFKGRVCGLCGNFDDIAVNDFATRSRSVVGDVLEFGNSWKLSPSCPDALAPKDPCTANPFRKSWAQKQCSILHGPTFAACHAHVEPARYYEACVNDACACDSGGDCECFCTAVAAYAQACHEVGLCVSWRTPSICPLFCDYYNPEGQCEWHYQPCGVPCLRTCRNPRGDCLRDVRGLEGCYPKCPPEAPIFDEDKMQCVATCPTPPLPPRCHVHGKSYRPGAVVPSDKNCQSCLCTERGVECTYKAEACVCTYNGQRFHPGDVIYHTTDGTGGCISARCGANGTIERRVYPCSPTTPVPPTTFSFSTPPLVVSSTHTPSNGPSSAHTGPPSSAWPTTAGTSPRTRLPTASASLPPVCGEKCLWSPWMDVSRPGRGTDSGDFDTLENLRAHGYRVCESPRSVECRAEDAPGVPLRALGQRVQCSPDVGLTCRNREQASGLCYNYQIRVQCCTPLPCSTSSSPAQTTPPTTSKTTETRASGSSAPSSTPGTVSLSTARTTPAPGTATSVKKTFSTPSPPPVPATSTSSMSTTAPGTSVVSSKPTPTEPSTSSCLQELCTWTEWIDGSYPAPGINGGDFDTFQNLRDEGYTFCESPRSVQCRAESFPNTPLADLGQDVICSHTEGLICLNKNQLPPICYNYEIRIQCCETVNVCRDITRLPKTVATTRPTPHPTGAQTQTTFTTHMPSASTEQPTATSRGGPTATSVTQGTHTTLVTRNCHPRCTWTKWFDVDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAKSHPEVSIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPRGCHMTSTPGSTSSSPAQTTPSTTSKTTETQASGSSAPSSTPGTVSLSTARTTPAPGTATSVKKTFSTPSPPPVPATSTSSMSTTAPGTSVVSSKPTPTEPSTSSCLQELCTWTEWIDGSYPAPGINGGDFDTFQNLRDEGYTFCESPRSVQCRAESFPNTPLADLGQDVICSHTEGLICLNKNQLPPICYNYEIRIQCCETVNVCRDITRPPKTVATTRPTPHPTGAQTQTTFTTHMPSASTEQPTATSRGGPTATSVTQGTHTTPVTRNCHPRCTWTTWFDVDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAKSHPEVSIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPKGCPVTSTPVTAPSTPSGRATSPTQSTSSWQKSRTTTLVTTSTTSTPQTSTTYAHTTSTTSAPTARTTSAPTTRTTSASPASTTSGPGNTPSPVPTTSTISAPTTSITSAPTTSTTSAPTSSTTSGPGTTPSPVPTTSITSAPTTSTTSAPTTSTTSARTSSTTSATTTSRISGPETTPSPVPTTSTTSATTTSTTSAPTTSTTSAPTSSTTSSPQTSTTSAPTTSTTSGPGTTPSPVPTTSTTSAPTTRTTSAPKSSTTSAATTSTTSGPETTPRPVPTTSTTSSPTTSTTSAPTTSTTSASTTSTTSGAGTTPSPVPTTSTTSAPTTSTTSAPISSTTSATTTSTTSGPGTTPSPVPTTSTTSAPTTSTTSGPGTTPSAVPTTSITSAPTTSTNSAPISSTTSATTTSRISGPETTPSPVPTASTTSASTTSTTSGPGTTPSPVPTTSTISVPTTSTTSASTTSTTSASTTSTTSGPGTTPSPVPTTSTTSAPTTSTTSAPTTSTISAPTTSTTSATTTSTTSAPTPRRTSAPTTSTISASTTSTTSATTTSTTSATTTSTISAPTTSTTLSPTTSTTSTTITSTTSAPISSTTSTPQTSTTSAPTTSTTSGPGTTSSPVPTTSTTSAPTTSTTSAPTTRTTSVPTSSTTSTATTSTTSGPGTTPSPVPTTSTTSAPTTRTTSAPTTSTTSAPTTSTTSAPTSSTTSATTTSTISVPTTSTTSVPGTTPSPVPTTSTISVPTTSTTSASTTSTTSGPGTTPSPVPTTSTTSAPTTSTTSAPTTSTISAPTTSTPSAPTTSTTLAPTTSTTSAPTTSTTSTPTSSTTSSPQTSTTSASTTSITSGPGTTPSPVPTTSTTSAPTTSTTSAATTSTISAPTTSTTSAPTTSTTSASTASKTSGLGTTPSPIPTTSTTSPPTTSTTSASTASKTSGPGTTPSPVPTTSTIFAPRTSTTSASTTSTTPGPGTTPSPVPTTSTASVSKTSTSHVSISKTTHSQPVTRDCHLRCTWTKWFDIDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAESHPEVSIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPKGCPVTSTPVTAPSTPSGRATSPTQSTSSWQKSRTTTLVTTSTTSTPQTSTTSAPTTSTTSAPTTSTTSAPTTSTTSTPQTSISSAPTSSTTSAPTSSTISARTTSIISAPTTSTTSSPTTSTTSATTTSTTSAPTSSTTSTPQTSKTSAATSSTTSGSGTTPSPVTTTSTASVSKTSTSHVSVSKTTHSQPVTRDCHPRCTWTKWFDVDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAKSHPEVSIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPKGCPVTSTSVTAPSTPSGRATSPTQSTSSWQKSRTTTLVTSSITSTTQTSTTSAPTTSTTPASIPSTTSAPTTSTTSAPTTSTTSAPTTSTTSTPQTTTSSAPTSSTTSAPTTSTISAPTTSTISAPTTSTTSAPTASTTSAPTSTSSAPTTNTTSAPTTSTTSAPITSTISAPTTSTTSTPQTSTISSPTTSTTSTPQTSTTSSPTTSTTSAPTTSTTSAPTTSTTSTPQTSISSAPTSSTTSAPTASTISAPTTSTTSFHTTSTTSPPTSSTSSTPQTSKTSAATSSTTSGSGTTPSPVPTTSTASVSKTSTSHVSVSKTTHSQPVTRDCHPRCTWTKWFDVDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAESHPEVSIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPKGCPVTSTPVTAPSTPSGRATSPTQSTSSWQKSRTTTLVTTSTTSTPQTSTTSAPTTSTIPASTPSTTSAPTTSTTSAPTTSTTSAPTHRTTSGPTTSTTLAPTTSTTSAPTTSTNSAPTTSTISASTTSTISAPTTSTISSPTSSTTSTPQTSKTSAATSSTTSGSGTTPSPVPTTSTTSASTTSTTSAPTTSTTSGPGTTPSPVPSTSTTSAATTSTTSAPTTRTTSAPTSSMTSGPGTTPSPVPTTSTTSAPTTSTTSGPGTTPSPVPTTSTTSAPITSTTSGPGSTPSPVPTTSTTSAPTTSTTSASTASTTSGPGTTPSPVPTTSTTSAPTTRTTSASTASTTSGPGSTPSPVPTTSTTSAPTTRTTPASTASTTSGPGTTPSPVPTTSTTSASTTSTISLPTTSTTSAPITSMTSGPGTTPSPVPTTSTTSAPTTSTTSASTASTTSGPGTTPSPVPTTSTTSAPTTSTTSASTASTTSGPGTSLSPVPTTSTTSAPTTSTTSGPGTTPSPVPTTSTTSAPTTSTTSGPGTTPSPVPTTSTTPVSKTSTSHLSVSKTTHSQPVTSDCHPLCAWTKWFDVDFPSPGPHGGDKETYNNIIRSGEKICRRPEEITRLQCRAESHPEVNIEHLGQVVQCSREEGLVCRNQDQQGPFKMCLNYEVRVLCCETPRGCPVTSVTPYGTSPTNALYPSLSTSMVSASVASTSVASSSVASSSVAYSTQTCFCNVADRLYPAGSTIYRHRDLAGHCYYALCSQDCQVVRGVDSDCPSTTLPPAPATSPSISTSEPVTELGCPNAVPPRKKGETWATPNCSEATCEGNNVISLRPRTCPRVEKPTCANGYPAVKVADQDGCCHHYQCQCVCSGWGDPHYITFDGTYYTFLDNCTYVLVQQIVPVYGHFRVLVDNYFCGAEDGLSCPRSIILEYHQDRVVLTRKPVHGVMTNEIIFNNKVVSPGFRKNGIVVSRIGVKMYATIPELGVQVMFSGLIFSVEVPFSKFANNTEGQCGTCTNDRKDECRTPRGTVVASCSEMSGLWNVSIPDQPACHRPHPTPTTVGPTTVGSTTVGPTTVGSTTVGPTTPPAPCLPSPICQLILSKVFEPCHTVIPPLLFYEGCVFDRCHMTDLDVVCSSLELYAALCASHDICIDWRGRTGHMCPFTCPADKVYQPCGPSNPSYCYGNDSASLGALPEAGPITEGCFCPEGMTLFSTSAQVCVPTGCPRCLGPHGEPVKVGHTVGMDCQECTCEAATWTLTCRPKLCPLPPACPLPGFVPVPAAPQAGQCCPQYSCACNTSRCPAPVGCPEGARAIPTYQEGACCPVQNCSWTVCSINGTLYQPGAVVSSSLCETCRCELPGGPPSDAFVVSCETQICNTHCPVGFEYQEQSGQCCGTCVQVACVTNTSKSPAHLFYPGETWSDAGNHCVTHQCEKHQDGLVVVTTKKACPPLSCSLDEARMSKDGCCRFCPPPPPPYQNQSTCAVYHRSLIIQQQGCSSSEPVRLAYCRGNCGDSSSMYSLEGNTVEHRCQCCQELRTSLRNVTLHCTDGSSRAFSYTEVEECGCMGRRCPAPGDTQHSEEAEPEPSQEAESGSWERGVPVSPMH
Enzyme Length 5654
Uniprot Accession Number P98088
Absorption
Active Site
Activity Regulation
Binding Site
Calcium Binding
catalytic Activity
DNA Binding
EC Number
Enzyme Function FUNCTION: Gel-forming glycoprotein of gastric and respiratory tract epithelia that protects the mucosa from infection and chemical damage by binding to inhaled microorganisms and particles that are subsequently removed by the mucociliary system (PubMed:14535999, PubMed:14718370). Interacts with H.pylori in the gastric epithelium, Barrett's esophagus as well as in gastric metaplasia of the duodenum (GMD) (PubMed:14535999). {ECO:0000269|PubMed:14535999, ECO:0000303|PubMed:14535999, ECO:0000303|PubMed:14718370}.
Temperature Dependency
PH Dependency
Pathway
nucleotide Binding
Features Chain (1); Compositional bias (6); Disulfide bond (16); Domain (12); Erroneous termination (1); Frameshift (4); Glycosylation (58); Mutagenesis (2); Natural variant (1); Region (17); Repeat (9); Sequence conflict (82); Signal peptide (1); Site (1)
Keywords 3D-structure;Direct protein sequencing;Disulfide bond;Glycoprotein;Reference proteome;Repeat;Secreted;Signal
Interact With
Induction
Subcellular Location SUBCELLULAR LOCATION: Secreted {ECO:0000269|PubMed:14718370}.
Modified Residue
Post Translational Modification PTM: C-, O- and N-glycosylated (PubMed:14718370). O-glycosylated on the second and last Thr of the Thr-/Ser-rich tandem repeats TTPSPVPTTSTTSA (PubMed:25939779, PubMed:14718370). One form of glycosylation is also known as Lewis B (LeB) blood group antigen, a tetrasaccharide consisting of N-acetylglucosamine having a fucosyl residue attached (PubMed:14535999). It has a role as an epitope and antigen and functions as a receptor for H.pylori binding and facilitates infection (PubMed:14535999). C-mannosylation in the Cys-rich subdomains may be required for proper folding of these regions and for export from the endoplasmic reticulum during biosynthesis (PubMed:14718370). {ECO:0000269|PubMed:14535999, ECO:0000269|PubMed:14718370, ECO:0000269|PubMed:25939779}.; PTM: Proteolytic cleavage in the C-terminal is initiated early in the secretory pathway and does not involve a serine protease. The extent of cleavage is increased in the acidic parts of the secretory pathway. Cleavage generates a reactive group which could link the protein to a primary amide. {ECO:0000269|PubMed:16787389}.
Signal Peptide SIGNAL 1..27; /evidence=ECO:0000255
Structure 3D X-ray crystallography (3)
Cross Reference PDB 5AJN; 5AJO; 5AJP;
Mapped Pubmed ID 10330415; 10430883; 10504389; 10742600; 10753916; 11042166; 11062056; 11062147; 11283017; 11304796; 11821425; 11919081; 11988092; 11992401; 12042033; 12360467; 12391274; 12417511; 12464682; 12527922; 12652076; 12690113; 12820724; 12855678; 12972643; 1329093; 14527933; 14749330; 14988081; 15235131; 15466199; 15486459; 15531749; 15560372; 15563276; 15599692; 15620693; 15640347; 15687324; 16142311; 16142316; 16148149; 16151858; 16251127; 16251947; 16319059; 16409634; 16465045; 16475027; 16500622; 16540890; 16552336; 16596179; 16722930; 17113861; 17148666; 17203232; 17227128; 17237423; 17321686; 17330845; 17333267; 17356062; 17395013; 17401217; 17471237; 17543073; 17555715; 17621824; 17646388; 17659847; 17703412; 17891046; 17982272; 17991319; 18006877; 18027866; 18073139; 18163520; 18166592; 18167142; 18201532; 18254322; 18283638; 18285671; 18300795; 18424749; 18475301; 18537974; 18539955; 18676374; 18782111; 18782768; 18825309; 18848467; 18978302; 19059885; 19141355; 19168703; 19201815; 19201889; 19258923; 19266212; 19394453; 19501047; 19556605; 19558858; 19596978; 19617566; 19671252; 19718656; 19718741; 19723147; 19789190; 19841186; 19841867; 19856421; 19924550; 20117097; 20216230; 20338103; 20362698; 20368025; 20422702; 20464982; 20485009; 20503287; 20525715; 20639461; 20709182; 20713760; 20731025; 20734208; 20929551; 20953890; 20972463; 21092491; 21097527; 21131736; 21139981; 21146919; 21150319; 21197415; 21249315; 21275604; 21300824; 21348892; 21362305; 21418859; 21418911; 21455588; 21502330; 21512244; 21544845; 21556755; 21622856; 21685325; 21697763; 21773870; 21815153; 21857919; 21998660; 22078291; 22183981; 22213337; 22220206; 22239058; 22261707; 22269464; 22297031; 22303480; 22348416; 22388989; 22389405; 22391959; 22441738; 22461326; 22500101; 22610099; 22676183; 22691042; 22700966; 22748473; 22798432; 23084780; 23112547; 23113953; 23249391; 23252568; 23292004; 23392388; 23392769; 23464473; 23507963; 23527003; 23549814; 23581859; 23582017; 23602830; 23619266; 23671562; 2373995; 23768102; 23801416; 23807779; 23828387; 23828549; 23836919; 23860410; 23953484; 24010879; 24027752; 24065389; 24102466; 24120646; 24332705; 24468034; 24487386; 24556756; 24598137; 24603585; 24643043; 24717945; 24720799; 24722639; 24810798; 24840724; 24887023; 24901072; 24901817; 24920497; 24935372; 24975020; 25036673; 25108707; 25151557; 25153226; 25166306; 25298197; 25376946; 25403854; 25559041; 25605164; 25638393; 25649981; 25704757; 25838093; 26057128; 26100173; 26161982; 26299896; 26318254; 26476272; 26722604; 26751774; 26867523; 26871672; 26881964; 26971129; 26980390; 27084849; 27183390; 27193208; 27297822; 27298226; 27324793; 27517516; 27610469; 27729120; 27776277; 27845339; 27845589; 27919041; 27992432; 28321513; 28505002; 28602698; 28762513; 28942146; 28946937; 29031476; 29286856; 29289532; 29408556; 29604272; 29705272; 29767240; 29802623; 29869461; 29878077; 29906464; 29943626; 30061200; 30089720; 30272559; 30552067; 30628655; 30727993; 30759005; 30837395; 30909132; 31128106; 31362570; 31466461; 31596609; 31922915; 32028958; 32076085; 32083303; 32098629; 32208937; 32400877; 32401676; 32776556; 32992527; 33111394; 33154304; 33245816; 33300069; 33332733; 33373033; 33760106; 33765317; 33963925; 34019912; 34130299; 34167450; 34171599; 34281463; 34288207; 34321852; 34534538; 34969769; 8333853; 8598452; 8611500; 8920913; 9295285; 9394011; 9417100; 9435216; 9915862; 9988682;
Motif
Gene Encoded By
Mass 585,570
Kinetics
Metal Binding
Rhea ID
Cross Reference Brenda