NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1207182352|ref|XP_021333794|]
View 

mucin-2-like isoform X1 [Danio rerio]

Protein Classification

SEA domain-containing protein( domain architecture ID 10475853)

SEA (found in Sea urchin sperm protein, Enterokinase, Agrin) domain-containing protein similar to vertebrate interphotoreceptor matrix proteoglycan 1 and human membrane mucins (mucin-12, -16 and -17)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1820-2150 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1820 SAQSTNTPQSVTTSQpiqtetqstsetqTTTEGQSTSTTsqlpTGTTQSTFSSSAQSTNTPQSSTtlqpiqteiQSTSET 1899
Cdd:NF033849   241 TGYGESVGHSTSQGQ-------------SHSVGTSESHS----VGTSQSQSHTTGHGSTRGWSHT---------QSTSES 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1900 QSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTTSQLPTEATQSTFSTSAQPTNTPQSLTT 1979
Cdd:NF033849   295 ESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSS 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1980 SPTIQTEIQTTTEGQSTMTTSQLPtdttqstmttsqiPTETTQSAISTSaqpTNTPQSVTTSQPIQTEIQSTSETQSTMT 2059
Cdd:NF033849   375 VSSSESSSRSSSSGVSGGFSGGIA-------------GGGVTSEGLGAS---QGGSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2060 TSQLPTDTTQSTfsTSAQPTTTPQSVTTSQpiqteTQTTTEGQSTSTTSQLPTDTTQSTMTTSQLSTETTQSTISTSAQS 2139
Cdd:NF033849   439 SSGHSDSSSHST--SSGQADSVSQGTSWSE-----GTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTG 511
                          330
                   ....*....|.
gi 1207182352 2140 TNTPQSVTTSQ 2150
Cdd:NF033849   512 RSESQGTSLGT 522
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2905-2989 3.47e-04

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


:

Pssm-ID: 460188  Cd Length: 100  Bit Score: 42.22  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2905 IVFTSDLEDATTPAFQILAKLVEQECDKVYRM-KYGALFLRVIVLAFRAVNkiraveQNVLVDLEIVFNQNSTEQIPDNN 2983
Cdd:pfam01390   12 LQYTPDLGNPSSQEFKSLSRRIESLLNELFRNsSLRKQYIKSHVLRLRPDG------GSVVVDVVLVFRFPSTEPALDRE 85

                   ....*.
gi 1207182352 2984 DIVQTL 2989
Cdd:pfam01390   86 KLIEEI 91
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1820-2150 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1820 SAQSTNTPQSVTTSQpiqtetqstsetqTTTEGQSTSTTsqlpTGTTQSTFSSSAQSTNTPQSSTtlqpiqteiQSTSET 1899
Cdd:NF033849   241 TGYGESVGHSTSQGQ-------------SHSVGTSESHS----VGTSQSQSHTTGHGSTRGWSHT---------QSTSES 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1900 QSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTTSQLPTEATQSTFSTSAQPTNTPQSLTT 1979
Cdd:NF033849   295 ESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSS 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1980 SPTIQTEIQTTTEGQSTMTTSQLPtdttqstmttsqiPTETTQSAISTSaqpTNTPQSVTTSQPIQTEIQSTSETQSTMT 2059
Cdd:NF033849   375 VSSSESSSRSSSSGVSGGFSGGIA-------------GGGVTSEGLGAS---QGGSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2060 TSQLPTDTTQSTfsTSAQPTTTPQSVTTSQpiqteTQTTTEGQSTSTTSQLPTDTTQSTMTTSQLSTETTQSTISTSAQS 2139
Cdd:NF033849   439 SSGHSDSSSHST--SSGQADSVSQGTSWSE-----GTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTG 511
                          330
                   ....*....|.
gi 1207182352 2140 TNTPQSVTTSQ 2150
Cdd:NF033849   512 RSESQGTSLGT 522
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1792-2110 9.92e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 9.92e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1792 ENATTSETQSTMTTSQLPTDTTQSTFSTSAQSTNTPQSVTTSQpiqtetqstSETQTTTEGQSTSTTSQLPTGTTQSTFS 1871
Cdd:NF033849   249 HSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST---------SESESTGQSSSVGTSESQSHGTTEGTST 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1872 SSAQSTNtpqSSTTLQPIQTEIQSTSETQSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSt 1951
Cdd:NF033849   320 TDSSSHS---QSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG- 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1952 tSQLPTEATQSTFSTSaQPTNTPQSLTTSPTIQTEIQTTTEGQSTMTTsqlptdttqstmttsqipTETTQSAISTSAQP 2031
Cdd:NF033849   396 -GIAGGGVTSEGLGAS-QGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG------------------HSDSSSHSTSSGQA 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2032 TNTPQSVTTSQPiQTEIQSTSETQST-MTTSQLPTDTTQSTFSTS---AQPTTTPQSVTTSQpiqtetqTTTEGQSTSTT 2107
Cdd:NF033849   456 DSVSQGTSWSEG-TGTSQGQSVGTSEsWSTSQSETDSVGDSTGTSesvSQGDGRSTGRSESQ-------GTSLGTSGGRT 527

                   ...
gi 1207182352 2108 SQL 2110
Cdd:NF033849   528 SGA 530
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2905-2989 3.47e-04

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 42.22  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2905 IVFTSDLEDATTPAFQILAKLVEQECDKVYRM-KYGALFLRVIVLAFRAVNkiraveQNVLVDLEIVFNQNSTEQIPDNN 2983
Cdd:pfam01390   12 LQYTPDLGNPSSQEFKSLSRRIESLLNELFRNsSLRKQYIKSHVLRLRPDG------GSVVVDVVLVFRFPSTEPALDRE 85

                   ....*.
gi 1207182352 2984 DIVQTL 2989
Cdd:pfam01390   86 KLIEEI 91
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1793-1981 8.86e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 8.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1793 NATTSETQSTMTTSQLPTDTTQSTFSTSAQSTNTPQSVTTSQPIQTETQSTSETQTTTEGQSTSTTSQLPTGTTQSTFSS 1872
Cdd:COG3469     21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1873 SAQSTNTPQSSTTLQPiqTEIQSTSETQSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTT 1952
Cdd:COG3469    101 TASGANTGTSTVTTTS--TGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTT 178
                          170       180
                   ....*....|....*....|....*....
gi 1207182352 1953 SQLPTEATQSTFSTSAQPTNTPQSLTTSP 1981
Cdd:COG3469    179 PSATTTATATTASGATTPSATTTATTTGP 207
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1820-2150 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1820 SAQSTNTPQSVTTSQpiqtetqstsetqTTTEGQSTSTTsqlpTGTTQSTFSSSAQSTNTPQSSTtlqpiqteiQSTSET 1899
Cdd:NF033849   241 TGYGESVGHSTSQGQ-------------SHSVGTSESHS----VGTSQSQSHTTGHGSTRGWSHT---------QSTSES 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1900 QSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTTSQLPTEATQSTFSTSAQPTNTPQSLTT 1979
Cdd:NF033849   295 ESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSS 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1980 SPTIQTEIQTTTEGQSTMTTSQLPtdttqstmttsqiPTETTQSAISTSaqpTNTPQSVTTSQPIQTEIQSTSETQSTMT 2059
Cdd:NF033849   375 VSSSESSSRSSSSGVSGGFSGGIA-------------GGGVTSEGLGAS---QGGSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2060 TSQLPTDTTQSTfsTSAQPTTTPQSVTTSQpiqteTQTTTEGQSTSTTSQLPTDTTQSTMTTSQLSTETTQSTISTSAQS 2139
Cdd:NF033849   439 SSGHSDSSSHST--SSGQADSVSQGTSWSE-----GTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTG 511
                          330
                   ....*....|.
gi 1207182352 2140 TNTPQSVTTSQ 2150
Cdd:NF033849   512 RSESQGTSLGT 522
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1792-2110 9.92e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 9.92e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1792 ENATTSETQSTMTTSQLPTDTTQSTFSTSAQSTNTPQSVTTSQpiqtetqstSETQTTTEGQSTSTTSQLPTGTTQSTFS 1871
Cdd:NF033849   249 HSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST---------SESESTGQSSSVGTSESQSHGTTEGTST 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1872 SSAQSTNtpqSSTTLQPIQTEIQSTSETQSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSt 1951
Cdd:NF033849   320 TDSSSHS---QSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG- 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1952 tSQLPTEATQSTFSTSaQPTNTPQSLTTSPTIQTEIQTTTEGQSTMTTsqlptdttqstmttsqipTETTQSAISTSAQP 2031
Cdd:NF033849   396 -GIAGGGVTSEGLGAS-QGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG------------------HSDSSSHSTSSGQA 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2032 TNTPQSVTTSQPiQTEIQSTSETQST-MTTSQLPTDTTQSTFSTS---AQPTTTPQSVTTSQpiqtetqTTTEGQSTSTT 2107
Cdd:NF033849   456 DSVSQGTSWSEG-TGTSQGQSVGTSEsWSTSQSETDSVGDSTGTSesvSQGDGRSTGRSESQ-------GTSLGTSGGRT 527

                   ...
gi 1207182352 2108 SQL 2110
Cdd:NF033849   528 SGA 530
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2905-2989 3.47e-04

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 42.22  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 2905 IVFTSDLEDATTPAFQILAKLVEQECDKVYRM-KYGALFLRVIVLAFRAVNkiraveQNVLVDLEIVFNQNSTEQIPDNN 2983
Cdd:pfam01390   12 LQYTPDLGNPSSQEFKSLSRRIESLLNELFRNsSLRKQYIKSHVLRLRPDG------GSVVVDVVLVFRFPSTEPALDRE 85

                   ....*.
gi 1207182352 2984 DIVQTL 2989
Cdd:pfam01390   86 KLIEEI 91
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1793-1981 8.86e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 8.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1793 NATTSETQSTMTTSQLPTDTTQSTFSTSAQSTNTPQSVTTSQPIQTETQSTSETQTTTEGQSTSTTSQLPTGTTQSTFSS 1872
Cdd:COG3469     21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1873 SAQSTNTPQSSTTLQPiqTEIQSTSETQSTMTTSQLPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTT 1952
Cdd:COG3469    101 TASGANTGTSTVTTTS--TGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTT 178
                          170       180
                   ....*....|....*....|....*....
gi 1207182352 1953 SQLPTEATQSTFSTSAQPTNTPQSLTTSP 1981
Cdd:COG3469    179 PSATTTATATTASGATTPSATTTATTTGP 207
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1908-2111 8.41e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 8.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1908 LPTETTQSTISTSAQPTNTPQSVTTSPTTQTETQTTTEGQSTSTTSQLPTEATQSTFSTSAQPTNTPQSLTTSPTIQTEI 1987
Cdd:COG3469      9 SPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAA 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207182352 1988 QTTTEGQSTMTTSQLPTDTTQSTMTTSQIP-TETTQSAISTSAQPTNTPQSVTTSQPIQTEIQSTSETQSTMTTSQLPTD 2066
Cdd:COG3469     89 ATSTSATLVATSTASGANTGTSTVTTTSTGaGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTST 168
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1207182352 2067 TTQSTFSTSAQPTTTPQSVTTSQPIQTETQTTTEGQSTSTTSQLP 2111
Cdd:COG3469    169 TTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH