NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907182167|ref|XP_036009078|]
View 

mucin-6 isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182167   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 1.62e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 111.28  E-value: 1.62e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.80e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.80e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182167  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.31e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


:

Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.31e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 6.25e-22

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 92.40  E-value: 6.25e-22
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 1.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


:

Pssm-ID: 460351  Cd Length: 55  Bit Score: 76.66  E-value: 1.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 1.83e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 67.73  E-value: 1.83e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 5.93e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


:

Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.86  E-value: 5.93e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4506-4585 3.69e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


:

Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.69e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182167  4585 P 4585
Cdd:smart00041   79 P 79
PHA03247 super family cl33720
large tegument protein UL36; Provisional
3979-4401 2.84e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 2.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.20e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 51.16  E-value: 1.20e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
2614-2835 1.60e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 54.37  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 1.99e-06

von Willebrand factor (vWF) type C domain;


:

Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.94  E-value: 1.99e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182167   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
3523-3723 5.91e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 5.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
1657-1878 6.33e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 6.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
2918-3108 2.40e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 2.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 2.65e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.69  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2988-3443 7.72e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 7.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247  3016 ETDP 3019
2A1904 super family cl36772
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2341-2695 2.18e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


The actual alignment was detected with superfamily member TIGR00927:

Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 47.68  E-value: 2.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927   75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927  149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927  223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927  302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927  382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
3806-4026 2.22e-03

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
ROM1 super family cl34999
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 5.64e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


The actual alignment was detected with superfamily member COG5422:

Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 42.96  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182167   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
398-550 4.45e-29

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 115.55  E-value: 4.45e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167  478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 1.62e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 111.28  E-value: 1.62e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.80e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.80e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182167  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1062-1129 5.66e-26

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 103.61  E-value: 5.66e-26
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742    1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.31e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.31e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
45-193 1.11e-24

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 103.25  E-value: 1.11e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167    45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216   12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167   120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216   90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 6.25e-22

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 92.40  E-value: 6.25e-22
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
592-661 1.42e-21

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 90.90  E-value: 1.42e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
869-1019 1.80e-20

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 90.89  E-value: 1.80e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094   78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 1.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 76.66  E-value: 1.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
303-358 8.47e-16

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 74.28  E-value: 8.47e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 1.83e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 67.73  E-value: 1.83e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
765-828 2.53e-13

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 67.03  E-value: 2.53e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826    1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 5.93e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.86  E-value: 5.93e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4506-4585 3.69e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.69e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182167  4585 P 4585
Cdd:smart00041   79 P 79
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
246-299 4.02e-09

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 55.81  E-value: 4.02e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 1907182167   246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832   25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
3979-4401 2.84e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 2.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.20e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 51.16  E-value: 1.20e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
4076-4392 1.34e-06

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 55.00  E-value: 1.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4076 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 4145
Cdd:TIGR00927   68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4146 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 4202
Cdd:TIGR00927  148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4203 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 4270
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4271 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4348
Cdd:TIGR00927  306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 4349 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4392
Cdd:TIGR00927  386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2614-2835 1.60e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 54.37  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 1.99e-06

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.94  E-value: 1.99e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182167   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3523-3723 5.91e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 5.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1657-1878 6.33e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 6.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
665-722 8.32e-06

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 45.84  E-value: 8.32e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167  665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2918-3108 2.40e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 2.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 2.65e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.69  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2450-2846 3.41e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.30  E-value: 3.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2450 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2529
Cdd:pfam05109  428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2530 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2609
Cdd:pfam05109  503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2610 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2689
Cdd:pfam05109  567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2690 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2769
Cdd:pfam05109  647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2770 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2841
Cdd:pfam05109  725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804

                   ....*
gi 1907182167 2842 SAVTP 2846
Cdd:pfam05109  805 SKLRP 809
PHA03247 PHA03247
large tegument protein UL36; Provisional
2988-3443 7.72e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 7.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247  3016 ETDP 3019
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
4144-4378 1.05e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 48.73  E-value: 1.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4144 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 4218
Cdd:COG5422     59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4219 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4298
Cdd:COG5422    134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4299 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4373
Cdd:COG5422    211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286

                   ....*
gi 1907182167 4374 PTIHM 4378
Cdd:COG5422    287 MRLQL 291
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
2260-3880 1.27e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 48.61  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2260 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2339
Cdd:COG3210     80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2340 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2419
Cdd:COG3210    160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2420 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2499
Cdd:COG3210    240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2500 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2579
Cdd:COG3210    317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2580 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2659
Cdd:COG3210    397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2660 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2739
Cdd:COG3210    477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2740 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2819
Cdd:COG3210    557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2820 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2899
Cdd:COG3210    637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2900 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2970
Cdd:COG3210    717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2971 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 3050
Cdd:COG3210    797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3051 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 3130
Cdd:COG3210    877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3131 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 3210
Cdd:COG3210    957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3211 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3290
Cdd:COG3210   1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:COG3210   1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3450
Cdd:COG3210   1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3451 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3530
Cdd:COG3210   1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3531 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3610
Cdd:COG3210   1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3611 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3690
Cdd:COG3210   1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3691 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3770
Cdd:COG3210   1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3771 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3850
Cdd:COG3210   1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
                         1610      1620      1630
                   ....*....|....*....|....*....|
gi 1907182167 3851 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3880
Cdd:COG3210   1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2341-2695 2.18e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 47.68  E-value: 2.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927   75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927  149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927  223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927  302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927  382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2303-2521 2.97e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 2.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2303 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2382
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2383 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2461
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2462 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469    159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
2631-3110 6.53e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 6.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2631 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2707
Cdd:PHA03247  2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2708 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2787
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2788 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2867
Cdd:PHA03247  2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2868 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2947
Cdd:PHA03247  2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2948 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 3027
Cdd:PHA03247  2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3028 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 3102
Cdd:PHA03247  2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988

                   ....*...
gi 1907182167 3103 TSLATHLP 3110
Cdd:PHA03247  2989 PASSTPPL 2996
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2169-2584 7.36e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2169 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 2248
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2249 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2328
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2329 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2407
Cdd:pfam05109  576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2482
Cdd:pfam05109  650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2483 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2561
Cdd:pfam05109  730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
                          410       420
                   ....*....|....*....|...
gi 1907182167 2562 HLPFSSTSSVTPTSEVIITPTPQ 2584
Cdd:pfam05109  810 RWTFTSPPVTTAQATVPVPPTSQ 832
PHA03247 PHA03247
large tegument protein UL36; Provisional
1758-2222 9.77e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 9.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247  2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
360-395 1.47e-03

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 39.72  E-value: 1.47e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907182167  360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093    1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1492-1875 1.50e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.14  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154  223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154  285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154  364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154  440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
4109-4392 1.70e-03

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 44.66  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4109 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 4184
Cdd:pfam04388  276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4185 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 4263
Cdd:pfam04388  352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4264 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 4313
Cdd:pfam04388  432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4314 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4392
Cdd:pfam04388  512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3806-4026 2.22e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3191-3405 2.72e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3191 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3270
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3271 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3349
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3350 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3405
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1961-2179 2.82e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2917-3241 2.97e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2917 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2990
Cdd:TIGR00927   91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2991 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 3056
Cdd:TIGR00927  169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3057 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3136
Cdd:TIGR00927  246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3137 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 3206
Cdd:TIGR00927  326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907182167 3207 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 3241
Cdd:TIGR00927  405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
4114-4342 4.18e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 4.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4114 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 4193
Cdd:NF033849   250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 4268
Cdd:NF033849   330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 4269 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4342
Cdd:NF033849   410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
1669-2011 5.27e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.06  E-value: 5.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927   73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927  308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927  387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3047-3410 5.40e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 5.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3047 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 3126
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3127 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 3206
Cdd:pfam03154  265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3207 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3282
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3283 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3354
Cdd:pfam03154  413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3355 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3410
Cdd:pfam03154  488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3360-3720 5.63e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 5.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3360 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3439
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3440 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3519
Cdd:pfam03154  236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3520 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3599
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3600 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3665
Cdd:pfam03154  380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 3666 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3720
Cdd:pfam03154  460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 5.64e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 42.96  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2784-3100 8.80e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 8.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2847
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2848 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2919
Cdd:pfam05109  502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:pfam05109  580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:pfam05109  656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
                          330       340
                   ....*....|....*....|...
gi 1907182167 3078 KHTTGVSLETSVQTTIASPTPSA 3100
Cdd:pfam05109  731 PPKNATSPQAPSGQKTAVPTVTS 753
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182167   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
398-550 4.45e-29

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 115.55  E-value: 4.45e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167  478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 1.62e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 111.28  E-value: 1.62e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.80e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.80e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182167  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1062-1129 5.66e-26

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 103.61  E-value: 5.66e-26
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742    1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.31e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.31e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
45-193 1.11e-24

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 103.25  E-value: 1.11e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167    45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216   12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167   120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216   90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 6.25e-22

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 92.40  E-value: 6.25e-22
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
592-661 1.42e-21

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 90.90  E-value: 1.42e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
869-1019 1.80e-20

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 90.89  E-value: 1.80e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094   78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 1.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 76.66  E-value: 1.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
303-358 8.47e-16

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 74.28  E-value: 8.47e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 1.83e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 67.73  E-value: 1.83e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
765-828 2.53e-13

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 67.03  E-value: 2.53e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826    1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 5.93e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.86  E-value: 5.93e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4506-4585 3.69e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.69e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167  4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182167  4585 P 4585
Cdd:smart00041   79 P 79
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
246-299 4.02e-09

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 55.81  E-value: 4.02e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 1907182167   246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832   25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
3979-4401 2.84e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 2.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3898-4357 4.14e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 60.18  E-value: 4.14e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3898 PLFSTLSVTPTTEGLNTPT-SPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPvtytttaatqtkssfsTDRTSTPHLSQ 3976
Cdd:PHA03307    54 TVVAGAAACDRFEPPTGPPpGPGTEAPANESRSTPTWSLSTLAPASPAREGSP----------------TPPGPSSPDPP 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3977 SSTVTPTQPTPIPATTNSPMTTvglTGTPVVHTPSGTSSIAHTPHTthslPTAASSSTTLSTAPQFRTSEQSTttfPTPS 4056
Cdd:PHA03307   118 PPTPPPASPPPSPAPDLSEMLR---PVGSPGPPPAASPPAAGASPA----AVASDAASSRQAALPLSSPEETA---RAPS 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4057 APQTSLVTSLPPFSTSSVSPTdeihitstnPHTVSSVSMSRPVSTILQTtievttpPNTSTPVTHSTSATTEAQGSFSTE 4136
Cdd:PHA03307   188 SPPAEPPPSTPPAAASPRPPR---------RSSPISASASSPAPAPGRS-------AADDAGASSSDSSSSESSGCGWGP 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4137 RTSTSyLSHPssttvhqstaGPVITSIKSTMGVTGTPPvhttsGTTSSPQTPHSTHPISTAAISRttGISGTPFRTPMKT 4216
Cdd:PHA03307   252 ENECP-LPRP----------APITLPTRIWEASGWNGP-----SSRPGPASSSSSPRERSPSPSP--SSPGSGPAPSSPR 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4217 TITFPTPSSLQTSMATLfppfSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdintTSATTQA 4296
Cdd:PHA03307   314 ASSSSSSSRESSSSSTS----SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPS------SPAASAG 383
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4297 HSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASP-PSSAP 4357
Cdd:PHA03307   384 RPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPP 445
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.20e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 51.16  E-value: 1.20e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
4076-4392 1.34e-06

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 55.00  E-value: 1.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4076 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 4145
Cdd:TIGR00927   68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4146 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 4202
Cdd:TIGR00927  148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4203 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 4270
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4271 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4348
Cdd:TIGR00927  306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 4349 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4392
Cdd:TIGR00927  386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2614-2835 1.60e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 54.37  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 1.99e-06

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.94  E-value: 1.99e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182167   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3523-3723 5.91e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 5.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1657-1878 6.33e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.45  E-value: 6.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2601-2794 7.70e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.06  E-value: 7.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2601 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 2680
Cdd:COG3469     26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2681 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 2760
Cdd:COG3469    106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907182167 2761 TTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2794
Cdd:COG3469    182 TTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
665-722 8.32e-06

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 45.84  E-value: 8.32e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167  665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2635-2858 1.44e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 51.29  E-value: 1.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2635 TSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 2714
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2715 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2794
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 2795 PFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfSSTSAVTPTSEVIITPTPQH 2858
Cdd:COG3469    156 TETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT----PSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2918-3108 2.40e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 2.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 2.65e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.69  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2450-2846 3.41e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.30  E-value: 3.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2450 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2529
Cdd:pfam05109  428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2530 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2609
Cdd:pfam05109  503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2610 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2689
Cdd:pfam05109  567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2690 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2769
Cdd:pfam05109  647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2770 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2841
Cdd:pfam05109  725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804

                   ....*
gi 1907182167 2842 SAVTP 2846
Cdd:pfam05109  805 SKLRP 809
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2645-2856 4.91e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 4.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2645 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2724
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTT----GSVVVAASGSAGSGTGTTAASSTAATSSTT 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2725 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2804
Cdd:COG3469     77 STTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2805 KHTTGVsleTSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTP 2856
Cdd:COG3469    157 ETATGG---TTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1644-1833 5.87e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 5.87e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1644 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 1723
Cdd:COG3469     26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1724 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 1803
Cdd:COG3469    106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907182167 1804 ITNSLmTTGGLTGTPPVHTTSGTTSSPQTP 1833
Cdd:COG3469    182 TTTAT-ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3495-3682 7.65e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.98  E-value: 7.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3495 TGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSS 3574
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3575 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLM 3654
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA 187
                          170       180
                   ....*....|....*....|....*...
gi 1907182167 3655 TTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469    188 TTASGATTPSATTTATTTGPPTPGLPKH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
2988-3443 7.72e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 7.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247  3016 ETDP 3019
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
4144-4378 1.05e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 48.73  E-value: 1.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4144 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 4218
Cdd:COG5422     59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4219 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4298
Cdd:COG5422    134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4299 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4373
Cdd:COG5422    211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286

                   ....*
gi 1907182167 4374 PTIHM 4378
Cdd:COG5422    287 MRLQL 291
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
2260-3880 1.27e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 48.61  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2260 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2339
Cdd:COG3210     80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2340 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2419
Cdd:COG3210    160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2420 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2499
Cdd:COG3210    240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2500 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2579
Cdd:COG3210    317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2580 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2659
Cdd:COG3210    397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2660 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2739
Cdd:COG3210    477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2740 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2819
Cdd:COG3210    557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2820 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2899
Cdd:COG3210    637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2900 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2970
Cdd:COG3210    717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2971 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 3050
Cdd:COG3210    797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3051 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 3130
Cdd:COG3210    877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3131 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 3210
Cdd:COG3210    957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3211 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3290
Cdd:COG3210   1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:COG3210   1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3450
Cdd:COG3210   1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3451 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3530
Cdd:COG3210   1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3531 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3610
Cdd:COG3210   1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3611 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3690
Cdd:COG3210   1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3691 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3770
Cdd:COG3210   1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3771 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3850
Cdd:COG3210   1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
                         1610      1620      1630
                   ....*....|....*....|....*....|
gi 1907182167 3851 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3880
Cdd:COG3210   1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2341-2695 2.18e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 47.68  E-value: 2.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927   75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927  149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927  223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927  302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927  382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1623-1821 2.26e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.44  E-value: 2.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1623 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1702
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTN-----FPTHSGPQSSLST 1777
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVsgtetATGGTTTTSTTTT 171
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1907182167 1778 HLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVH 1821
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2303-2521 2.97e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 2.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2303 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2382
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2383 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2461
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2462 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469    159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2873-3067 4.25e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.28  E-value: 4.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2873 LPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2952
Cdd:COG3469     22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2953 TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 3032
Cdd:COG3469    101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1907182167 3033 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 3067
Cdd:COG3469    181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
4042-4320 4.26e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 46.81  E-value: 4.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4042 FRTSEQSTTTFPTPSAPQTSLVTSLPPfsTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTH 4121
Cdd:COG5422     17 FGAPRKSDAFVSKQLLPPRRLQRKLNP--ISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITH 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 STSAT--TEAQGSFS---TERTSTSYLSHPSSTTVHQSTAGPvitsikstmgvTGTPpvhttSGTTSSPQTPHSTHPIST 4196
Cdd:COG5422     95 SPSATssTSSLNSNDgdqFSPASDSLSFNPSSTQSRKDSGPG-----------DGSP-----VQKRKNPLLPSSSTHGTH 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4197 AAISrTTGISGTPFRTPMKTTiTFPTPSSLQTSMATLFPPF--STSVMSSTEIFNTP---TNPHSVSSASTSRPLSTSLP 4271
Cdd:COG5422    159 PPIV-FTDNNGSHAGAPNARS-RKEIPSLGSQSMQLPSPHFrqKFSSSDTSNGFSYPsirKNSRHSSNSMPSFPHSSTAV 236
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1907182167 4272 TTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTL 4320
Cdd:COG5422    237 LLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSRVAVEF 285
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
4165-4402 5.49e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.28  E-value: 5.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4165 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSS 4244
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4245 TEIFNTPtnphsvsSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 4324
Cdd:COG3469     82 ATAAAAA-------ATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4325 RSASTLQYTPTPSSVShspllTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV 4402
Cdd:COG3469    155 GTETATGGTTTTSTTT-----TTTSASTTPSATTTATATTASG-----------ATTPSATTTATTTGPPTPGLPKHV 216
PHA03247 PHA03247
large tegument protein UL36; Provisional
3818-4391 5.71e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 5.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3818 PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqSSLSTHL 3897
Cdd:PHA03247  2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3898 PLFSTLSVTPTTEGLNTPTSPH-SLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTT------------AATQTKSSF 3964
Cdd:PHA03247  2639 DPHPPPTVPPPERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppptpePAPHALVSA 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3965 STDRTSTPHLSQSSTVTPTQPTPiPATTNSPMTTVGLTGTPVVHTPSGTSSI------AHTPHTTHSLPTAASSSTTLST 4038
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPappaapAAGPPRRLTRPAVASLSESRES 2797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4039 APQFRTSEQSTT--TFPTPSAPQTSLVTSL--PPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVStilqttievTTPPN 4114
Cdd:PHA03247  2798 LPSPWDPADPPAavLAPAAALPPAASPAGPlpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR---------RRPPS 2868
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4115 TSTPVTHSTSAtteaqgsfsteRTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHT-TSGTTSSPQTPHSTHP 4193
Cdd:PHA03247  2869 RSPAAKPAAPA-----------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPP 2937
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGTPfrtpmkttitFPTPSSLQTSMATLFPPfstsvmssteifNTPTNPHSVSSASTSRPLSTSLPTT 4273
Cdd:PHA03247  2938 RPQPPLAPTTDPAGAG----------EPSGAVPQPWLGALVPG------------RVAVPRFRVPQPAPSREAPASSTPP 2995
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4274 IKGTGTPQTpvsdinTTSATTQAHSSFPTTRtstshlslPSSMTSTLTPAS----RSASTLQYTPTPSSVSHSPLLTTPT 4349
Cdd:PHA03247  2996 LTGHSLSRV------SSWASSLALHEETDPP--------PVSLKQTLWPPDdtedSDADSLFDSDSERSDLEALDPLPPE 3061
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*....
gi 1907182167 4350 ASPPSSAPTFVSPTAAStviSSALPTIHMTPTP-------SSRPTSSTG 4391
Cdd:PHA03247  3062 PHDPFAHEPDPATPEAG---ARESPSSQFGPPPlsanaalSRRYVRSTG 3107
PHA03247 PHA03247
large tegument protein UL36; Provisional
2631-3110 6.53e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 6.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2631 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2707
Cdd:PHA03247  2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2708 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2787
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2788 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2867
Cdd:PHA03247  2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2868 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2947
Cdd:PHA03247  2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2948 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 3027
Cdd:PHA03247  2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3028 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 3102
Cdd:PHA03247  2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988

                   ....*...
gi 1907182167 3103 TSLATHLP 3110
Cdd:PHA03247  2989 PASSTPPL 2996
PHA03247 PHA03247
large tegument protein UL36; Provisional
4165-4463 6.59e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 6.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4165 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPfRTPMKTTITfPTPSSLqTSMATLFPPFST---SV 4241
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAAR-PTVGSL-TSLADPPPPPPTpepAP 2712
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4242 MSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLT 4321
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4322 PASRSASTLQYTPTPSSVSHSPLLTTPTASPPssAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247  2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4402 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLPTSA 4463
Cdd:PHA03247  2871 PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2169-2584 7.36e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2169 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 2248
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2249 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2328
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2329 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2407
Cdd:pfam05109  576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2482
Cdd:pfam05109  650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2483 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2561
Cdd:pfam05109  730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
                          410       420
                   ....*....|....*....|...
gi 1907182167 2562 HLPFSSTSSVTPTSEVIITPTPQ 2584
Cdd:pfam05109  810 RWTFTSPPVTTAQATVPVPPTSQ 832
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1486-3107 8.01e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 45.91  E-value: 8.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1486 TAPLITSITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHP 1565
Cdd:COG3210     41 GSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGGIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTA 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1566 FSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTTGNILPT 1645
Cdd:COG3210    121 ASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGA 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1646 TIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSG 1725
Cdd:COG3210    201 LINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGT 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1726 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAIT 1805
Cdd:COG3210    281 NATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAG 360
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1806 NSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTS 1885
Cdd:COG3210    361 SGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGT 440
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1886 SVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSA 1965
Cdd:COG3210    441 VTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAG 520
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1966 VTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTprTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQ 2045
Cdd:COG3210    521 GGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASG--SNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGG 598
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2046 SSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTS 2125
Cdd:COG3210    599 TVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTV 678
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2126 TPHLSQSSTVTPTQSTPIPATTNS----LMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV 2201
Cdd:COG3210    679 TSGATGGTTGTTLNAATGGTLNNAgntlTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLS 758
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2202 QTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSASTSTTTGNILPT----TIGKTGSPHTSVPVIYT 2277
Cdd:COG3210    759 IGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSggtiTINTATTGLTGTGDTTS 838
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2278 TSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTV 2357
Cdd:COG3210    839 GAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTA 918
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2358 AVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEG 2437
Cdd:COG3210    919 TGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILV 998
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2438 TRPPHTSVPVMYTTTAATQTKSSFSTDRT----STPHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSS 2513
Cdd:COG3210    999 AGNSGTTASTTGGSGAIVAGGNGVTGTTGtasaTGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNG 1078
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2514 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVIITPTPQHTLSSASTS 2593
Cdd:COG3210   1079 GGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGA 1158
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2594 TTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTG 2673
Cdd:COG3210   1159 SSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSA 1238
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2674 TPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNT 2753
Cdd:COG3210   1239 GQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGI 1318
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2754 QSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA 2833
Cdd:COG3210   1319 GGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNA 1398
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2834 THLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTS 2913
Cdd:COG3210   1399 GRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVG 1478
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2914 APHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTT 2993
Cdd:COG3210   1479 GAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGA 1558
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2994 NFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVA 3073
Cdd:COG3210   1559 AGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNG 1638
                         1610      1620      1630
                   ....*....|....*....|....*....|....
gi 1907182167 3074 VSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 3107
Cdd:COG3210   1639 GEGVLALVAGGNTTNGTTLSGAVNGAGNGWAVDL 1672
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2501-2832 8.74e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 8.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2501 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVIIT 2580
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2581 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2660
Cdd:pfam03154  265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2661 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2735
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2736 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2810
Cdd:pfam03154  413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
                          330       340
                   ....*....|....*....|..
gi 1907182167 2811 SLETSVQTTIASPTPSAPQTSL 2832
Cdd:pfam03154  493 QPPSSASVSSSGPVPAAVSCPL 514
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3523-3746 9.07e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 9.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 3683 PFSTVAVSNTKHTTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3746
Cdd:COG3469    156 TETATGGTTTTSTTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
1758-2222 9.77e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 9.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247  2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2398-2585 1.08e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.13  E-value: 1.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2398 STLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPvmYTTTAATQTKSSFSTDRTSTPHLSQSSTV 2477
Cdd:COG3469     33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT--ATAAAAAATSTSATLVATSTASGANTGTS 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2478 TPTQ-STPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2556
Cdd:COG3469    111 TVTTtSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA 190
                          170       180
                   ....*....|....*....|....*....
gi 1907182167 2557 TSLTThlpfSSTSSVTPTSEVIITPTPQH 2585
Cdd:COG3469    191 SGATT----PSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2920-3131 1.45e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:COG3469     14 GASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTS 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 3079
Cdd:COG3469     94 ATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 3080 TTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3131
Cdd:COG3469    168 TTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
360-395 1.47e-03

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 39.72  E-value: 1.47e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907182167  360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093    1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1492-1875 1.50e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.14  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154  223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154  285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154  364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154  440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1827-2242 1.67e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1827 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1906
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1907 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1986
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1987 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2065
Cdd:pfam05109  576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2066 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2140
Cdd:pfam05109  650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2141 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 2219
Cdd:pfam05109  730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
                          410       420
                   ....*....|....*....|...
gi 1907182167 2220 HLPFSSTSSVTPTSKVIITPTPQ 2242
Cdd:pfam05109  810 RWTFTSPPVTTAQATVPVPPTSQ 832
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
4109-4392 1.70e-03

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 44.66  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4109 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 4184
Cdd:pfam04388  276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4185 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 4263
Cdd:pfam04388  352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4264 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 4313
Cdd:pfam04388  432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4314 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4392
Cdd:pfam04388  512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
830-887 1.77e-03

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 39.85  E-value: 1.77e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167   830 CDYAGVSYPGGFELHTDCKTCTCSQGRWTCQlSTQCPSTCVLYGEGHIITFDGQRFVF 887
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCT-KVWCGPKPCLLHNLSGECPLGQGCVP 57
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
4181-4460 1.86e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4181 TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMatlfPPFSTSVMSSTEIFNTPTNPHSVSSA 4260
Cdd:pfam05109  392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA----PNTTTGLPSSTHVPTNLTAPASTGPT 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4261 STSRPLSTSLPTTIKGTGTPQTPVSDI----------NTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL 4330
Cdd:pfam05109  468 VSTADVTSPTPAGTTSGASPVTPSPSPrdngteskapDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA 547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4331 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV--PTFSSF 4408
Cdd:pfam05109  548 VTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVT-----------TPTPNATSPTVGETSPQANTTNHTlgGTSSTP 616
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4409 SSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4460
Cdd:pfam05109  617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
VWC smart00214
von Willebrand factor (vWF) type C domain;
360-395 1.96e-03

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 39.42  E-value: 1.96e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1907182167   360 CMLNGMVYGPGEITKT-ACQTCQCTMGRW-TCTKQPCP 395
Cdd:smart00214    1 CVHNGRVYNDGETWKPdPCQICTCLDGTTvLCDPVECP 38
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3977-4224 2.00e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3977 SSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTThSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPS 4056
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSV-VVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4057 APQTSLVTSLPPFSTSSVSPTDeihitstnphtvssvsmsrPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTE 4136
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTAS-------------------GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATS 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4137 RTSTSYLSHPSSTTVHqstagpvitsikstmGVTGTPPVHTTSGTTSSPQTPhSTHPISTAAISRTTGISGTPFRTPMKT 4216
Cdd:COG3469    143 SAGSTTTTTTVSGTET---------------ATGGTTTTSTTTTTTSASTTP-SATTTATATTASGATTPSATTTATTTG 206

                   ....*...
gi 1907182167 4217 TITFPTPS 4224
Cdd:COG3469    207 PPTPGLPK 214
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3806-4026 2.22e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3968-4366 2.34e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3968 RTSTPHLSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQ 4047
Cdd:pfam03154  168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4048 STTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPV-THSTSAT 4126
Cdd:pfam03154  248 PLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGqSQQRIHT 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4127 TEAQGSFSTERTSTSYLSHPSSTTVhqstagPVItsikstmgvtgTPPVHTTSGTTSSPQT-PHSTHPISTAAISRTTGI 4205
Cdd:pfam03154  328 PPSQSQLQSQQPPREQPLPPAPLSM------PHI-----------KPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNL 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4206 SGTPFRTPMKTTITFPTPSS-------LQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTT--IKG 4276
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAhppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHpfVPG 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4277 TGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSvshspllTTPTASPPSSA 4356
Cdd:pfam03154  471 GPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPES-------PPPPPRSPSPE 543
                          410
                   ....*....|.
gi 1907182167 4357 PTFV-SPTAAS 4366
Cdd:pfam03154  544 PTVVnTPSHAS 554
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2933-3107 2.42e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2933 VSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3011
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3012 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3089
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182167 3090 QTTIASPTPSAPQTSLAT 3107
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2670-2888 2.52e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2670 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPlfSTLSVTPTTE 2749
Cdd:COG3469      5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAAT--SSTTSTTATA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2750 GLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2829
Cdd:COG3469     83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGG 162
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2830 TSlathlpfsSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVP 2888
Cdd:COG3469    163 TT--------TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
4147-4374 2.54e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4147 SSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTpfrtpmkttiTFPTPSSL 4226
Cdd:COG3469      4 VSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATS 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4227 QTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTS 4306
Cdd:COG3469     74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4307 TSHLSLPSSMTSTLTPASRSASTLQYTPTPSsvshspllTTPTASPPSSAPTFVSPTAASTVISSALP 4374
Cdd:COG3469    154 SGTETATGGTTTTSTTTTTTSASTTPSATTT--------ATATTASGATTPSATTTATTTGPPTPGLP 213
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
3815-4188 2.58e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.80  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3815 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3894
Cdd:pfam17823   45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3895 THLPLFSTLSVTPTTEGLNTPTSphslSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHL 3974
Cdd:pfam17823  125 SAAQSLPAAIAALPSEAFSAPRA----AACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3975 SQSSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFP 4053
Cdd:pfam17823  201 SAPATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRL 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4054 TPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQG 4131
Cdd:pfam17823  281 SPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSA 360
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 4132 --------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 4188
Cdd:pfam17823  361 spvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3191-3405 2.72e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3191 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3270
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3271 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3349
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3350 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3405
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3548-3722 2.79e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3548 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3626
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3627 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3704
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182167 3705 QTTIASPTPSAPQTSLAT 3722
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1961-2179 2.82e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2917-3241 2.97e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2917 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2990
Cdd:TIGR00927   91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2991 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 3056
Cdd:TIGR00927  169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3057 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3136
Cdd:TIGR00927  246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3137 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 3206
Cdd:TIGR00927  326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907182167 3207 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 3241
Cdd:TIGR00927  405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1913-2104 3.07e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1913 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 1992
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1993 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2070
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907182167 2071 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2104
Cdd:COG3469    182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3143-3334 3.07e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3143 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 3222
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3223 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 3300
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907182167 3301 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 3334
Cdd:COG3469    182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2328-2537 3.28e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2328 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfSTLSVTPTTE 2407
Cdd:COG3469      5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST-----AATSSTTSTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPA 2487
Cdd:COG3469     80 ATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETA 159
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2488 TTNSlmTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2537
Cdd:COG3469    160 TGGT--TTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3901-4068 3.55e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3901 STLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTV 3980
Cdd:COG3469     33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTV 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3981 TPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQF---RTSEQSTTTFPTPSA 4057
Cdd:COG3469    113 TTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSastTPSATTTATATTASG 192
                          170
                   ....*....|.
gi 1907182167 4058 PQTSLVTSLPP 4068
Cdd:COG3469    193 ATTPSATTTAT 203
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
4114-4342 4.18e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 4.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4114 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 4193
Cdd:NF033849   250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 4268
Cdd:NF033849   330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 4269 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4342
Cdd:NF033849   410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3672-4130 4.83e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 4.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3672 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3751
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3752 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3827
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3828 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3907
Cdd:pfam05109  579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3908 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3987
Cdd:pfam05109  658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3988 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 4065
Cdd:pfam05109  738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4066 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 4130
Cdd:pfam05109  815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
1669-2011 5.27e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.06  E-value: 5.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927   73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927  308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927  387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
3963-4342 5.38e-03

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 43.20  E-value: 5.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3963 SFSTDRTSTPHLSQSSTVTpTQPTPIPATTNSPMTTVGlTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTapqf 4042
Cdd:COG5099     38 STPNSFSPIPSKASSSATF-TLNLPINNSVNHKITSSS-SSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPST---- 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4043 rtseqSTTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHI-TSTNPHTVSSVSMSRPVSTILQttiEVTTPPNTSTPVTH 4121
Cdd:COG5099    112 -----SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSnKLPLPNPNHSNSATTNQSGSSF---INTPASSSSQPLTN 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 STSATTEAQGSFSTERTSTSYLSHPSSTTVhqSTAGPVITSIkstmGVTGTPPVHTTSGTTSSPQTPHsTHPISTAAISR 4201
Cdd:COG5099    184 LVVSSIKRFPYLTSLSPFFNYLIDPSSDSA--TASADTSPSF----NPPPNLSPNNLFSTSDLSPLPD-TQSVENNIILN 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4202 TTGISGTPFRTPMKTTI--TFPTPSSLQTSMATLFPP-FSTSVMSSTEIFNT----PTNPHSVSSASTSRPLSTSLPTTI 4274
Cdd:COG5099    257 SSSSINELTSIYGSVPSirNLRGLNSALVSFLNVSSSsLAFSALNGKEVSPTgspsTRSFARVLPKSSPNNLLTEILTTG 336
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4275 KGTGTPQTPVSDINTTSATTQAHSsfpttrTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHS 4342
Cdd:COG5099    337 VNPPQSLPSLLNPVFLSTSTGFSL------TNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSES 398
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1703-1877 5.38e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 5.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 1781
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1782 FSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 1859
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182167 1860 QTTIASPTPSAPQTSLAT 1877
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1896-2073 5.38e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 5.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1896 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1975
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1976 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2055
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170
                   ....*....|....*...
gi 1907182167 2056 STLSVTPTTEGLNTPTSP 2073
Cdd:COG3469    189 TASGATTPSATTTATTTG 206
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3047-3410 5.40e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 5.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3047 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 3126
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3127 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 3206
Cdd:pfam03154  265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3207 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3282
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3283 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3354
Cdd:pfam03154  413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3355 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3410
Cdd:pfam03154  488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1690-1901 5.62e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 5.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1690 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 1769
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1770 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 1849
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 1850 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSSVTPTSEVIITPTPQH 1901
Cdd:COG3469    162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3360-3720 5.63e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 5.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3360 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3439
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3440 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3519
Cdd:pfam03154  236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3520 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3599
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3600 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3665
Cdd:pfam03154  380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 3666 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3720
Cdd:pfam03154  460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 5.64e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 42.96  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
PHA03247 PHA03247
large tegument protein UL36; Provisional
4183-4460 8.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 8.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4183 SSPQTPHSTHPISTAAISRTTGISGTPF----RTPMKTTITFP---TPSSLQTSMATLFPPFSTSVMSSTEIF--NTPTN 4253
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQsarpRAPVDDRGDPRgpaPPSPLPPDTHAPDPPPPSPSPAANEPDphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4254 PHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdinttSATTQA--HSSFPTTRTSTSHLSLPSSMTSTLTPASRSA-STL 4330
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQA-------SSPPQRprRRAARPTVGSLTSLADPPPPPPTPEPAPHALvSAT 2719
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4331 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTS---------STGLLSTSKTTSH 4401
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgpprrltrpAVASLSESRESLP 2799
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4402 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4460
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
4049-4271 8.67e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 8.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4049 TTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTE 4128
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4129 AQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSpqtphsthpiSTAAISRTTGISGT 4208
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS----------GASATSSAGSTTTT 150
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 4209 PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLP 4271
Cdd:COG3469    151 TTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
4272-4402 8.78e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 42.38  E-value: 8.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4272 TTIKGTGTPQTP--VSDINTTSATTQAHSSFPTTRTSTShlslpSSMTSTLTPAsrsastlqyTPTPSSVSHSPLLTTPT 4349
Cdd:PLN02217   548 AWIPGKGVPYIPglFAGNPGSTNSTPTGSAASSNTTFSS-----DSPSTVVAPS---------TSPPAGHLGSPPATPSK 613
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 4350 ASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHV 4402
Cdd:PLN02217   614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMV 666
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2784-3100 8.80e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 8.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2847
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2848 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2919
Cdd:pfam05109  502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:pfam05109  580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:pfam05109  656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
                          330       340
                   ....*....|....*....|...
gi 1907182167 3078 KHTTGVSLETSVQTTIASPTPSA 3100
Cdd:pfam05109  731 PPKNATSPQAPSGQKTAVPTVTS 753
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2800-3003 9.13e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 9.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2800 AVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQ 2879
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2880 TGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSS 2959
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS-TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1907182167 2960 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQS 3003
Cdd:COG3469    171 TTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
4098-4309 9.36e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 9.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4098 PVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPpvhT 4177
Cdd:COG3469     11 TAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA---A 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4178 TSGTTSSPQTPHSTHPiSTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSV 4257
Cdd:COG3469     88 AATSTSATLVATSTAS-GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTT 166
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4258 SSASTSRPLSTSLPTTIKGTGTPQTPVSdinTTSATTQAHSSFPTTRTSTSH 4309
Cdd:COG3469    167 STTTTTTSASTTPSATTTATATTASGAT---TPSATTTATTTGPPTPGLPKH 215
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH