|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182167 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
1.62e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 111.28 E-value: 1.62e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.80e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.80e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182167 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.31e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods. :
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.31e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
6.25e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 92.40 E-value: 6.25e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. :
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.66 E-value: 1.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
1.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 67.73 E-value: 1.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
5.93e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. :
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.86 E-value: 5.93e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4506-4585 |
3.69e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers. :
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.69e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182167 4585 P 4585
Cdd:smart00041 79 P 79
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
3979-4401 |
2.84e-08 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.72 E-value: 2.84e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.20e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 51.16 E-value: 1.20e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2614-2835 |
1.60e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.37 E-value: 1.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
1.99e-06 |
|
von Willebrand factor (vWF) type C domain; :
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.94 E-value: 1.99e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182167 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
3523-3723 |
5.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 5.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
6.33e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 6.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2918-3108 |
2.40e-05 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 2.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
2.65e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.69 E-value: 2.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
2988-3443 |
7.72e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 7.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247 3016 ETDP 3019
|
|
| 2A1904 super family |
cl36772 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2341-2695 |
2.18e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds] The actual alignment was detected with superfamily member TIGR00927:
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 47.68 E-value: 2.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
3806-4026 |
2.22e-03 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| ROM1 super family |
cl34999 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
5.64e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms]; The actual alignment was detected with superfamily member COG5422:
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 42.96 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182167 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.45e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.45e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
1.62e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 111.28 E-value: 1.62e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.80e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.80e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182167 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
5.66e-26 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 103.61 E-value: 5.66e-26
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.31e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.31e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.11e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.11e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
6.25e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 92.40 E-value: 6.25e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
1.42e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.90 E-value: 1.42e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.80e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.80e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.66 E-value: 1.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
8.47e-16 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 74.28 E-value: 8.47e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
1.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 67.73 E-value: 1.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
2.53e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 67.03 E-value: 2.53e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
5.93e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.86 E-value: 5.93e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4506-4585 |
3.69e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.69e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182167 4585 P 4585
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
4.02e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.81 E-value: 4.02e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3979-4401 |
2.84e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.72 E-value: 2.84e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.20e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 51.16 E-value: 1.20e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
4076-4392 |
1.34e-06 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.00 E-value: 1.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4076 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 4145
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4146 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 4202
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4203 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 4270
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4271 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4348
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 4349 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4392
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2614-2835 |
1.60e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.37 E-value: 1.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
1.99e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.94 E-value: 1.99e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182167 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3523-3723 |
5.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 5.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
6.33e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 6.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
8.32e-06 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.84 E-value: 8.32e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2918-3108 |
2.40e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 2.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
2.65e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.69 E-value: 2.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2450-2846 |
3.41e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.30 E-value: 3.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2450 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2529
Cdd:pfam05109 428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2530 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2609
Cdd:pfam05109 503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2610 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2689
Cdd:pfam05109 567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2690 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2769
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2770 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2841
Cdd:pfam05109 725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804
|
....*
gi 1907182167 2842 SAVTP 2846
Cdd:pfam05109 805 SKLRP 809
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2988-3443 |
7.72e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 7.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247 3016 ETDP 3019
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
4144-4378 |
1.05e-04 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 48.73 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4144 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 4218
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4219 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4298
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4299 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4373
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182167 4374 PTIHM 4378
Cdd:COG5422 287 MRLQL 291
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
2260-3880 |
1.27e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 48.61 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2260 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2339
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2340 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2419
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2420 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2499
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2500 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2579
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2580 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2659
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2660 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2739
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2740 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2819
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2820 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2899
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2900 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2970
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2971 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 3050
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3051 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 3130
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3131 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 3210
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3211 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3290
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3450
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3451 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3530
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3531 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3610
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3611 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3690
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3691 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3770
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3771 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3850
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182167 3851 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3880
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2341-2695 |
2.18e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 47.68 E-value: 2.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2303-2521 |
2.97e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2303 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2382
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2383 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2461
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2462 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469 159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2631-3110 |
6.53e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2631 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2707
Cdd:PHA03247 2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2708 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2787
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2788 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2867
Cdd:PHA03247 2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2868 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2947
Cdd:PHA03247 2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2948 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 3027
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3028 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 3102
Cdd:PHA03247 2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988
|
....*...
gi 1907182167 3103 TSLATHLP 3110
Cdd:PHA03247 2989 PASSTPPL 2996
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2169-2584 |
7.36e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 7.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2169 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 2248
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2249 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2328
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2329 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2407
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2482
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2483 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2561
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182167 2562 HLPFSSTSSVTPTSEVIITPTPQ 2584
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1758-2222 |
9.77e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 9.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247 2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.47e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.72 E-value: 1.47e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182167 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1492-1875 |
1.50e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154 223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154 285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154 364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154 440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
4109-4392 |
1.70e-03 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 44.66 E-value: 1.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4109 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 4184
Cdd:pfam04388 276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4185 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 4263
Cdd:pfam04388 352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4264 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 4313
Cdd:pfam04388 432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4314 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4392
Cdd:pfam04388 512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3806-4026 |
2.22e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3191-3405 |
2.72e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3191 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3270
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3271 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3349
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3350 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3405
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1961-2179 |
2.82e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2917-3241 |
2.97e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.83 E-value: 2.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2917 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2990
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2991 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 3056
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3057 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3136
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3137 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 3206
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182167 3207 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 3241
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
4114-4342 |
4.18e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.46 E-value: 4.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4114 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 4193
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 4268
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 4269 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4342
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1669-2011 |
5.27e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.06 E-value: 5.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3047-3410 |
5.40e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.22 E-value: 5.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3047 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 3126
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3127 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 3206
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3207 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3282
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3283 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3354
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3355 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3410
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3360-3720 |
5.63e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.22 E-value: 5.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3360 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3439
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3440 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3519
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3520 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3599
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3600 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3665
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 3666 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3720
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
5.64e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 42.96 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2784-3100 |
8.80e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2847
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2848 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2919
Cdd:pfam05109 502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:pfam05109 580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:pfam05109 656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
|
330 340
....*....|....*....|...
gi 1907182167 3078 KHTTGVSLETSVQTTIASPTPSA 3100
Cdd:pfam05109 731 PPKNATSPQAPSGQKTAVPTVTS 753
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182167 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.45e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.45e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
1.62e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 111.28 E-value: 1.62e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.80e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.80e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182167 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
5.66e-26 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 103.61 E-value: 5.66e-26
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.31e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.31e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.11e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.11e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
6.25e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 92.40 E-value: 6.25e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
1.42e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.90 E-value: 1.42e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.80e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.80e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.66 E-value: 1.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
8.47e-16 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 74.28 E-value: 8.47e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
1.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 67.73 E-value: 1.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
2.53e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 67.03 E-value: 2.53e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
5.93e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.86 E-value: 5.93e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4506-4585 |
3.69e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.69e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4506 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4584
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182167 4585 P 4585
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
4.02e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.81 E-value: 4.02e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3979-4401 |
2.84e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.72 E-value: 2.84e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3979 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 4043
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4044 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 4121
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 4200
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4201 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 4272
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4273 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4351
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4352 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3898-4357 |
4.14e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 60.18 E-value: 4.14e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3898 PLFSTLSVTPTTEGLNTPT-SPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPvtytttaatqtkssfsTDRTSTPHLSQ 3976
Cdd:PHA03307 54 TVVAGAAACDRFEPPTGPPpGPGTEAPANESRSTPTWSLSTLAPASPAREGSP----------------TPPGPSSPDPP 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3977 SSTVTPTQPTPIPATTNSPMTTvglTGTPVVHTPSGTSSIAHTPHTthslPTAASSSTTLSTAPQFRTSEQSTttfPTPS 4056
Cdd:PHA03307 118 PPTPPPASPPPSPAPDLSEMLR---PVGSPGPPPAASPPAAGASPA----AVASDAASSRQAALPLSSPEETA---RAPS 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4057 APQTSLVTSLPPFSTSSVSPTdeihitstnPHTVSSVSMSRPVSTILQTtievttpPNTSTPVTHSTSATTEAQGSFSTE 4136
Cdd:PHA03307 188 SPPAEPPPSTPPAAASPRPPR---------RSSPISASASSPAPAPGRS-------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4137 RTSTSyLSHPssttvhqstaGPVITSIKSTMGVTGTPPvhttsGTTSSPQTPHSTHPISTAAISRttGISGTPFRTPMKT 4216
Cdd:PHA03307 252 ENECP-LPRP----------APITLPTRIWEASGWNGP-----SSRPGPASSSSSPRERSPSPSP--SSPGSGPAPSSPR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4217 TITFPTPSSLQTSMATLfppfSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdintTSATTQA 4296
Cdd:PHA03307 314 ASSSSSSSRESSSSSTS----SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPS------SPAASAG 383
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4297 HSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASP-PSSAP 4357
Cdd:PHA03307 384 RPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPP 445
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.20e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 51.16 E-value: 1.20e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
4076-4392 |
1.34e-06 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.00 E-value: 1.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4076 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 4145
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4146 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 4202
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4203 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 4270
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4271 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4348
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 4349 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4392
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2614-2835 |
1.60e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.37 E-value: 1.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2614 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2693
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2694 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2773
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2774 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
1.99e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.94 E-value: 1.99e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182167 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3523-3723 |
5.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 5.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182167 3683 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3723
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
6.33e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 6.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2601-2794 |
7.70e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.06 E-value: 7.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2601 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 2680
Cdd:COG3469 26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2681 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 2760
Cdd:COG3469 106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182167 2761 TTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2794
Cdd:COG3469 182 TTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
8.32e-06 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.84 E-value: 8.32e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2635-2858 |
1.44e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 51.29 E-value: 1.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2635 TSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 2714
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2715 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2794
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 2795 PFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfSSTSAVTPTSEVIITPTPQH 2858
Cdd:COG3469 156 TETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT----PSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2918-3108 |
2.40e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 2.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2997
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2998 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182167 3078 KHTTgvsletsvqTTIASPTPSAPQTSLATH 3108
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
2.65e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.69 E-value: 2.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2450-2846 |
3.41e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.30 E-value: 3.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2450 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2529
Cdd:pfam05109 428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2530 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2609
Cdd:pfam05109 503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2610 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2689
Cdd:pfam05109 567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2690 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2769
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2770 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2841
Cdd:pfam05109 725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804
|
....*
gi 1907182167 2842 SAVTP 2846
Cdd:pfam05109 805 SKLRP 809
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2645-2856 |
4.91e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 4.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2645 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2724
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTT----GSVVVAASGSAGSGTGTTAASSTAATSSTT 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2725 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2804
Cdd:COG3469 77 STTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 2805 KHTTGVsleTSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTP 2856
Cdd:COG3469 157 ETATGG---TTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1644-1833 |
5.87e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1644 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 1723
Cdd:COG3469 26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1724 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 1803
Cdd:COG3469 106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|
gi 1907182167 1804 ITNSLmTTGGLTGTPPVHTTSGTTSSPQTP 1833
Cdd:COG3469 182 TTTAT-ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3495-3682 |
7.65e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.98 E-value: 7.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3495 TGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSS 3574
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3575 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLM 3654
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA 187
|
170 180
....*....|....*....|....*...
gi 1907182167 3655 TTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469 188 TTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2988-3443 |
7.72e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.55 E-value: 7.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2988 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 3059
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3060 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 3133
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3134 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 3213
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3214 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3290
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3439
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182167 3440 PSAP 3443
Cdd:PHA03247 3016 ETDP 3019
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
4144-4378 |
1.05e-04 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 48.73 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4144 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 4218
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4219 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4298
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4299 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4373
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182167 4374 PTIHM 4378
Cdd:COG5422 287 MRLQL 291
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
2260-3880 |
1.27e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 48.61 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2260 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2339
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2340 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2419
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2420 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2499
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2500 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2579
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2580 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2659
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2660 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2739
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2740 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2819
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2820 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2899
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2900 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2970
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2971 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 3050
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3051 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 3130
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3131 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 3210
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3211 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3290
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3291 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3370
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3371 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3450
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3451 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3530
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3531 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3610
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3611 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3690
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3691 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3770
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3771 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3850
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182167 3851 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3880
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2341-2695 |
2.18e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 47.68 E-value: 2.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2341 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2415
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2416 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2486
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2487 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2566
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2567 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2639
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2640 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2695
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1623-1821 |
2.26e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.44 E-value: 2.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1623 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1702
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTN-----FPTHSGPQSSLST 1777
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVsgtetATGGTTTTSTTTT 171
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1907182167 1778 HLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVH 1821
Cdd:COG3469 172 TTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2303-2521 |
2.97e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2303 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2382
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2383 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2461
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2462 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469 159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2873-3067 |
4.25e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 4.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2873 LPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2952
Cdd:COG3469 22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2953 TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 3032
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*
gi 1907182167 3033 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 3067
Cdd:COG3469 181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
4042-4320 |
4.26e-04 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 46.81 E-value: 4.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4042 FRTSEQSTTTFPTPSAPQTSLVTSLPPfsTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTH 4121
Cdd:COG5422 17 FGAPRKSDAFVSKQLLPPRRLQRKLNP--ISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITH 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 STSAT--TEAQGSFS---TERTSTSYLSHPSSTTVHQSTAGPvitsikstmgvTGTPpvhttSGTTSSPQTPHSTHPIST 4196
Cdd:COG5422 95 SPSATssTSSLNSNDgdqFSPASDSLSFNPSSTQSRKDSGPG-----------DGSP-----VQKRKNPLLPSSSTHGTH 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4197 AAISrTTGISGTPFRTPMKTTiTFPTPSSLQTSMATLFPPF--STSVMSSTEIFNTP---TNPHSVSSASTSRPLSTSLP 4271
Cdd:COG5422 159 PPIV-FTDNNGSHAGAPNARS-RKEIPSLGSQSMQLPSPHFrqKFSSSDTSNGFSYPsirKNSRHSSNSMPSFPHSSTAV 236
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1907182167 4272 TTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTL 4320
Cdd:COG5422 237 LLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSRVAVEF 285
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
4165-4402 |
5.49e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 5.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4165 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSS 4244
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4245 TEIFNTPtnphsvsSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 4324
Cdd:COG3469 82 ATAAAAA-------ATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4325 RSASTLQYTPTPSSVShspllTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV 4402
Cdd:COG3469 155 GTETATGGTTTTSTTT-----TTTSASTTPSATTTATATTASG-----------ATTPSATTTATTTGPPTPGLPKHV 216
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3818-4391 |
5.71e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 5.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3818 PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqSSLSTHL 3897
Cdd:PHA03247 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEP 2638
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3898 PLFSTLSVTPTTEGLNTPTSPH-SLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTT------------AATQTKSSF 3964
Cdd:PHA03247 2639 DPHPPPTVPPPERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppptpePAPHALVSA 2718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3965 STDRTSTPHLSQSSTVTPTQPTPiPATTNSPMTTVGLTGTPVVHTPSGTSSI------AHTPHTTHSLPTAASSSTTLST 4038
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPappaapAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4039 APQFRTSEQSTT--TFPTPSAPQTSLVTSL--PPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVStilqttievTTPPN 4114
Cdd:PHA03247 2798 LPSPWDPADPPAavLAPAAALPPAASPAGPlpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR---------RRPPS 2868
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4115 TSTPVTHSTSAtteaqgsfsteRTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHT-TSGTTSSPQTPHSTHP 4193
Cdd:PHA03247 2869 RSPAAKPAAPA-----------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPP 2937
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGTPfrtpmkttitFPTPSSLQTSMATLFPPfstsvmssteifNTPTNPHSVSSASTSRPLSTSLPTT 4273
Cdd:PHA03247 2938 RPQPPLAPTTDPAGAG----------EPSGAVPQPWLGALVPG------------RVAVPRFRVPQPAPSREAPASSTPP 2995
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4274 IKGTGTPQTpvsdinTTSATTQAHSSFPTTRtstshlslPSSMTSTLTPAS----RSASTLQYTPTPSSVSHSPLLTTPT 4349
Cdd:PHA03247 2996 LTGHSLSRV------SSWASSLALHEETDPP--------PVSLKQTLWPPDdtedSDADSLFDSDSERSDLEALDPLPPE 3061
|
570 580 590 600
....*....|....*....|....*....|....*....|....*....
gi 1907182167 4350 ASPPSSAPTFVSPTAAStviSSALPTIHMTPTP-------SSRPTSSTG 4391
Cdd:PHA03247 3062 PHDPFAHEPDPATPEAG---ARESPSSQFGPPPlsanaalSRRYVRSTG 3107
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2631-3110 |
6.53e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2631 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2707
Cdd:PHA03247 2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2708 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2787
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2788 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2867
Cdd:PHA03247 2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2868 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2947
Cdd:PHA03247 2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2948 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 3027
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3028 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 3102
Cdd:PHA03247 2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988
|
....*...
gi 1907182167 3103 TSLATHLP 3110
Cdd:PHA03247 2989 PASSTPPL 2996
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4165-4463 |
6.59e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 6.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4165 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPfRTPMKTTITfPTPSSLqTSMATLFPPFST---SV 4241
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAAR-PTVGSL-TSLADPPPPPPTpepAP 2712
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4242 MSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLT 4321
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4322 PASRSASTLQYTPTPSSVSHSPLLTTPTASPPssAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSH 4401
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4402 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLPTSA 4463
Cdd:PHA03247 2871 PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2169-2584 |
7.36e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 7.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2169 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 2248
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2249 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2328
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2329 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2407
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2482
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2483 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2561
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182167 2562 HLPFSSTSSVTPTSEVIITPTPQ 2584
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
1486-3107 |
8.01e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 45.91 E-value: 8.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1486 TAPLITSITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHP 1565
Cdd:COG3210 41 GSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGGIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTA 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1566 FSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTTGNILPT 1645
Cdd:COG3210 121 ASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGA 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1646 TIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSG 1725
Cdd:COG3210 201 LINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGT 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1726 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAIT 1805
Cdd:COG3210 281 NATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAG 360
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1806 NSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTS 1885
Cdd:COG3210 361 SGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGT 440
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1886 SVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSA 1965
Cdd:COG3210 441 VTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAG 520
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1966 VTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTprTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQ 2045
Cdd:COG3210 521 GGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASG--SNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGG 598
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2046 SSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTS 2125
Cdd:COG3210 599 TVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTV 678
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2126 TPHLSQSSTVTPTQSTPIPATTNS----LMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV 2201
Cdd:COG3210 679 TSGATGGTTGTTLNAATGGTLNNAgntlTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLS 758
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2202 QTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSASTSTTTGNILPT----TIGKTGSPHTSVPVIYT 2277
Cdd:COG3210 759 IGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSggtiTINTATTGLTGTGDTTS 838
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2278 TSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTV 2357
Cdd:COG3210 839 GAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTA 918
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2358 AVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEG 2437
Cdd:COG3210 919 TGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILV 998
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2438 TRPPHTSVPVMYTTTAATQTKSSFSTDRT----STPHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSS 2513
Cdd:COG3210 999 AGNSGTTASTTGGSGAIVAGGNGVTGTTGtasaTGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNG 1078
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2514 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVIITPTPQHTLSSASTS 2593
Cdd:COG3210 1079 GGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGA 1158
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2594 TTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTG 2673
Cdd:COG3210 1159 SSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSA 1238
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2674 TPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNT 2753
Cdd:COG3210 1239 GQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGI 1318
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2754 QSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA 2833
Cdd:COG3210 1319 GGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNA 1398
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2834 THLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTS 2913
Cdd:COG3210 1399 GRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVG 1478
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2914 APHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTT 2993
Cdd:COG3210 1479 GAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGA 1558
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2994 NFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVA 3073
Cdd:COG3210 1559 AGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNG 1638
|
1610 1620 1630
....*....|....*....|....*....|....
gi 1907182167 3074 VSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 3107
Cdd:COG3210 1639 GEGVLALVAGGNTTNGTTLSGAVNGAGNGWAVDL 1672
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2501-2832 |
8.74e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.91 E-value: 8.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2501 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVIIT 2580
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2581 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2660
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2661 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2735
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2736 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2810
Cdd:pfam03154 413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
|
330 340
....*....|....*....|..
gi 1907182167 2811 SLETSVQTTIASPTPSAPQTSL 2832
Cdd:pfam03154 493 QPPSSASVSSSGPVPAAVSCPL 514
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3523-3746 |
9.07e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 9.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3523 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3602
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3603 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3682
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 3683 PFSTVAVSNTKHTTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3746
Cdd:COG3469 156 TETATGGTTTTSTTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1758-2222 |
9.77e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 9.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247 2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2398-2585 |
1.08e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.13 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2398 STLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPvmYTTTAATQTKSSFSTDRTSTPHLSQSSTV 2477
Cdd:COG3469 33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT--ATAAAAAATSTSATLVATSTASGANTGTS 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2478 TPTQ-STPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2556
Cdd:COG3469 111 TVTTtSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA 190
|
170 180
....*....|....*....|....*....
gi 1907182167 2557 TSLTThlpfSSTSSVTPTSEVIITPTPQH 2585
Cdd:COG3469 191 SGATT----PSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2920-3131 |
1.45e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:COG3469 14 GASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTS 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 3079
Cdd:COG3469 94 ATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 3080 TTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3131
Cdd:COG3469 168 TTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.47e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.72 E-value: 1.47e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182167 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1492-1875 |
1.50e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154 223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154 285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154 364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154 440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1827-2242 |
1.67e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.91 E-value: 1.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1827 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1906
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1907 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1986
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1987 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2065
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2066 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2140
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2141 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 2219
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182167 2220 HLPFSSTSSVTPTSKVIITPTPQ 2242
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
4109-4392 |
1.70e-03 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 44.66 E-value: 1.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4109 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 4184
Cdd:pfam04388 276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4185 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 4263
Cdd:pfam04388 352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4264 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 4313
Cdd:pfam04388 432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4314 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4392
Cdd:pfam04388 512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
830-887 |
1.77e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 39.85 E-value: 1.77e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 830 CDYAGVSYPGGFELHTDCKTCTCSQGRWTCQlSTQCPSTCVLYGEGHIITFDGQRFVF 887
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCT-KVWCGPKPCLLHNLSGECPLGQGCVP 57
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
4181-4460 |
1.86e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.52 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4181 TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMatlfPPFSTSVMSSTEIFNTPTNPHSVSSA 4260
Cdd:pfam05109 392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA----PNTTTGLPSSTHVPTNLTAPASTGPT 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4261 STSRPLSTSLPTTIKGTGTPQTPVSDI----------NTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL 4330
Cdd:pfam05109 468 VSTADVTSPTPAGTTSGASPVTPSPSPrdngteskapDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4331 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV--PTFSSF 4408
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVT-----------TPTPNATSPTVGETSPQANTTNHTlgGTSSTP 616
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4409 SSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4460
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
360-395 |
1.96e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214564 Cd Length: 59 Bit Score: 39.42 E-value: 1.96e-03
10 20 30
....*....|....*....|....*....|....*...
gi 1907182167 360 CMLNGMVYGPGEITKT-ACQTCQCTMGRW-TCTKQPCP 395
Cdd:smart00214 1 CVHNGRVYNDGETWKPdPCQICTCLDGTTvLCDPVECP 38
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3977-4224 |
2.00e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 2.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3977 SSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTThSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPS 4056
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSV-VVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4057 APQTSLVTSLPPFSTSSVSPTDeihitstnphtvssvsmsrPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTE 4136
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTAS-------------------GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATS 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4137 RTSTSYLSHPSSTTVHqstagpvitsikstmGVTGTPPVHTTSGTTSSPQTPhSTHPISTAAISRTTGISGTPFRTPMKT 4216
Cdd:COG3469 143 SAGSTTTTTTVSGTET---------------ATGGTTTTSTTTTTTSASTTP-SATTTATATTASGATTPSATTTATTTG 206
|
....*...
gi 1907182167 4217 TITFPTPS 4224
Cdd:COG3469 207 PPTPGLPK 214
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3806-4026 |
2.22e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3806 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3885
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3886 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3965
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182167 3966 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 4026
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3968-4366 |
2.34e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 2.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3968 RTSTPHLSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQ 4047
Cdd:pfam03154 168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4048 STTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPV-THSTSAT 4126
Cdd:pfam03154 248 PLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGqSQQRIHT 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4127 TEAQGSFSTERTSTSYLSHPSSTTVhqstagPVItsikstmgvtgTPPVHTTSGTTSSPQT-PHSTHPISTAAISRTTGI 4205
Cdd:pfam03154 328 PPSQSQLQSQQPPREQPLPPAPLSM------PHI-----------KPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNL 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4206 SGTPFRTPMKTTITFPTPSS-------LQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTT--IKG 4276
Cdd:pfam03154 391 PPPPALKPLSSLSTHHPPSAhppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHpfVPG 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4277 TGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSvshspllTTPTASPPSSA 4356
Cdd:pfam03154 471 GPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPES-------PPPPPRSPSPE 543
|
410
....*....|.
gi 1907182167 4357 PTFV-SPTAAS 4366
Cdd:pfam03154 544 PTVVnTPSHAS 554
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2933-3107 |
2.42e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2933 VSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3011
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3012 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3089
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182167 3090 QTTIASPTPSAPQTSLAT 3107
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2670-2888 |
2.52e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2670 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPlfSTLSVTPTTE 2749
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAAT--SSTTSTTATA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2750 GLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2829
Cdd:COG3469 83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGG 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2830 TSlathlpfsSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVP 2888
Cdd:COG3469 163 TT--------TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
4147-4374 |
2.54e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4147 SSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTpfrtpmkttiTFPTPSSL 4226
Cdd:COG3469 4 VSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATS 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4227 QTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTS 4306
Cdd:COG3469 74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4307 TSHLSLPSSMTSTLTPASRSASTLQYTPTPSsvshspllTTPTASPPSSAPTFVSPTAASTVISSALP 4374
Cdd:COG3469 154 SGTETATGGTTTTSTTTTTTSASTTPSATTT--------ATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3815-4188 |
2.58e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.80 E-value: 2.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3815 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3894
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3895 THLPLFSTLSVTPTTEGLNTPTSphslSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHL 3974
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRA----AACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3975 SQSSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFP 4053
Cdd:pfam17823 201 SAPATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRL 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4054 TPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQG 4131
Cdd:pfam17823 281 SPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSA 360
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 4132 --------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 4188
Cdd:pfam17823 361 spvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3191-3405 |
2.72e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3191 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3270
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3271 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3349
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3350 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3405
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3548-3722 |
2.79e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3548 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3626
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3627 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3704
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182167 3705 QTTIASPTPSAPQTSLAT 3722
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1961-2179 |
2.82e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2917-3241 |
2.97e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.83 E-value: 2.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2917 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2990
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2991 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 3056
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3057 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3136
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3137 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 3206
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182167 3207 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 3241
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1913-2104 |
3.07e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1913 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 1992
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1993 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2070
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182167 2071 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2104
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3143-3334 |
3.07e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3143 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 3222
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3223 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 3300
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182167 3301 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 3334
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2328-2537 |
3.28e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2328 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfSTLSVTPTTE 2407
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST-----AATSSTTSTT 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2408 GLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPA 2487
Cdd:COG3469 80 ATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETA 159
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2488 TTNSlmTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2537
Cdd:COG3469 160 TGGT--TTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3901-4068 |
3.55e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3901 STLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTV 3980
Cdd:COG3469 33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTV 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3981 TPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQF---RTSEQSTTTFPTPSA 4057
Cdd:COG3469 113 TTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSastTPSATTTATATTASG 192
|
170
....*....|.
gi 1907182167 4058 PQTSLVTSLPP 4068
Cdd:COG3469 193 ATTPSATTTAT 203
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
4114-4342 |
4.18e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.46 E-value: 4.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4114 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 4193
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4194 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 4268
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 4269 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4342
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3672-4130 |
4.83e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 43.37 E-value: 4.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3672 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3751
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3752 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3827
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3828 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3907
Cdd:pfam05109 579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3908 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3987
Cdd:pfam05109 658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3988 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 4065
Cdd:pfam05109 738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4066 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 4130
Cdd:pfam05109 815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1669-2011 |
5.27e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.06 E-value: 5.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
3963-4342 |
5.38e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 43.20 E-value: 5.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3963 SFSTDRTSTPHLSQSSTVTpTQPTPIPATTNSPMTTVGlTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTapqf 4042
Cdd:COG5099 38 STPNSFSPIPSKASSSATF-TLNLPINNSVNHKITSSS-SSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPST---- 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4043 rtseqSTTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHI-TSTNPHTVSSVSMSRPVSTILQttiEVTTPPNTSTPVTH 4121
Cdd:COG5099 112 -----SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSnKLPLPNPNHSNSATTNQSGSSF---INTPASSSSQPLTN 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4122 STSATTEAQGSFSTERTSTSYLSHPSSTTVhqSTAGPVITSIkstmGVTGTPPVHTTSGTTSSPQTPHsTHPISTAAISR 4201
Cdd:COG5099 184 LVVSSIKRFPYLTSLSPFFNYLIDPSSDSA--TASADTSPSF----NPPPNLSPNNLFSTSDLSPLPD-TQSVENNIILN 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4202 TTGISGTPFRTPMKTTI--TFPTPSSLQTSMATLFPP-FSTSVMSSTEIFNT----PTNPHSVSSASTSRPLSTSLPTTI 4274
Cdd:COG5099 257 SSSSINELTSIYGSVPSirNLRGLNSALVSFLNVSSSsLAFSALNGKEVSPTgspsTRSFARVLPKSSPNNLLTEILTTG 336
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182167 4275 KGTGTPQTPVSDINTTSATTQAHSsfpttrTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHS 4342
Cdd:COG5099 337 VNPPQSLPSLLNPVFLSTSTGFSL------TNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSES 398
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1703-1877 |
5.38e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 5.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 1781
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1782 FSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 1859
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182167 1860 QTTIASPTPSAPQTSLAT 1877
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1896-2073 |
5.38e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 5.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1896 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1975
Cdd:COG3469 29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1976 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2055
Cdd:COG3469 109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
|
170
....*....|....*...
gi 1907182167 2056 STLSVTPTTEGLNTPTSP 2073
Cdd:COG3469 189 TASGATTPSATTTATTTG 206
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3047-3410 |
5.40e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.22 E-value: 5.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3047 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 3126
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3127 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 3206
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3207 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3282
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3283 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3354
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182167 3355 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3410
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1690-1901 |
5.62e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1690 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 1769
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1770 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 1849
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1907182167 1850 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSSVTPTSEVIITPTPQH 1901
Cdd:COG3469 162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3360-3720 |
5.63e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.22 E-value: 5.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3360 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3439
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3440 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3519
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3520 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3599
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3600 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3665
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182167 3666 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3720
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
5.64e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 42.96 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182167 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4183-4460 |
8.20e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 8.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4183 SSPQTPHSTHPISTAAISRTTGISGTPF----RTPMKTTITFP---TPSSLQTSMATLFPPFSTSVMSSTEIF--NTPTN 4253
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQsarpRAPVDDRGDPRgpaPPSPLPPDTHAPDPPPPSPSPAANEPDphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4254 PHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdinttSATTQA--HSSFPTTRTSTSHLSLPSSMTSTLTPASRSA-STL 4330
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQA-------SSPPQRprRRAARPTVGSLTSLADPPPPPPTPEPAPHALvSAT 2719
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4331 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTS---------STGLLSTSKTTSH 4401
Cdd:PHA03247 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgpprrltrpAVASLSESRESLP 2799
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182167 4402 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4460
Cdd:PHA03247 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
4049-4271 |
8.67e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 8.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4049 TTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTE 4128
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4129 AQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSpqtphsthpiSTAAISRTTGISGT 4208
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS----------GASATSSAGSTTTT 150
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 4209 PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLP 4271
Cdd:COG3469 151 TTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
4272-4402 |
8.78e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 42.38 E-value: 8.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4272 TTIKGTGTPQTP--VSDINTTSATTQAHSSFPTTRTSTShlslpSSMTSTLTPAsrsastlqyTPTPSSVSHSPLLTTPT 4349
Cdd:PLN02217 548 AWIPGKGVPYIPglFAGNPGSTNSTPTGSAASSNTTFSS-----DSPSTVVAPS---------TSPPAGHLGSPPATPSK 613
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1907182167 4350 ASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHV 4402
Cdd:PLN02217 614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMV 666
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2784-3100 |
8.80e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2847
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2848 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2919
Cdd:pfam05109 502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2920 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2999
Cdd:pfam05109 580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 3000 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 3077
Cdd:pfam05109 656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
|
330 340
....*....|....*....|...
gi 1907182167 3078 KHTTGVSLETSVQTTIASPTPSA 3100
Cdd:pfam05109 731 PPKNATSPQAPSGQKTAVPTVTS 753
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2800-3003 |
9.13e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 9.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2800 AVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQ 2879
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 2880 TGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSS 2959
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS-TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1907182167 2960 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQS 3003
Cdd:COG3469 171 TTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
4098-4309 |
9.36e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 9.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4098 PVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPpvhT 4177
Cdd:COG3469 11 TAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA---A 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182167 4178 TSGTTSSPQTPHSTHPiSTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSV 4257
Cdd:COG3469 88 AATSTSATLVATSTAS-GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTT 166
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182167 4258 SSASTSRPLSTSLPTTIKGTGTPQTPVSdinTTSATTQAHSSFPTTRTSTSH 4309
Cdd:COG3469 167 STTTTTTSASTTPSATTTATATTASGAT---TPSATTTATTTGPPTPGLPKH 215
|
|
|