|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
8.34e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 145207988 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
495-780 |
3.32e-43 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.32e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 495 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 573
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 574 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 653
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 654 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 731
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 145207988 732 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 780
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-482 |
5.66e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 261 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 334
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 335 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTPiTLTSSYP-----APFAMMSHHEMNGSLTSP---------- 396
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTP-TPNATSPtvgetSPQANTTNHTLGGTSSTPvvtsppknat 626
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 397 -SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVP 472
Cdd:pfam05109 627 sAVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRP 703
|
250
....*....|
gi 145207988 473 FPHDALAGPG 482
Cdd:pfam05109 704 GTTSQASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
8.34e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 145207988 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
495-780 |
3.32e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.32e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 495 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 573
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 574 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 653
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 654 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 731
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 145207988 732 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 780
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
494-779 |
3.73e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.79 E-value: 3.73e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 494 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 572
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 573 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 652
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 653 RSWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVST 728
Cdd:cd00200 160 KLWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 145207988 729 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 779
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
617-656 |
2.42e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.42e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 145207988 617 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 656
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
516-778 |
2.11e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 516 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 595
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 596 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 665
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 666 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 742
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 145207988 743 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 778
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
619-656 |
3.64e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.64e-06
10 20 30
....*....|....*....|....*....|....*...
gi 145207988 619 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 656
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-482 |
5.66e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 261 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 334
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 335 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTPiTLTSSYP-----APFAMMSHHEMNGSLTSP---------- 396
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTP-TPNATSPtvgetSPQANTTNHTLGGTSSTPvvtsppknat 626
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 397 -SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVP 472
Cdd:pfam05109 627 sAVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRP 703
|
250
....*....|
gi 145207988 473 FPHDALAGPG 482
Cdd:pfam05109 704 GTTSQASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
8.34e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 8.34e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 145207988 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
495-780 |
3.32e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.32e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 495 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 573
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 574 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 653
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 654 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 731
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 145207988 732 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 780
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
494-779 |
3.73e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.79 E-value: 3.73e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 494 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 572
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 573 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 652
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 653 RSWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVST 728
Cdd:cd00200 160 KLWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 145207988 729 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 779
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
462-780 |
8.07e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.99 E-value: 8.07e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 462 VSADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIWDISQPGSKSPISQLdclnR 541
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 542 DNYIRSCKLLPDGRTLIVGGEASTLTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLV 621
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 622 RQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMESSNVEVLH- 697
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDl 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 698 HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISADDKYIVTGSGDKKAT 776
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 145207988 777 VYEV 780
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
488-740 |
4.40e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 149.68 E-value: 4.40e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 488 RQINTLS-HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 565
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 566 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWT 645
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 646 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGK 723
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDlATGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 145207988 724 WFVSTGKDNLLNAWRTP 740
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
589-780 |
2.63e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 106.65 E-value: 2.63e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 589 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 667
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 668 FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 745
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 145207988 746 --FQSkESSSVLSCDISADDKYIVTGSGDKKATVYEV 780
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
620-780 |
1.01e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 1.01e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 620 LVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 696
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 697 H-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISADDKYIVTGSGDK 773
Cdd:cd00200 79 DlETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 145207988 774 KATVYEV 780
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
488-617 |
3.97e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 71.87 E-value: 3.97e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 488 RQINTLS-HGEVVCAVTISNPTRHVYTGGKGC-VKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 565
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 145207988 566 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHN 617
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
617-656 |
2.42e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.42e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 145207988 617 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 656
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
516-778 |
2.11e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 516 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 595
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 596 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 665
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 666 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 742
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 145207988 743 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 778
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
619-656 |
3.64e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.64e-06
10 20 30
....*....|....*....|....*....|....*...
gi 145207988 619 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 656
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
575-614 |
2.25e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.25e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 145207988 575 TPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWD 614
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
580-661 |
3.32e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 40.44 E-value: 3.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 580 AELTSSAPACYALAISPDAKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 652
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 145207988 653 RSWDLREGR 661
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-482 |
5.66e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 261 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 334
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 335 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTPiTLTSSYP-----APFAMMSHHEMNGSLTSP---------- 396
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTP-TPNATSPtvgetSPQANTTNHTLGGTSSTPvvtsppknat 626
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145207988 397 -SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVP 472
Cdd:pfam05109 627 sAVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRP 703
|
250
....*....|
gi 145207988 473 FPHDALAGPG 482
Cdd:pfam05109 704 GTTSQASGPG 713
|
|
|