|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-132 |
3.05e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 278.15 E-value: 3.05e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1039792720 98 QEHQQQVAQAVERAKQVTMTELNAIIG--QQLQAQHL 132
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGqqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
481-766 |
3.43e-43 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.43e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 481 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 559
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 560 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 639
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 640 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 717
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039792720 718 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 766
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
250-468 |
1.47e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 250 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 319
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 320 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 386
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 387 GLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 463
Cdd:pfam05109 632 GQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 1039792720 464 LAGPG 468
Cdd:pfam05109 709 ASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-132 |
3.05e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 278.15 E-value: 3.05e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1039792720 98 QEHQQQVAQAVERAKQVTMTELNAIIG--QQLQAQHL 132
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGqqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
481-766 |
3.43e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.43e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 481 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 559
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 560 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 639
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 640 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 717
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039792720 718 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 766
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
480-765 |
5.77e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.02 E-value: 5.77e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 480 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 558
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 559 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 638
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 639 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 716
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039792720 717 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 765
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
603-642 |
2.68e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.68e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1039792720 603 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 642
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
502-764 |
2.06e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 502 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 581
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 582 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 651
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 652 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 728
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1039792720 729 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 764
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
605-642 |
3.91e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.91e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1039792720 605 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 642
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
250-468 |
1.47e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 250 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 319
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 320 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 386
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 387 GLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 463
Cdd:pfam05109 632 GQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 1039792720 464 LAGPG 468
Cdd:pfam05109 709 ASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-132 |
3.05e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 278.15 E-value: 3.05e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1039792720 98 QEHQQQVAQAVERAKQVTMTELNAIIG--QQLQAQHL 132
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGqqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
481-766 |
3.43e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.43e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 481 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 559
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 560 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 639
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 640 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 717
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039792720 718 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 766
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
480-765 |
5.77e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.02 E-value: 5.77e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 480 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 558
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 559 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 638
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 639 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 716
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039792720 717 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 765
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
448-766 |
8.31e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.99 E-value: 8.31e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 448 VSADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIWDISQPGSKSPISQLdclnR 527
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 528 DNYIRSCKLLPDGRTLIVGGEASTLTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLV 607
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 608 RQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMESSNVEVLH- 683
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDl 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 684 HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISADDKYIVTGSGDKKAT 762
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 1039792720 763 VYEV 766
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
474-726 |
4.68e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 149.68 E-value: 4.68e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 474 RQINTLS-HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 551
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 552 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWT 631
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 632 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGK 709
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDlATGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1039792720 710 WFVSTGKDNLLNAWRTP 726
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
575-766 |
3.47e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 106.27 E-value: 3.47e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 575 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 653
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 654 FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 731
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 1039792720 732 --FQSkESSSVLSCDISADDKYIVTGSGDKKATVYEV 766
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
606-766 |
1.32e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 1.32e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 606 LVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 682
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 683 H-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISADDKYIVTGSGDK 759
Cdd:cd00200 79 DlETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 1039792720 760 KATVYEV 766
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
474-603 |
4.15e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 71.87 E-value: 4.15e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 474 RQINTLS-HGEVVCAVTISNPTRHVYTGGKGC-VKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 551
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1039792720 552 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHN 603
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
603-642 |
2.68e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.68e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1039792720 603 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 642
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
502-764 |
2.06e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 502 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 581
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 582 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 651
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 652 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 728
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1039792720 729 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 764
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
605-642 |
3.91e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.91e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1039792720 605 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 642
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
250-468 |
1.47e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 250 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 319
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 320 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 386
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 387 GLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 463
Cdd:pfam05109 632 GQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 1039792720 464 LAGPG 468
Cdd:pfam05109 709 ASGPG 713
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
561-600 |
2.32e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.14 E-value: 2.32e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1039792720 561 TPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWD 600
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
566-647 |
3.34e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 40.44 E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039792720 566 AELTSSAPACYALAISPDAKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 638
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 1039792720 639 RSWDLREGR 647
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
|