|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
6.56e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 6.56e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 568961195 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-765 |
2.25e-43 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.00 E-value: 2.25e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 480 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 558
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 559 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 638
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 639 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 716
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961195 717 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 765
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-467 |
5.62e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 251 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 320
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 321 TPRNDAPTPGTSTTPGLRSMP--GKppgMDPIGIMASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------S 387
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGK---TSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsA 628
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 388 AYAGLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 462
Cdd:pfam05109 629 VTTGQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 568961195 463 LAGPG 467
Cdd:pfam05109 709 ASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
6.56e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 6.56e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 568961195 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-765 |
2.25e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.00 E-value: 2.25e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 480 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 558
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 559 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 638
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 639 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 716
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961195 717 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 765
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
479-764 |
4.46e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.41 E-value: 4.46e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 479 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 557
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 558 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 637
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 638 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 715
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961195 716 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 764
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
602-641 |
2.54e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.54e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961195 602 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 641
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
501-763 |
2.05e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 501 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 580
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 581 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 650
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 651 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 727
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 568961195 728 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 763
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
604-641 |
3.79e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.79e-06
10 20 30
....*....|....*....|....*....|....*...
gi 568961195 604 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 641
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-467 |
5.62e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 251 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 320
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 321 TPRNDAPTPGTSTTPGLRSMP--GKppgMDPIGIMASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------S 387
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGK---TSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsA 628
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 388 AYAGLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 462
Cdd:pfam05109 629 VTTGQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 568961195 463 LAGPG 467
Cdd:pfam05109 709 ASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
6.56e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 6.56e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 568961195 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-765 |
2.25e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.00 E-value: 2.25e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 480 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 558
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 559 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 638
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 639 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 716
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961195 717 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 765
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
479-764 |
4.46e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.41 E-value: 4.46e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 479 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 557
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 558 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 637
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 638 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 715
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961195 716 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 764
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
447-765 |
5.74e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 5.74e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 447 VSADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIWDISQPGSKSPISQLdclnR 526
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 527 DNYIRSCKLLPDGRTLIVGGEASTLTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLV 606
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 607 RQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMESSNVEVLH- 682
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDl 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 683 HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISADDKYIVTGSGDKKAT 761
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 568961195 762 VYEV 765
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
473-725 |
2.86e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 150.45 E-value: 2.86e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 473 RQINTLS-HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 550
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 551 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWT 630
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 631 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGK 708
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDlATGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 568961195 709 WFVSTGKDNLLNAWRTP 725
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
574-765 |
3.00e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 106.65 E-value: 3.00e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 574 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 652
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 653 FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 730
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 568961195 731 --FQSkESSSVLSCDISADDKYIVTGSGDKKATVYEV 765
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
605-765 |
1.12e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 1.12e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 605 LVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 681
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 682 H-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISADDKYIVTGSGDK 758
Cdd:cd00200 79 DlETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 568961195 759 KATVYEV 765
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
473-602 |
3.38e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 72.25 E-value: 3.38e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 473 RQINTLS-HGEVVCAVTISNPTRHVYTGGKGC-VKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 550
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 568961195 551 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHN 602
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
602-641 |
2.54e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.54e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961195 602 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 641
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
501-763 |
2.05e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 501 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 580
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 581 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 650
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 651 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 727
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 568961195 728 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 763
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
604-641 |
3.79e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.79e-06
10 20 30
....*....|....*....|....*....|....*...
gi 568961195 604 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 641
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
560-599 |
2.25e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.25e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961195 560 TPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWD 599
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
565-646 |
3.30e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 40.44 E-value: 3.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 565 AELTSSAPACYALAISPDAKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 637
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 568961195 638 RSWDLREGR 646
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-467 |
5.62e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 251 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 320
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 321 TPRNDAPTPGTSTTPGLRSMP--GKppgMDPIGIMASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------S 387
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGK---TSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsA 628
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961195 388 AYAGLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDA 462
Cdd:pfam05109 629 VTTGQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQ 708
|
....*
gi 568961195 463 LAGPG 467
Cdd:pfam05109 709 ASGPG 713
|
|
|