|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
24-103 |
3.05e-64 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 209.20 E-value: 3.05e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 24 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 103
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
462-747 |
2.19e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.19e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 462 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 540
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 541 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 620
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 621 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 698
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1191017746 699 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 747
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
462-746 |
4.79e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 146.33 E-value: 4.79e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 462 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 540
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 541 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 620
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 621 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 698
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1191017746 699 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 746
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
497-745 |
2.74e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.74e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 497 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 569
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 570 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 645
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 646 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 720
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 1191017746 721 VLS-------CDISVDDKYIVTGSGDKKATVY 745
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
584-623 |
4.65e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.65e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1191017746 584 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 623
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
586-623 |
5.70e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 5.70e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1191017746 586 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 623
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
24-103 |
3.05e-64 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 209.20 E-value: 3.05e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 24 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 103
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
462-747 |
2.19e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.19e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 462 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 540
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 541 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 620
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 621 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 698
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1191017746 699 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 747
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
429-747 |
3.32e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 155.84 E-value: 3.32e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 429 VSADGQMQPVPFPPDALIGPGIPRHARQINTLNHGEVVCAVTISNPTRHVYTGGKGCVKVWDISHPGNKSPVSQLdclnR 508
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 509 DNYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLV 588
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 589 RQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMENSNVEVLHV 665
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 666 -TKPDKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISVDDKYIVTGSGDKKAT 743
Cdd:COG2319 234 aTGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 1191017746 744 VYEV 747
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
455-707 |
2.84e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 153.14 E-value: 2.84e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 455 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 532
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 533 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 612
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 613 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMENSNVEVLHV-TKPDKYQLHLHESCVLSLKFAHCGK 690
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLaTGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1191017746 691 WFVSTGKDNLLNAWRTP 707
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
462-746 |
4.79e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 146.33 E-value: 4.79e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 462 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 540
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 541 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 620
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 621 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 698
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1191017746 699 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 746
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
510-747 |
1.39e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 1.39e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 510 NYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVR 589
Cdd:cd00200 10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLE--TGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 590 QFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEV--LHVT 666
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTlRGHTDWVNSVAFSPDGTFVASSSQDGTIKLwdLRTG 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 667 KPdKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDKKATV 744
Cdd:cd00200 168 KC-VATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLgtLRGHE-NGVNSVAFSPDGYLLASGSEDGTIRV 245
|
...
gi 1191017746 745 YEV 747
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
587-747 |
3.45e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.32 E-value: 3.45e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 587 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMENSNVEVL 663
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 664 HVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 740
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 1191017746 741 KATVYEV 747
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
455-584 |
1.21e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 73.41 E-value: 1.21e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 455 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 532
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1191017746 533 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 584
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
497-745 |
2.74e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.74e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 497 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 569
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 570 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 645
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 646 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 720
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 1191017746 721 VLS-------CDISVDDKYIVTGSGDKKATVY 745
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
584-623 |
4.65e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.65e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1191017746 584 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 623
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
586-623 |
5.70e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 5.70e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1191017746 586 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 623
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
547-628 |
1.09e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 41.98 E-value: 1.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 547 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 619
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 1191017746 620 RSWDLREGR 628
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
542-581 |
1.09e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 1.09e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1191017746 542 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 581
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| COG5276 |
COG5276 |
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function ... |
477-623 |
8.89e-03 |
|
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function unknown];
Pssm-ID: 444087 [Multi-domain] Cd Length: 320 Bit Score: 38.77 E-value: 8.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1191017746 477 HVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYirscRLLPDGRTLIVGGEAST-LSIWDLAAPT-PRIKAELTSSA 553
Cdd:COG5276 31 YAYVAgGSNGLAIVDVSDPANPVLVGSLPTPGGTWR----DVKVSGDYLYVASEGSEgLQIFDISDPAnPKLVGRYDTGG 106
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1191017746 554 PACYALAISpDSKVCFSCCSDGNIAVWDLHNQT---LVRQFQgHTDGASCIDISNDGTKLWTGGLDNTVRSWD 623
Cdd:COG5276 107 SGAHNIAVD-GNYAYVAGGSDNGLVIVDISDPTnpvLVGRYS-LPGQAYLHDVQVVGDYAYVADWEDGLVIVD 177
|
|
|