|
Name |
Accession |
Description |
Interval |
E-value |
| Hira |
pfam07569 |
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are ... |
696-878 |
3.70e-59 |
|
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain pfam00400.
Pssm-ID: 462211 Cd Length: 221 Bit Score: 201.69 E-value: 3.70e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 696 LHCTGPYVMALTAAATLSVWDVHRQVVVVKEESLHSILSGS---------DMTVSQILLTQHGIPVMNLSDGKAYCFNPS 766
Cdd:pfam07569 17 LECSGSYLLAVTSVGLLYVWDIKKQKALLPPVSLAPLLDSSsrysdkltrAPTITSASLTSNGVPIVTLSNGDGYLYDKS 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 767 LSTWNLVSDKQDSLA-QCADFRNSLPSQDAMLCSGPLAIIQGRTSN--SGRQAARLFSVPHVV----------QQETTLA 833
Cdd:pfam07569 97 LETWLRISDSWWALGsQYWDSTGSSRSSSQSSAAGILSFLERKTNEelLRKGRGRLLQRLAKTllmkegfenfETVVTLA 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 1907116672 834 YLENQVAAALTLQSSHEYRHWLLLYARYLVNEGFEYRLREICKDL 878
Cdd:pfam07569 177 HLENRLAAALLLGSPDEYRHWLLMYAKRLAEEGLKGRLRELCKEL 221
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-270 |
2.92e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.42 E-value: 2.92e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKF 81
Cdd:COG2319 176 LLASGSDDGTVRLW-------------------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWD-LAT 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 82 PEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAham 161
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT------GHSGGVNSVAFSPDGKLLASG--- 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 162 nNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSSTkpscpycccAVGSKDRSLSVWLTCLKRPLVVI 241
Cdd:COG2319 307 -SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP--------DGKTL---------ASGSDDGTVRLWDLATGELLRTL 368
|
250 260
....*....|....*....|....*....
gi 1907116672 242 HElFDKSIMDISWTLNGLGILVCSMDGSV 270
Cdd:COG2319 369 TG-HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2-270 |
2.19e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 121.67 E-value: 2.19e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWKRAT------------------YIGPSTVFGSSG-----KLANVEQWRCVSILRSHSGDVMDVAWS 58
Cdd:cd00200 23 LLATGSGDGTIKVWDLETgellrtlkghtgpvrdvaASADGTYLASGSsdktiRLWDLETGECVRTLTGHTSYVSSVAFS 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 59 PHDAWLASCSVDNTVVIWNAVKFpEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSItkpfdecg 138
Cdd:cd00200 103 PDGRILSSSSRDKTIKVWDVETG-KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATL-------- 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 139 gTTH---VLRLSWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNP--KIFkkkqkngsstkpsc 213
Cdd:cd00200 174 -TGHtgeVNSVAFSPDGEKLLSS----SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPdgYLL-------------- 234
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1907116672 214 pycccAVGSKDRSLSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 270
Cdd:cd00200 235 -----ASGSEDGTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
38-77 |
2.14e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 53.47 E-value: 2.14e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1907116672 38 QWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 77
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
39-77 |
3.61e-09 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 53.12 E-value: 3.61e-09
10 20 30
....*....|....*....|....*....|....*....
gi 1907116672 39 WRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 77
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
401-630 |
6.05e-06 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 50.07 E-value: 6.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 401 PLSSSLAgtmLSSPSGQQLLPLDSSTPSFGASKPCTEPVAATSARPTG------------ESVSKDSMNATSTPAASS-- 466
Cdd:pfam03546 113 TLTTSPA---QVKPLGKNSQVRPASTVGKGPSGKGANPAPPGKAGSAAplvqvgkkeedsESSSEESDSEGEAPPAATqa 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 467 -PSVLTTPSKIEPMKAFDSRFTERSKATPGApslTSViptAVERLKEQnlvkelrSRELESSSDSDEKVHLAKPSSLSKR 545
Cdd:pfam03546 190 kPSGKILQVRPASGPAKGAAPAPPQKAGPVA---TQV---KAERSKED-------SESSEESSDSEEEAPAAATPAQAKP 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 546 KLELEvETVEKKKKGRP--RKDSRLLPMSLSVQSPAALSTEKEAMCLSAPALAlklpiPGPQRAFTLQVSSDPSMYIEVE 623
Cdd:pfam03546 257 ALKTP-QTKASPRKGTPitPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVA-----RGAQRPEEDSSSSEESESEEET 330
|
....*..
gi 1907116672 624 NEVTTVG 630
Cdd:pfam03546 331 APAAAVG 337
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
45-119 |
2.81e-05 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 48.02 E-value: 2.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 45 LRSHSGDVMDVAWSP-HDAWLASCSVDNTVVIW---------NAVKFPeiLATLRGHSGLVKGLTWDPVGKYI-ASQADD 113
Cdd:PTZ00420 70 LKGHTSSILDLQFNPcFSEILASGSEDLTIRVWeiphndesvKEIKDP--QCILKGHKKKISIIDWNPMNYYImCSSGFD 147
|
....*.
gi 1907116672 114 RSLKVW 119
Cdd:PTZ00420 148 SFVNIW 153
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Hira |
pfam07569 |
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are ... |
696-878 |
3.70e-59 |
|
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain pfam00400.
Pssm-ID: 462211 Cd Length: 221 Bit Score: 201.69 E-value: 3.70e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 696 LHCTGPYVMALTAAATLSVWDVHRQVVVVKEESLHSILSGS---------DMTVSQILLTQHGIPVMNLSDGKAYCFNPS 766
Cdd:pfam07569 17 LECSGSYLLAVTSVGLLYVWDIKKQKALLPPVSLAPLLDSSsrysdkltrAPTITSASLTSNGVPIVTLSNGDGYLYDKS 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 767 LSTWNLVSDKQDSLA-QCADFRNSLPSQDAMLCSGPLAIIQGRTSN--SGRQAARLFSVPHVV----------QQETTLA 833
Cdd:pfam07569 97 LETWLRISDSWWALGsQYWDSTGSSRSSSQSSAAGILSFLERKTNEelLRKGRGRLLQRLAKTllmkegfenfETVVTLA 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 1907116672 834 YLENQVAAALTLQSSHEYRHWLLLYARYLVNEGFEYRLREICKDL 878
Cdd:pfam07569 177 HLENRLAAALLLGSPDEYRHWLLMYAKRLAEEGLKGRLRELCKEL 221
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-270 |
2.92e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.42 E-value: 2.92e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKF 81
Cdd:COG2319 176 LLASGSDDGTVRLW-------------------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWD-LAT 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 82 PEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAham 161
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT------GHSGGVNSVAFSPDGKLLASG--- 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 162 nNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSSTkpscpycccAVGSKDRSLSVWLTCLKRPLVVI 241
Cdd:COG2319 307 -SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP--------DGKTL---------ASGSDDGTVRLWDLATGELLRTL 368
|
250 260
....*....|....*....|....*....
gi 1907116672 242 HElFDKSIMDISWTLNGLGILVCSMDGSV 270
Cdd:COG2319 369 TG-HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-285 |
4.87e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 4.87e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKf 81
Cdd:COG2319 134 TLASGSADGTVRLW-------------------DLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT- 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 82 PEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHAm 161
Cdd:COG2319 194 GKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT------GHSGSVRSVAFSPDGRLLASGSA- 266
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 162 nnsGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSStkpscpyccCAVGSKDRSLSVWlTCLKRPLVVI 241
Cdd:COG2319 267 ---DGTVRLWDLATGELLRTLTGHSGGVNSVAFSP--------DGKL---------LASGSDDGTVRLW-DLATGKLLRT 325
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 1907116672 242 HELFDKSIMDISWTLNGLGILVCSMDGSVAFLDFSQDELGDPLS 285
Cdd:COG2319 326 LTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT 369
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-285 |
5.05e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 5.05e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKf 81
Cdd:COG2319 92 LLASASADGTVRLW-------------------DLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT- 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 82 PEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHAm 161
Cdd:COG2319 152 GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT------GHTGAVRSVAFSPDGKLLASGSA- 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 162 nnsGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSStkpscpyccCAVGSKDRSLSVWLTCLKRPLVVI 241
Cdd:COG2319 225 ---DGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP--------DGRL---------LASGSADGTVRLWDLATGELLRTL 284
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 1907116672 242 HELFDkSIMDISWTLNGLGILVCSMDGSVAFLDFSQDELGDPLS 285
Cdd:COG2319 285 TGHSG-GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 327
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2-270 |
2.19e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 121.67 E-value: 2.19e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWKRAT------------------YIGPSTVFGSSG-----KLANVEQWRCVSILRSHSGDVMDVAWS 58
Cdd:cd00200 23 LLATGSGDGTIKVWDLETgellrtlkghtgpvrdvaASADGTYLASGSsdktiRLWDLETGECVRTLTGHTSYVSSVAFS 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 59 PHDAWLASCSVDNTVVIWNAVKFpEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSItkpfdecg 138
Cdd:cd00200 103 PDGRILSSSSRDKTIKVWDVETG-KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATL-------- 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 139 gTTH---VLRLSWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNP--KIFkkkqkngsstkpsc 213
Cdd:cd00200 174 -TGHtgeVNSVAFSPDGEKLLSS----SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPdgYLL-------------- 234
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1907116672 214 pycccAVGSKDRSLSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 270
Cdd:cd00200 235 -----ASGSEDGTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
24-285 |
1.40e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.30 E-value: 1.40e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 24 STVFGSSGKLANVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPV 103
Cdd:COG2319 53 AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWD-LATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 104 GKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHamnnSGPTAQIIEREGWKTNMDFV 183
Cdd:COG2319 132 GKTLASGSADGTVRLWDLATGKLLRTLT------GHSGAVTSVAFSPDGKLLASGS----DDGTVRLWDLATGKLLRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 184 GHRKAVTVVKFNP--KIFkkkqkngsstkpscpycccAVGSKDRSLSVW-LTclKRPLVVIHELFDKSIMDISWTLNGLG 260
Cdd:COG2319 202 GHTGAVRSVAFSPdgKLL-------------------ASGSADGTVRLWdLA--TGKLLRTLTGHSGSVRSVAFSPDGRL 260
|
250 260
....*....|....*....|....*
gi 1907116672 261 ILVCSMDGSVAFLDFSQDELGDPLS 285
Cdd:COG2319 261 LASGSADGTVRLWDLATGELLRTLT 285
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
41-290 |
6.74e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 102.80 E-value: 6.74e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 41 CVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWr 120
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLW- 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 121 tlDWQLETSITKPFDECGGtthVLRLSWSPDGHYLVSAHAMNnsgpTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifk 200
Cdd:cd00200 79 --DLETGECVRTLTGHTSY---VSSVAFSPDGRILSSSSRDK----TIKVWDVETGKCLTTLRGHTDWVNSVAFSP---- 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 201 kkqkNGSSTkpscpycccAVGSKDRSLSVW-LTCLKrpLVVIHELFDKSIMDISWTLNGLGILVCSMDGSVAFLDFSQDE 279
Cdd:cd00200 146 ----DGTFV---------ASSSQDGTIKLWdLRTGK--CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
|
250
....*....|.
gi 1907116672 280 LGDPLSEEEKS 290
Cdd:cd00200 211 CLGTLRGHENG 221
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1-158 |
9.77e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 9.77e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 1 MYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVK 80
Cdd:cd00200 148 TFVASSSQDGTIKLW-------------------DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWD-LS 207
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907116672 81 FPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSA 158
Cdd:cd00200 208 TGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS------GHTNSVTSLAWSPDGKRLASG 279
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-121 |
6.71e-22 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 99.22 E-value: 6.71e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 2 YLASGGDDKLIMVWKRATYIGPSTVFGSSG-----------------------KLANVEQWRCVSILRSHSGDVMDVAWS 58
Cdd:COG2319 260 LLASGSADGTVRLWDLATGELLRTLTGHSGgvnsvafspdgkllasgsddgtvRLWDLATGKLLRTLTGHTGAVRSVAFS 339
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907116672 59 PHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRT 121
Cdd:COG2319 340 PDGKTLASGSDDGTVRLWD-LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
3-120 |
3.00e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.32 E-value: 3.00e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 3 LASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKFp 82
Cdd:cd00200 192 LLSSSSDGTIKLW-------------------DLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTG- 251
|
90 100 110
....*....|....*....|....*....|....*...
gi 1907116672 83 EILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWR 120
Cdd:cd00200 252 ECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
33-285 |
5.75e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 81.11 E-value: 5.75e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 33 LANVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKfPEILATLRGHSGLVKGLTWDPVGKYIASQAD 112
Cdd:COG2319 20 LLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAA-GALLATLLGHTAAVLSVAFSPDGRLLASASA 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 113 DRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHAmnnsGPTAQIIEREGWKTNMDFVGHRKAVTVV 192
Cdd:COG2319 99 DGTVRLWDLATGLLLRTLT------GHTGAVRSVAFSPDGKTLASGSA----DGTVRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 193 KFNPkifkkkqkNGSSTkpscpycccAVGSKDRSLSVWLTCLKRPLVVI--HelfDKSIMDISWTLNGLGILVCSMDGSV 270
Cdd:COG2319 169 AFSP--------DGKLL---------ASGSDDGTVRLWDLATGKLLRTLtgH---TGAVRSVAFSPDGKLLASGSADGTV 228
|
250
....*....|....*
gi 1907116672 271 AFLDFSQDELGDPLS 285
Cdd:COG2319 229 RLWDLATGKLLRTLT 243
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
38-77 |
2.14e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 53.47 E-value: 2.14e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1907116672 38 QWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 77
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
39-77 |
3.61e-09 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 53.12 E-value: 3.61e-09
10 20 30
....*....|....*....|....*....|....*....
gi 1907116672 39 WRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 77
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
83-119 |
4.63e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.23 E-value: 4.63e-06
10 20 30
....*....|....*....|....*....|....*..
gi 1907116672 83 EILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVW 119
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
401-630 |
6.05e-06 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 50.07 E-value: 6.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 401 PLSSSLAgtmLSSPSGQQLLPLDSSTPSFGASKPCTEPVAATSARPTG------------ESVSKDSMNATSTPAASS-- 466
Cdd:pfam03546 113 TLTTSPA---QVKPLGKNSQVRPASTVGKGPSGKGANPAPPGKAGSAAplvqvgkkeedsESSSEESDSEGEAPPAATqa 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 467 -PSVLTTPSKIEPMKAFDSRFTERSKATPGApslTSViptAVERLKEQnlvkelrSRELESSSDSDEKVHLAKPSSLSKR 545
Cdd:pfam03546 190 kPSGKILQVRPASGPAKGAAPAPPQKAGPVA---TQV---KAERSKED-------SESSEESSDSEEEAPAAATPAQAKP 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 546 KLELEvETVEKKKKGRP--RKDSRLLPMSLSVQSPAALSTEKEAMCLSAPALAlklpiPGPQRAFTLQVSSDPSMYIEVE 623
Cdd:pfam03546 257 ALKTP-QTKASPRKGTPitPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVA-----RGAQRPEEDSSSSEESESEEET 330
|
....*..
gi 1907116672 624 NEVTTVG 630
Cdd:pfam03546 331 APAAAVG 337
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
45-119 |
2.81e-05 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 48.02 E-value: 2.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 45 LRSHSGDVMDVAWSP-HDAWLASCSVDNTVVIW---------NAVKFPeiLATLRGHSGLVKGLTWDPVGKYI-ASQADD 113
Cdd:PTZ00420 70 LKGHTSSILDLQFNPcFSEILASGSEDLTIRVWeiphndesvKEIKDP--QCILKGHKKKISIIDWNPMNYYImCSSGFD 147
|
....*.
gi 1907116672 114 RSLKVW 119
Cdd:PTZ00420 148 SFVNIW 153
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
83-119 |
3.41e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.56 E-value: 3.41e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1907116672 83 EILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVW 119
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
47-163 |
9.48e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 46.57 E-value: 9.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907116672 47 SHSGDVMD--VAWSPHDAWLASCS-VDNTVVIW----NAVKFPEILATlrGHSGLVKGLTWDPVGKYIASQADDRSLKVw 119
Cdd:COG4946 338 TNTPGVRErlPAWSPDGKSIAYFSdASGEYELYiapaDGSGEPKQLTL--GDLGRVFNPVWSPDGKKIAFTDNRGRLWV- 414
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1907116672 120 rtLDwqLETSITKPFDECGGTTHVLRLSWSPDGHYLVSAHAMNN 163
Cdd:COG4946 415 --VD--LASGKVRKVDTDGYGDGISDLAWSPDSKWLAYSKPGPN 454
|
|
|