|
Name |
Accession |
Description |
Interval |
E-value |
| Hira |
pfam07569 |
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are ... |
734-916 |
3.82e-59 |
|
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain pfam00400.
Pssm-ID: 462211 Cd Length: 221 Bit Score: 201.69 E-value: 3.82e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 734 LHCTGPYVMALTAAATLSVWDVHRQVVVVKEESLHSILSGS---------DMTVSQILLTQHGIPVMNLSDGKAYCFNPS 804
Cdd:pfam07569 17 LECSGSYLLAVTSVGLLYVWDIKKQKALLPPVSLAPLLDSSsrysdkltrAPTITSASLTSNGVPIVTLSNGDGYLYDKS 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 805 LSTWNLVSDKQDSLA-QCADFRNSLPSQDAMLCSGPLAIIQGRTSN--SGRQAARLFSVPHVV----------QQETTLA 871
Cdd:pfam07569 97 LETWLRISDSWWALGsQYWDSTGSSRSSSQSSAAGILSFLERKTNEelLRKGRGRLLQRLAKTllmkegfenfETVVTLA 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 568994636 872 YLENQVAAALTLQSSHEYRHWLLLYARYLVNEGFEYRLREICKDL 916
Cdd:pfam07569 177 HLENRLAAALLLGSPDEYRHWLLMYAKRLAEEGLKGRLRELCKEL 221
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
25-308 |
4.97e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.59 E-value: 4.97e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLAS 104
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLW-------------------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 105 CSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRL 184
Cdd:COG2319 222 GSADGTVRLWD-LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT------GHSGGVNSV 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 185 SWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSSTkpscpycccAVGSKDRS 264
Cdd:COG2319 295 AFSPDGKLLASG----SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP--------DGKTL---------ASGSDDGT 353
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 568994636 265 LSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 308
Cdd:COG2319 354 VRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
25-308 |
2.56e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 136.31 E-value: 2.56e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWKRAT------------------YIGPSTVFGSSG-----KLANVEQWRCVS 81
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETgellrtlkghtgpvrdvaASADGTYLASGSsdktiRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 82 ILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKFpEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLD 161
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG-KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 162 WQLETSItkpfdecggTTH---VLRLSWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNP--KI 236
Cdd:cd00200 167 GKCVATL---------TGHtgeVNSVAFSPDGEKLLSS----SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPdgYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568994636 237 FkkkqkngsstkpscpycccAVGSKDRSLSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 308
Cdd:cd00200 234 L-------------------ASGSEDGTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
76-115 |
2.34e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 53.47 E-value: 2.34e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568994636 76 QWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 115
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
77-115 |
3.83e-09 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 52.73 E-value: 3.83e-09
10 20 30
....*....|....*....|....*....|....*....
gi 568994636 77 WRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 115
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
439-668 |
3.37e-06 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 50.84 E-value: 3.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 439 PLSSSLAgtmLSSPSGQQLLPLDSSTPSFGASKPCTEPVAATSARPTG------------ESVSKDSMNATSTPAASS-- 504
Cdd:pfam03546 113 TLTTSPA---QVKPLGKNSQVRPASTVGKGPSGKGANPAPPGKAGSAAplvqvgkkeedsESSSEESDSEGEAPPAATqa 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 505 -PSVLTTPSKIEPMKAFDSRFTERSKATPGApslTSViptAVERLKEQnlvkelrSRELESSSDSDEKVHLAKPSSLSKR 583
Cdd:pfam03546 190 kPSGKILQVRPASGPAKGAAPAPPQKAGPVA---TQV---KAERSKED-------SESSEESSDSEEEAPAAATPAQAKP 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 584 KLELEvETVEKKKKGRP--RKDSRLLPMSLSVQSPAALSTEKEAMCLSAPALAlklpiPGPQRAFTLQVSSDPSMYIEVE 661
Cdd:pfam03546 257 ALKTP-QTKASPRKGTPitPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVA-----RGAQRPEEDSSSSEESESEEET 330
|
....*..
gi 568994636 662 NEVTTVG 668
Cdd:pfam03546 331 APAAAVG 337
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
83-157 |
2.96e-05 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 48.02 E-value: 2.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 83 LRSHSGDVMDVAWSP-HDAWLASCSVDNTVVIW---------NAVKFPeiLATLRGHSGLVKGLTWDPVGKYI-ASQADD 151
Cdd:PTZ00420 70 LKGHTSSILDLQFNPcFSEILASGSEDLTIRVWeiphndesvKEIKDP--QCILKGHKKKISIIDWNPMNYYImCSSGFD 147
|
....*.
gi 568994636 152 RSLKVW 157
Cdd:PTZ00420 148 SFVNIW 153
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Hira |
pfam07569 |
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are ... |
734-916 |
3.82e-59 |
|
TUP1-like enhancer of split; The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain pfam00400.
Pssm-ID: 462211 Cd Length: 221 Bit Score: 201.69 E-value: 3.82e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 734 LHCTGPYVMALTAAATLSVWDVHRQVVVVKEESLHSILSGS---------DMTVSQILLTQHGIPVMNLSDGKAYCFNPS 804
Cdd:pfam07569 17 LECSGSYLLAVTSVGLLYVWDIKKQKALLPPVSLAPLLDSSsrysdkltrAPTITSASLTSNGVPIVTLSNGDGYLYDKS 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 805 LSTWNLVSDKQDSLA-QCADFRNSLPSQDAMLCSGPLAIIQGRTSN--SGRQAARLFSVPHVV----------QQETTLA 871
Cdd:pfam07569 97 LETWLRISDSWWALGsQYWDSTGSSRSSSQSSAAGILSFLERKTNEelLRKGRGRLLQRLAKTllmkegfenfETVVTLA 176
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 568994636 872 YLENQVAAALTLQSSHEYRHWLLLYARYLVNEGFEYRLREICKDL 916
Cdd:pfam07569 177 HLENRLAAALLLGSPDEYRHWLLMYAKRLAEEGLKGRLRELCKEL 221
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
25-308 |
4.97e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.59 E-value: 4.97e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLAS 104
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLW-------------------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 105 CSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRL 184
Cdd:COG2319 222 GSADGTVRLWD-LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT------GHSGGVNSV 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 185 SWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSSTkpscpycccAVGSKDRS 264
Cdd:COG2319 295 AFSPDGKLLASG----SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP--------DGKTL---------ASGSDDGT 353
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 568994636 265 LSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 308
Cdd:COG2319 354 VRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
25-308 |
2.56e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 136.31 E-value: 2.56e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWKRAT------------------YIGPSTVFGSSG-----KLANVEQWRCVS 81
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETgellrtlkghtgpvrdvaASADGTYLASGSsdktiRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 82 ILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKFpEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLD 161
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG-KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 162 WQLETSItkpfdecggTTH---VLRLSWSPDGHYLVSAhamnNSGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNP--KI 236
Cdd:cd00200 167 GKCVATL---------TGHtgeVNSVAFSPDGEKLLSS----SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPdgYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568994636 237 FkkkqkngsstkpscpycccAVGSKDRSLSVWLTCLKRPLVVIHElFDKSIMDISWTLNGLGILVCSMDGSV 308
Cdd:cd00200 234 L-------------------ASGSEDGTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
25-323 |
7.94e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.74 E-value: 7.94e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLAS 104
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSADGTVRLW-------------------DLATGKLLRTLTGHSGAVTSVAFSPDGKLLAS 179
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 105 CSVDNTVVIWNAVKfPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRL 184
Cdd:COG2319 180 GSDDGTVRLWDLAT-GKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT------GHSGSVRSV 252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 185 SWSPDGHYLVSAHAmnnsGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSStkpscpyccCAVGSKDRS 264
Cdd:COG2319 253 AFSPDGRLLASGSA----DGTVRLWDLATGELLRTLTGHSGGVNSVAFSP--------DGKL---------LASGSDDGT 311
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 568994636 265 LSVWlTCLKRPLVVIHELFDKSIMDISWTLNGLGILVCSMDGSVAFLDFSQDELGDPLS 323
Cdd:COG2319 312 VRLW-DLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT 369
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
25-323 |
1.96e-33 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 133.88 E-value: 1.96e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLAS 104
Cdd:COG2319 77 HTAAVLSVAFSPDGRLLASASADGTVRLW-------------------DLATGLLLRTLTGHTGAVRSVAFSPDGKTLAS 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 105 CSVDNTVVIWNAVKfPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRL 184
Cdd:COG2319 138 GSADGTVRLWDLAT-GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT------GHTGAVRSV 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 185 SWSPDGHYLVSAHAmnnsGPTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifkkkqkNGSStkpscpyccCAVGSKDRS 264
Cdd:COG2319 211 AFSPDGKLLASGSA----DGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP--------DGRL---------LASGSADGT 269
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 568994636 265 LSVWLTCLKRPLVVIHELFDkSIMDISWTLNGLGILVCSMDGSVAFLDFSQDELGDPLS 323
Cdd:COG2319 270 VRLWDLATGELLRTLTGHSG-GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 327
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
12-196 |
1.59e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 116.28 E-value: 1.59e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 12 DENIPKMLCQMDNHLACVNCVRWSNSGMYLASGGDDKLIMVWKRATY-----------------IGPSTVFGSSG----- 69
Cdd:cd00200 79 DLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGkclttlrghtdwvnsvaFSPDGTFVASSsqdgt 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 70 -KLANVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQ 148
Cdd:cd00200 159 iKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWD-LSTGKCLGTLRGHENGVNSVAFSPDGYLLASG 237
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 568994636 149 ADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSA 196
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTLS------GHTNSVTSLAWSPDGKRLASG 279
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
25-159 |
4.47e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 111.93 E-value: 4.47e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWKRATYIGPSTVFGSSG-----------------------KLANVEQWRCVS 81
Cdd:COG2319 245 HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGgvnsvafspdgkllasgsddgtvRLWDLATGKLLR 324
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568994636 82 ILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRT 159
Cdd:COG2319 325 TLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD-LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
62-323 |
6.28e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 6.28e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 62 STVFGSSGKLANVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPV 141
Cdd:COG2319 53 AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWD-LATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 142 GKYIASQADDRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHamnnSGPTAQIIEREGWKTNMDFV 221
Cdd:COG2319 132 GKTLASGSADGTVRLWDLATGKLLRTLT------GHSGAVTSVAFSPDGKLLASGS----DDGTVRLWDLATGKLLRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 222 GHRKAVTVVKFNP--KIFkkkqkngsstkpscpycccAVGSKDRSLSVW-LTclKRPLVVIHELFDKSIMDISWTLNGLG 298
Cdd:COG2319 202 GHTGAVRSVAFSPdgKLL-------------------ASGSADGTVRLWdLA--TGKLLRTLTGHSGSVRSVAFSPDGRL 260
|
250 260
....*....|....*....|....*
gi 568994636 299 ILVCSMDGSVAFLDFSQDELGDPLS 323
Cdd:COG2319 261 LASGSADGTVRLWDLATGELLRTLT 285
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
25-158 |
3.95e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 103.57 E-value: 3.95e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 25 HLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWSPHDAWLAS 104
Cdd:cd00200 176 HTGEVNSVAFSPDGEKLLSSSSDGTIKLW-------------------DLSTGKCLGTLRGHENGVNSVAFSPDGYLLAS 236
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 568994636 105 CSVDNTVVIWNAVKFpEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWR 158
Cdd:cd00200 237 GSEDGTIRVWDLRTG-ECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
79-328 |
6.27e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 103.18 E-value: 6.27e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 79 CVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNaVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWr 158
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLW- 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 159 tlDWQLETSITKPFDECGGtthVLRLSWSPDGHYLVSAHAMNnsgpTAQIIEREGWKTNMDFVGHRKAVTVVKFNPkifk 238
Cdd:cd00200 79 --DLETGECVRTLTGHTSY---VSSVAFSPDGRILSSSSRDK----TIKVWDVETGKCLTTLRGHTDWVNSVAFSP---- 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 239 kkqkNGSSTkpscpycccAVGSKDRSLSVW-LTCLKrpLVVIHELFDKSIMDISWTLNGLGILVCSMDGSVAFLDFSQDE 317
Cdd:cd00200 146 ----DGTFV---------ASSSQDGTIKLWdLRTGK--CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
|
250
....*....|.
gi 568994636 318 LGDPLSEEEKS 328
Cdd:cd00200 211 CLGTLRGHENG 221
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
71-323 |
2.86e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.27 E-value: 2.86e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 71 LANVEQWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWNAVKfPEILATLRGHSGLVKGLTWDPVGKYIASQAD 150
Cdd:COG2319 20 LLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAA-GALLATLLGHTAAVLSVAFSPDGRLLASASA 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 151 DRSLKVWRTLDWQLETSITkpfdecGGTTHVLRLSWSPDGHYLVSAHAmnnsGPTAQIIEREGWKTNMDFVGHRKAVTVV 230
Cdd:COG2319 99 DGTVRLWDLATGLLLRTLT------GHTGAVRSVAFSPDGKTLASGSA----DGTVRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 231 KFNPkifkkkqkNGSSTkpscpycccAVGSKDRSLSVWLTCLKRPLVVI--HelfDKSIMDISWTLNGLGILVCSMDGSV 308
Cdd:COG2319 169 AFSP--------DGKLL---------ASGSDDGTVRLWDLATGKLLRTLtgH---TGAVRSVAFSPDGKLLASGSADGTV 228
|
250
....*....|....*
gi 568994636 309 AFLDFSQDELGDPLS 323
Cdd:COG2319 229 RLWDLATGKLLRTLT 243
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
16-116 |
9.31e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 74.18 E-value: 9.31e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 16 PKMLCQMDNHLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAW 95
Cdd:COG2319 320 GKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLW-------------------DLATGELLRTLTGHTGAVTSVAF 380
|
90 100
....*....|....*....|.
gi 568994636 96 SPHDAWLASCSVDNTVVIWNA 116
Cdd:COG2319 381 SPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
17-115 |
1.50e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 1.50e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 17 KMLCQMDNHLACVNCVRWSNSGMYLASGGDDKLIMVWkratyigpstvfgssgklaNVEQWRCVSILRSHSGDVMDVAWS 96
Cdd:cd00200 210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVW-------------------DLRTGECVQTLSGHTNSVTSLAWS 270
|
90
....*....|....*....
gi 568994636 97 PHDAWLASCSVDNTVVIWN 115
Cdd:cd00200 271 PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
76-115 |
2.34e-09 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 53.47 E-value: 2.34e-09
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568994636 76 QWRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 115
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
77-115 |
3.83e-09 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 52.73 E-value: 3.83e-09
10 20 30
....*....|....*....|....*....|....*....
gi 568994636 77 WRCVSILRSHSGDVMDVAWSPHDAWLASCSVDNTVVIWN 115
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
17-54 |
2.76e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.00 E-value: 2.76e-06
10 20 30
....*....|....*....|....*....|....*...
gi 568994636 17 KMLCQMDNHLACVNCVRWSNSGMYLASGGDDKLIMVWK 54
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
439-668 |
3.37e-06 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 50.84 E-value: 3.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 439 PLSSSLAgtmLSSPSGQQLLPLDSSTPSFGASKPCTEPVAATSARPTG------------ESVSKDSMNATSTPAASS-- 504
Cdd:pfam03546 113 TLTTSPA---QVKPLGKNSQVRPASTVGKGPSGKGANPAPPGKAGSAAplvqvgkkeedsESSSEESDSEGEAPPAATqa 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 505 -PSVLTTPSKIEPMKAFDSRFTERSKATPGApslTSViptAVERLKEQnlvkelrSRELESSSDSDEKVHLAKPSSLSKR 583
Cdd:pfam03546 190 kPSGKILQVRPASGPAKGAAPAPPQKAGPVA---TQV---KAERSKED-------SESSEESSDSEEEAPAAATPAQAKP 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 584 KLELEvETVEKKKKGRP--RKDSRLLPMSLSVQSPAALSTEKEAMCLSAPALAlklpiPGPQRAFTLQVSSDPSMYIEVE 661
Cdd:pfam03546 257 ALKTP-QTKASPRKGTPitPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVA-----RGAQRPEEDSSSSEESESEEET 330
|
....*..
gi 568994636 662 NEVTTVG 668
Cdd:pfam03546 331 APAAAVG 337
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
121-157 |
4.96e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.23 E-value: 4.96e-06
10 20 30
....*....|....*....|....*....|....*..
gi 568994636 121 EILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVW 157
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
17-53 |
2.49e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.95 E-value: 2.49e-05
10 20 30
....*....|....*....|....*....|....*..
gi 568994636 17 KMLCQMDNHLACVNCVRWSNSGMYLASGGDDKLIMVW 53
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
83-157 |
2.96e-05 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 48.02 E-value: 2.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 83 LRSHSGDVMDVAWSP-HDAWLASCSVDNTVVIW---------NAVKFPeiLATLRGHSGLVKGLTWDPVGKYI-ASQADD 151
Cdd:PTZ00420 70 LKGHTSSILDLQFNPcFSEILASGSEDLTIRVWeiphndesvKEIKDP--QCILKGHKKKISIIDWNPMNYYImCSSGFD 147
|
....*.
gi 568994636 152 RSLKVW 157
Cdd:PTZ00420 148 SFVNIW 153
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
121-157 |
3.73e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.56 E-value: 3.73e-05
10 20 30
....*....|....*....|....*....|....*..
gi 568994636 121 EILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVW 157
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
85-201 |
9.96e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 46.57 E-value: 9.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568994636 85 SHSGDVMD--VAWSPHDAWLASCS-VDNTVVIW----NAVKFPEILATlrGHSGLVKGLTWDPVGKYIASQADDRSLKVw 157
Cdd:COG4946 338 TNTPGVRErlPAWSPDGKSIAYFSdASGEYELYiapaDGSGEPKQLTL--GDLGRVFNPVWSPDGKKIAFTDNRGRLWV- 414
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 568994636 158 rtLDwqLETSITKPFDECGGTTHVLRLSWSPDGHYLVSAHAMNN 201
Cdd:COG4946 415 --VD--LASGKVRKVDTDGYGDGISDLAWSPDSKWLAYSKPGPN 454
|
|
|