|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
263-336 |
5.21e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 5.21e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 263 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 336
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
404-465 |
9.86e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.86e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 404 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 465
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
924-1240 |
8.25e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 924 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 995
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 996 PNMPQQRQDQHHQSAMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1074
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1075 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 1135
Cdd:PHA03247 2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1136 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1215
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
|
330 340
....*....|....*....|....*
gi 1333701020 1216 aqqtvftihPSHVQPAYTNPPHMAH 1240
Cdd:PHA03247 2984 ---------PSREAPASSTPPLTGH 2999
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
925-1097 |
8.61e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 50.42 E-value: 8.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 925 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 990
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 991 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 1066
Cdd:pfam09770 245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
|
170 180 190
....*....|....*....|....*....|.
gi 1333701020 1067 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 1097
Cdd:pfam09770 318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
932-1045 |
6.92e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.11 E-value: 6.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 932 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 1009
Cdd:TIGR01628 379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
|
90 100 110
....*....|....*....|....*....|....*.
gi 1333701020 1010 AMMHPASAagPPIVATPPAYSTQyvaySPQQFPNQP 1045
Cdd:TIGR01628 452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
905-920 |
5.45e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.45e-03
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
267-334 |
9.26e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 9.26e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 267 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 334
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
263-336 |
5.21e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 5.21e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 263 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 336
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
404-465 |
9.86e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.86e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 404 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 465
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
924-1240 |
8.25e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 924 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 995
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 996 PNMPQQRQDQHHQSAMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1074
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1075 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 1135
Cdd:PHA03247 2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1136 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1215
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
|
330 340
....*....|....*....|....*
gi 1333701020 1216 aqqtvftihPSHVQPAYTNPPHMAH 1240
Cdd:PHA03247 2984 ---------PSREAPASSTPPLTGH 2999
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
925-1097 |
8.61e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 50.42 E-value: 8.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 925 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 990
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 991 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 1066
Cdd:pfam09770 245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
|
170 180 190
....*....|....*....|....*....|.
gi 1333701020 1067 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 1097
Cdd:pfam09770 318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
918-1287 |
2.18e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 918 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPTPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 997
Cdd:PHA03247 2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSRP 2664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 998 MPQQRQDQhhqsammhPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTH 1077
Cdd:PHA03247 2665 RRARRLGR--------AAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1078 AQP-------GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTPHPQPSATPTG 1150
Cdd:PHA03247 2737 AAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA 2816
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1151 ---QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqnSFPAAQQTVFTIHPSH 1227
Cdd:PHA03247 2817 alpPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRS 2894
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1228 VQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSAlQPIPVS 1287
Cdd:PHA03247 2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAG 2953
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
925-1285 |
2.43e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 2.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 925 KPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 1004
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1005 qhhQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQP---- 1080
Cdd:PHA03247 2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1081 ---GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNATLHPHTPHPQPSATPtGQQQSQHG 1157
Cdd:PHA03247 2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAALPP 2820
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1158 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQNSF--PAAQQTV--FTI 1223
Cdd:PHA03247 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLarPAVSRSTesFAL 2900
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1333701020 1224 HPSHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIP 1285
Cdd:PHA03247 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
1013-1267 |
4.46e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 48.11 E-value: 4.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1013 HPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPTHAQPGLVSSSA 1087
Cdd:pfam09770 106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPAPAPQPAAQPA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1088 TQYGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSH 1160
Cdd:pfam09770 185 SLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1161 PAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAH 1240
Cdd:pfam09770 258 GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAH 333
|
250 260
....*....|....*....|....*..
gi 1333701020 1241 vPQAHVQSGMVPSHpTAHAPMMLMTTQ 1267
Cdd:pfam09770 334 -RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
932-1045 |
6.92e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.11 E-value: 6.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 932 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 1009
Cdd:TIGR01628 379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
|
90 100 110
....*....|....*....|....*....|....*.
gi 1333701020 1010 AMMHPASAagPPIVATPPAYSTQyvaySPQQFPNQP 1045
Cdd:TIGR01628 452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
922-1210 |
9.34e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 47.00 E-value: 9.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 922 SQPKPSTTPTSPRPQAQPSPsmvGHQQPTPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAKTYRAVPNMPQ 1000
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYAPAAEQPAQQPY 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1001 QRQDQHHQSAMMHPASAAGPPIVATPpaystqyvaysPQQFPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPTHAQP 1080
Cdd:PRK10263 421 YAPAPEQPAQQPYYAPAPEQPVAGNA-----------WQAEEQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQQP 481
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1081 GLVsssatqygahEQTHAMYACPKLPYNKETSPSFYFaisTGSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHGGS 1159
Cdd:PRK10263 482 QPV----------EQQPVVEPEPVVEETKPARPPLYY---FEEVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPS 548
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1333701020 1160 HPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1210
Cdd:PRK10263 549 VAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
923-1163 |
2.13e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.80 E-value: 2.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 923 QPKPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFA-PNMMYPVPVSP--------GVQPLYPIPMTPMPVNQAKTYR 993
Cdd:pfam09770 106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyEKYKEPEPIPDlqvdaslwGVAPKKAAAPAPAPQPAAQPAS 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 994 AVPNMPQQRQDQHHQSAMM---HPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvysPVIQGNAR 1070
Cdd:pfam09770 186 LPAPSRKMMSLEEVEAAMRaqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP----QQHPGQGH 261
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1071 MMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKLPynketSPSfyfaistgslaQQYAHPN------ATLHPHTPHPQP 1144
Cdd:pfam09770 262 PVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPV-----QPT-----------QILQNPNrlsaarVGYPQNPQPGVQ 325
|
250
....*....|....*....
gi 1333701020 1145 SATPTGQQQSQHGGSHPAP 1163
Cdd:pfam09770 326 PAPAHQAHRQQGSFGRQAP 344
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
907-1260 |
2.44e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 2.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 907 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 986
Cdd:PHA03378 583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 987 NQAKTYRAVPNMPQQRQDQHHqsAMMHPASAAgPPIVATPPAYSTQYvaySPQQFPNQPlvqhvphyqSQHPHvyspviq 1066
Cdd:PHA03378 661 PYKPTWTQIGHIPYQPSPTGA--NTMLPIQWA-PGTMQPPPRAPTPM---RPPAAPPGR---------AQRPA------- 718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1067 gNARMMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKLPYNKETspsfyfaistgslaqQYAHPNATLHPHTPHPQPSA 1146
Cdd:PHA03378 719 -AATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG---------------RARPPAAAPGAPTPQPPPQA 782
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1147 TPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPTPP 1198
Cdd:PHA03378 783 PPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPSPG 859
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1333701020 1199 SMTPASNTQSPQNSFPAAQqtvftihPSHV--QPAYTNPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1260
Cdd:PHA03378 860 SGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
905-1061 |
2.58e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.18 E-value: 2.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 905 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ--PLYPIPMT 982
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPlrPNGLAPMN 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 983 PMpvnqaktyRAVPNMPQQRQDQHHQSAMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 1056
Cdd:TIGR01628 442 AV--------RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1333701020 1057 HPHVY 1061
Cdd:TIGR01628 514 FPLVE 518
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
866-1236 |
1.70e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.83 E-value: 1.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 866 NTEHKRGPEVTSQGVQTSSPGCKQEKDDKEEKKDAAEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 945
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 946 HQQPTPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 1006
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1007 hQSAMMHPASAAGPPIVAT------PPAYSTQyVAYSPQQFPNQPLVQhvPHYQSQHPHVYSPVIQGNARM---MAPPTH 1077
Cdd:pfam03154 286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPA-APGQSQQRIHTPPSQ--SQLQSQQPPREQPLPPAPLSMphiKPPPTT 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1078 AQPGLVSSSATQYGAHEQTHAMYacpKLPYNKETSPSFYFAISTGSLAQQYAHPNA-TLHPHT-PHPQPSATPTGQQQSQ 1155
Cdd:pfam03154 362 PIPQLPNPQSHKHPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1156 hggSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNP 1235
Cdd:pfam03154 439 ---SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515
|
.
gi 1333701020 1236 P 1236
Cdd:pfam03154 516 P 516
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1137-1290 |
2.02e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.67 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1137 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNT 1206
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAA 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1207 QSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQS 1279
Cdd:PRK07764 690 PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAA 769
|
170
....*....|.
gi 1333701020 1280 ALQPIPVSTTA 1290
Cdd:PRK07764 770 PAAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
930-1293 |
3.89e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 3.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 930 PTSPRPQAQPSPSmVGHQQPTPVYTqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQdqhhQS 1009
Cdd:PHA03247 2551 PPPPLPPAAPPAA-PDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPA----PP 2614
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1010 AMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQPGLVSSSATQ 1089
Cdd:PHA03247 2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1090 YGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTP----HPQPSATPTGQQQSQHGGSHPAPSP 1165
Cdd:PHA03247 2695 LTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAP 2771
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 1166 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQAH 1245
Cdd:PHA03247 2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1333701020 1246 VQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1293
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
905-920 |
5.45e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.45e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
970-1082 |
5.71e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 5.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1333701020 970 SPGVQPLYPiPMTPMPVNQAKTYRAVPNMPQQRQDQHHQSAMMHPASAAGPPIVATPPAYST--QYVAYSPQ-QFPNQPL 1046
Cdd:PRK10263 746 TPIVEPVQQ-PQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQpqQPVAPQPQyQQPQQPV 824
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1333701020 1047 VQHVPHYQSQHPHVYSP---------VIQGNARMMAPPTHAQPGL 1082
Cdd:PRK10263 825 APQPQYQQPQQPVAPQPqdtllhpllMRNGDSRPLHKPTTPLPSL 869
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
267-334 |
9.26e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 9.26e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1333701020 267 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 334
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|