|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.07e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.07e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.50e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.50e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
762-1039 |
8.17e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAP 985
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERP 2908
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412605 986 SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQS-PQSSFPAAQ 1039
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGePSGAVPQPW 2963
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.88e-05 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
5.57e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.57e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.07e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.07e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.50e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.50e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1039 |
8.17e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAP 985
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERP 2908
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412605 986 SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQS-PQSSFPAAQ 1039
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGePSGAVPQPW 2963
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
740-988 |
5.80e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 50.80 E-value: 5.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 740 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 809
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 810 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 875
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 876 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNA 955
Cdd:pfam09770 236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSA 311
|
250 260 270
....*....|....*....|....*....|....
gi 1720412605 956 ALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 988
Cdd:pfam09770 312 ARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-901 |
3.90e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 3.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412605 894 QSQ------HPHVY 901
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.88e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
3.89e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.89e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
4.71e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 40.90 E-value: 4.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412605 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
5.57e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.57e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
693-823 |
6.38e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412605 772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.07e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.07e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.50e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.50e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1039 |
8.17e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAP 985
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERP 2908
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412605 986 SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQS-PQSSFPAAQ 1039
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGePSGAVPQPW 2963
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
740-988 |
5.80e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 50.80 E-value: 5.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 740 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 809
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 810 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 875
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 876 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNA 955
Cdd:pfam09770 236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSA 311
|
250 260 270
....*....|....*....|....*....|....
gi 1720412605 956 ALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 988
Cdd:pfam09770 312 ARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
756-1112 |
9.02e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.98 E-value: 9.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAySTQYVAYSPQQFPNqpLVQHVPHYQ-------SQHPHVYSpvIQGN 908
Cdd:PRK07764 472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA-GADDAATLRERWPE--ILAAVPKRSrktwailLPEATVLG--VRGD 546
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 909 -----------ARMMAPPAHAQPgLVSSSAAQFGAHEQTHAmYVSTGSLAQQYAHPNAAL----HPHTPHPQPSATPTGQ 973
Cdd:PRK07764 547 tlvlgfstgglARRFASPGNAEV-LVTALAEELGGDWQVEA-VVGPAPGAAGGEGPPAPAssgpPEEAARPAAPAAPAAP 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 974 QQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVF 1043
Cdd:PRK07764 625 AAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPA 704
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412605 1044 TIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQPIPVSTTA 1112
Cdd:PRK07764 705 PAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAAPPPSPPS 780
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-901 |
3.90e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 3.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412605 894 QSQ------HPHVY 901
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
843-1089 |
7.48e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.95 E-value: 7.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 843 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 915
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 916 AHAQPGLVSSSAAQFG----------------AHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHG 979
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 980 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQPAyttPPH 1059
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAH 330
|
250 260 270
....*....|....*....|....*....|
gi 1720412605 1060 MAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1089
Cdd:pfam09770 331 QAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
704-1115 |
9.53e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.68 E-value: 9.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 784 HQQPAPVYTQPvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQ-----QRQDQHHQSTMMHPASAA 858
Cdd:pfam03154 207 PPQGSPATSQP---------PNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQppppsQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 859 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMA--PPAHAQPGLVSSSAAQFGAHEQT 936
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQLQSQQPPREQPLPPAPL 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 937 HAMYVSTGSLAQQYAHPNAALHPHTPHpqpsatPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1016
Cdd:pfam03154 351 SMPHIKPPPTTPIPQLPNPQSHKHPPH------LSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 1017 PTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTAHAPMMLMTTQPPggpq 1095
Cdd:pfam03154 425 PPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPTSTSSAMPGIQPP---- 495
|
410 420
....*....|....*....|
gi 1720412605 1096 AALAQSALQPIPVSTTAHFP 1115
Cdd:pfam03154 496 SSASVSSSGPVPAAVSCPLP 515
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.88e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
756-1121 |
1.54e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 1.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 756 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPAPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PHA03247 2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSRP 2664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATppaystqyVAYSPQQFPNQPLVQHVPHYQSqhPHVYSPVIQGNARMMAPP 915
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRPRRRAARPTVGS--------LTSLADPPPPPPTPEPAPHALV--SATPLPPGPAAARQASPA 2734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 916 AHAQPglvSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQA 995
Cdd:PHA03247 2735 LPAAP---APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 996 AQALHLASPQQQSAiyhaGLAPTPPSMTPASNTQSPQ-----------------------SSFPAAQQTVFTIHPSHVQP 1052
Cdd:PHA03247 2812 LAPAAALPPAASPA----GPLPPPTSAQPTAPPPPPGppppslplggsvapggdvrrrppSRSPAAKPAAPARPPVRRLA 2887
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720412605 1053 AYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFPYMTHPS 1121
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
711-1109 |
6.52e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 6.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 711 PEVTSQGVQTSSPACKQEK---DDREEKKDTTEQvRKSTLNPNAKEFNPRSFSQPKPS------TTPTSPRPQAQPSPSM 781
Cdd:PHA03247 2561 PAAPDRSVPPPRPAPRPSEpavTSRARRPDAPPQ-SARPRAPVDDRGDPRGPAPPSPLppdthaPDPPPPSPSPAANEPD 2639
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 782 VGHQQPAPVYTQPVCFA--PNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMpqqrQDQHHQSTMMHPASAAG 859
Cdd:PHA03247 2640 PHPPPTVPPPERPRDDPapGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----ADPPPPPPTPEPAPHAL 2715
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 860 PPIVATPPAystqyVAYSPQQFPNQPLVQHVPhyqsqhPHVYSPVIQGN-ARMMAPPAHAQPGLVSSSAAQFGAHEQT-- 936
Cdd:PHA03247 2716 VSATPLPPG-----PAAARQASPALPAAPAPP------AVPAGPATPGGpARPARPPTTAGPPAPAPPAAPAAGPPRRlt 2784
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 937 -HAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSqhGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGL 1015
Cdd:PHA03247 2785 rPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA--GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 1016 APTPPSMTPASNTQSPqsSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQ 1095
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
410
....*....|....
gi 1720412605 1096 AALAQSAlQPIPVS 1109
Cdd:PHA03247 2941 PPLAPTT-DPAGAG 2953
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
746-1058 |
7.17e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 7.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 746 TLNPNAKEFNPRSFSQPKPsttPTSPRPQAQPSPSMVGHQQPAPVYTQPVcfaPNMMYPVPVSPgvqPLYPIPMTPMPVN 825
Cdd:pfam03154 229 TLIQQTPTLHPQRLPSPHP---PLQPMTQPPPPSQVSPQPLPQPSLHGQM---PPMPHSLQTGP---SHMQHPVPPQPFP 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 826 QAKTYRAGKVPNMPQQRQDQHHQSTMMHPASAAGPPivatppaystqyvaysPQQFPNQplvQHVPHYQSQHPHVYSPVI 905
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ----------------SQQPPRE---QPLPPAPLSMPHIKPPPT 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 906 QGNARMMAPPAHAQPGLVSS-SAAQFGAHEQTHAMYVSTGSLAQQY---AHPNA-ALHPHT-PHPQPSATPTGQQQSQhg 979
Cdd:pfam03154 361 TPIPQLPNPQSHKHPPHLSGpSPFQMNSNLPPPPALKPLSSLSTHHppsAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ-- 438
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720412605 980 gSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPP 1058
Cdd:pfam03154 439 -SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPP 516
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
398-1034 |
7.59e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 7.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 398 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 477
Cdd:PHA03247 2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 478 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSPSGPvlaspqAGIIPAEAV 551
Cdd:PHA03247 2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD------RGDPRGPAP 2613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 552 SMPVPAASPTPASPASNRaltpsieakdsrlqdqrqnSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQIDDLKKf 631
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSP-------------------SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA- 2673
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 632 kndfrLQPSSTSESMDQLLSKNREG---EKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSISPSMLSNAEhk 708
Cdd:PHA03247 2674 -----AQASSPPQRPRRRAARPTVGsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-- 2746
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 709 rGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPA 788
Cdd:PHA03247 2747 -GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 789 PvytqpvcfapnmmyPVPVSPGVQPLYPiPMTPMPVNQAKTYRAGKVPNMPQQRQdqhhqstmmhPASAAGPPIVATPPA 868
Cdd:PHA03247 2826 G--------------PLPPPTSAQPTAP-PPPPGPPPPSLPLGGSVAPGGDVRRR----------PPSRSPAAKPAAPAR 2880
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 869 YSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvyspviqgnarmmaPPAHAQPglvsssaaqfgaheqthamyvSTGSLAQ 948
Cdd:PHA03247 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQ---------------PQAPPPP---------------------QPQPQPP 2924
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 949 QYAHPNAALHPHtPHPQPSATPTGQQQSQHGGSHPAPSPvqhhqhqaaqALHLASPQQQSAIYHAGLAPTPPSMTPASNT 1028
Cdd:PHA03247 2925 PPPQPQPPPPPP-PRPQPPLAPTTDPAGAGEPSGAVPQP----------WLGALVPGRVAVPRFRVPQPAPSREAPASST 2993
|
....*.
gi 1720412605 1029 QSPQSS 1034
Cdd:PHA03247 2994 PPLTGH 2999
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
779-1005 |
9.53e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 9.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 779 PSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPgVQPLYPIPMTPMPVNQaktyragkvPNMPQQRQdqhhqstmmhPASAA 858
Cdd:PRK10263 309 PLLNGAPITEPVAVAAAATTATQSWAAPVEP-VTQTPPVASVDVPPAQ---------PTVAWQPV----------PGPQT 368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 859 GPPIVATPPAystqyvAYSPQQFPNQPLVQHVPHYQSQHPHvyspviqgnarmmAPPAHAQPGLVSSSAAQFGAHEQTHA 938
Cdd:PRK10263 369 GEPVIAPAPE------GYPQQSQYAQPAVQYNEPLQQPVQP-------------QQPYYAPAAEQPAQQPYYAPAPEQPA 429
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720412605 939 MYvstGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQ 1005
Cdd:PRK10263 430 QQ---PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
760-1032 |
1.14e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.15 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 838
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 839 PQQRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgNARMMAPPAHA 918
Cdd:PRK10263 416 AQQPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQPVE 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 919 QPGLVSSSAAQFGAHEQTHAMYVSTgSLAQQYAHPNAALHP-HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQ 997
Cdd:PRK10263 486 QQPVVEPEPVVEETKPARPPLYYFE-EVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPL 564
|
250 260 270
....*....|....*....|....*....|....*...
gi 1720412605 998 ALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1032
Cdd:PRK10263 565 ASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
901-1079 |
2.33e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 901 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 974
Cdd:PRK10263 307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 975 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1046
Cdd:PRK10263 386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
|
170 180 190
....*....|....*....|....*....|...
gi 1720412605 1047 PSHVQPAYTTPPHMAhvPQAHVQSGMVPSHPTA 1079
Cdd:PRK10263 466 QTYQQPAAQEPLYQQ--PQPVEQQPVVEPEPVV 496
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
3.89e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.89e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412605 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
4.71e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 40.90 E-value: 4.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412605 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
5.57e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 5.57e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
436-942 |
6.08e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 6.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 436 PAPVSTMPKRMSSEGPPRMSPKAQ--RHPRNHRVSAGRGSMSSGLEFVSHNPPSEAAAP-PVARTSPAGGTWSSVVSGVP 512
Cdd:PHA03247 2573 PAPRPSEPAVTSRARRPDAPPQSArpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPsPAANEPDPHPPPTVPPPERP 2652
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 513 RLSPKTHRPRSPRQSSIGNSPSGPVlASPQAGIIPA--EAVSMPVPAASPTPASPASNRALTPSIEAKDSRL--QDQRQN 588
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQAS-SPPQRPRRRAarPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPgpAAARQA 2731
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 589 SPAGSKENV-KASETSPSFSKADNKGMSPVVSEHrkqiddlkkfkndfrlQPSSTSESMDQLLSKNREGEKSRDLIKDKT 667
Cdd:PHA03247 2732 SPALPAAPApPAVPAGPATPGGPARPARPPTTAG----------------PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 668 EASAKDSFIDSSSSSSNCTSGSSKTNSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACkqekddreekkdtteqvrkSTL 747
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-------------------GSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 748 NPNAkEFNPRSFSQPKPSTTPTSPRPQAQpspsmvghQQPAPvytqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQA 827
Cdd:PHA03247 2857 APGG-DVRRRPPSRSPAAKPAAPARPPVR--------RLARP--------------AVSRSTESFALPPDQPERPPQPQA 2913
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 828 KTyRAGKVPNMPQQRQDQHHQSTMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQG 907
Cdd:PHA03247 2914 PP-PPQPQPQPPPPPQPQPPPPPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREA 2988
|
490 500 510
....*....|....*....|....*....|....*
gi 1720412605 908 NARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYVS 942
Cdd:PHA03247 2989 PASSTPPLTGHSLSRVSSWASSLALHEETDPPPVS 3023
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
693-823 |
6.38e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412605 693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412605 772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|