|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
6.04e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.04e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.46e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.46e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
675-993 |
2.05e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 675 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 748
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 749 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 827
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 828 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 889
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 890 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 969
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412614 970 qqtvftihPSHVQPAYTTPPHMAH 993
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
312-519 |
9.82e-05 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 312 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 387
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 388 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 459
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 460 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 519
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
656-671 |
4.78e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 4.78e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
6.04e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.04e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.46e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.46e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
675-993 |
2.05e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 675 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 748
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 749 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 827
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 828 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 889
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 890 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 969
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412614 970 qqtvftihPSHVQPAYTTPPHMAH 993
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
647-814 |
2.75e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.26 E-value: 2.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 647 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 726
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 727 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 806
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412614 807 QSQ------HPHVY 814
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
756-1020 |
6.60e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.34 E-value: 6.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 756 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 830
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 831 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 903
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 904 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 983
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412614 984 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1020
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
312-519 |
9.82e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 312 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 387
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 388 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 459
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 460 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 519
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
381-530 |
3.65e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 381 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 447
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 448 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 527
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412614 528 SPV 530
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
5-72 |
3.87e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.87e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 5 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 72
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
656-671 |
4.78e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 4.78e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
606-736 |
6.24e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 606 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 684
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412614 685 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 736
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
6.04e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.04e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.46e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.46e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
675-993 |
2.05e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 675 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 748
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 749 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 827
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 828 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 889
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 890 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 969
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412614 970 qqtvftihPSHVQPAYTTPPHMAH 993
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
676-1065 |
9.51e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 9.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 676 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQr 755
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 756 qdqhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP-- 833
Cdd:PHA03247 2667 -----ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPap 2741
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 834 -----GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQ 908
Cdd:PHA03247 2742 pavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 909 HGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihP 978
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------S 2892
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 979 SHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEA 1055
Cdd:PHA03247 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRV 2972
|
410
....*....|
gi 1720412614 1056 CVCRGRRGTP 1065
Cdd:PHA03247 2973 AVPRFRVPQP 2982
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
670-1038 |
2.36e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 2.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 670 RSFSQPKPSTTPTSP-------RPQAQPSPSM----VGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVN 738
Cdd:PHA03247 2566 RSVPPPRPAPRPSEPavtsrarRPDAPPQSARprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 739 QAKTYRAGKVPNMPQQRQDQHhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVI 818
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 819 QGNARMMAPPAHAQP-------GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNAALHPH 891
Cdd:PHA03247 2725 PAAARQASPALPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPW 2802
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 892 TPHPQPSATPtGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQS 961
Cdd:PHA03247 2803 DPADPPAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARP 2881
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 962 PQSSF--PAAQQTV--FTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPI 1037
Cdd:PHA03247 2882 PVRRLarPAVSRSTesFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
|
.
gi 1720412614 1038 P 1038
Cdd:PHA03247 2962 P 2962
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
647-814 |
2.75e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.26 E-value: 2.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 647 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 726
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 727 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 806
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412614 807 QSQ------HPHVY 814
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
756-1020 |
6.60e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.34 E-value: 6.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 756 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 830
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 831 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 903
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 904 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 983
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412614 984 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1020
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
312-519 |
9.82e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 9.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 312 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 387
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 388 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 459
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 460 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 519
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
617-1046 |
1.74e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.91 E-value: 1.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 617 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 696
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 697 HQQPAPVYTQPvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQ-----QRQDQHHQSTMMHPASAA 771
Cdd:pfam03154 207 PPQGSPATSQP---------PNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQppppsQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 772 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAppahAQPGLVSSSAAQFGAHEQTHA 851
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQPLP 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 852 MYACPkLPYNKetsPSFYFAISTGSLAQQYAHPnaalhPHTPHPQPSATPTgqqqsqhggSHPAPSPVQHHQHQAAQALH 931
Cdd:pfam03154 347 PAPLS-MPHIK---PPPTTPIPQLPNPQSHKHP-----PHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPP 408
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 932 LASPQQQSAIYHAGLAPTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTA 1010
Cdd:pfam03154 409 SAHPPPLQLMPQSQQLPPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPT 483
|
410 420 430
....*....|....*....|....*....|....*.
gi 1720412614 1011 HAPMMLMTTQPPggpqAALAQSALQPIPVSTTAHFP 1046
Cdd:pfam03154 484 STSSAMPGIQPP----SSASVSSSGPVPAAVSCPLP 515
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
658-1013 |
2.27e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 2.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 658 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 737
Cdd:PHA03378 583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 738 NQAKTYraGKVPNMPQQRQDQHHqSTMMHPASAagPPIVATPPAYSTQYvaySPQQFPNQPlvqhvphyqSQHPHvyspv 817
Cdd:PHA03378 661 PYKPTW--TQIGHIPYQPSPTGA-NTMLPIQWA--PGTMQPPPRAPTPM---RPPAAPPGR---------AQRPA----- 718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 818 iqgNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYACPKLPynketspsfyfAISTGSLAQQYAHPNAAlhphTPHPQP 897
Cdd:PHA03378 719 ---AATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP-----------AAAPGRARPPAAAPGAP----TPQPPP 780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 898 SATPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPT 949
Cdd:PHA03378 781 QAPPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPS 857
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412614 950 PPSMTPASNTQSPQSSFPAAQqtvftihPSHV--QPAYTTPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1013
Cdd:PHA03378 858 PGSGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
673-963 |
4.24e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 4.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 673 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 751
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 752 PQQRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPAHA 831
Cdd:PRK10263 416 AQQPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQ 479
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 832 QPGLVsssaaqfgahEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALhpHTPHPQPSATPTGQQQSQHGG 911
Cdd:PRK10263 480 QPQPV----------EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW--YQPIPEPVKEPEPIKSSLKAP 547
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412614 912 SHPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 963
Cdd:PRK10263 548 SVAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
311-919 |
5.20e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 5.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 311 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 390
Cdd:PHA03247 2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 391 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPA 461
Cdd:PHA03247 2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPP 2619
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 462 EAVSMPVPAASPTPAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQI 538
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 539 DDLKKFKNDFRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNA 618
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAG 2778
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 619 EHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG-- 696
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlg 2853
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 697 --------------HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTyRAGKVPNMPQQRQDQHH 760
Cdd:PHA03247 2854 gsvapggdvrrrppSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPP 2932
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 761 QSTMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSA 840
Cdd:PHA03247 2933 PPPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWA 3008
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 841 AQFGAHEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNAALHPHTPHPQPSATPTGQQQSQHggSH 913
Cdd:PHA03247 3009 SSLALHEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQ 3086
|
....*.
gi 1720412614 914 PAPSPV 919
Cdd:PHA03247 3087 FGPPPL 3092
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
653-919 |
6.53e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 653 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 722
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 723 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 788
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 789 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYACPklpynketspsf 868
Cdd:pfam09770 236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQP------------ 299
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412614 869 yfaistgslAQQYAHPN---AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 919
Cdd:pfam09770 300 ---------TQILQNPNrlsAARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
890-1043 |
1.12e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.05 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 890 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNT 959
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAA 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 960 QSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQS 1032
Cdd:PRK07764 690 PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAA 769
|
170
....*....|.
gi 1720412614 1033 ALQPIPVSTTA 1043
Cdd:PRK07764 770 PAAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
670-1046 |
2.29e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 2.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 670 RSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFA--PNMMY---------------PVPVSP---------- 722
Cdd:PHA03247 2487 RFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPvhPRMLTwirgleelasddagdPPPPLPpaappaapdr 2566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 723 GVQPLYPIPMTPMPVNQAKTYR--------AGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQF 794
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRpdappqsaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 795 PNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMyacPKLPYNKETSPSFYFAIST 874
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATPLPP 2723
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 875 GSLAQQYAHPNAALHPHTP----HPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTP 950
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 951 PSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA 1030
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883
|
410
....*....|....*.
gi 1720412614 1031 QSALQPIPVSTTAHFP 1046
Cdd:PHA03247 2884 RRLARPAVSRSTESFA 2899
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
381-530 |
3.65e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 381 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 447
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 448 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 527
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412614 528 SPV 530
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
5-72 |
3.87e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.87e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412614 5 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 72
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
656-671 |
4.78e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 4.78e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
606-736 |
6.24e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412614 606 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 684
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412614 685 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 736
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|