|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
5.97e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 5.97e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.37e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.37e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
663-981 |
2.70e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 663 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 736
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 737 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 815
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 816 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 877
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 878 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 957
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412629 958 qqtvftihPSHVQPAYTTPPHMAH 981
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| DUF3498 super family |
cl26404 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
369-518 |
4.43e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP. The actual alignment was detected with superfamily member pfam12004:
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 40.90 E-value: 4.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 369 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 435
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 436 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 515
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412629 516 SPV 518
Cdd:pfam12004 354 SPV 356
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
644-659 |
4.46e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.46e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
5.97e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 5.97e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.37e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.37e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
663-981 |
2.70e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 663 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 736
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 737 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 815
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 816 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 877
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 878 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 957
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412629 958 qqtvftihPSHVQPAYTTPPHMAH 981
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
635-802 |
3.17e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 3.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 635 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 714
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 715 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 794
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412629 795 QSQ------HPHVY 802
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
744-1008 |
6.97e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.95 E-value: 6.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 744 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 818
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 819 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 891
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 892 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 971
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412629 972 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1008
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
5-72 |
3.83e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.83e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 5 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 72
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
369-518 |
4.43e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 40.90 E-value: 4.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 369 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 435
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 436 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 515
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412629 516 SPV 518
Cdd:pfam12004 354 SPV 356
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
644-659 |
4.46e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.46e-03
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
300-482 |
6.62e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 40.63 E-value: 6.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 300 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 375
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 376 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQS---------------SIGNSPSGPVLA 440
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPAdddpppweelppefaSPAPAQPDAAPA 519
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1720412629 441 SPQAGIIPAEAVSMP---------VPAASPTPASPASNRALTPSIEAKDSR 482
Cdd:PRK12323 520 GWVAESIPDPATADPddafetlapAPAAAPAPRAAAATEPVVAPRPPRASA 570
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
594-724 |
6.93e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 594 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 672
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412629 673 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 724
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
1-74 |
5.97e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 5.97e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 1 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 74
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
141-202 |
8.37e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.37e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 141 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 202
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
663-981 |
2.70e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 2.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 663 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 736
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 737 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 815
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 816 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 877
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 878 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 957
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
|
330 340
....*....|....*....|....
gi 1720412629 958 qqtvftihPSHVQPAYTTPPHMAH 981
Cdd:PHA03247 2984 --------PSREAPASSTPPLTGH 2999
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
664-1053 |
1.17e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 1.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 664 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQr 743
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 744 qdqhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP-- 821
Cdd:PHA03247 2667 -----ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPap 2741
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 822 -----GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQ 896
Cdd:PHA03247 2742 pavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 897 HGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihP 966
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------S 2892
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 967 SHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEA 1043
Cdd:PHA03247 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRV 2972
|
410
....*....|
gi 1720412629 1044 CVCRGRRGTP 1053
Cdd:PHA03247 2973 AVPRFRVPQP 2982
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
635-802 |
3.17e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 3.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 635 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 714
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 715 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 794
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412629 795 QSQ------HPHVY 802
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
658-1026 |
3.32e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 3.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 658 RSFSQPKPSTTPTSP-------RPQAQPSPSM----VGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVN 726
Cdd:PHA03247 2566 RSVPPPRPAPRPSEPavtsrarRPDAPPQSARprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 727 QAKTYRAGKVPNMPQQRQDQHhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVI 806
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 807 QGNARMMAPPAHAQP-------GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNAALHPH 879
Cdd:PHA03247 2725 PAAARQASPALPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPW 2802
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 880 TPHPQPSATPtGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQS 949
Cdd:PHA03247 2803 DPADPPAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARP 2881
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 950 PQSSF--PAAQQTV--FTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPI 1025
Cdd:PHA03247 2882 PVRRLarPAVSRSTesFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
|
.
gi 1720412629 1026 P 1026
Cdd:PHA03247 2962 P 2962
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
744-1008 |
6.97e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.95 E-value: 6.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 744 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 818
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 819 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 891
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 892 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 971
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412629 972 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1008
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
605-1034 |
3.51e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.76 E-value: 3.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 605 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 684
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 685 HQQPAPVYTQPvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQ-----QRQDQHHQSTMMHPASAA 759
Cdd:pfam03154 207 PPQGSPATSQP---------PNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQppppsQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 760 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAppahAQPGLVSSSAAQFGAHEQTHA 839
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQPLP 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 840 MYACPkLPYNKetsPSFYFAISTGSLAQQYAHPnaalhPHTPHPQPSATPTgqqqsqhggSHPAPSPVQHHQHQAAQALH 919
Cdd:pfam03154 347 PAPLS-MPHIK---PPPTTPIPQLPNPQSHKHP-----PHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHHPP 408
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 920 LASPQQQSAIYHAGLAPTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTA 998
Cdd:pfam03154 409 SAHPPPLQLMPQSQQLPPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPT 483
|
410 420 430
....*....|....*....|....*....|....*.
gi 1720412629 999 HAPMMLMTTQPPggpqAALAQSALQPIPVSTTAHFP 1034
Cdd:pfam03154 484 STSSAMPGIQPP----SSASVSSSGPVPAAVSCPLP 515
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
646-1001 |
4.09e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 44.67 E-value: 4.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 646 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 725
Cdd:PHA03378 583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 726 NQAKTYraGKVPNMPQQRQDQHHqSTMMHPASAagPPIVATPPAYSTQyvAYSPQQFPnqplvqhVPHYQSQHPHVYSPV 805
Cdd:PHA03378 661 PYKPTW--TQIGHIPYQPSPTGA-NTMLPIQWA--PGTMQPPPRAPTP--MRPPAAPP-------GRAQRPAAATGRARP 726
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 806 IQGNARMMAPPAHAqPGLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPsfyfaistgslaqqyahpnaalhphTPHPQP 885
Cdd:PHA03378 727 PAAAPGRARPPAAA-PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAP-------------------------TPQPPP 780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 886 SATPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPT 937
Cdd:PHA03378 781 QAPPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPS 857
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412629 938 PPSMTPASNTQSPQSSFPAAQqtvftihPSHV--QPAYTTPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1001
Cdd:PHA03378 858 PGSGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
661-951 |
5.35e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.31 E-value: 5.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 661 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 739
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 740 PQQRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPAHA 819
Cdd:PRK10263 416 AQQPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQ 479
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 820 QPGLVsssaaqfgahEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALhpHTPHPQPSATPTGQQQSQHGG 899
Cdd:PRK10263 480 QPQPV----------EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW--YQPIPEPVKEPEPIKSSLKAP 547
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412629 900 SHPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 951
Cdd:PRK10263 548 SVAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
299-907 |
7.08e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 7.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 299 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 378
Cdd:PHA03247 2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 379 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPA 449
Cdd:PHA03247 2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPP 2619
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 450 EAVSMPVPAASPTPAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQI 526
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 527 DDLKKFKNDFRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNA 606
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAG 2778
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 607 EHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG-- 684
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlg 2853
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 685 --------------HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTyRAGKVPNMPQQRQDQHH 748
Cdd:PHA03247 2854 gsvapggdvrrrppSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPP 2932
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 749 QSTMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSA 828
Cdd:PHA03247 2933 PPPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWA 3008
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 829 AQFGAHEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNAALHPHTPHPQPSATPTGQQQSQHggSH 901
Cdd:PHA03247 3009 SSLALHEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQ 3086
|
....*.
gi 1720412629 902 PAPSPV 907
Cdd:PHA03247 3087 FGPPPL 3092
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
641-907 |
7.52e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 7.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 641 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 710
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 711 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 776
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 777 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYACPklpynketspsf 856
Cdd:pfam09770 236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQP------------ 299
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412629 857 yfaistgslAQQYAHPN---AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 907
Cdd:pfam09770 300 ---------TQILQNPNrlsAARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
878-1031 |
1.30e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.05 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 878 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNT 947
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAA 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 948 QSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQS 1020
Cdd:PRK07764 690 PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAA 769
|
170
....*....|.
gi 1720412629 1021 ALQPIPVSTTA 1031
Cdd:PRK07764 770 PAAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
669-1034 |
2.87e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 2.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 669 PTSPRPQAQPSPsmvghqqpAPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQaktyraGKVPNMPQQRQDQHH 748
Cdd:PHA03247 2551 PPPPLPPAAPPA--------APDRSVP----PPRPAPRPSEPAVTSRARRPDAPPQSAR------PRAPVDDRGDPRGPA 2612
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 749 QSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSA 828
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV 2692
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 829 AQFGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALHPHTP----HPQPSATPTGQQQSQHGGSHPAP 904
Cdd:PHA03247 2693 GSLTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAP 2769
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 905 SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQ 984
Cdd:PHA03247 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1720412629 985 AHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1034
Cdd:PHA03247 2850 LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
5-72 |
3.83e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 3.83e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412629 5 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 72
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
369-518 |
4.43e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 40.90 E-value: 4.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 369 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 435
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 436 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 515
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412629 516 SPV 518
Cdd:pfam12004 354 SPV 356
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
644-659 |
4.46e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.46e-03
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
300-482 |
6.62e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 40.63 E-value: 6.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 300 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 375
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 376 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQS---------------SIGNSPSGPVLA 440
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPAdddpppweelppefaSPAPAQPDAAPA 519
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1720412629 441 SPQAGIIPAEAVSMP---------VPAASPTPASPASNRALTPSIEAKDSR 482
Cdd:PRK12323 520 GWVAESIPDPATADPddafetlapAPAAAPAPRAAAATEPVVAPRPPRASA 570
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
594-724 |
6.93e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412629 594 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 672
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412629 673 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 724
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|