|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
262-335 |
5.24e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 5.24e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
403-464 |
9.92e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.92e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
923-1231 |
5.76e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 5.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 923 PKPSTTPTSPRPQAQPSP------SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 996
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 997 PNMPQQRQDQHHQSAMMHPASAAGPPIAA-TPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1075
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRlTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1076 PTHAQPGLVSSSATQYGAHEQTHAMYVSTGSLA------QQYAHPNATLHP---HTPHPQPSATPTGQQQSQHGGSHPAP 1146
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPpvrRLARPAVSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1147 SPVQHHQHQAAQALHLASPQ---QQSAIYHAGLAPTP---PSMTPASNTQSPQNS------FPAAQQTVFTIHPSHVQPA 1214
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQpppPPPPRPQPPLAPTTdpaGAGEPSGAVPQPWLGalvpgrVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|....*...
gi 1777350987 1215 -YTNPPHMAHVPQCASEA 1231
Cdd:PHA03247 2991 sSTPPLTGHSLSRVSSWA 3008
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
562-753 |
1.50e-03 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1777350987 717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
904-919 |
6.86e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.86e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
262-335 |
5.24e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 5.24e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
403-464 |
9.92e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.92e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
923-1231 |
5.76e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 5.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 923 PKPSTTPTSPRPQAQPSP------SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 996
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 997 PNMPQQRQDQHHQSAMMHPASAAGPPIAA-TPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1075
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRlTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1076 PTHAQPGLVSSSATQYGAHEQTHAMYVSTGSLA------QQYAHPNATLHP---HTPHPQPSATPTGQQQSQHGGSHPAP 1146
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPpvrRLARPAVSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1147 SPVQHHQHQAAQALHLASPQ---QQSAIYHAGLAPTP---PSMTPASNTQSPQNS------FPAAQQTVFTIHPSHVQPA 1214
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQpppPPPPRPQPPLAPTTdpaGAGEPSGAVPQPWLGalvpgrVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|....*...
gi 1777350987 1215 -YTNPPHMAHVPQCASEA 1231
Cdd:PHA03247 2991 sSTPPLTGHSLSRVSSWA 3008
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
902-1149 |
1.25e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.65 E-value: 1.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 902 QVRKSTLNPNAKefnprsfSQPKPSTTPTSPRPQAQPSPSMvGHQQPTPV------YTQPvcfapnmmYPVP---VSP-- 970
Cdd:pfam09770 99 QVRFNRQQPAAR-------AAQSSAQPPASSLPQYQYASQQ-SQQPSKPVrtgyekYKEP--------EPIPdlqVDAsl 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 971 -GVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQRQDQHHQ-SAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLV 1048
Cdd:pfam09770 163 wGVAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAmRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1049 QHVPHYQSQHPhvysPVIQGNARMMA-----PPTHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNATLHPHTP 1123
Cdd:pfam09770 243 QQQPQQQPQQP----QQHPGQGHPVTilqrpQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQ 318
|
250 260
....*....|....*....|....*..
gi 1777350987 1124 HPQPSATPT-GQQQSQHGGSHPAPSPV 1149
Cdd:pfam09770 319 NPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
904-1062 |
1.03e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 46.72 E-value: 1.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 904 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 983
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 984 PVNQaktyrAGKVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 1057
Cdd:TIGR01628 439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1777350987 1058 HPHVY 1062
Cdd:TIGR01628 514 FPLVE 518
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
562-753 |
1.50e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1777350987 717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
904-919 |
6.86e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.86e-03
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
266-333 |
9.32e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 9.32e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 266 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 333
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
262-335 |
5.24e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 5.24e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
403-464 |
9.92e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.92e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
923-1231 |
5.76e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 5.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 923 PKPSTTPTSPRPQAQPSP------SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 996
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 997 PNMPQQRQDQHHQSAMMHPASAAGPPIAA-TPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1075
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRlTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1076 PTHAQPGLVSSSATQYGAHEQTHAMYVSTGSLA------QQYAHPNATLHP---HTPHPQPSATPTGQQQSQHGGSHPAP 1146
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPpvrRLARPAVSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1147 SPVQHHQHQAAQALHLASPQ---QQSAIYHAGLAPTP---PSMTPASNTQSPQNS------FPAAQQTVFTIHPSHVQPA 1214
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQpppPPPPRPQPPLAPTTdpaGAGEPSGAVPQPWLGalvpgrVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|....*...
gi 1777350987 1215 -YTNPPHMAHVPQCASEA 1231
Cdd:PHA03247 2991 sSTPPLTGHSLSRVSSWA 3008
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
902-1149 |
1.25e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.65 E-value: 1.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 902 QVRKSTLNPNAKefnprsfSQPKPSTTPTSPRPQAQPSPSMvGHQQPTPV------YTQPvcfapnmmYPVP---VSP-- 970
Cdd:pfam09770 99 QVRFNRQQPAAR-------AAQSSAQPPASSLPQYQYASQQ-SQQPSKPVrtgyekYKEP--------EPIPdlqVDAsl 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 971 -GVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQRQDQHHQ-SAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLV 1048
Cdd:pfam09770 163 wGVAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAmRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1049 QHVPHYQSQHPhvysPVIQGNARMMA-----PPTHAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNATLHPHTP 1123
Cdd:pfam09770 243 QQQPQQQPQQP----QQHPGQGHPVTilqrpQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQ 318
|
250 260
....*....|....*....|....*..
gi 1777350987 1124 HPQPSATPT-GQQQSQHGGSHPAPSPV 1149
Cdd:pfam09770 319 NPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
904-1062 |
1.03e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 46.72 E-value: 1.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 904 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 983
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 984 PVNQaktyrAGKVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 1057
Cdd:TIGR01628 439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1777350987 1058 HPHVY 1062
Cdd:TIGR01628 514 FPLVE 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
917-1312 |
3.97e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 3.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 917 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPTPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKv 996
Cdd:PHA03247 2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSR- 2663
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 997 pnmpqqrqdQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPP 1076
Cdd:PHA03247 2664 ---------PRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA 2734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1077 THAQPGLVSSSATQYGAHEQTHAMYVSTGSLAQQYAHPNAtlhPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQA 1156
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA---PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1157 AQALHLASPQQQSaiyhAGLAPTPPSMTPASnTQSPQNSFPAAQQTVFTIHP----SHVQPAYTNPPHMAHVPQCASEAL 1232
Cdd:PHA03247 2812 LAPAAALPPAASP----AGPLPPPTSAQPTA-PPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRL 2886
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1233 ARCGLEMRLSWIYLSEGYLAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFPYMTHPSVQAHH 1312
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA 2966
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
1014-1226 |
1.05e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.49 E-value: 1.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1014 HPASAAGPPIAATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPPTHAQPGLVSS 1086
Cdd:pfam09770 106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPAPAPQPAAQPA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1087 SATQYG----------------AHEQTHAMYVSTGSLAQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQ 1150
Cdd:pfam09770 185 SLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVT 264
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1777350987 1151 HHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAHVPQ 1226
Cdd:pfam09770 265 ILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAHRQQ 336
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
921-1193 |
1.43e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.15 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 921 SQPKPSTTPTSPRPQAQPSPsmvGHQQPTPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAktyragkvPNM 999
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYA--------PAA 412
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1000 PQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgNARMMAPPTHA 1079
Cdd:PRK10263 413 EQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQPVE 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1080 QPGLVSSSATQYGAHEQTHAMYVSTgSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQ 1158
Cdd:PRK10263 486 QQPVVEPEPVVEETKPARPPLYYFE-EVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPL 564
|
250 260 270
....*....|....*....|....*....|....*...
gi 1777350987 1159 ALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1193
Cdd:PRK10263 565 ASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
562-753 |
1.50e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1777350987 717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
917-1298 |
2.21e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.28 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 917 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 996
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 997 PNMPQQRQDQhhQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGN 1069
Cdd:PRK07764 472 AAPEPTAAPA--PAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGD 546
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1070 ARMMApptHAQPGLVSSSATQYGA-------HEQTHA-----MYVSTGSLAQQYAHPNA----TLHPHTPHPQPSATPTG 1133
Cdd:PRK07764 547 TLVLG---FSTGGLARRFASPGNAevlvtalAEELGGdwqveAVVGPAPGAAGGEGPPApassGPPEEAARPAAPAAPAA 623
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1134 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQP 1213
Cdd:PRK07764 624 PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1214 AYTNPPHMAHVPQCASEALARCGlemrlswiylSEGYLAHVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQP 1291
Cdd:PRK07764 704 APAATPPAGQADDPAAQPPQAAQ----------GASAPSPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAA 773
|
....*..
gi 1777350987 1292 IPVSTTA 1298
Cdd:PRK07764 774 PPPSPPS 780
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
940-1166 |
3.67e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 3.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 940 PSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPgVQPLYPIPMTPMPVNQaktyragkvPNMPQQRQdqhhqsammhPASAA 1019
Cdd:PRK10263 309 PLLNGAPITEPVAVAAAATTATQSWAAPVEP-VTQTPPVASVDVPPAQ---------PTVAWQPV----------PGPQT 368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350987 1020 GPPIAATPPAystqyvAYSPQQFPNQPLVQHVPHYQSQHPHvyspviqgnarmmAPPTHAQPGLVSSSATQYGAHEQTHA 1099
Cdd:PRK10263 369 GEPVIAPAPE------GYPQQSQYAQPAVQYNEPLQQPVQP-------------QQPYYAPAAEQPAQQPYYAPAPEQPA 429
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1777350987 1100 MYvstGSLAQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQ 1166
Cdd:PRK10263 430 QQ---PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
904-919 |
6.86e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.86e-03
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
266-333 |
9.32e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 9.32e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350987 266 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 333
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|