NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1734272048|ref|NP_001359503|]
View 

ataxin-2 isoform 4 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
108-181 4.59e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 4.59e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
249-310 8.69e-16

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.69e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 super family cl33720
large tegument protein UL36; Provisional
769-1087 3.46e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  769 PKPSTTPTSPRPQAQPSPsmVG--------HQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 840
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  841 KVPNMPQQRQDQHHQSAMMHPASAAGPPIA-ATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 919
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRrLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  920 APPTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNA 980
Cdd:PHA03247  2829 PPPTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQ 2904
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  981 TLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSf 1060
Cdd:PHA03247  2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA- 2983
                          330       340
                   ....*....|....*....|....*..
gi 1734272048 1061 paaqqtvftihPSHVQPAYTNPPHMAH 1087
Cdd:PHA03247  2984 -----------PSREAPASSTPPLTGH 2999
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
408-599 9.36e-04

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK12323:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 9.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1734272048  563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
750-765 7.34e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


:

Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 7.34e-03
                           10
                   ....*....|....*.
gi 1734272048  750 RKSTLNPNAKEFNPRS 765
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
108-181 4.59e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 4.59e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
249-310 8.69e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.69e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
769-1087 3.46e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  769 PKPSTTPTSPRPQAQPSPsmVG--------HQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 840
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  841 KVPNMPQQRQDQHHQSAMMHPASAAGPPIA-ATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 919
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRrLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  920 APPTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNA 980
Cdd:PHA03247  2829 PPPTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQ 2904
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  981 TLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSf 1060
Cdd:PHA03247  2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA- 2983
                          330       340
                   ....*....|....*....|....*..
gi 1734272048 1061 paaqqtvftihPSHVQPAYTNPPHMAH 1087
Cdd:PHA03247  2984 -----------PSREAPASSTPPLTGH 2999
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
860-1114 4.34e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.72  E-value: 4.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  860 HPASAAGPPIAATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPTHAQPGLVSSSA 934
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPAPAPQPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  935 TQYGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSH 1007
Cdd:pfam09770  185 SLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1008 PAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAH 1087
Cdd:pfam09770  258 GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAH 333
                          250       260
                   ....*....|....*....|....*..
gi 1734272048 1088 vPQAHVQSGMVPSHpTAHAPMMLMTTQ 1114
Cdd:pfam09770  334 -RQQGSFGRQAPII-THPQQLAQLSEE 358
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
750-908 1.20e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 46.34  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  750 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 829
Cdd:TIGR01628  367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  830 PVNQaktyrAGKVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 903
Cdd:TIGR01628  439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513

                   ....*
gi 1734272048  904 HPHVY 908
Cdd:TIGR01628  514 FPLVE 518
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
408-599 9.36e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 9.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1734272048  563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
750-765 7.34e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 7.34e-03
                           10
                   ....*....|....*.
gi 1734272048  750 RKSTLNPNAKEFNPRS 765
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
112-179 8.16e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.07  E-value: 8.16e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  112 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 179
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
108-181 4.59e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 4.59e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
249-310 8.69e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.69e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
769-1087 3.46e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  769 PKPSTTPTSPRPQAQPSPsmVG--------HQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 840
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  841 KVPNMPQQRQDQHHQSAMMHPASAAGPPIA-ATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 919
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRrLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  920 APPTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNA 980
Cdd:PHA03247  2829 PPPTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQ 2904
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  981 TLHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSf 1060
Cdd:PHA03247  2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA- 2983
                          330       340
                   ....*....|....*....|....*..
gi 1734272048 1061 paaqqtvftihPSHVQPAYTNPPHMAH 1087
Cdd:PHA03247  2984 -----------PSREAPASSTPPLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
764-1132 1.96e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 1.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  764 RSFSQPKPSTTPTSP-------RPQAQPSPSM----VGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVN 832
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPavtsrarRPDAPPQSARprapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  833 QAKTYRAGKVPNMPQQRQDQHhQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVI 912
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG 2724
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  913 QGNARMMAPPTHAQP-------GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNATLHPH 985
Cdd:PHA03247  2725 PAAARQASPALPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPW 2802
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  986 TPHPQPSATPtGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQS 1055
Cdd:PHA03247  2803 DPADPPAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARP 2881
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1056 PQNSF--PAAQQTV--FTIHPSHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPI 1131
Cdd:PHA03247  2882 PVRRLarPAVSRSTesFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961

                   .
gi 1734272048 1132 P 1132
Cdd:PHA03247  2962 P 2962
PHA03247 PHA03247
large tegument protein UL36; Provisional
763-1134 2.96e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 2.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  763 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPTPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKv 842
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSR- 2663
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  843 pnmpqqrqdQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPP 922
Cdd:PHA03247  2664 ---------PRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA 2734
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  923 THAQP-------GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTPHPQPSATP 995
Cdd:PHA03247  2735 LPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAP 2814
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  996 TG---QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqnSFPAAQQTVFTIHP 1072
Cdd:PHA03247  2815 AAalpPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP--ARPPVRRLARPAVS 2892
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1734272048 1073 SHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSAlQPIPVS 1134
Cdd:PHA03247  2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAG 2953
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
860-1114 4.34e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.72  E-value: 4.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  860 HPASAAGPPIAATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPTHAQPGLVSSSA 934
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPAPAPQPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  935 TQYGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSH 1007
Cdd:pfam09770  185 SLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1008 PAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAH 1087
Cdd:pfam09770  258 GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAH 333
                          250       260
                   ....*....|....*....|....*..
gi 1734272048 1088 vPQAHVQSGMVPSHpTAHAPMMLMTTQ 1114
Cdd:pfam09770  334 -RQQGSFGRQAPII-THPQQLAQLSEE 358
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
750-908 1.20e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 46.34  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  750 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLypiPMTPM 829
Cdd:TIGR01628  367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLR---PNGLA 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  830 PVNQaktyrAGKVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 903
Cdd:TIGR01628  439 PMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513

                   ....*
gi 1734272048  904 HPHVY 908
Cdd:TIGR01628  514 FPLVE 518
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
748-1010 2.99e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.03  E-value: 2.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  748 QVRKSTLNPNAKefnprsfSQPKPSTTPTSPRPQAQPSPSMvGHQQPTPV------YTQPvcfapnmmYPVP---VSP-- 816
Cdd:pfam09770   99 QVRFNRQQPAAR-------AAQSSAQPPASSLPQYQYASQQ-SQQPSKPVrtgyekYKEP--------EPIPdlqVDAsl 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  817 -GVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQRQDQHHQ-SAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLV 894
Cdd:pfam09770  163 wGVAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAmRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  895 QHVPHYQSQHPhvysPVIQGNARMMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKLPynketSPSfyfaistgslaQQ 974
Cdd:pfam09770  243 QQQPQQQPQQP----QQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPV-----QPT-----------QI 302
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1734272048  975 YAHPN------ATLHPHTPHPQPSATPTGQQQSQHGGSHPAP 1010
Cdd:pfam09770  303 LQNPNrlsaarVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAP 344
PRK10263 PRK10263
DNA translocase FtsK; Provisional
767-1057 4.16e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 4.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  767 SQPKPSTTPTSPRPQAQPSPsmvGHQQPTPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAktyragkvPNM 845
Cdd:PRK10263   345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYA--------PAA 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  846 PQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPTHA 925
Cdd:PRK10263   413 EQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQ 479
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  926 QPGLVsssatqygahEQTHAMYACPKLPYNKETSPSFYFaisTGSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHG 1004
Cdd:PRK10263   480 QPQPV----------EQQPVVEPEPVVEETKPARPPLYY---FEEVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKA 546
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1734272048 1005 GSHPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1057
Cdd:PRK10263   547 PSVAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
PHA03378 PHA03378
EBNA-3B; Provisional
752-1107 8.71e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.52  E-value: 8.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  752 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 831
Cdd:PHA03378   583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  832 NQAKTYraGKVPNMPQQRQDQHHqsAMMHPASAAgPPIAATPPAYSTQyvAYSPQQFPnqplvqhVPHYQSQHPHVYSPV 911
Cdd:PHA03378   661 PYKPTW--TQIGHIPYQPSPTGA--NTMLPIQWA-PGTMQPPPRAPTP--MRPPAAPP-------GRAQRPAAATGRARP 726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  912 IQGNARMMAPPTHAqPGLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQyaHPNATlhpHTPHPQP 991
Cdd:PHA03378   727 PAAAPGRARPPAAA-PGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQ--RPRGA---PTPQPPP 800
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  992 SATPTGQQQS--QHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYH---AGLAPTPPSMTPASNTQSPQNSFPAAQQT 1066
Cdd:PHA03378   801 QAGPTSMQLMprAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERqaaAGPTPSPGSGTSDKIVQAPVFYPPVLQPI 880
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1734272048 1067 VFTIHPSHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAP 1107
Cdd:PHA03378   881 QVMRQLGSVRAAAASTVTQAPTEYTGERRGVGPMHPTDIPP 921
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
408-599 9.36e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 9.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1734272048  563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
984-1137 1.27e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  984 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNT 1053
Cdd:PRK07764   610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAA 689
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1054 QSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQS 1126
Cdd:PRK07764   690 PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAA 769
                          170
                   ....*....|.
gi 1734272048 1127 ALQPIPVSTTA 1137
Cdd:PRK07764   770 PAAAPPPSPPS 780
PHA03247 PHA03247
large tegument protein UL36; Provisional
775-1140 1.95e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  775 PTSPRPQAQPSPSmVGHQQPTPVYTqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQaktyraGKVPNMPQQRQDQHH 854
Cdd:PHA03247  2551 PPPPLPPAAPPAA-PDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSAR------PRAPVDDRGDPRGPA 2612
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  855 QSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQPGLVSSSA 934
Cdd:PHA03247  2613 PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV 2692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  935 TQYGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTP----HPQPSATPTGQQQSQHGGSHPAP 1010
Cdd:PHA03247  2693 GSLTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAP 2769
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1011 SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQ 1090
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1734272048 1091 AHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1140
Cdd:PHA03247  2850 LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
PHA03247 PHA03247
large tegument protein UL36; Provisional
407-1013 3.73e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 3.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  407 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkAQRHPRNHRVSAGRGSISSG 486
Cdd:PHA03247  2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML--TWIRGLEELASDDAGDPPPP 2554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  487 LEFVSHNP-PSEAATPPVARTSPSGgtwssvvsgvPRLSPKTHRPRSPRQNSIGNTP---SGPVLASPQAGIIPTEAVAM 562
Cdd:PHA03247  2555 LPPAAPPAaPDRSVPPPRPAPRPSE----------PAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPPDTHAP 2624
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  563 PIPAASPTPAS---PASNRAVTPSSEAKDSRLQDQRQNSPAGNKENIKPNETSPSFSKAENKGISPVVSEHRKQIDDLKK 639
Cdd:PHA03247  2625 DPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP 2704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  640 FKNDfRLQPSSTSESMDQLLNKNREGEKSRDLIKDKIEPSAKDSFIENSSSNCTSGSSKPN--SPSISPSILSNTEHKRG 717
Cdd:PHA03247  2705 PPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRL 2783
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  718 PEVTSQGVQTSSPACKQEKDDKEEKKDAAEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG------- 790
Cdd:PHA03247  2784 TRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvap 2858
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  791 ----HQQPTPVYTQPVCFAPNMMY-------PVPVSPGVQPLYPIPMTPMPVNQAKTyRAGKVPNMPQQRQDQHHQSAMM 859
Cdd:PHA03247  2859 ggdvRRRPPSRSPAAKPAAPARPPvrrlarpAVSRSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPPPPPPP 2937
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  860 HPASAAgPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPTHAQPGLVSSSATQYGA 939
Cdd:PHA03247  2938 RPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWASSLAL 3013
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1734272048  940 HEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNATLHPHTPHPQPSATPTGQQQSQHggSHPAPSP 1012
Cdd:PHA03247  3014 HEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQFGPPP 3091

                   .
gi 1734272048 1013 V 1013
Cdd:PHA03247  3092 L 3092
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
750-765 7.34e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 7.34e-03
                           10
                   ....*....|....*.
gi 1734272048  750 RKSTLNPNAKEFNPRS 765
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
112-179 8.16e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.07  E-value: 8.16e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1734272048  112 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 179
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH