NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720412640|ref|XP_030110119|]
View 

ataxin-2 isoform X28 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 5.89e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 5.89e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 8.26e-16

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.26e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-951 1.50e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 55.81  E-value: 1.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1720412640  911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770  321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
PHA03247 super family cl33720
large tegument protein UL36; Provisional
485-940 3.81e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 3.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  485 PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPAEAVSMPVPAASPT 561
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  562 PAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQIDDLKKFKNDFRLQ 638
Cdd:PHA03247  2633 PAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  639 PSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNAEHKRGPEVTSQGV 718
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAGPPRRLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  719 QTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG--------------- 783
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvapggdvrrrp 2866
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  784 -HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTY-RAVPNMPQQRQDQHHQSTMMHPASAAgPP 859
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPPPpQPQPQPPPPPQPQPPPPPPPRPQPPL-AP 2945
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  860 IVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYV 939
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPV 3022

                   .
gi 1720412640  940 S 940
Cdd:PHA03247  3023 S 3023
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
899-1066 2.41e-03

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  899 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 972
Cdd:PRK10263   307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  973 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1044
Cdd:PRK10263   386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
                          170       180
                   ....*....|....*....|..
gi 1720412640 1045 PSHVQPAYTTPPHMAhvPQYKP 1066
Cdd:PRK10263   466 QTYQQPAAQEPLYQQ--PQPVE 485
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 5.89e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 5.89e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 8.26e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.26e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-951 1.50e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 55.81  E-value: 1.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1720412640  911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770  321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1060 2.50e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.50e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  913 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLA------QQYAHPNAALHPH---------------------------T 959
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPPvrrlarpavsrstesfalppdqperppQ 2910
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  960 PHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaaqqt 1039
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA------- 2983
                          330       340
                   ....*....|....*....|.
gi 1720412640 1040 vftihPSHVQPAYTTPPHMAH 1060
Cdd:PHA03247  2984 -----PSREAPASSTPPLTGH 2999
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-899 7.79e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 7.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628  433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
                          170
                   ....*....|..
gi 1720412640  894 Q------HPHVY 899
Cdd:TIGR01628  507 QvlgerlFPLVE 518
PHA03247 PHA03247
large tegument protein UL36; Provisional
485-940 3.81e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 3.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  485 PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPAEAVSMPVPAASPT 561
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  562 PAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQIDDLKKFKNDFRLQ 638
Cdd:PHA03247  2633 PAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  639 PSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNAEHKRGPEVTSQGV 718
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAGPPRRLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  719 QTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG--------------- 783
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvapggdvrrrp 2866
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  784 -HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTY-RAVPNMPQQRQDQHHQSTMMHPASAAgPP 859
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPPPpQPQPQPPPPPQPQPPPPPPPRPQPPL-AP 2945
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  860 IVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYV 939
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPV 3022

                   .
gi 1720412640  940 S 940
Cdd:PHA03247  3023 S 3023
PRK10263 PRK10263
DNA translocase FtsK; Provisional
899-1066 2.41e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  899 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 972
Cdd:PRK10263   307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  973 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1044
Cdd:PRK10263   386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
                          170       180
                   ....*....|....*....|..
gi 1720412640 1045 PSHVQPAYTTPPHMAhvPQYKP 1066
Cdd:PRK10263   466 QTYQQPAAQEPLYQQ--PQPVE 485
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 3.78e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 3.78e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.87e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412640  615 SPV 617
Cdd:pfam12004  354 SPV 356
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 5.89e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 5.89e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 8.26e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 8.26e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-951 1.50e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 55.81  E-value: 1.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1720412640  911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770  321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1060 2.50e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.50e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  913 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLA------QQYAHPNAALHPH---------------------------T 959
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPPvrrlarpavsrstesfalppdqperppQ 2910
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  960 PHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaaqqt 1039
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA------- 2983
                          330       340
                   ....*....|....*....|.
gi 1720412640 1040 vftihPSHVQPAYTTPPHMAH 1060
Cdd:PHA03247  2984 -----PSREAPASSTPPLTGH 2999
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
740-985 6.41e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 50.42  E-value: 6.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  740 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 809
Cdd:pfam09770   98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  810 GVQPLYPIPMTPMPVNQAKTYRAVPNMPQ---QRQDQHHQSTMMHPASAAGPPivATPPAYSTQYVAYSPQQFPNQPLVQ 886
Cdd:pfam09770  170 AAAPAPAPQPAAQPASLPAPSRKMMSLEEveaAMRAQAKKPAQQPAPAPAQPP--AAPPAQQAQQQQQFPPQIQQQQQPQ 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  887 HVPHYQSQHPHVYSPVIQGNaRMMAPPAhaQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSA 966
Cdd:pfam09770  248 QQPQQPQQHPGQGHPVTILQ-RPQSPQP--DPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGV 324
                          250       260
                   ....*....|....*....|....*
gi 1720412640  967 TPTG------QQQSQHGGSHPAPSP 985
Cdd:pfam09770  325 QPAPahqahrQQGSFGRQAPIITHP 349
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-899 7.79e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 7.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628  433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
                          170
                   ....*....|..
gi 1720412640  894 Q------HPHVY 899
Cdd:TIGR01628  507 QvlgerlFPLVE 518
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
756-1063 2.38e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 2.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 835
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  836 MPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGN-- 906
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGDtl 548
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  907 ---------ARMMAPPAHAQpGLVSSSAAQFGAHEQTHAmYVSTGSLAQQYAHPNAAL----HPHTPHPQPSATPTGQQQ 973
Cdd:PRK07764   549 vlgfstgglARRFASPGNAE-VLVTALAEELGGDWQVEA-VVGPAPGAAGGEGPPAPAssgpPEEAARPAAPAAPAAPAA 626
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  974 SQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYT 1053
Cdd:PRK07764   627 PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA 706
                          330
                   ....*....|
gi 1720412640 1054 TPPHMAHVPQ 1063
Cdd:PRK07764   707 ATPPAGQADD 716
PHA03247 PHA03247
large tegument protein UL36; Provisional
485-940 3.81e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 3.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  485 PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPAEAVSMPVPAASPT 561
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  562 PAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQIDDLKKFKNDFRLQ 638
Cdd:PHA03247  2633 PAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  639 PSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNAEHKRGPEVTSQGV 718
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAGPPRRLTRPAVASL 2791
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  719 QTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG--------------- 783
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvapggdvrrrp 2866
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  784 -HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTY-RAVPNMPQQRQDQHHQSTMMHPASAAgPP 859
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPPPpQPQPQPPPPPQPQPPPPPPPRPQPPL-AP 2945
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  860 IVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYV 939
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPV 3022

                   .
gi 1720412640  940 S 940
Cdd:PHA03247  3023 S 3023
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
704-1056 5.88e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 5.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154  127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  784 HQQPAPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 844
Cdd:pfam03154  207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  845 hQSTMMHPASAAGPPIVAT------PPAYSTQYVAYSPQ-----------QFPNQPLVQHVPHYQSQHPHVYSPVIQGNA 907
Cdd:pfam03154  286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPAAPGQSQQrihtppsqsqlQSQQPPREQPLPPAPLSMPHIKPPPTTPIP 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  908 RMMAPPAHAQPGLVSS-SAAQFGAHEQTHAMYVSTGSLAQQY---AHPNA-ALHPHT-PHPQPSATPTGQQQSQhggSHP 981
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGpSPFQMNSNLPPPPALKPLSSLSTHHppsAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ---SLP 441
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720412640  982 APSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPP 1056
Cdd:pfam03154  442 PPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPP 516
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
399-606 5.93e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 5.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
PHA03247 PHA03247
large tegument protein UL36; Provisional
756-1036 6.75e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 6.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  756 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPAPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 835
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSRP 2664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  836 MPQQRQDQhhqstmmhPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAH 915
Cdd:PHA03247  2665 RRARRLGR--------AAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  916 AQPGLVSSSAAqfgaheqthamyvstgslAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPsPVQHHQHQAAQ 995
Cdd:PHA03247  2737 AAPAPPAVPAG------------------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP-AVASLSESRES 2797
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1720412640  996 ALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAA 1036
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
841-1063 8.56e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.57  E-value: 8.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  841 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 913
Cdd:pfam09770   96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  914 AHAQPGLVSSSAAQFG----------------AHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHG 977
Cdd:pfam09770  175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  978 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQ----SSFPAAQQTVFTIHPSHVQPAYT 1053
Cdd:pfam09770  255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNrlsaARVGYPQNPQPGVQPAPAHQAHR 334
                          250
                   ....*....|
gi 1720412640 1054 TPPHMAHVPQ 1063
Cdd:pfam09770  335 QQGSFGRQAP 344
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1070 1.50e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 1.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  747 LNPNAKEFNPRSFSQPKPSTTPTSPRPQAQ--PSPSMVG---------------------------------------HQ 785
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRddPAPGRVSrprrarrlgraaqassppqrprrraarptvgsltsladpPP 2703
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  786 QPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTyRAVPNMPQQRQDQHHQSTMMHPAS----AAGPPIV 861
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-PATPGGPARPARPPTTAGPPAPAPpaapAAGPPRR 2782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  862 ATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYVST 941
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  942 GSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1014
Cdd:PHA03247  2860 GDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412640 1015 PTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQYKPTTNS 1070
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PRK10263 PRK10263
DNA translocase FtsK; Provisional
760-1030 2.77e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.08  E-value: 2.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipMTPMPVNQAKTYRAVPNMPQ 838
Cdd:PRK10263   345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL----QQPVQPQQPYYAPAAEQPAQ 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  839 QRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgNARMMAPPAHAQP 918
Cdd:PRK10263   418 QPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQPVEQQ 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  919 GLVSSSAAQFGAHEQTHAMYVSTgSLAQQYAHPNAALHP-HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQAL 997
Cdd:PRK10263   488 PVVEPEPVVEETKPARPPLYYFE-EVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLAS 566
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1720412640  998 HLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1030
Cdd:PRK10263   567 GV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
PRK10263 PRK10263
DNA translocase FtsK; Provisional
779-1003 4.38e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  779 PSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPgVQPLYPIPMTPMPVNQaktyravPNMPQQRQdqhhqstmmhPASAAGP 858
Cdd:PRK10263   309 PLLNGAPITEPVAVAAAATTATQSWAAPVEP-VTQTPPVASVDVPPAQ-------PTVAWQPV----------PGPQTGE 370
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  859 PIVATPPAystqyvAYSPQQFPNQPLVQHVPHYQSQHPHvyspviqgnarmmAPPAHAQPGLVSSSAAQFGAHEQTHAMY 938
Cdd:PRK10263   371 PVIAPAPE------GYPQQSQYAQPAVQYNEPLQQPVQP-------------QQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720412640  939 vstGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQ 1003
Cdd:PRK10263   432 ---PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
PRK10263 PRK10263
DNA translocase FtsK; Provisional
899-1066 2.41e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  899 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 972
Cdd:PRK10263   307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  973 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1044
Cdd:PRK10263   386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
                          170       180
                   ....*....|....*....|..
gi 1720412640 1045 PSHVQPAYTTPPHMAhvPQYKP 1066
Cdd:PRK10263   466 QTYQQPAAQEPLYQQ--PQPVE 485
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 3.78e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 3.78e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412640   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.87e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412640  615 SPV 617
Cdd:pfam12004  354 SPV 356
PRK10263 PRK10263
DNA translocase FtsK; Provisional
693-823 5.39e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 5.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263   741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412640  772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263   821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 5.58e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.28  E-value: 5.58e-03
                           10
                   ....*....|....*.
gi 1720412640  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
735-845 7.97e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.14  E-value: 7.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  735 KKDTTEQVrkSTLNPNAKEFN---PRSFSQPKPSTTPtSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSpgv 811
Cdd:PRK14971   380 KPVFTQPA--AAPQPSAAAAAspsPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFK--- 453
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1720412640  812 qPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQHH 845
Cdd:PRK14971   454 -EEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIK 486
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
443-780 8.55e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 8.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  443 PKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSMSSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVP--RLSPKTHR 520
Cdd:PHA03307    73 PGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT-------PPGPSSPDPPPPTPPPASPPPSPAPDLSemLRPVGSPG 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  521 PRSPRQSSIGNSPSGPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSrlqDQRQNSPAGSKenvkas 600
Cdd:PHA03307   146 PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPR---PPRRSSPISAS------ 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  601 etspsfskadnkGMSPVVSEHRKQIDDLkkfkndfRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSS 680
Cdd:PHA03307   217 ------------ASSPAPAPGRSAADDA-------GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNG 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412640  681 SSSNCTSGSSKTNSPSISPSMLSNAEHKRGPEVTSQGVQTSSPackQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFS 760
Cdd:PHA03307   278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSS---SRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
                          330       340
                   ....*....|....*....|
gi 1720412640  761 QPKPSTTPTSPRPQAQPSPS 780
Cdd:PHA03307   355 RPPPPADPSSPRKRPRPSRA 374
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH