NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1777350993|ref|XP_031506619|]
View 

ataxin-2 isoform X11 [Papio anubis]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
262-335 5.21e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 5.21e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
403-464 9.85e-16

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.85e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 super family cl33720
large tegument protein UL36; Provisional
923-1239 9.83e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 9.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  923 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 994
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  995 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1073
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1074 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 1134
Cdd:PHA03247  2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1135 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1214
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
                          330       340
                   ....*....|....*....|....*
gi 1777350993 1215 aqqtvftihPSHVQPAYTNPPHMAH 1239
Cdd:PHA03247  2984 ---------PSREAPASSTPPLTGH 2999
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
562-753 1.60e-03

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK12323:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1777350993  717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
904-919 8.79e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


:

Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 8.79e-03
                           10
                   ....*....|....*.
gi 1777350993  904 RKSTLNPNAKEFNPRS 919
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
262-335 5.21e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 5.21e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
403-464 9.85e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.85e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
923-1239 9.83e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 9.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  923 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 994
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  995 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1073
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1074 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 1134
Cdd:PHA03247  2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1135 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1214
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
                          330       340
                   ....*....|....*....|....*
gi 1777350993 1215 aqqtvftihPSHVQPAYTNPPHMAH 1239
Cdd:PHA03247  2984 ---------PSREAPASSTPPLTGH 2999
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
924-1096 1.43e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.65  E-value: 1.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  924 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 989
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  990 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPiaaTPPAYSTQyvaySPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 1065
Cdd:pfam09770  245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDP---AQPSIQPQ----AQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1777350993 1066 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 1096
Cdd:pfam09770  318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
931-1044 5.92e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.49  E-value: 5.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  931 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 1008
Cdd:TIGR01628  379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1777350993 1009 AMMHPASAagPPIAATPPAYSTQyvaySPQQFPNQP 1044
Cdd:TIGR01628  452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
562-753 1.60e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1777350993  717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
904-919 8.79e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 8.79e-03
                           10
                   ....*....|....*.
gi 1777350993  904 RKSTLNPNAKEFNPRS 919
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
266-333 9.25e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.07  E-value: 9.25e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  266 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 333
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
262-335 5.21e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 88.38  E-value: 5.21e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  262 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 335
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
403-464 9.85e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.85e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  403 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 464
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
923-1239 9.83e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 9.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  923 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 994
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  995 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1073
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1074 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 1134
Cdd:PHA03247  2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1135 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1214
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
                          330       340
                   ....*....|....*....|....*
gi 1777350993 1215 aqqtvftihPSHVQPAYTNPPHMAH 1239
Cdd:PHA03247  2984 ---------PSREAPASSTPPLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
924-1286 1.09e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  924 KPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 1003
Cdd:PHA03247  2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1004 qhhQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQP---- 1079
Cdd:PHA03247  2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1080 ---GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTPHPQPSATPTG---QQQS 1153
Cdd:PHA03247  2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAalpPAAS 2823
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1154 QHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqnSFPAAQQTVFTIHPSHVQPAYTN 1233
Cdd:PHA03247  2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALP 2901
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1777350993 1234 PPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSAlQPIPVS 1286
Cdd:PHA03247  2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAG 2953
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
924-1096 1.43e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.65  E-value: 1.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  924 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 989
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  990 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPiaaTPPAYSTQyvaySPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 1065
Cdd:pfam09770  245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDP---AQPSIQPQ----AQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1777350993 1066 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 1096
Cdd:pfam09770  318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
1012-1266 5.02e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.72  E-value: 5.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1012 HPASAAGPPIAATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPTHAQPGLVSSSA 1086
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPAPAPQPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1087 TQYGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSH 1159
Cdd:pfam09770  185 SLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1160 PAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAH 1239
Cdd:pfam09770  258 GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAH 333
                          250       260
                   ....*....|....*....|....*..
gi 1777350993 1240 vPQAHVQSGMVPSHpTAHAPMMLMTTQ 1266
Cdd:pfam09770  334 -RQQGSFGRQAPII-THPQQLAQLSEE 358
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
931-1044 5.92e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.49  E-value: 5.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  931 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 1008
Cdd:TIGR01628  379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1777350993 1009 AMMHPASAagPPIAATPPAYSTQyvaySPQQFPNQP 1044
Cdd:TIGR01628  452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
PRK10263 PRK10263
DNA translocase FtsK; Provisional
921-1209 1.41e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  921 SQPKPSTTPTSPRPQAQPSPsmvGHQQPTPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAKTYRAVPNMPQ 999
Cdd:PRK10263   345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYAPAAEQPAQQPY 420
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1000 QRQDQHHQSAMMHPASAAGPPIAATPpaystqyvaysPQQFPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPTHAQP 1079
Cdd:PRK10263   421 YAPAPEQPAQQPYYAPAPEQPVAGNA-----------WQAEEQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQQP 481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1080 GLVsssatqygahEQTHAMYACPKLPYNKETSPSFYFaisTGSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHGGS 1158
Cdd:PRK10263   482 QPV----------EQQPVVEPEPVVEETKPARPPLYY---FEEVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPS 548
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1777350993 1159 HPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1209
Cdd:PRK10263   549 VAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
922-1162 2.37e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.41  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  922 QPKPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFA-PNMMYPVPVSP--------GVQPLYPIPMTPMPVNQAKTYR 992
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyEKYKEPEPIPDlqvdaslwGVAPKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  993 AVPNMPQQRQDQHHQSAMM---HPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvysPVIQGNAR 1069
Cdd:pfam09770  186 LPAPSRKMMSLEEVEAAMRaqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP----QQHPGQGH 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1070 MMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKLPynketSPSfyfaistgslaQQYAHPN------ATLHPHTPHPQP 1143
Cdd:pfam09770  262 PVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPV-----QPT-----------QILQNPNrlsaarVGYPQNPQPGVQ 325
                          250
                   ....*....|....*....
gi 1777350993 1144 SATPTGQQQSQHGGSHPAP 1162
Cdd:pfam09770  326 PAPAHQAHRQQGSFGRQAP 344
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
904-1060 5.95e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 44.03  E-value: 5.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  904 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ--PLYPIPMT 981
Cdd:TIGR01628  367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPlrPNGLAPMN 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  982 PMpvnqaktyRAVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 1055
Cdd:TIGR01628  442 AV--------RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513

                   ....*
gi 1777350993 1056 HPHVY 1060
Cdd:TIGR01628  514 FPLVE 518
PHA03378 PHA03378
EBNA-3B; Provisional
906-1259 7.22e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 7.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  906 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 985
Cdd:PHA03378   583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  986 NQAKTYRAVPNMPQQRQDQHHqsAMMHPASAAgPPIAATPPAYSTQyvAYSPQQFPnqplvqhVPHYQSQHPHVYSPVIQ 1065
Cdd:PHA03378   661 PYKPTWTQIGHIPYQPSPTGA--NTMLPIQWA-PGTMQPPPRAPTP--MRPPAAPP-------GRAQRPAAATGRARPPA 728
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1066 GNARMMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKlpynketspsfyfaistgslaqqyahPNATLHPHTPHPQPSA 1145
Cdd:PHA03378   729 AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP--------------------------PAAAPGAPTPQPPPQA 782
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1146 TPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPTPP 1197
Cdd:PHA03378   783 PPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPSPG 859
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1777350993 1198 SMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAP 1259
Cdd:PHA03378   860 SGTSDKIVQAPVFYPPVLQPIQVMRQLGSVRAAAASTVTQAPTEYTGERRGVGPMHPTDIPP 921
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
562-753 1.60e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  562 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 637
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  638 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 716
Cdd:PRK12323   447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1777350993  717 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 753
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
917-1289 1.93e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  917 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 996
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  997 MPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGNAR 1069
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGDTL 548
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1070 MMApptHAQPGLVSSSATQYGA-------HEQTHAMYAcpklpYNKETSPSFYFAISTGSLAQQYAHPNatlhPHTPHPQ 1142
Cdd:PRK07764   549 VLG---FSTGGLARRFASPGNAevlvtalAEELGGDWQ-----VEAVVGPAPGAAGGEGPPAPASSGPP----EEAARPA 616
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1143 PSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSF 1212
Cdd:PRK07764   617 APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGA 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1213 PAAQQTVFTIHPSHVQPAYTNPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQPIPV 1285
Cdd:PRK07764   697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAAPPP 776

                   ....
gi 1777350993 1286 STTA 1289
Cdd:PRK07764   777 SPPS 780
PHA03247 PHA03247
large tegument protein UL36; Provisional
929-1292 2.45e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 2.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  929 PTSPRPQAQPSPSmVGHQQPTPVYTqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQdqhhQS 1008
Cdd:PHA03247  2551 PPPPLPPAAPPAA-PDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPA----PP 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1009 AMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQPGLVSSSATQ 1088
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1089 YGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTP----HPQPSATPTGQQQSQHGGSHPAPSP 1164
Cdd:PHA03247  2695 LTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1165 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQAH 1244
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1777350993 1245 VQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1292
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
904-919 8.79e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 34.89  E-value: 8.79e-03
                           10
                   ....*....|....*.
gi 1777350993  904 RKSTLNPNAKEFNPRS 919
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
266-333 9.25e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.07  E-value: 9.25e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1777350993  266 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 333
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
865-1235 9.54e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 9.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  865 NTEHKRGPEVTSQGVQTSSPACKQEKDDKEEKKDAAEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 944
Cdd:pfam03154  127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993  945 HQQPTPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 1005
Cdd:pfam03154  207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1006 hQSAMMHPASAAGPPIAAT------PPAYSTQyVAYSPQQFPNQPLVQhvPHYQSQHPHVYSPVIQGNARM---MAPPTH 1076
Cdd:pfam03154  286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPA-APGQSQQRIHTPPSQ--SQLQSQQPPREQPLPPAPLSMphiKPPPTT 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1077 AQPGLVSSSATQYGAHEQTHAMYacpKLPYNKETSPSFYFAISTGSLAQQYAHPNA-TLHPHT-PHPQPSATPTGQQQSQ 1154
Cdd:pfam03154  362 PIPQLPNPQSHKHPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1777350993 1155 hggSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNP 1234
Cdd:pfam03154  439 ---SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515

                   .
gi 1777350993 1235 P 1235
Cdd:pfam03154  516 P 516
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH