|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
163-224 |
6.59e-26 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 101.14 E-value: 6.59e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007933 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
147-224 |
1.32e-13 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 66.55 E-value: 1.32e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
165-223 |
3.66e-12 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 61.74 E-value: 3.66e-12
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007933 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
393-770 |
4.93e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 4.93e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 393 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 472
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 473 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 545
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 546 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 621
Cdd:PHA03247 2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 622 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 701
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007933 702 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 770
Cdd:PHA03247 2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
448-771 |
6.06e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.15 E-value: 6.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 448 LNPHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQpqmagpLVTQGLQASPQSVPFPAV 527
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHT------LIQQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 528 sfPPQHLLPMSPTPQFPMrddvatqfgQMTLSRQSSGETPEPPAGPVYPPSLMPQPT-QQPAYVIASTGQ-QLPAGGFSG 605
Cdd:pfam03154 248 --PLQPMTQPPPPSQVSP---------QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVpPQPFPLTPQSSQsQVPPGPSPA 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 606 SGPPISQQVLQPPPSPQGFVQQP------PPAQMPVYYY------PSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQ 673
Cdd:pfam03154 317 APGQSQQRIHTPPSQSQLQSQQPpreqplPPAPLSMPHIkpppttPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 674 AGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGP 753
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPI 475
|
330
....*....|....*...
gi 1811007933 754 LPATGVPVYCSVTPPTPQ 771
Cdd:pfam03154 476 TPPSGPPTSTSSAMPGIQ 493
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
556-683 |
7.07e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 42.87 E-value: 7.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 556 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 633
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1811007933 634 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 683
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
163-224 |
6.59e-26 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 101.14 E-value: 6.59e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007933 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
147-224 |
1.32e-13 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 66.55 E-value: 1.32e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
165-223 |
3.66e-12 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 61.74 E-value: 3.66e-12
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007933 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| R3H |
cd02325 |
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ... |
167-223 |
1.30e-11 |
|
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100064 Cd Length: 59 Bit Score: 60.32 E-value: 1.30e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007933 167 LLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINK 223
Cdd:cd02325 1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
393-770 |
4.93e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 4.93e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 393 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 472
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 473 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 545
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 546 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 621
Cdd:PHA03247 2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 622 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 701
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007933 702 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 770
Cdd:PHA03247 2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
393-634 |
5.36e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 5.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 393 PPLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSILLNPHTGQPFVNPDGTPAiYNPPSSQ 472
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP-AAPAAGP 2779
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 473 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLVTQGLQASPQSVPFPAVSFPPQHLLPMSPTPQFPMRDDVATq 552
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP- 2858
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 553 fGQMTLSRQSSGETPEPPAGPVYPPS---LMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPP 629
Cdd:PHA03247 2859 -GGDVRRRPPSRSPAAKPAAPARPPVrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
|
....*
gi 1811007933 630 PAQMP 634
Cdd:PHA03247 2938 RPQPP 2942
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
394-768 |
5.59e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 5.59e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 394 PLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSI--LLNPHTGQPFVNPDGTPAIynpPSS 471
Cdd:PHA03247 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLtsLADPPPPPPTPEPAPHALV---SAT 2719
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 472 QQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPlvtqglqASPQSVPFPAVSFPPQhlLPMSPTPQFPMRDDVAT 551
Cdd:PHA03247 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------ARPPTTAGPPAPAPPA--APAAGPPRRLTRPAVAS 2790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 552 qfGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPA 631
Cdd:PHA03247 2791 --LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS 2868
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 632 QMPVYYYPSGQYP-------TSTTQQYRPMASVQFSAQRGQQmPQTAQQAGYQPVLSGQQgfqglvgVQQSPPSQGVLSS 704
Cdd:PHA03247 2869 RSPAAKPAAPARPpvrrlarPAVSRSTESFALPPDQPERPPQ-PQAPPPPQPQPQPPPPP-------QPQPPPPPPPRPQ 2940
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1811007933 705 PQGApvqsvmvsyPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPP 768
Cdd:PHA03247 2941 PPLA---------PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
448-771 |
6.06e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.15 E-value: 6.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 448 LNPHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQpqmagpLVTQGLQASPQSVPFPAV 527
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHT------LIQQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 528 sfPPQHLLPMSPTPQFPMrddvatqfgQMTLSRQSSGETPEPPAGPVYPPSLMPQPT-QQPAYVIASTGQ-QLPAGGFSG 605
Cdd:pfam03154 248 --PLQPMTQPPPPSQVSP---------QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVpPQPFPLTPQSSQsQVPPGPSPA 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 606 SGPPISQQVLQPPPSPQGFVQQP------PPAQMPVYYY------PSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQ 673
Cdd:pfam03154 317 APGQSQQRIHTPPSQSQLQSQQPpreqplPPAPLSMPHIkpppttPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 674 AGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGP 753
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPI 475
|
330
....*....|....*...
gi 1811007933 754 LPATGVPVYCSVTPPTPQ 771
Cdd:pfam03154 476 TPPSGPPTSTSSAMPGIQ 493
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
563-743 |
9.81e-06 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 49.31 E-value: 9.81e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 563 SGETPEPPAGPVYPPSLMPQPTQQP--AYVIASTGQQ---------LPAGGFSGSGPPISQQVL--QPPPSPQgfVQQPP 629
Cdd:PRK10263 295 SGNRATQPEYDEYDPLLNGAPITEPvaVAAAATTATQswaapvepvTQTPPVASVDVPPAQPTVawQPVPGPQ--TGEPV 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 630 PAQMPVYYYPSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQG 707
Cdd:PRK10263 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQpvQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQ 452
|
170 180 190
....*....|....*....|....*....|....*.
gi 1811007933 708 APVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPV 743
Cdd:PRK10263 453 QSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQP 488
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
450-755 |
7.65e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.30 E-value: 7.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 450 PHTGQPFVNPDGTPAIYNPPSSQQP------------------LRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLV 511
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQStaaphtliqqtptlhpqrLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 512 TQGLQASPQSV-------PFPAVSFPPQHLLPMSPTPQFPmrddVATQFGQMTLSRQSSGETPEPPAGPVYPPSLM---- 580
Cdd:pfam03154 279 PHSLQTGPSHMqhpvppqPFPLTPQSSQSQVPPGPSPAAP----GQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLsmph 354
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 581 --------------PQPTQQPAYVIASTGQQLPA-----------GGFSGSGPPISQqvlqPPP---SPQGFVQQPPPAQ 632
Cdd:pfam03154 355 ikpppttpipqlpnPQSHKHPPHLSGPSPFQMNSnlppppalkplSSLSTHHPPSAH----PPPlqlMPQSQQLPPPPAQ 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 633 MPVYyypsgqyptSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQS 712
Cdd:pfam03154 431 PPVL---------TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVS 501
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1811007933 713 VMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLP 755
Cdd:pfam03154 502 SSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEP 544
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
519-782 |
1.78e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 1.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 519 PQSVPFPAVSFPPQHLLPMSPTPQfpmRDDVATQFGQMTLSRQSSgeTPEPPAGPVYPPSlMPQPTQQPAYVIASTGQQL 598
Cdd:PHA03247 2627 PPPSPSPAANEPDPHPPPTVPPPE---RPRDDPAPGRVSRPRRAR--RLGRAAQASSPPQ-RPRRRAARPTVGSLTSLAD 2700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 599 PaggfsgsgPPisqqvlqPPPSPQgfvqQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQP 678
Cdd:PHA03247 2701 P--------PP-------PPPTPE----PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 679 VLSGQQgfqglvgvQQSPPSQGVLSSPQGAP---VQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPnqAGPGPLP 755
Cdd:PHA03247 2762 TTAGPP--------APAPPAAPAAGPPRRLTrpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPP 2831
|
250 260
....*....|....*....|....*..
gi 1811007933 756 ATGVPVYCSVTPPTPQNSLRLLGPHCP 782
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
556-760 |
2.32e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 2.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 556 MTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviaSTGQ------------QLPAG--GFSGSGPPISQQVLQPPPSP 621
Cdd:pfam09770 108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPV----RTGYekykepepipdlQVDASlwGVAPKKAAAPAPAPQPAAQP 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 622 QGFVQQPPP--------AQMpvyyypSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVG 691
Cdd:pfam09770 184 ASLPAPSRKmmsleeveAAM------RAQAKKPAQQPAPAPAQppAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007933 692 VQQSPPSQgvLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVP 760
Cdd:pfam09770 258 GQGHPVTI--LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGV 324
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
562-686 |
5.93e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 43.23 E-value: 5.93e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 562 SSGETPEPPAGP---VYPPSLMPQPTQQPAYVIASTGQQLPAGgfSGSGPPISQQVLQPPPS-PQGFVQQPPPAQMPVYY 637
Cdd:PRK14971 364 QKGDDASGGRGPkqhIKPVFTQPAAAPQPSAAAAASPSPSQSS--AAAQPSAPQSATQPAGTpPTVSVDPPAAVPVNPPS 441
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1811007933 638 YPSGQYPTSTTQQYRPMA-----SVQFSAQRGQQMPQTAQQAGYQPVLSGQQGF 686
Cdd:PRK14971 442 TAPQAVRPAQFKEEKKIPvskvsSLGPSTLRPIQEKAEQATGNIKEAPTGTQKE 495
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
556-683 |
7.07e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 42.87 E-value: 7.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 556 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 633
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1811007933 634 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 683
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
558-699 |
7.60e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.10 E-value: 7.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 558 LSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQ------QVLQPPPSPQGFVQQPPPA 631
Cdd:pfam09770 203 MRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPgqghpvTILQRPQSPQPDPAQPSIQ 282
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007933 632 QMPVYYY----PSGQYPTSTTQQ-YRPMASVQFSAQRGQQMPQTAQQAGYQPvlsgQQGFQGLVGVQQSPPSQ 699
Cdd:pfam09770 283 PQAQQFHqqppPVPVQPTQILQNpNRLSAARVGYPQNPQPGVQPAPAHQAHR----QQGSFGRQAPIITHPQQ 351
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
568-768 |
1.27e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 1.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 568 EPPAGPVYPPSLMP--QPTQQPAyviastgqqlpaggfsgsgPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPT 645
Cdd:PRK10263 738 DGPHEPLFTPIVEPvqQPQQPVA-------------------PQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 798
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 646 STTQQYrpmasvqfsaQRGQQmPQTAQQagyqpvlsgqqgfqglvgvQQSPPSQGVLSSPQgapvqsvmvsyptmssYQV 725
Cdd:PRK10263 799 APQPQY----------QQPQQ-PVAPQP-------------------QYQQPQQPVAPQPQ----------------YQQ 832
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1811007933 726 PMTQGSQGlPQQSYQQPVMLPN-QAGPGPLPATGVPVYCSVTPP 768
Cdd:PRK10263 833 PQQPVAPQ-PQDTLLHPLLMRNgDSRPLHKPTTPLPSLDLLTPP 875
|
|
| R3H_Smubp-2_like |
cd02641 |
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and ... |
174-221 |
3.18e-03 |
|
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Pssm-ID: 100070 Cd Length: 60 Bit Score: 36.56 E-value: 3.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 1811007933 174 IIDFIGDDNNHYKKFP-QMSSYQRMLVHRVAAYFGLDHNVDQTGKSVII 221
Cdd:cd02641 8 VKAFMKDPKATELEFPpTLSSHDRLLVHELAEELGLRHESTGEGSDRVI 56
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
520-783 |
3.31e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.21 E-value: 3.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 520 QSVPFPAVSFPPQHLLP-MSPTPQFPMRDDVA--------TQFGQMTLSRQSSGETPEPPA--GPVYPPSLMPQPTQQPA 588
Cdd:PHA03378 639 QPITFNVLVFPTPHQPPqVEITPYKPTWTQIGhipyqpspTGANTMLPIQWAPGTMQPPPRapTPMRPPAAPPGRAQRPA 718
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 589 YviASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMP 668
Cdd:PHA03378 719 A--ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 669 QTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQ 748
Cdd:PHA03378 797 QPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPV 876
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1811007933 749 AGPGPLPATG---VPVYCSVTPPTPQN---SLRLLGPHCPS 783
Cdd:PHA03378 877 LQPIQVMRQLgsvRAAAASTVTQAPTEytgERRGVGPMHPT 917
|
|
| R3H_unknown_2 |
cd06006 |
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain ... |
169-209 |
7.07e-03 |
|
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100076 Cd Length: 59 Bit Score: 35.42 E-value: 7.07e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1811007933 169 KMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLD 209
Cdd:cd06006 3 QIESTLRKFINDKSKRSLRFPPMRSPQRAFIHELAKDYGLY 43
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
559-708 |
8.26e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 39.28 E-value: 8.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007933 559 SRQSSGETP-EPPAGP--VYPPSLMPQPTQQPAYVIASTGQQ---LPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQ 632
Cdd:PRK10927 91 SRQPGVRAPtEPSAGGevKTPEQLTPEQRQLLEQMQADMRQQptqLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQ 170
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007933 633 MPVYYYPSGQYPTSTTQQyrpmasvQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGA 708
Cdd:PRK10927 171 QSRTTEQSWQQQTRTSQA-------APVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVTRAADAPKPT 239
|
|
|