|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
163-224 |
6.88e-26 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 101.14 E-value: 6.88e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007923 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
147-224 |
1.98e-13 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 66.17 E-value: 1.98e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
|
|
| SUZ |
pfam12752 |
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ... |
245-300 |
2.53e-13 |
|
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.
Pssm-ID: 463689 [Multi-domain] Cd Length: 56 Bit Score: 65.04 E-value: 2.53e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 245 ESQKRFILKRDNSSIDKEDTQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752 1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
165-223 |
5.93e-12 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 61.35 E-value: 5.93e-12
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007923 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
428-805 |
4.02e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 4.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 428 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 507
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 580
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 581 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 656
Cdd:PHA03247 2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 657 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 736
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 737 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 805
Cdd:PHA03247 2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
485-806 |
2.56e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.31 E-value: 2.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 485 PHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLvtqglQASPQSVPfpavsf 564
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS-----QVSPQPLP------ 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 565 PPQHLLPMSPTPQ------FPMRDDVATQ-FGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviastgqqlpagg 637
Cdd:pfam03154 268 QPSLHGQMPPMPHslqtgpSHMQHPVPPQpFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQ-------------- 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 638 fSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPvyyypsgQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSG 717
Cdd:pfam03154 334 -LQSQQPPREQPLPPAPLSMPHIKPPPTTPIP-------QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTH 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 718 QQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGPLPATGVPVY 797
Cdd:pfam03154 406 HPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPITPPSGPPTS 484
|
....*....
gi 1811007923 798 CSVTPPTPQ 806
Cdd:pfam03154 485 TSSAMPGIQ 493
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
591-718 |
7.45e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 42.87 E-value: 7.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 668
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1811007923 669 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 718
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| R3H_encore_like |
cd02642 |
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ... |
163-224 |
6.88e-26 |
|
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100071 Cd Length: 63 Bit Score: 101.14 E-value: 6.88e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007923 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642 1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
|
|
| R3H |
smart00393 |
Putative single-stranded nucleic acids-binding domain; |
147-224 |
1.98e-13 |
|
Putative single-stranded nucleic acids-binding domain;
Pssm-ID: 214647 Cd Length: 79 Bit Score: 66.17 E-value: 1.98e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393 1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
|
|
| SUZ |
pfam12752 |
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ... |
245-300 |
2.53e-13 |
|
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.
Pssm-ID: 463689 [Multi-domain] Cd Length: 56 Bit Score: 65.04 E-value: 2.53e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 245 ESQKRFILKRDNSSIDKEDTQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752 1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
|
|
| R3H |
pfam01424 |
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ... |
165-223 |
5.93e-12 |
|
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.
Pssm-ID: 460206 Cd Length: 60 Bit Score: 61.35 E-value: 5.93e-12
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007923 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424 1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
|
|
| R3H |
cd02325 |
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ... |
167-223 |
1.36e-11 |
|
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100064 Cd Length: 59 Bit Score: 60.32 E-value: 1.36e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 167 LLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINK 223
Cdd:cd02325 1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
428-805 |
4.02e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 4.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 428 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 507
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 580
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 581 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 656
Cdd:PHA03247 2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 657 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 736
Cdd:PHA03247 2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 737 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 805
Cdd:PHA03247 2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
428-669 |
4.96e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 4.96e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 428 PPLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSILLNPHTGQPFVNPDGTPAiYNPPSSQ 507
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP-AAPAAGP 2779
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLVTQGLQASPQSVPFPAVSFPPQHLLPMSPTPQFPMRDDVATq 587
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP- 2858
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 588 fGQMTLSRQSSGETPEPPAGPVYPPS---LMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPP 664
Cdd:PHA03247 2859 -GGDVRRRPPSRSPAAKPAAPARPPVrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
|
....*
gi 1811007923 665 PAQMP 669
Cdd:PHA03247 2938 RPQPP 2942
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
485-806 |
2.56e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.31 E-value: 2.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 485 PHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLvtqglQASPQSVPfpavsf 564
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS-----QVSPQPLP------ 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 565 PPQHLLPMSPTPQ------FPMRDDVATQ-FGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviastgqqlpagg 637
Cdd:pfam03154 268 QPSLHGQMPPMPHslqtgpSHMQHPVPPQpFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQ-------------- 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 638 fSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPvyyypsgQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSG 717
Cdd:pfam03154 334 -LQSQQPPREQPLPPAPLSMPHIKPPPTTPIP-------QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTH 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 718 QQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGPLPATGVPVY 797
Cdd:pfam03154 406 HPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPITPPSGPPTS 484
|
....*....
gi 1811007923 798 CSVTPPTPQ 806
Cdd:pfam03154 485 TSSAMPGIQ 493
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
429-803 |
5.97e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 5.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 429 PLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSI--LLNPHTGQPFVNPDGTPAIynpPSS 506
Cdd:PHA03247 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLtsLADPPPPPPTPEPAPHALV---SAT 2719
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 507 QQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPlvtqglqASPQSVPFPAVSFPPQhlLPMSPTPQFPMRDDVAT 586
Cdd:PHA03247 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------ARPPTTAGPPAPAPPA--APAAGPPRRLTRPAVAS 2790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 587 qfGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPA 666
Cdd:PHA03247 2791 --LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS 2868
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 667 QMPVYYYPSGQYP-------TSTTQQYRPMASVQFSAQRGQQmPQTAQQAGYQPVLSGQQgfqglvgVQQSPPSQGVLSS 739
Cdd:PHA03247 2869 RSPAAKPAAPARPpvrrlarPAVSRSTESFALPPDQPERPPQ-PQAPPPPQPQPQPPPPP-------QPQPPPPPPPRPQ 2940
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1811007923 740 PQGApvqsvmvsyPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPP 803
Cdd:PHA03247 2941 PPLA---------PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
598-778 |
8.68e-06 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 49.70 E-value: 8.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 598 SGETPEPPAGPVYPPSLMPQPTQQP--AYVIASTGQQ---------LPAGGFSGSGPPISQQVL--QPPPSPQgfVQQPP 664
Cdd:PRK10263 295 SGNRATQPEYDEYDPLLNGAPITEPvaVAAAATTATQswaapvepvTQTPPVASVDVPPAQPTVawQPVPGPQ--TGEPV 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 665 PAQMPVYYYPSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQG 742
Cdd:PRK10263 373 IAPAPEGYPQQSQYAQPAVQYNEPLQQpvQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQ 452
|
170 180 190
....*....|....*....|....*....|....*.
gi 1811007923 743 APVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPV 778
Cdd:PRK10263 453 QSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQP 488
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
554-817 |
1.35e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 1.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 554 PQSVPFPAVSFPPQHLLPMSPTPQfpmRDDVATQFGQMTLSRQSSgeTPEPPAGPVYPPSlMPQPTQQPAYVIASTGQQL 633
Cdd:PHA03247 2627 PPPSPSPAANEPDPHPPPTVPPPE---RPRDDPAPGRVSRPRRAR--RLGRAAQASSPPQ-RPRRRAARPTVGSLTSLAD 2700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 634 PaggfsgsgPPisqqvlqPPPSPQgfvqQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQP 713
Cdd:PHA03247 2701 P--------PP-------PPPTPE----PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 714 VLSGQQgfqglvgvQQSPPSQGVLSSPQGAP---VQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPnqAGPGPLP 790
Cdd:PHA03247 2762 TTAGPP--------APAPPAAPAAGPPRRLTrpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPP 2831
|
250 260
....*....|....*....|....*..
gi 1811007923 791 ATGVPVYCSVTPPTPQNSLRLLGPHCP 817
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
591-795 |
2.37e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.03 E-value: 2.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviaSTGQ------------QLPAG--GFSGSGPPISQQVLQPPPSP 656
Cdd:pfam09770 108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPV----RTGYekykepepipdlQVDASlwGVAPKKAAAPAPAPQPAAQP 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 657 QGFVQQPPP--------AQMpvyyypSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVG 726
Cdd:pfam09770 184 ASLPAPSRKmmsleeveAAM------RAQAKKPAQQPAPAPAQppAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 727 VQQSPPSQgvLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVP 795
Cdd:pfam09770 258 GQGHPVTI--LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGV 324
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
597-721 |
6.04e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 43.23 E-value: 6.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 597 SSGETPEPPAGP---VYPPSLMPQPTQQPAYVIASTGQQLPAGgfSGSGPPISQQVLQPPPS-PQGFVQQPPPAQMPVYY 672
Cdd:PRK14971 364 QKGDDASGGRGPkqhIKPVFTQPAAAPQPSAAAAASPSPSQSS--AAAQPSAPQSATQPAGTpPTVSVDPPAAVPVNPPS 441
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1811007923 673 YPSGQYPTSTTQQYRPMA-----SVQFSAQRGQQMPQTAQQAGYQPVLSGQQGF 721
Cdd:PRK14971 442 TAPQAVRPAQFKEEKKIPvskvsSLGPSTLRPIQEKAEQATGNIKEAPTGTQKE 495
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
593-746 |
7.30e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.10 E-value: 7.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 593 LSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQ------QVLQPPPSPqgfvqQPPPA 666
Cdd:pfam09770 203 MRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPgqghpvTILQRPQSP-----QPDPA 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 667 QmpvyyyPSGQYPTSTTQQYRPMASVQfSAQRGQQ--MPQTAQQAGYQPVLSGQQGFQGLVGvQQSPPSQGVLSSPQGAP 744
Cdd:pfam09770 278 Q------PSIQPQAQQFHQQPPPVPVQ-PTQILQNpnRLSAARVGYPQNPQPGVQPAPAHQA-HRQQGSFGRQAPIITHP 349
|
..
gi 1811007923 745 VQ 746
Cdd:pfam09770 350 QQ 351
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
591-718 |
7.45e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 42.87 E-value: 7.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 668
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1811007923 669 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 718
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
603-803 |
1.09e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 603 EPPAGPVYPPSLMP--QPTQQPAyviastgqqlpaggfsgsgPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPT 680
Cdd:PRK10263 738 DGPHEPLFTPIVEPvqQPQQPVA-------------------PQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 798
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 681 STTQQYrpmasvqfsaQRGQQmPQTAQQagyqpvlsgqqgfqglvgvQQSPPSQGVLSSPQgapvqsvmvsyptmssYQV 760
Cdd:PRK10263 799 APQPQY----------QQPQQ-PVAPQP-------------------QYQQPQQPVAPQPQ----------------YQQ 832
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1811007923 761 PMTQGSQGlPQQSYQQPVMLPN-QAGPGPLPATGVPVYCSVTPP 803
Cdd:PRK10263 833 PQQPVAPQ-PQDTLLHPLLMRNgDSRPLHKPTTPLPSLDLLTPP 875
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
555-818 |
2.51e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.59 E-value: 2.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 555 QSVPFPAVSFPPQHLLP-MSPTPQFPMRDDVA--------TQFGQMTLSRQSSGETPEPPA--GPVYPPSLMPQPTQQPA 623
Cdd:PHA03378 639 QPITFNVLVFPTPHQPPqVEITPYKPTWTQIGhipyqpspTGANTMLPIQWAPGTMQPPPRapTPMRPPAAPPGRAQRPA 718
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 624 YviASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMP 703
Cdd:PHA03378 719 A--ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 704 QTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQ 783
Cdd:PHA03378 797 QPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPV 876
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1811007923 784 AGPGPLPATG---VPVYCSVTPPTPQN---SLRLLGPHCPS 818
Cdd:PHA03378 877 LQPIQVMRQLgsvRAAAASTVTQAPTEytgERRGVGPMHPT 917
|
|
| R3H_Smubp-2_like |
cd02641 |
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and ... |
174-221 |
4.68e-03 |
|
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Pssm-ID: 100070 Cd Length: 60 Bit Score: 36.18 E-value: 4.68e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 1811007923 174 IIDFIGDDNNHYKKFP-QMSSYQRMLVHRVAAYFGLDHNVDQTGKSVII 221
Cdd:cd02641 8 VKAFMKDPKATELEFPpTLSSHDRLLVHELAEELGLRHESTGEGSDRVI 56
|
|
| R3H_unknown_2 |
cd06006 |
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain ... |
169-209 |
8.73e-03 |
|
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Pssm-ID: 100076 Cd Length: 59 Bit Score: 35.42 E-value: 8.73e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1811007923 169 KMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLD 209
Cdd:cd06006 3 QIESTLRKFINDKSKRSLRFPPMRSPQRAFIHELAKDYGLY 43
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
594-743 |
9.16e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 39.28 E-value: 9.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 594 SRQSSGETP-EPPAGP--VYPPSLMPQPTQQPAYVIASTGQQ---LPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQ 667
Cdd:PRK10927 91 SRQPGVRAPtEPSAGGevKTPEQLTPEQRQLLEQMQADMRQQptqLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQ 170
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 668 MPVYYYPSGQYPTSTTQQyrpmasvQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGA 743
Cdd:PRK10927 171 QSRTTEQSWQQQTRTSQA-------APVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVTRAADAPKPT 239
|
|
|