NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1811007923|ref|XP_032315170|]
View 

cAMP-regulated phosphoprotein 21 isoform X1 [Camelus ferus]

Protein Classification

R3H and SUZ domain-containing protein( domain architecture ID 10119061)

R3H and SUZ domain-containing protein may bind single-stranded nucleic acids through its R3H domain and RNA through its SUZ domain, similar to Zea mays DBF1-interactor protein 1 that is a potential regulator of DBF1 activity in stress responses

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 6.88e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


:

Pssm-ID: 100071  Cd Length: 63  Bit Score: 101.14  E-value: 6.88e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007923 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.53e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


:

Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.53e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 245 ESQKRFILKRDNSSIDKEDTQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
PHA03247 super family cl33720
large tegument protein UL36; Provisional
428-805 4.02e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 4.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  428 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 507
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 580
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  581 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 656
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  657 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 736
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923  737 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 805
Cdd:PHA03247  2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 6.88e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 101.14  E-value: 6.88e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007923 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
147-224 1.98e-13

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 66.17  E-value: 1.98e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393   1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.53e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.53e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 245 ESQKRFILKRDNSSIDKEDTQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
165-223 5.93e-12

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 61.35  E-value: 5.93e-12
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007923 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424   1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PHA03247 PHA03247
large tegument protein UL36; Provisional
428-805 4.02e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 4.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  428 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 507
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 580
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  581 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 656
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  657 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 736
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923  737 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 805
Cdd:PHA03247  2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
485-806 2.56e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.31  E-value: 2.56e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 485 PHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLvtqglQASPQSVPfpavsf 564
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS-----QVSPQPLP------ 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 565 PPQHLLPMSPTPQ------FPMRDDVATQ-FGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviastgqqlpagg 637
Cdd:pfam03154 268 QPSLHGQMPPMPHslqtgpSHMQHPVPPQpFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQ-------------- 333
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 638 fSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPvyyypsgQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSG 717
Cdd:pfam03154 334 -LQSQQPPREQPLPPAPLSMPHIKPPPTTPIP-------QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTH 405
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 718 QQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGPLPATGVPVY 797
Cdd:pfam03154 406 HPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPITPPSGPPTS 484

                  ....*....
gi 1811007923 798 CSVTPPTPQ 806
Cdd:pfam03154 485 TSSAMPGIQ 493
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
591-718 7.45e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.87  E-value: 7.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 668
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1811007923 669 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 718
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 6.88e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 101.14  E-value: 6.88e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1811007923 163 DRMILLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
147-224 1.98e-13

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 66.17  E-value: 1.98e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIgDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393   1 ADFLPVTLDALSYRPRRREELIELELEIARFV-KSTKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.53e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.53e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 245 ESQKRFILKRDNSSIDKEDTQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
165-223 5.93e-12

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 61.35  E-value: 5.93e-12
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1811007923 165 MILLKMEQEIIDFIGDDNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424   1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
R3H cd02325
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ...
167-223 1.36e-11

R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100064  Cd Length: 59  Bit Score: 60.32  E-value: 1.36e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 167 LLKMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINK 223
Cdd:cd02325     1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
PHA03247 PHA03247
large tegument protein UL36; Provisional
428-805 4.02e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 4.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  428 PPLQSTPLASGVAASSPGcvsypengmgGQVAPSStsyilLPLEAATGIPPgsillnPHTGQPFVNPDGTPAIYNPPSSQ 507
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPR----------GPAPPSP-----LPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPE 2650
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQ-------PQMAGPLVTQGLQASPQSVPFPavsfPPQHLLPMSPTPQFPM 580
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQrprrraaRPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  581 RDDVATQFGQMTLSRQSSGETPEPPAGPVYPPSlmPQPTQQPAyviASTGQQLPAGGFSGSGPPISQQVLQPP----PSP 656
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPAR--PPTTAGPP---APAPPAAPAAGPPRRLTRPAVASLSESreslPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  657 QGFVQQPPPAQMPVYYYPSGQYPTSTTQQyrPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQqgfqglvgVQQSPPSQgv 736
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD--------VRRRPPSR-- 2869
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923  737 lsSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPPTP 805
Cdd:PHA03247  2870 --SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
PHA03247 PHA03247
large tegument protein UL36; Provisional
428-669 4.96e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 4.96e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  428 PPLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSILLNPHTGQPFVNPDGTPAiYNPPSSQ 507
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP-AAPAAGP 2779
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  508 QPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLVTQGLQASPQSVPFPAVSFPPQHLLPMSPTPQFPMRDDVATq 587
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP- 2858
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  588 fGQMTLSRQSSGETPEPPAGPVYPPS---LMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPP 664
Cdd:PHA03247  2859 -GGDVRRRPPSRSPAAKPAAPARPPVrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937

                   ....*
gi 1811007923  665 PAQMP 669
Cdd:PHA03247  2938 RPQPP 2942
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
485-806 2.56e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.31  E-value: 2.56e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 485 PHTGQPFVNPDGTPAIYNPPSSQQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPLvtqglQASPQSVPfpavsf 564
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS-----QVSPQPLP------ 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 565 PPQHLLPMSPTPQ------FPMRDDVATQ-FGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviastgqqlpagg 637
Cdd:pfam03154 268 QPSLHGQMPPMPHslqtgpSHMQHPVPPQpFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQ-------------- 333
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 638 fSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPvyyypsgQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSG 717
Cdd:pfam03154 334 -LQSQQPPREQPLPPAPLSMPHIKPPPTTPIP-------QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTH 405
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 718 QQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAgPGPLPATGVPVY 797
Cdd:pfam03154 406 HPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGP-PPITPPSGPPTS 484

                  ....*....
gi 1811007923 798 CSVTPPTPQ 806
Cdd:pfam03154 485 TSSAMPGIQ 493
PHA03247 PHA03247
large tegument protein UL36; Provisional
429-803 5.97e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 5.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  429 PLQSTPLASGVAASSPGCVSYPENGMGGQVAPSSTSYILLPLEAATGIPPGSI--LLNPHTGQPFVNPDGTPAIynpPSS 506
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLtsLADPPPPPPTPEPAPHALV---SAT 2719
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  507 QQPLRSAVVGQSQQQPQQQPSPQPQQQVQPPQPQMAGPlvtqglqASPQSVPFPAVSFPPQhlLPMSPTPQFPMRDDVAT 586
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------ARPPTTAGPPAPAPPA--APAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  587 qfGQMTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPA 666
Cdd:PHA03247  2791 --LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS 2868
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  667 QMPVYYYPSGQYP-------TSTTQQYRPMASVQFSAQRGQQmPQTAQQAGYQPVLSGQQgfqglvgVQQSPPSQGVLSS 739
Cdd:PHA03247  2869 RSPAAKPAAPARPpvrrlarPAVSRSTESFALPPDQPERPPQ-PQAPPPPQPQPQPPPPP-------QPQPPPPPPPRPQ 2940
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1811007923  740 PQGApvqsvmvsyPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVPVYCSVTPP 803
Cdd:PHA03247  2941 PPLA---------PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PRK10263 PRK10263
DNA translocase FtsK; Provisional
598-778 8.68e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 49.70  E-value: 8.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  598 SGETPEPPAGPVYPPSLMPQPTQQP--AYVIASTGQQ---------LPAGGFSGSGPPISQQVL--QPPPSPQgfVQQPP 664
Cdd:PRK10263   295 SGNRATQPEYDEYDPLLNGAPITEPvaVAAAATTATQswaapvepvTQTPPVASVDVPPAQPTVawQPVPGPQ--TGEPV 372
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  665 PAQMPVYYYPSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQG 742
Cdd:PRK10263   373 IAPAPEGYPQQSQYAQPAVQYNEPLQQpvQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQ 452
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1811007923  743 APVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPV 778
Cdd:PRK10263   453 QSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQP 488
PHA03247 PHA03247
large tegument protein UL36; Provisional
554-817 1.35e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  554 PQSVPFPAVSFPPQHLLPMSPTPQfpmRDDVATQFGQMTLSRQSSgeTPEPPAGPVYPPSlMPQPTQQPAYVIASTGQQL 633
Cdd:PHA03247  2627 PPPSPSPAANEPDPHPPPTVPPPE---RPRDDPAPGRVSRPRRAR--RLGRAAQASSPPQ-RPRRRAARPTVGSLTSLAD 2700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  634 PaggfsgsgPPisqqvlqPPPSPQgfvqQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQP 713
Cdd:PHA03247  2701 P--------PP-------PPPTPE----PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  714 VLSGQQgfqglvgvQQSPPSQGVLSSPQGAP---VQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPnqAGPGPLP 790
Cdd:PHA03247  2762 TTAGPP--------APAPPAAPAAGPPRRLTrpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPP 2831
                          250       260
                   ....*....|....*....|....*..
gi 1811007923  791 ATGVPVYCSVTPPTPQNSLRLLGPHCP 817
Cdd:PHA03247  2832 TSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
591-795 2.37e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.03  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVYPPSLMPQPTQQPAyviaSTGQ------------QLPAG--GFSGSGPPISQQVLQPPPSP 656
Cdd:pfam09770 108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPV----RTGYekykepepipdlQVDASlwGVAPKKAAAPAPAPQPAAQP 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 657 QGFVQQPPP--------AQMpvyyypSGQYPTSTTQQYRPMAS--VQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVG 726
Cdd:pfam09770 184 ASLPAPSRKmmsleeveAAM------RAQAKKPAQQPAPAPAQppAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1811007923 727 VQQSPPSQgvLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQAGPGPLPATGVP 795
Cdd:pfam09770 258 GQGHPVTI--LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGV 324
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
597-721 6.04e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.23  E-value: 6.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 597 SSGETPEPPAGP---VYPPSLMPQPTQQPAYVIASTGQQLPAGgfSGSGPPISQQVLQPPPS-PQGFVQQPPPAQMPVYY 672
Cdd:PRK14971  364 QKGDDASGGRGPkqhIKPVFTQPAAAPQPSAAAAASPSPSQSS--AAAQPSAPQSATQPAGTpPTVSVDPPAAVPVNPPS 441
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1811007923 673 YPSGQYPTSTTQQYRPMA-----SVQFSAQRGQQMPQTAQQAGYQPVLSGQQGF 721
Cdd:PRK14971  442 TAPQAVRPAQFKEEKKIPvskvsSLGPSTLRPIQEKAEQATGNIKEAPTGTQKE 495
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
593-746 7.30e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.10  E-value: 7.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 593 LSRQSSGETPEPPAGPVYPPSLMPQPTQQPAYVIASTGQQLPAGGFSGSGPPISQ------QVLQPPPSPqgfvqQPPPA 666
Cdd:pfam09770 203 MRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPgqghpvTILQRPQSP-----QPDPA 277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 667 QmpvyyyPSGQYPTSTTQQYRPMASVQfSAQRGQQ--MPQTAQQAGYQPVLSGQQGFQGLVGvQQSPPSQGVLSSPQGAP 744
Cdd:pfam09770 278 Q------PSIQPQAQQFHQQPPPVPVQ-PTQILQNpnRLSAARVGYPQNPQPGVQPAPAHQA-HRQQGSFGRQAPIITHP 349

                  ..
gi 1811007923 745 VQ 746
Cdd:pfam09770 350 QQ 351
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
591-718 7.45e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.87  E-value: 7.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 591 MTLSRQSSGETPEPPAGPVypPSLMPQPTQQPAYVIASTGQQLP--AGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQM 668
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM--GSPMGGAMGQPPYYGQGPQQQFNgqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP 446
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1811007923 669 PVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMPQTAQQAGYQPVLSGQ 718
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
PRK10263 PRK10263
DNA translocase FtsK; Provisional
603-803 1.09e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  603 EPPAGPVYPPSLMP--QPTQQPAyviastgqqlpaggfsgsgPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPT 680
Cdd:PRK10263   738 DGPHEPLFTPIVEPvqQPQQPVA-------------------PQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 798
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923  681 STTQQYrpmasvqfsaQRGQQmPQTAQQagyqpvlsgqqgfqglvgvQQSPPSQGVLSSPQgapvqsvmvsyptmssYQV 760
Cdd:PRK10263   799 APQPQY----------QQPQQ-PVAPQP-------------------QYQQPQQPVAPQPQ----------------YQQ 832
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1811007923  761 PMTQGSQGlPQQSYQQPVMLPN-QAGPGPLPATGVPVYCSVTPP 803
Cdd:PRK10263   833 PQQPVAPQ-PQDTLLHPLLMRNgDSRPLHKPTTPLPSLDLLTPP 875
PHA03378 PHA03378
EBNA-3B; Provisional
555-818 2.51e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 2.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 555 QSVPFPAVSFPPQHLLP-MSPTPQFPMRDDVA--------TQFGQMTLSRQSSGETPEPPA--GPVYPPSLMPQPTQQPA 623
Cdd:PHA03378  639 QPITFNVLVFPTPHQPPqVEITPYKPTWTQIGhipyqpspTGANTMLPIQWAPGTMQPPPRapTPMRPPAAPPGRAQRPA 718
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 624 YviASTGQQLPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQMPVYYYPSGQYPTSTTQQYRPMASVQFSAQRGQQMP 703
Cdd:PHA03378  719 A--ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 704 QTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGAPVQSVMVSYPTMSSYQVPMTQGSQGLPQQSYQQPVMLPNQ 783
Cdd:PHA03378  797 QPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPV 876
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 1811007923 784 AGPGPLPATG---VPVYCSVTPPTPQN---SLRLLGPHCPS 818
Cdd:PHA03378  877 LQPIQVMRQLgsvRAAAASTVTQAPTEytgERRGVGPMHPT 917
R3H_Smubp-2_like cd02641
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and ...
174-221 4.68e-03

R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.


Pssm-ID: 100070  Cd Length: 60  Bit Score: 36.18  E-value: 4.68e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1811007923 174 IIDFIGDDNNHYKKFP-QMSSYQRMLVHRVAAYFGLDHNVDQTGKSVII 221
Cdd:cd02641     8 VKAFMKDPKATELEFPpTLSSHDRLLVHELAEELGLRHESTGEGSDRVI 56
R3H_unknown_2 cd06006
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain ...
169-209 8.73e-03

R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100076  Cd Length: 59  Bit Score: 35.42  E-value: 8.73e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1811007923 169 KMEQEIIDFIGDDNNHYKKFPQMSSYQRMLVHRVAAYFGLD 209
Cdd:cd06006     3 QIESTLRKFINDKSKRSLRFPPMRSPQRAFIHELAKDYGLY 43
PRK10927 PRK10927
cell division protein FtsN;
594-743 9.16e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 39.28  E-value: 9.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1811007923 594 SRQSSGETP-EPPAGP--VYPPSLMPQPTQQPAYVIASTGQQ---LPAGGFSGSGPPISQQVLQPPPSPQGFVQQPPPAQ 667
Cdd:PRK10927   91 SRQPGVRAPtEPSAGGevKTPEQLTPEQRQLLEQMQADMRQQptqLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQ 170
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1811007923 668 MPVYYYPSGQYPTSTTQQyrpmasvQFSAQRGQQMPQTAQQAGYQPVLSGQQGFQGLVGVQQSPPSQGVLSSPQGA 743
Cdd:PRK10927  171 QSRTTEQSWQQQTRTSQA-------APVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVTRAADAPKPT 239
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH