|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 super family |
cl35903 |
DNA translocase FtsK; Provisional |
152-341 |
1.35e-07 |
|
DNA translocase FtsK; Provisional The actual alignment was detected with superfamily member PRK10263:
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.48 E-value: 1.35e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 152 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 231
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 232 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 311
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 568914203 312 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 341
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
510-534 |
3.47e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization. :
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.00 E-value: 3.47e-05
|
| GIY-YIG_SF super family |
cl15257 |
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ... |
479-546 |
2.42e-03 |
|
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions. The actual alignment was detected with superfamily member cd10442:
Pssm-ID: 472790 Cd Length: 92 Bit Score: 37.73 E-value: 2.42e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 479 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 546
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
626-659 |
2.86e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins. :
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 35.69 E-value: 2.86e-03
10 20 30
....*....|....*....|....*....|....
gi 568914203 626 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 659
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
152-341 |
1.35e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.48 E-value: 1.35e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 152 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 231
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 232 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 311
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 568914203 312 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 341
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
510-534 |
3.47e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.00 E-value: 3.47e-05
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
159-317 |
1.16e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.41 E-value: 1.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 159 VQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQPKllrQAQTQTSPEHLAPQQDQvePQVPSQPPWQLQpre 238
Cdd:pfam09770 199 VEAAMRAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPP---QIQQQQQPQQQPQQPQQ--HPGQGHPVTILQ--- 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 239 tDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQvPTQA----QSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQL 314
Cdd:pfam09770 268 -RPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQ-PTQIlqnpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI 345
|
...
gi 568914203 315 SSH 317
Cdd:pfam09770 346 ITH 348
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
479-546 |
2.42e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.42e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 479 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 546
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
626-659 |
2.86e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 35.69 E-value: 2.86e-03
10 20 30
....*....|....*....|....*....|....
gi 568914203 626 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 659
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
152-341 |
1.35e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.48 E-value: 1.35e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 152 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 231
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 232 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 311
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 568914203 312 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 341
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
141-352 |
2.10e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 51.14 E-value: 2.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 141 ASSEESTEKGPTGQPQARVQPQ--TQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQ 218
Cdd:PRK07764 595 AGGEGPPAPASSGPPEEAARPAapAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 219 QDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQpqplWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQ----DQPQTWPQ 294
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA----ATPPAGQADDPAAQPPQAAQGASAPSPAADDPvplpPEPDDPPD 750
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 568914203 295 GSVPPPEQASGPACATEPQLSSHAAEA-GSDPDKALPEPVSAQSSEDRsREASAGGLDL 352
Cdd:PRK07764 751 PAGAPAQPPPPPAPAPAAAPAAAPPPSpPSEEEEMAEDDAPSMDDEDR-RDAEEVAMEL 808
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
141-347 |
1.49e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 1.49e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 141 ASSEESTEKGPTgqPQARVQPQTQMTAPKQTQTPDRLP---EPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSP--EHL 215
Cdd:PHA03247 2789 ASLSESRESLPS--PWDPADPPAAVLAPAAALPPAASPagpLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRP 2866
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 216 APQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPLwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQtwpqg 295
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP-ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ----- 2940
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 568914203 296 svPPPEQASGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 347
Cdd:PHA03247 2941 --PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
510-534 |
3.47e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.00 E-value: 3.47e-05
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
142-289 |
3.50e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 47.41 E-value: 3.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 142 SSEESTEKGPTGQPQARVQPQTQmTAPKQTQTPDRLPEPPEVQMLPR--IQPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 219
Cdd:PRK14949 639 SSADRKPKTPPSRAPPASLSKPA-SSPDASQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRPPWEEAPE 717
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 220 DQVEPQVPSQPPwqlqpRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 289
Cdd:PRK14949 718 VASANDGPNNAA-----EGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTE 782
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
145-303 |
3.91e-05 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 46.21 E-value: 3.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 145 ESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQ 221
Cdd:PRK10927 101 EPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQ 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 222 VEPQVPSQPPWQLQPRetdPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPT--QAQSQEQTSEKTQDQPQTWPQGSVPP 299
Cdd:PRK10927 181 QQTRTSQAAPVQAQPR---QSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVtrAADAPKPTAEKKDERRWMVQCGSFRG 257
|
....
gi 568914203 300 PEQA 303
Cdd:PRK10927 258 AEQA 261
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
180-300 |
1.01e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.85 E-value: 1.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 180 PPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAqtqpqplwqAQS 259
Cdd:PRK10263 740 PHEPLFTPIVEPVQ---QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY---------QQP 807
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 568914203 260 QKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 300
Cdd:PRK10263 808 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
159-317 |
1.16e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.41 E-value: 1.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 159 VQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQPKllrQAQTQTSPEHLAPQQDQvePQVPSQPPWQLQpre 238
Cdd:pfam09770 199 VEAAMRAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPP---QIQQQQQPQQQPQQPQQ--HPGQGHPVTILQ--- 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 239 tDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQvPTQA----QSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQL 314
Cdd:pfam09770 268 -RPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQ-PTQIlqnpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI 345
|
...
gi 568914203 315 SSH 317
Cdd:pfam09770 346 ITH 348
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
149-294 |
1.60e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.03 E-value: 1.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 149 KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQ----PKLLRQAQTQTSPEHLAPQQDQVEP 224
Cdd:pfam09770 208 KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGqghpVTILQRPQSPQPDPAQPSIQPQAQQ 287
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568914203 225 QVPSQPPWQLQPRETDP-PNQAQAQTQPQPlwqaqsqkQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQ 294
Cdd:pfam09770 288 FHQQPPPVPVQPTQILQnPNRLSAARVGYP--------QNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ 350
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
154-239 |
1.90e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.08 E-value: 1.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 154 QPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVPSQPpwq 233
Cdd:PRK10263 767 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPVAP--- 839
|
....*.
gi 568914203 234 lQPRET 239
Cdd:PRK10263 840 -QPQDT 844
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
151-283 |
1.95e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.86 E-value: 1.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 151 PTGQPQARVQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHLAPQQDQVEPQVPSQP 230
Cdd:PRK07994 383 ATAAPTAAVAPPQAPAVPPPPASA---PQQAPAVPLPETTSQLLAARQQ---LQRAQGATKAKKSEPAAASRARPVNSAL 456
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 568914203 231 PW--QLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSE 283
Cdd:PRK07994 457 ERlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPE 511
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
151-280 |
3.98e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 3.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 151 PTGQPQARVQPQTQmtAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 230
Cdd:pfam09770 222 PAAPPAQQAQQQQQ--FPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568914203 231 PWQLQ-------PRETDPPNQAQAQTQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 280
Cdd:pfam09770 300 TQILQnpnrlsaARVGYPQNPQPGVQPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
93-348 |
5.47e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 5.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 93 GPDSMLSEPQVPEPEPFETLEPPAKRCRRVRIKGIDHHnwlfaylwiFASSEESTEKGPTGQPQARVQPQTQMTAPKQTQ 172
Cdd:PHA03247 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES---------FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 173 TPDRLPEPPEVQMLPRIQPQAlQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPpwqlQPRETDPPNQAQAQTQPQ 252
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAG-AGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE----APASSTPPLTGHSLSRVS 3005
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 253 PlWQAQSQKQAQTQAHP-------QVPTQAQSQEQTSEKTQDqPQTWPQGSVPPPEQASGPACATEPQLSSHAAEAGSDP 325
Cdd:PHA03247 3006 S-WASSLALHEETDPPPvslkqtlWPPDDTEDSDADSLFDSD-SERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESP 3083
|
250 260
....*....|....*....|....
gi 568914203 326 DKAL-PEPVSAQSSEDRSREASAG 348
Cdd:PHA03247 3084 SSQFgPPPLSANAALSRRYVRSTG 3107
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
151-242 |
9.05e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 9.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 151 PTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVP--S 228
Cdd:PRK10263 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPvaP 826
|
90
....*....|....
gi 568914203 229 QPPWQlQPRETDPP 242
Cdd:PRK10263 827 QPQYQ-QPQQPVAP 839
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
479-546 |
2.42e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.42e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 479 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 546
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
11-312 |
2.43e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 2.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 11 PQATRQSLLGPPPVGVPINPSqlnhsgrnTQKQARTPSSTTpnrkDSSSQTVPLEDREDPtEGSEEATELQMDTCEDQDS 90
Cdd:PHA03247 2561 PAAPDRSVPPPRPAPRPSEPA--------VTSRARRPDAPP----QSARPRAPVDDRGDP-RGPAPPSPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 91 LVGPDSMLSEPQVPEPEPFETLEPP--------AKRCRRVRIKGidhhnwlfaylwiFASSEESTEKGPTgqPQARVQPQ 162
Cdd:PHA03247 2628 PPSPSPAANEPDPHPPPTVPPPERPrddpapgrVSRPRRARRLG-------------RAAQASSPPQRPR--RRAARPTV 2692
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 163 TQMTAPKQTQTPDRLPEPPevqmlPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQ-----VEPQVPSQPPWQLQPR 237
Cdd:PHA03247 2693 GSLTSLADPPPPPPTPEPA-----PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPP 2767
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568914203 238 ETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEP 312
Cdd:PHA03247 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
626-659 |
2.86e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 35.69 E-value: 2.86e-03
10 20 30
....*....|....*....|....*....|....
gi 568914203 626 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 659
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
153-418 |
4.22e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 4.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 153 GQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHL-APQQDQVEPQVPsqPP 231
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQ---LPNPQSHKHPPHLsGPSPFQMNSNLP--PP 393
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 232 WQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEqtsektqdqpqtwPQGSVPPPEQASGPAcATE 311
Cdd:pfam03154 394 PALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLP-------------PPAASHPPTSGLHQV-PSQ 459
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 312 PQLSSHAAEAGSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecekragemlGMWGAGSSLKVTILQSSNSRAFNTTP 389
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS----------GPVPAAVSCPLPPVQIKEEALDEAEE 529
|
250 260 270
....*....|....*....|....*....|.
gi 568914203 390 LTSGPRPGDSTSATPAIASTPS--KQSLQFF 418
Cdd:pfam03154 530 PESPPPPPRSPSPEPTVVNTPShaSQSARFY 560
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
148-333 |
6.58e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 40.06 E-value: 6.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 148 EKGPTGQPQARVQPQtqmtAPKQTQTPDRlPEPPEVQMLPRI--------QPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 219
Cdd:PTZ00449 566 EHKPSKIPTLSKKPE----FPKDPKHPKD-PEEPKKPKRPRSaqrptrpkSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 220 DQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVP- 298
Cdd:PTZ00449 641 RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRp 720
|
170 180 190
....*....|....*....|....*....|....*.
gi 568914203 299 -PPEQASGPACATEPQlsshaaeagSDPDKALPEPV 333
Cdd:PTZ00449 721 lPPKLPRDEEFPFEPI---------GDPDAEQPDDI 747
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
146-302 |
8.80e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 39.63 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 146 STEKGPTGQPQARVQPQTQMTAPKQTQTPDR-------LPEP-PEVQMLPRI----------QPQALQIQTQPKLLRQ-- 205
Cdd:pfam09770 111 AAQSSAQPPASSLPQYQYASQQSQQPSKPVRtgyekykEPEPiPDLQVDASLwgvapkkaaaPAPAPQPAAQPASLPAps 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 206 ----------AQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPlwQAQSQKQAQTQAHP-QVPTQ 274
Cdd:pfam09770 191 rkmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ--QPQQPQQHPGQGHPvTILQR 268
|
170 180
....*....|....*....|....*...
gi 568914203 275 AQSQEQTSEKTQDQPQTWPQGSVPPPEQ 302
Cdd:pfam09770 269 PQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
156-311 |
9.04e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 38.89 E-value: 9.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 156 QARVQPQTQMTAPKQTQTPDRLpEPPEVQMLPRIQPQALQIQTQpkLLRQAQTQTSPEHLAP--QQDQVEPQVPSQPPWQ 233
Cdd:PRK10927 93 QPGVRAPTEPSAGGEVKTPEQL-TPEQRQLLEQMQADMRQQPTQ--LVEVPWNEQTPEQRQQtlQRQRQAQQLAEQQRLA 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568914203 234 LQPRETDPPNQAQAQTQPqplwqaQSQKQAQTQAHPQVPTQAQSQE--QTSEKTQDQPQtwPQGSVPPPEQASGPACATE 311
Cdd:PRK10927 170 QQSRTTEQSWQQQTRTSQ------AAPVQAQPRQSKPASTQQPYQDllQTPAHTTAQSK--PQQAAPVTRAADAPKPTAE 241
|
|
|