|
Name |
Accession |
Description |
Interval |
E-value |
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
130-406 |
1.53e-12 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 71.20 E-value: 1.53e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 130 DISKRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALE 209
Cdd:TIGR04523 374 KLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKE 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 210 ESRAQAEQALSEKESQIETLVTSLLKMK-DWAAVLGEADDGNLDLDMKSglENTAALDNQPKgALKKLIyaAKLNASLKA 288
Cdd:TIGR04523 454 LIIKNLDNTRESLETQLKVLSRSINKIKqNLEQKQKELKSKEKELKKLN--EEKKELEEKVK-DLTKKI--SSLKEKIEK 528
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 289 LEGERNQVYTQLS----EVDQVKEDLT------------EHIKSLESKQASLQSEKTEFEsesQKLQQKLKVITELYQEN 352
Cdd:TIGR04523 529 LESEKKEKESKISdledELNKDDFELKkenlekeideknKEIEELKQTQKSLKKKQEEKQ---ELIDQKEKEKKDLIKEI 605
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 259155338 353 EMKLHRKLTVEENYRLEKEE--KLSKVDEKISHATEELEtcrQRAKDLEEELERTI 406
Cdd:TIGR04523 606 EEKEKKISSLEKELEKAKKEneKLSSIIKNIKSKKNKLK---QEVKQIKETIKEIR 658
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
126-402 |
2.31e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 70.86 E-value: 2.31e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 126 ELMADISKRIQSLEDESKSLKSQVAEAKTTfrifeinEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQK 205
Cdd:TIGR02168 691 EKIAELEKALAELRKELEELEEELEQLRKE-------LEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEI 763
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 206 VALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEADDgnlDLDMKSGLENTAALDNQPKgalkkliyaaklnas 285
Cdd:TIGR02168 764 EELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD---ELRAELTLLNEEAANLRER--------------- 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 286 LKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITE---LYQENEMKLHRKLTV 362
Cdd:TIGR02168 826 LESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEalaLLRSELEELSEELRE 905
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 259155338 363 EENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEEL 402
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERL 945
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
52-391 |
4.74e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 66.62 E-value: 4.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 52 RSIRSRFYVGREKKLALELSAL-IEEKCKLLDkvsivQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMAD 130
Cdd:TIGR02168 211 KAERYKELKAELRELELALLVLrLEELREELE-----ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 131 ISKRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEE 210
Cdd:TIGR02168 286 LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 211 SRAQAEQALSEKESQIETLVTSLLKMKDWAAVLgEADDGNLDLDMKSGLENTAALDNQPKGALKKLiyaakLNASLKALE 290
Cdd:TIGR02168 366 ELEELESRLEELEEQLETLRSKVAQLELQIASL-NNEIERLEARLERLEDRRERLQQEIEELLKKL-----EEAELKELQ 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 291 GERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELyQENEMKLHRKLTVEENYRLEK 370
Cdd:TIGR02168 440 AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERL-QENLEGFSEGVKALLKNQSGL 518
|
330 340
....*....|....*....|.
gi 259155338 371 EEKLSKVDEKIShATEELETC 391
Cdd:TIGR02168 519 SGILGVLSELIS-VDEGYEAA 538
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
62-401 |
1.84e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 64.70 E-value: 1.84e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 62 REKKLALELSALIEEKCKLLDKVSIVQKEATYENLEQSKSK--LEDEILLLEEKLEEERAKHSEQDE-LMADISKRIQSL 138
Cdd:TIGR02169 205 REREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQlaSLEEELEKLTEEISELEKRLEEIEqLLEELNKKIKDL 284
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 139 -EDESKSLKSQVAEakttfriFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQ 217
Cdd:TIGR02169 285 gEEEQLRVKEKIGE-------LEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTE 357
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 218 ALSEKESQIETLVtsllkmkdwaAVLGEADdgnldldmksglENTAALDNQPKGALKKLiyaAKLNASLKALEGERNQVY 297
Cdd:TIGR02169 358 EYAELKEELEDLR----------AELEEVD------------KEFAETRDELKDYREKL---EKLKREINELKRELDRLQ 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 298 TQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLHRkltveenyrleKEEKLSKV 377
Cdd:TIGR02169 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYD-----------LKEEYDRV 481
|
330 340
....*....|....*....|....
gi 259155338 378 DEKISHATEELETCRQRAKDLEEE 401
Cdd:TIGR02169 482 EKELSKLQRELAEAEAQARASEER 505
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
164-406 |
2.09e-10 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 63.24 E-value: 2.09e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 164 ERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVL 243
Cdd:COG4942 30 EQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAEL 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 244 geaddgnLDLDMKSGLENTAAL---DNQPKGALKKLIYAAKLNASLKAlegernqvytQLSEVDQVKEDLTEHIKSLESK 320
Cdd:COG4942 110 -------LRALYRLGRQPPLALllsPEDFLDAVRRLQYLKYLAPARRE----------QAEELRADLAELAALRAELEAE 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 321 QASLQSEKTEFESESQKLQQKLKvitelyqenemklhrkltveenyrlEKEEKLSKVDEKISHATEELETCRQRAKDLEE 400
Cdd:COG4942 173 RAELEALLAELEEERAALEALKA-------------------------ERQKLLARLEKELAELAAELAELQQEAEELEA 227
|
....*.
gi 259155338 401 ELERTI 406
Cdd:COG4942 228 LIARLE 233
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
64-347 |
3.24e-10 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 63.50 E-value: 3.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 64 KKLALELSALiEEKCKLLDKVSiVQKEATYENLEQSKSKledeillleekleeeraKHSEQDELMADISKR---IQSLED 140
Cdd:TIGR04523 387 KNLESQINDL-ESKIQNQEKLN-QQKDEQIKKLQQEKEL-----------------LEKEIERLKETIIKNnseIKDLTN 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 141 ESKSLKSQVAEAKTTFRIFEINEERLKGAIKdalNENSQLQESQKQLLQETEMMK----------EQVNDLDKQKVALEE 210
Cdd:TIGR04523 448 QDSVKELIIKNLDNTRESLETQLKVLSRSIN---KIKQNLEQKQKELKSKEKELKklneekkeleEKVKDLTKKISSLKE 524
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 211 SRAQAEQALSEKESQIETLVTSLLKMKD---WAAVLGEADDGNLDLDmKSGLENTAALDNQPKgaLKKLIyaaklnaslK 287
Cdd:TIGR04523 525 KIEKLESEKKEKESKISDLEDELNKDDFelkKENLEKEIDEKNKEIE-ELKQTQKSLKKKQEE--KQELI---------D 592
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 288 ALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITE 347
Cdd:TIGR04523 593 QKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKE 652
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
118-448 |
3.73e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.54 E-value: 3.73e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMADIS-KRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAI---KDALNENSQLQESQKQLLQETem 193
Cdd:TIGR02168 216 KELKAELRELELALLvLRLEELREELEELQEELKEAEEELEELTAELQELEEKLeelRLEVSELEEEIEELQKELYAL-- 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 194 mKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKdwaavlgeaddgnldldmksglENTAALDNQPKGAL 273
Cdd:TIGR02168 294 -ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELA----------------------EELAELEEKLEELK 350
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 274 KKLiyaAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENE 353
Cdd:TIGR02168 351 EEL---ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 354 MKLhrkltvEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLE---EELERTIHSYQGQVISHEKKAHdnwlAARTL 430
Cdd:TIGR02168 428 KKL------EEAELKELQAELEELEEELEELQEELERLEEALEELReelEEAEQALDAAERELAQLQARLD----SLERL 497
|
330 340
....*....|....*....|.
gi 259155338 431 ERNLNDLR---KENAHNRQKL 448
Cdd:TIGR02168 498 QENLEGFSegvKALLKNQSGL 518
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
59-460 |
4.80e-10 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 63.12 E-value: 4.80e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 59 YVGREKKLALELSALIEEKCKLLDKVSIVQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDelMADISKRIQSL 138
Cdd:TIGR04523 209 KIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKE--LEQNNKKIKEL 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 139 EDESKSLKSQVAEAKTtfrifEINEERLKGAIKDALNENSQLQESQKQLLQETEM---MKEQVNDLDKQKVALEESRAQA 215
Cdd:TIGR04523 287 EKQLNQLKSEISDLNN-----QKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIisqLNEQISQLKKELTNSESENSEK 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 216 EQALSEKESQIETLvtslLKMKDwaavlgEADDGNLDLDM-KSGLENTaaLDNQPKgalkkliYAAKLNASLKALEGERN 294
Cdd:TIGR04523 362 QRELEEKQNEIEKL----KKENQ------SYKQEIKNLESqINDLESK--IQNQEK-------LNQQKDEQIKKLQQEKE 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 295 QVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLHRK--------------- 359
Cdd:TIGR04523 423 LLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKqkelkskekelkkln 502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 360 ------------LTVEENYRLEKEEKLS----KVDEKISHATEELETCRQRAKdlEEELERTIHSYQgQVISHEKKAHDN 423
Cdd:TIGR04523 503 eekkeleekvkdLTKKISSLKEKIEKLEsekkEKESKISDLEDELNKDDFELK--KENLEKEIDEKN-KEIEELKQTQKS 579
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 259155338 424 WLAART--------LERNLNDLRKENAHNRQKLTETEFKFELLEK 460
Cdd:TIGR04523 580 LKKKQEekqelidqKEKEKKDLIKEIEEKEKKISSLEKELEKAKK 624
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
164-353 |
2.08e-09 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 60.23 E-value: 2.08e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 164 ERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTsllKMKDWAAVL 243
Cdd:COG3883 19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERRE---ELGERARAL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 244 GEADDGNLDLDMKSGLENTA-ALDNQpkGALKKLiyAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQA 322
Cdd:COG3883 96 YRSGGSVSYLDVLLGSESFSdFLDRL--SALSKI--ADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
|
170 180 190
....*....|....*....|....*....|.
gi 259155338 323 SLQSEKTEFESESQKLQQKLKVITELYQENE 353
Cdd:COG3883 172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELE 202
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
135-440 |
5.90e-08 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 56.27 E-value: 5.90e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 135 IQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVND----LDKQKVALEE 210
Cdd:pfam05483 445 LQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDmtleLKKHQEDIIN 524
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 211 SRAQAEQALSekesQIETLVTSLLKMKDwaavlgEADDGNLDLDMKsGLENTAALDNQPKGALKKLIYAAKLNASLKALE 290
Cdd:pfam05483 525 CKKQEERMLK----QIENLEEKEMNLRD------ELESVREEFIQK-GDEVKCKLDKSEENARSIEYEVLKKEKQMKILE 593
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 291 GERNQVYTQLSEVDQVKEDLTEHIKSL------ESKQAS---LQSEKTEFESESQKlqQKLKVITELYQEnemKLHRKLT 361
Cdd:pfam05483 594 NKCNNLKKQIENKNKNIEELHQENKALkkkgsaENKQLNayeIKVNKLELELASAK--QKFEEIIDNYQK---EIEDKKI 668
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 362 VEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSY-------QGQVISHEKKAHDNWLAARTLERNL 434
Cdd:pfam05483 669 SEEKLLEEVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYdkiieerDSELGLYKNKEQEQSSAKAALEIEL 748
|
....*.
gi 259155338 435 NDLRKE 440
Cdd:pfam05483 749 SNIKAE 754
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
64-404 |
8.21e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 56.22 E-value: 8.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 64 KKLALELSALIEEKCKLLDKV-----SIVQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADISKRIQSL 138
Cdd:TIGR02168 701 AELRKELEELEEELEQLRKELeelsrQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEA 780
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 139 EDESKSLKSQVAEAKTTFrifEINEERLkgaikDALN-ENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQ 217
Cdd:TIGR02168 781 EAEIEELEAQIEQLKEEL---KALREAL-----DELRaELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 218 ALSEKESQIETLVTSLlkmkdwaavlgeaddgnldldmksglentaaldnqpkgalkkliyaAKLNASLKALEGERNQVY 297
Cdd:TIGR02168 853 DIESLAAEIEELEELI----------------------------------------------EELESELEALLNERASLE 886
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 298 TQLSEVDQVKEDLTEHIKSLESKqaslqseKTEFESESQKLQQKLKVITELYQENEMKLHRKL-TVEENYRLEKEEklsk 376
Cdd:TIGR02168 887 EALALLRSELEELSEELRELESK-------RSELRRELEELREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEE---- 955
|
330 340
....*....|....*....|....*...
gi 259155338 377 VDEKISHATEELETCRQRAKDLEEELER 404
Cdd:TIGR02168 956 AEALENKIEDDEEEARRRLKRLENKIKE 983
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
63-456 |
1.34e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 55.53 E-value: 1.34e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 63 EKKLALELSALIEEKCKLLDKVSIVQKEATYENLEQSKSKLEDEILLLEEKLEEERA-KHSEQDELMADISKRIQSLEDE 141
Cdd:PTZ00121 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAkKKAEEDKKKADELKKAAAAKKK 1419
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 142 SKSLKSQVAEAKTTFRIFEINEERLKG------AIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQA 215
Cdd:PTZ00121 1420 ADEAKKKAEEKKKADEAKKKAEEAKKAdeakkkAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 216 EQALSEKESQIETLVTSLLKMKDWAAVLGEADDGNLDLDMKSGLENTAALDNQPKGALKKLIYAAKLNASLKAlEGERNQ 295
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKA-EEDKNM 1578
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 296 VYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKvitelYQENEMKLHRKLTVEENYRLEKEEKLS 375
Cdd:PTZ00121 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK-----KAEEEKKKVEQLKKKEAEEKKKAEELK 1653
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 376 KVDEKISHATEEL----ETCRQRAKDL--EEELERTIHSYQGQVISHEKKAHDNWLAARTLERNLNDLRKENAHNRQKLT 449
Cdd:PTZ00121 1654 KAEEENKIKAAEEakkaEEDKKKAEEAkkAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733
|
....*..
gi 259155338 450 ETEFKFE 456
Cdd:PTZ00121 1734 EAKKEAE 1740
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
52-347 |
1.77e-07 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 55.06 E-value: 1.77e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 52 RSIRSRFYVGREKKLALELSALI--------EEKCKLLDKVSIVQKE-ATYENLEQSKSKLEDEILLLeekleeeraKHS 122
Cdd:TIGR01612 662 KSELSKIYEDDIDALYNELSSIVkenaidntEDKAKLDDLKSKIDKEyDKIQNMETATVELHLSNIEN---------KKN 732
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 123 EQDELMADISKRIQSleDESKSLKSQVAEakttfriFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNdLD 202
Cdd:TIGR01612 733 ELLDIIVEIKKHIHG--EINKDLNKILED-------FKNKEKELSNKINDYAKEKDELNKYKSKISEIKNHYNDQIN-ID 802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 203 KQKValEESRAQAEQA------LSEKESQIETLVTSLLKMKDwaAVLGEAD-----DGNLDLDMKSGLENTAALDNQPKG 271
Cdd:TIGR01612 803 NIKD--EDAKQNYDKSkeyiktISIKEDEIFKIINEMKFMKD--DFLNKVDkfinfENNCKEKIDSEHEQFAELTNKIKA 878
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 272 AL--KKL-IYAAKLNAS-------LKALEGERNQVYTqLSEVD---QVKEDLTEHIKSLESKQASLQsektefesesQKL 338
Cdd:TIGR01612 879 EIsdDKLnDYEKKFNDSkslineiNKSIEEEYQNINT-LKKVDeyiKICENTKESIEKFHNKQNILK----------EIL 947
|
....*....
gi 259155338 339 QQKLKVITE 347
Cdd:TIGR01612 948 NKNIDTIKE 956
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
130-403 |
2.02e-07 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 54.66 E-value: 2.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 130 DISKRIQSLEDESKSLKSQVAEAKTtfrifeineerlkgAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALE 209
Cdd:PRK02224 311 AVEARREELEDRDEELRDRLEECRV--------------AAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAR 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 210 ESRAQAEQALSEKESQIETLVTSLlkmKDWAAVLGEADDGNLDLdmksgLENTAALDNQpkgalkkliyAAKLNASLKAL 289
Cdd:PRK02224 377 EAVEDRREEIEELEEEIEELRERF---GDAPVDLGNAEDFLEEL-----REERDELRER----------EAELEATLRTA 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 290 EGERNQVYT---------------------QLSEVDQVKEDLTEHIKSLESKQASLQS------EKTEFESESQKLQQKL 342
Cdd:PRK02224 439 RERVEEAEAlleagkcpecgqpvegsphveTIEEDRERVEELEAELEDLEEEVEEVEErleraeDLVEAEDRIERLEERR 518
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 259155338 343 KVITELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEKISHAT---EELETCRQRAKDLEEELE 403
Cdd:PRK02224 519 EDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAeaeEEAEEAREEVAELNSKLA 582
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
195-410 |
4.60e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 53.77 E-value: 4.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 195 KEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLkmkDWAAVLGEADDgnlDLDMKSGLENTAALDNQpkgaLK 274
Cdd:COG4913 609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQERRE---ALQRLAEYSWD---EIDVASAEREIAELEAE----LE 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 275 KLIyaaKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEktefesesqklQQKLKVITELYQENEM 354
Cdd:COG4913 679 RLD---ASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEE-----------LDELQDRLEAAEDLAR 744
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 259155338 355 KLHRKLTVEENYRLEKEEKLSKVDEKIshaTEELETCRQRAKDLEEELERTIHSYQ 410
Cdd:COG4913 745 LELRALLEERFAAALGDAVERELRENL---EERIDALRARLNRAEEELERAMRAFN 797
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
118-420 |
1.20e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.38 E-value: 1.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMAD--------ISKRIQSLEDESKSLKSQVAE-----AKTTFRIFEINEErlKGAIKDALNENSqlqesq 184
Cdd:TIGR02169 207 REKAERYQALLKEkreyegyeLLKEKEALERQKEAIERQLASleeelEKLTEEISELEKR--LEEIEQLLEELN------ 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 185 KQLLQETEmmkEQVNDLDKQKVALEESRAQAEQALSEKESQIETLvtsllkmkdwaavlgEADDGNLDLDMKSGLENTAA 264
Cdd:TIGR02169 279 KKIKDLGE---EEQLRVKEKIGELEAEIASLERSIAEKERELEDA---------------EERLAKLEAEIDKLLAEIEE 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 265 LDNQPKGALKKLiyaAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKV 344
Cdd:TIGR02169 341 LEREIEEERKRR---DKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQR 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 345 ITELYQENEMKLHR---KLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEE---LERTIHSYQGQVISHEK 418
Cdd:TIGR02169 418 LSEELADLNAAIAGieaKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEydrVEKELSKLQRELAEAEA 497
|
..
gi 259155338 419 KA 420
Cdd:TIGR02169 498 QA 499
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
123-403 |
1.48e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.99 E-value: 1.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 123 EQDELMADISKRIQSLEDESKSLKSQVAEAKTTFRifEINEerlkgaIKDALNENSQLQESQKQLLQETEMMKEQVNDLD 202
Cdd:PRK03918 173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLR--EINE------ISSELPELREELEKLEKEVKELEELKEEIEELE 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 203 KQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEaddgnldldMKSGLENTAALDNQPKGALKKLiyaAKL 282
Cdd:PRK03918 245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKE---------LKEKAEEYIKLSEFYEEYLDEL---REI 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 283 NASLKALEGERNQVYTQLSEVDQVK---EDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLHRK 359
Cdd:PRK03918 313 EKRLSRLEEEINGIEERIKELEEKEerlEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKE 392
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 259155338 360 LTVEENYRLEKEEKLSKVDEKIShateELETCRQRAKDLEEELE 403
Cdd:PRK03918 393 LEELEKAKEEIEEEISKITARIG----ELKKEIKELKKAIEELK 432
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
118-446 |
1.61e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 51.76 E-value: 1.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMADISKRIQSLedeskslKSQVAEAKTTFRIFEINEERLKgaikdalNENSQLQES-QKQLLQETEMMKE 196
Cdd:pfam12128 617 REKQAAAEEQLVQANGELEKA-------SREETFARTALKNARLDLRRLF-------DEKQSEKDKkNKALAERKDSANE 682
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 197 QVNDLDKQKVALEEsraQAEQALSEKESQIETLVTSllKMKDWAAVLGEADD--GNLDLDMKSGLENTAA----LDNQPK 270
Cdd:pfam12128 683 RLNSLEAQLKQLDK---KHQAWLEEQKEQKREARTE--KQAYWQVVEGALDAqlALLKAAIAARRSGAKAelkaLETWYK 757
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 271 GALKKL----IYAAKLNASLKALEG-------------------------ERNQVYTQLSEVDQVKEDLTEHiksLESKQ 321
Cdd:pfam12128 758 RDLASLgvdpDVIAKLKREIRTLERkieriavrrqevlryfdwyqetwlqRRPRLATQLSNIERAISELQQQ---LARLI 834
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 322 ASLQSEKTEFESESQKLQQKLKVITElyqenemkLHRKLTVEENY--RLEKEEKLSKVDEKISHATEELETCRQRAKDLE 399
Cdd:pfam12128 835 ADTKLRRAKLEMERKASEKQQVRLSE--------NLRGLRCEMSKlaTLKEDANSEQAQGSIGERLAQLEDLKLKRDYLS 906
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 259155338 400 EELERTIHSYQGQVISHEKKAHD-NWLAARTLERNLNDLRKENAHNRQ 446
Cdd:pfam12128 907 ESVKKYVEHFKNVIADHSGSGLAeTWESLREEDHYQNDKGIRLLDYRK 954
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
62-464 |
1.95e-06 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 51.26 E-value: 1.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 62 REKKLALELSALIEEKCKLLDKVSIvqkeatyenLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADiskriqsLEDE 141
Cdd:pfam05483 238 KEKQVSLLLIQITEKENKMKDLTFL---------LEESRDKANQLEEKTKLQDENLKELIEKKDHLTKE-------LEDI 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 142 SKSLKSQVAEAKTTfrifeinEERLKGAIKDALnensQLQESQKQLLQETEMMKEQ----VNDLDKQKVALEESRAQAEQ 217
Cdd:pfam05483 302 KMSLQRSMSTQKAL-------EEDLQIATKTIC----QLTEEKEAQMEELNKAKAAhsfvVTEFEATTCSLEELLRTEQQ 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 218 ALSEKESQIETLVTSLLK------------------MKDWAAVLGEadDGNLdLDMKSGLENTAaldNQPKGALKKLIYA 279
Cdd:pfam05483 371 RLEKNEDQLKIITMELQKksseleemtkfknnkeveLEELKKILAE--DEKL-LDEKKQFEKIA---EELKGKEQELIFL 444
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 280 aklnasLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITElyQENEMKLHRK 359
Cdd:pfam05483 445 ------LQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQ--EASDMTLELK 516
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 360 LTVEENYRLEKEE-----KLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEKKAHDNWLAARTLERNL 434
Cdd:pfam05483 517 KHQEDIINCKKQEermlkQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKC 596
|
410 420 430
....*....|....*....|....*....|
gi 259155338 435 NDLRKENAHNRQKLTETEFKFELLEKDPYA 464
Cdd:pfam05483 597 NNLKKQIENKNKNIEELHQENKALKKKGSA 626
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
279-452 |
2.23e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.48 E-value: 2.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 279 AAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITEL---YQENEMK 355
Cdd:COG1196 241 LEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERrreLEERLEE 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 356 LHRKLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTihsyQGQVISHEKKAHDNWLAARTLERNLN 435
Cdd:COG1196 321 LEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEA----EAELAEAEEELEELAEELLEALRAAA 396
|
170
....*....|....*..
gi 259155338 436 DLRKENAHNRQKLTETE 452
Cdd:COG1196 397 ELAAQLEELEEAEEALL 413
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
61-461 |
2.68e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.92 E-value: 2.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 61 GREKKLALELSALIEEKCKLLDkvsivQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHS--EQDELMADISKRIQSL 138
Cdd:COG4717 63 GRKPELNLKELKELEEELKEAE-----EKEEEYAELQEELEELEEELEELEAELEELREELEklEKLLQLLPLYQELEAL 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 139 EDESKSLKSQVAEAKTtfRIFEINEerLKGAIKDALNENSQLQESQKQLLQET-EMMKEQVNDLDKQKVALEESRAQAEQ 217
Cdd:COG4717 138 EAELAELPERLEELEE--RLEELRE--LEEELEELEAELAELQEELEELLEQLsLATEEELQDLAEELEELQQRLAELEE 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 218 ALSEKESQIETL---VTSLLKMKDWAAVLGEADDGNLDLDMKSGLENTAALDNQPKGALKKLIYAAKLNASLKALEGERN 294
Cdd:COG4717 214 ELEEAQEELEELeeeLEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLL 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 295 QVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQK--LQQKLKVITELYQENE--MKLHRKLTVEEnYRLEK 370
Cdd:COG4717 294 AREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPeeLLELLDRIEELQELLReaEELEEELQLEE-LEQEI 372
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 371 EEKLSKVDekishaTEELETCRQRAKDLEE--ELERTIHSYQGQVISHEKKAHDNWLAA---------RTLERNLNDLRK 439
Cdd:COG4717 373 AALLAEAG------VEDEEELRAALEQAEEyqELKEELEELEEQLEELLGELEELLEALdeeeleeelEELEEELEELEE 446
|
410 420
....*....|....*....|..
gi 259155338 440 ENAHNRQKLTETEFKFELLEKD 461
Cdd:COG4717 447 ELEELREELAELEAELEQLEED 468
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
62-342 |
3.21e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 50.71 E-value: 3.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 62 REKKLALELSAL----IEEKCKLLDKvSIVQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADISKRIQS 137
Cdd:COG1196 221 ELKELEAELLLLklreLEAELEELEA-ELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELAR 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 138 LEDESKSLKSQVAEAkttfrifEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQ 217
Cdd:COG1196 300 LEQDIARLEERRREL-------EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEA 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 218 ALSEKESQIETLVTSLLKMKDwAAVLGEADDGNLDLDMKSGLENTAALDNQpkgalkkliyAAKLNASLKALEGERNQVY 297
Cdd:COG1196 373 ELAEAEEELEELAEELLEALR-AAAELAAQLEELEEAEEALLERLERLEEE----------LEELEEALAELEEEEEEEE 441
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 259155338 298 TQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKL 342
Cdd:COG1196 442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEEL 486
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
287-404 |
4.15e-06 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 50.24 E-value: 4.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 287 KALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFEsesQKLQQKLKVITELyqENEMKLHRKltvEENY 366
Cdd:COG2433 388 KELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERLEAEVEELE---AELEEKDERIERL--ERELSEARS---EERR 459
|
90 100 110
....*....|....*....|....*....|....*...
gi 259155338 367 RLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELER 404
Cdd:COG2433 460 EIRKDREISRLDREIERLERELEEERERIEELKRKLER 497
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
52-336 |
4.76e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 50.32 E-value: 4.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 52 RSIRSRFYVGREKKLALELSALIEEKCKLLDKVSIVQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADI 131
Cdd:COG1196 225 LEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 132 S---KRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVAL 208
Cdd:COG1196 305 ArleERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEEL 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 209 EESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEADDGNLDLDmksgLENTAALDNQPKGALKKLIYAAKLNASLKA 288
Cdd:COG1196 385 AEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEAL----AELEEEEEEEEEALEEAAEEEAELEEEEEA 460
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 259155338 289 LEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQ 336
Cdd:COG1196 461 LLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLE 508
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
161-461 |
1.09e-05 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 48.86 E-value: 1.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 161 INEERLKGaikdaLNENSQLQESQKQLLQEtemmkeqvnDLDKQKVALEESRAQAEQALSEKESQIETLVTsllKMKDWA 240
Cdd:PHA02562 171 LNKDKIRE-----LNQQIQTLDMKIDHIQQ---------QIKTYNKNIEEQRKKNGENIARKQNKYDELVE---EAKTIK 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 241 AVLGEADDGNLDLDMKSGlentaaldnQPKGALKKL-IYAAKLNASLKALEGERNqVYTQLSEVDQVKEDLTEHIKSLES 319
Cdd:PHA02562 234 AEIEELTDELLNLVMDIE---------DPSAALNKLnTAAAKIKSKIEQFQKVIK-MYEKGGVCPTCTQQISEGPDRITK 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 320 KQASLQSEKTEFESESQKlQQKLKVITELYQENEMKLHrkltveenyrlEKEEKLSKVDEKIShateeleTCRQRAKDLE 399
Cdd:PHA02562 304 IKDKLKELQHSLEKLDTA-IDELEEIMDEFNEQSKKLL-----------ELKNKISTNKQSLI-------TLVDKAKKVK 364
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 259155338 400 EELERTihsyQGQVISHEKkahdnwlAARTLERNLNDLRKENAHNRQKLTETEFKFELLeKD 461
Cdd:PHA02562 365 AAIEEL----QAEFVDNAE-------ELAKLQDELDKIVKTKSELVKEKYHRGIVTDLL-KD 414
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
118-460 |
1.40e-05 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 48.63 E-value: 1.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMADISKRiqSLEDESKSLKSQVAEAKTtfRIFEI------NEERLKGAIKDALNENSQLQESQKQLLQET 191
Cdd:pfam01576 195 RLKKEEKGRQELEKAKR--KLEGESTDLQEQIAELQA--QIAELraqlakKEEELQAALARLEEETAQKNNALKKIRELE 270
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 192 EMMKEQVNDLDKQKVAleesRAQAEQALSEKESQIETLVTSLLKMKDWAAV---LGEADDGNLDLDMKSGLENTAALDNQ 268
Cdd:pfam01576 271 AQISELQEDLESERAA----RNKAEKQRRDLGEELEALKTELEDTLDTTAAqqeLRSKREQEVTELKKALEEETRSHEAQ 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 269 PKGALKKLIYA---------------AKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQS------- 326
Cdd:pfam01576 347 LQEMRQKHTQAleelteqleqakrnkANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQArlseser 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 327 EKTEFESESQKLQQKLKVITELYQENEMKLHrkltveenyRLEKEekLSKVDEKIsHATEEL--ETCRQ------RAKDL 398
Cdd:pfam01576 427 QRAELAEKLSKLQSELESVSSLLNEAEGKNI---------KLSKD--VSSLESQL-QDTQELlqEETRQklnlstRLRQL 494
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 259155338 399 EEE-----------------LERTIHSYQGQVISHEKKAHDNWLAARTLERNLNDLRKENAHNRQKLTETEFKFELLEK 460
Cdd:pfam01576 495 EDErnslqeqleeeeeakrnVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEK 573
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
133-461 |
1.62e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.48 E-value: 1.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 133 KRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDalnENSQLQESQKQLLQetemMKEQVNDLDKQKVALEESR 212
Cdd:TIGR04523 54 KELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKK---NKDKINKLNSDLSK----INSEIKNDKEQKNKLEVEL 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 213 AQAEQALSEKESQIETLVTSLLKMKDWAAVLGEADDgnlDLD-MKSGLENTaaldnqpKGALKKLIyaAKLNASLKALEG 291
Cdd:TIGR04523 127 NKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYN---DLKkQKEELENE-------LNLLEKEK--LNIQKNIDKIKN 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 292 ERNQVYTQLSevdqVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLhrKLTVEENYRL--- 368
Cdd:TIGR04523 195 KLLKLELLLS----NLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQL--NQLKDEQNKIkkq 268
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 369 --EKEEKLSKVDEKISHATEELETCR--------QRAKDLEEELERTIHSYQGQVISHEKKAHDNWLAARTLERNLNDLR 438
Cdd:TIGR04523 269 lsEKQKELEQNNKKIKELEKQLNQLKseisdlnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLK 348
|
330 340 350
....*....|....*....|....*....|
gi 259155338 439 KE-------NAHNRQKLTETEFKFELLEKD 461
Cdd:TIGR04523 349 KEltnseseNSEKQRELEEKQNEIEKLKKE 378
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
133-450 |
1.83e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 48.43 E-value: 1.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 133 KRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENS---------------------------------- 178
Cdd:pfam02463 153 ERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQelklkeqakkaleyyqlkekleleeeyllyldyl 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 179 QLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEADDGNLDLDMKSG 258
Cdd:pfam02463 233 KLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDD 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 259 LENTAALDNQPKGALKKLIYAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKL 338
Cdd:pfam02463 313 EEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 339 QQKLKVITELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEK 418
Cdd:pfam02463 393 KEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSED 472
|
330 340 350
....*....|....*....|....*....|..
gi 259155338 419 KAHDNWLAARTLERNLNDLRKENAHNRQKLTE 450
Cdd:pfam02463 473 LLKETQLVKLQEQLELLLSRQKLEERSQKESK 504
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
122-493 |
1.98e-05 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 48.15 E-value: 1.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 122 SEQDELMADISKRIQSLEDESKslksqvaeAKTTFRIFEINEERLKGA-IKDALNENSQLQESQKQLLQETEMMKEQVND 200
Cdd:COG5022 806 LGSRKEYRSYLACIIKLQKTIK--------REKKLRETEEVEFSLKAEvLIQKFGRSLKAKKRFSLLKKETIYLQSAQRV 877
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 201 LDKQKVALE-----ESRAQAEQALSEKESQIETLV----TSLLK--------MKDWAAVLGEADDGNlDLDMKSGLENTA 263
Cdd:COG5022 878 ELAERQLQElkidvKSISSLKLVNLELESEIIELKkslsSDLIEnlefktelIARLKKLLNNIDLEE-GPSIEYVKLPEL 956
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 264 ALDNQPKGALKKLiyAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLqSEKTEFESESQKLQQKLK 343
Cdd:COG5022 957 NKLHEVESKLKET--SEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQEST-KQLKELPVEVAELQSASK 1033
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 344 VI----TELYQENEM-KLHRKLTVEENYrLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEK 418
Cdd:COG5022 1034 IIssesTELSILKPLqKLKGLLLLENNQ-LQARYKALKLRRENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPA 1112
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 259155338 419 KAHDNWLAARTlerNLNDLRKENAHNRQKLTETEFKFELLEKDPYALDVPNTAFGREHSPYGPSPLGRPPSETRA 493
Cdd:COG5022 1113 NVLQFIVAQMI---KLNLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQ 1184
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
74-460 |
1.98e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.09 E-value: 1.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 74 IEEKCKLLDKvSIVQKEATYENLE--------------------------------------------------QSKSKL 103
Cdd:TIGR04523 38 LEKKLKTIKN-ELKNKEKELKNLDknlnkdeekinnsnnkikileqqikdlndklkknkdkinklnsdlskinsEIKNDK 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 104 EDEILLLEEKLEEERAKhSEQDELMADISKRIQSLEDESKS-------LKSQVAEAKTTFRIFEINEERLKGAIKDALNE 176
Cdd:TIGR04523 117 EQKNKLEVELNKLEKQK-KENKKNIDKFLTEIKKKEKELEKlnnkyndLKKQKEELENELNLLEKEKLNIQKNIDKIKNK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 177 NSQLQ----------ESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDwaavlgEA 246
Cdd:TIGR04523 196 LLKLElllsnlkkkiQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKK------QL 269
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 247 DDGNLDLDMKSG-LENTAALDNQPKGALKKL------IYAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLES 319
Cdd:TIGR04523 270 SEKQKELEQNNKkIKELEKQLNQLKSEISDLnnqkeqDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKK 349
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 320 KQASLQSEKTEFESESQKLQQKLKVITElyqENEMKLhrkltvEENYRLEKEeklskvdekishaTEELETCRQRAKDLE 399
Cdd:TIGR04523 350 ELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYK------QEIKNLESQ-------------INDLESKIQNQEKLN 407
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 259155338 400 EELERTIHSYQGQVISHEKKaHDNWLAARTLERN-LNDLRKENAHNRQKLTETEFKFELLEK 460
Cdd:TIGR04523 408 QQKDEQIKKLQQEKELLEKE-IERLKETIIKNNSeIKDLTNQDSVKELIIKNLDNTRESLET 468
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
133-440 |
2.02e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 48.19 E-value: 2.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 133 KRIQSLEDESKSLKSQVAEAKTTFrifeinEERLKGAIKDALNENSQLQESQKqllqETEMMKEQVNDLDKQKVALEESR 212
Cdd:pfam15921 317 RQLSDLESTVSQLRSELREAKRMY------EDKIEELEKQLVLANSELTEART----ERDQFSQESGNLDDQLQKLLADL 386
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 213 AQAEQALSEKESQIETLvtsllkmkdWAAVLG----------EADDGNLDLDMKSGLenTAALDNQPKGALKKLIYAAK- 281
Cdd:pfam15921 387 HKREKELSLEKEQNKRL---------WDRDTGnsitidhlrrELDDRNMEVQRLEAL--LKAMKSECQGQMERQMAAIQg 455
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 282 LNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLhrklt 361
Cdd:pfam15921 456 KNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKL----- 530
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 362 vEENYRLEKEEklskvdEKISHATEELETCRQRAKDLE---EELERTIHSYQGQVISHEKKAHDNWLAARTLERNLNDLR 438
Cdd:pfam15921 531 -QELQHLKNEG------DHLRNVQTECEALKLQMAEKDkviEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRR 603
|
..
gi 259155338 439 KE 440
Cdd:pfam15921 604 LE 605
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
128-351 |
2.06e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 2.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 128 MADISKRIQSLEDESKSLKSQVAEAKTTFRIFEINeerlkgaikdalNENSQLQESQKQLLQETEMMKEQVNDLDKQKVA 207
Cdd:COG3206 170 REEARKALEFLEEQLPELRKELEEAEAALEEFRQK------------NGLVDLSEEAKLLLQQLSELESQLAEARAELAE 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 208 LEESRAQAEQALSEKESQIETLVTSllkmkdwaAVLGEADDGNLDLDMKSGLENTAALDNQPKgalkkliyaaklnasLK 287
Cdd:COG3206 238 AEARLAALRAQLGSGPDALPELLQS--------PVIQQLRAQLAELEAELAELSARYTPNHPD---------------VI 294
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 259155338 288 ALEGERNQVYTQL-SEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQE 351
Cdd:COG3206 295 ALRAQIAALRAQLqQEAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLERE 359
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
61-454 |
2.47e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 2.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 61 GREKKLALELSALiEEKCKLLDKVSIVQkeatyENLEQSKSKLEDEILLLEEKLEEERAKHSEqdELMADISK---RIQS 137
Cdd:PRK03918 345 KKLKELEKRLEEL-EERHELYEEAKAKK-----EELERLKKRLTGLTPEKLEKELEELEKAKE--EIEEEISKitaRIGE 416
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 138 LEDESKSLKSQVAEAKTTFRIF-----EINEE-------RLKGAIKDALNENSQLQESQKQL---LQETEMMKEQVNDLD 202
Cdd:PRK03918 417 LKKEIKELKKAIEELKKAKGKCpvcgrELTEEhrkelleEYTAELKRIEKELKEIEEKERKLrkeLRELEKVLKKESELI 496
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 203 KQKVALEESRAQAE-------QALSEKESQIETLVTSLLKMKdwaavlGEADDGNLDLDMKSGLEntaaldnqpkgalKK 275
Cdd:PRK03918 497 KLKELAEQLKELEEklkkynlEELEKKAEEYEKLKEKLIKLK------GEIKSLKKELEKLEELK-------------KK 557
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 276 LiyaAKLNASLKALEGERNQVYTQLSE-----VDQVKEDLTE-------------HIKSLESKQASLQSEKTEFESESQK 337
Cdd:PRK03918 558 L---AELEKKLDELEEELAELLKELEElgfesVEELEERLKElepfyneylelkdAEKELEREEKELKKLEEELDKAFEE 634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 338 LQQKLKVITELyqENEMKLHRKLTVEENYRlEKEEKLSKVDEKISHATEELEtcrqRAKDLEEELERTIHSYQGQVISHE 417
Cdd:PRK03918 635 LAETEKRLEEL--RKELEELEKKYSEEEYE-ELREEYLELSRELAGLRAELE----ELEKRREEIKKTLEKLKEELEERE 707
|
410 420 430
....*....|....*....|....*....|....*..
gi 259155338 418 KKAHDNWLAARTLERnLNDLRKENAHNRQKLTETEFK 454
Cdd:PRK03918 708 KAKKELEKLEKALER-VEELREKVKKYKALLKERALS 743
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
131-235 |
2.73e-05 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 47.47 E-value: 2.73e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 131 ISKRIQSLEDESKSLKSQV---AEAKTTFRIFEINEERLK---GAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQ 204
Cdd:PRK12704 29 AEAKIKEAEEEAKRILEEAkkeAEAIKKEALLEAKEEIHKlrnEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
|
90 100 110
....*....|....*....|....*....|....
gi 259155338 205 KVALEESR---AQAEQALSEKESQIETLVTSLLK 235
Cdd:PRK12704 109 EEELEKKEkelEQKQQELEKKEEELEELIEEQLQ 142
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
64-461 |
3.95e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.37 E-value: 3.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 64 KKLALELSALIEEKCKLLDKVSIVQKEAtyENLEqskskledeillleekleeerakhseqdELMADISKRIQSLEDESK 143
Cdd:PRK03918 289 KEKAEEYIKLSEFYEEYLDELREIEKRL--SRLE----------------------------EEINGIEERIKELEEKEE 338
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 144 SLKSQVAEAKTTFRIFEINEERLKgAIKDALNENSQLQESQKQLLQET-EMMKEQVNDLDKQKVALEESRAQAEQALSEK 222
Cdd:PRK03918 339 RLEELKKKLKELEKRLEELEERHE-LYEEAKAKKEELERLKKRLTGLTpEKLEKELEELEKAKEEIEEEISKITARIGEL 417
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 223 ESQIETLVTSLLKMKDWAAV-------LGEADDGNL----DLDMKSGLENTAALDNQPKgalkkliyaaKLNASLKALEG 291
Cdd:PRK03918 418 KKEIKELKKAIEELKKAKGKcpvcgreLTEEHRKELleeyTAELKRIEKELKEIEEKER----------KLRKELRELEK 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 292 ERNQVYTQLSEvdqvkEDLTEHIKSLESKQASLQSEK-----TEFE---SESQKLQQKLKVI-TELYQENEMKlhRKLTV 362
Cdd:PRK03918 488 VLKKESELIKL-----KELAEQLKELEEKLKKYNLEElekkaEEYEklkEKLIKLKGEIKSLkKELEKLEELK--KKLAE 560
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 363 EENYRLEKEEKLSKVDEKIshateeletcRQRAKDLEEELERTIHSYqgqvisheKKAHDNWLAARTLERNLNDLRKENA 442
Cdd:PRK03918 561 LEKKLDELEEELAELLKEL----------EELGFESVEELEERLKEL--------EPFYNEYLELKDAEKELEREEKELK 622
|
410
....*....|....*....
gi 259155338 443 HNRQKLTETEFKFELLEKD 461
Cdd:PRK03918 623 KLEEELDKAFEELAETEKR 641
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
55-439 |
6.60e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 46.50 E-value: 6.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 55 RSRFYVGREKKLA---------LELSALIEEKCKLLDKVSIVQKEATYENLEQSKSKLEDE----ILLLEEKLEEERAKH 121
Cdd:pfam02463 164 GSRLKRKKKEALKklieetenlAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEyllyLDYLKLNEERIDLLQ 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 122 SEQDELMADISKRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQ---LLQETEMMKEQV 198
Cdd:pfam02463 244 ELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRkvdDEEKLKESEKEK 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 199 NDLDKQKVALEESRAQAEQALSEKESQIETlvtsLLKMKDWAAVLGEADDGNLDLDMKSGLENTAALDNQPKGALKKLIY 278
Cdd:pfam02463 324 KKAEKELKKEKEEIEELEKELKELEIKREA----EEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 399
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 279 AAK-LNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQ------------QKLKVI 345
Cdd:pfam02463 400 KSEeEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLkdelelkksedlLKETQL 479
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 346 TELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEK--ISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEKKAHDN 423
Cdd:pfam02463 480 VKLQEQLELLLSRQKLEERSQKESKARSGLKVLLAliKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADE 559
|
410
....*....|....*.
gi 259155338 424 WLAARTLERNLNDLRK 439
Cdd:pfam02463 560 VEERQKLVRALTELPL 575
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
64-343 |
7.93e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.21 E-value: 7.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 64 KKLALELSALIEEKCKLLDKVS-----IVQKEATYENLEQSKSklEDEILLLEEKLEEERAKHSEQDELMADISKRIQS- 137
Cdd:TIGR02169 747 SSLEQEIENVKSELKELEARIEeleedLHKLEEALNDLEARLS--HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRl 824
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 138 ------LEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEES 211
Cdd:TIGR02169 825 tlekeyLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERK 904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 212 RAQAEQALSEKESQIETLVTSLlkmkdwAAVLGEADDgnLDLDMKSGLENTAALDNQPKGALKKLiyaaKLNASLKALEG 291
Cdd:TIGR02169 905 IEELEAQIEKKRKRLSELKAKL------EALEEELSE--IEDPKGEDEEIPEEELSLEDVQAELQ----RVEEEIRALEP 972
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 259155338 292 ERNQVytqLSEVDQVKEDLTEhiksLESKQASLQSEKTEFESESQKLQQKLK 343
Cdd:TIGR02169 973 VNMLA---IQEYEEVLKRLDE----LKEKRAKLEEERKAILERIEEYEKKKR 1017
|
|
| RNase_Y_N |
pfam12072 |
RNase Y N-terminal region; |
133-230 |
8.71e-05 |
|
RNase Y N-terminal region;
Pssm-ID: 463456 [Multi-domain] Cd Length: 201 Bit Score: 44.11 E-value: 8.71e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 133 KRIQSLEDESKSLKS---QVAEAKTTFRIFEINEE--RLKGAIKDALNE-NSQLQESQKQLLQETEMMKEQVNDLDKQKV 206
Cdd:pfam12072 27 AKIGSAEELAKRIIEeakKEAETKKKEALLEAKEEihKLRAEAERELKErRNELQRQERRLLQKEETLDRKDESLEKKEE 106
|
90 100
....*....|....*....|....*..
gi 259155338 207 AL---EESRAQAEQALSEKESQIETLV 230
Cdd:pfam12072 107 SLekkEKELEAQQQQLEEKEEELEELI 133
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
125-412 |
9.38e-05 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 46.05 E-value: 9.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 125 DELMADIS------KRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQV 198
Cdd:PRK01156 176 DMLRAEISnidyleEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYE 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 199 NDLDKQkvaleESRAQAEQALSEKESQIETLVTSLLKMKDWAA---VLGEADDGNLDLDMKSGLENTAALDNQPKGALKK 275
Cdd:PRK01156 256 SEIKTA-----ESDLSMELEKNNYYKELEERHMKIINDPVYKNrnyINDYFKYKNDIENKKQILSNIDAEINKYHAIIKK 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 276 LiyaaklnaslKALEGERNQVYTQLSEvdqvKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMK 355
Cdd:PRK01156 331 L----------SVLQKDYNDYIKKKSR----YDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEI 396
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 356 LHRKLTVEENYRLEKEEKLSKVDE---KISHATEELETCRQRakdlEEELERTIHSYQGQ 412
Cdd:PRK01156 397 LKIQEIDPDAIKKELNEINVKLQDissKVSSLNQRIRALREN----LDELSRNMEMLNGQ 452
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
286-466 |
1.25e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.53 E-value: 1.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 286 LKALEGERNQVYTQLSEVDQvkedLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVItELYQENEmKLHRKLTvEEN 365
Cdd:COG4717 73 LKELEEELKEAEEKEEEYAE----LQEELEELEEELEELEAELEELREELEKLEKLLQLL-PLYQELE-ALEAELA-ELP 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 366 YRLEK-EEKLskvdEKISHATEELETCRQRAKDLEEELERTIHSYQgqvISHEKKAHDNWLAARTLERNLNDLRKENAHN 444
Cdd:COG4717 146 ERLEElEERL----EELRELEEELEELEAELAELQEELEELLEQLS---LATEEELQDLAEELEELQQRLAELEEELEEA 218
|
170 180
....*....|....*....|..
gi 259155338 445 RQKLTETEFKFELLEKDPYALD 466
Cdd:COG4717 219 QEELEELEEELEQLENELEAAA 240
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
63-461 |
1.78e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.13 E-value: 1.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 63 EKKLALELSALIEEKCKLLDkvsIVQKEATYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADISKRIQSLEDES 142
Cdd:PTZ00121 1416 AKKKADEAKKKAEEKKKADE---AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 143 KSLKSQVAEAKTTFRIFEINEERLKGAIKDALNENSQLQESQKQllqETEMMKEQVNDLDKQKVALEESRAQAEQALSEK 222
Cdd:PTZ00121 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKA---DEAKKAEEKKKADELKKAEELKKAEEKKKAEEA 1569
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 223 ESQIETLVTSLLKmkdwAAVLGEADDGNLDLDMKSGLENTAALDNQPKGALKKLIYAAKLnaslKALEGERNQVytqlse 302
Cdd:PTZ00121 1570 KKAEEDKNMALRK----AEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL----KKAEEEKKKV------ 1635
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 303 vDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKlhRKltVEENYRLEKEEKlsKVDEKIS 382
Cdd:PTZ00121 1636 -EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE--KK--AAEALKKEAEEA--KKAEELK 1708
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 383 HATEELETCRQRAKDLEEelERTIHSYQGQVISHEKKAHDNWLAARTLERN----LNDLRKENAHNRQKLTETEFKFELL 458
Cdd:PTZ00121 1709 KKEAEEKKKAEELKKAEE--ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKkiahLKKEEEKKAEEIRKEKEAVIEEELD 1786
|
...
gi 259155338 459 EKD 461
Cdd:PTZ00121 1787 EED 1789
|
|
| PRK14473 |
PRK14473 |
F0F1 ATP synthase subunit B; Provisional |
132-231 |
2.31e-04 |
|
F0F1 ATP synthase subunit B; Provisional
Pssm-ID: 172948 [Multi-domain] Cd Length: 164 Bit Score: 42.60 E-value: 2.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 132 SKRIQSLEDESKSLKSQVAEAKTTFrifeinEERLKGAIKDALNENSQLQESQKQllQETEMMKEQVNDLDKQKvalEES 211
Cdd:PRK14473 41 TRRIEESLRDAEKVREQLANAKRDY------EAELAKARQEAAKIVAQAQERARA--QEAEIIAQARREAEKIK---EEA 109
|
90 100
....*....|....*....|....
gi 259155338 212 RAQAEQ----ALSEKESQIETLVT 231
Cdd:PRK14473 110 RAQAEQerqrMLSELKSQIADLVT 133
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
118-454 |
2.63e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 44.43 E-value: 2.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKTTFrifEINEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQ 197
Cdd:pfam10174 351 RLRLEEKESFLNKKTKQLQDLTEEKSTLAGEIRDLKDML---DVKERKINVLQKKIENLQEQLRDKDKQLAGLKERVKSL 427
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 198 VNDLDKQKVALeesrAQAEQALSEKESQIETLVTSllKMKDWAAVLGEADDGNLDL-DMKSGL--------ENTAALDNQ 268
Cdd:pfam10174 428 QTDSSNTDTAL----TTLEEALSEKERIIERLKEQ--REREDRERLEELESLKKENkDLKEKVsalqpeltEKESSLIDL 501
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 269 PKGALKKLIYAAKLNASLKALEGE-----------RNQVYT--QLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESES 335
Cdd:pfam10174 502 KEHASSLASSGLKKDSKLKSLEIAveqkkeecsklENQLKKahNAEEAVRTNPEINDRIRLLEQEVARYKEESGKAQAEV 581
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 336 QKLQQKLKvitelyqenEMklhrkltveENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTihsyQGQVIS 415
Cdd:pfam10174 582 ERLLGILR---------EV---------ENEKNDKDKKIAELESLTLRQMKEQNKKVANIKHGQQEMKKK----GAQLLE 639
|
330 340 350
....*....|....*....|....*....|....*....
gi 259155338 416 HEKKAHDNwLAARTLERNLNDLRKENAHNRQKLTETEFK 454
Cdd:pfam10174 640 EARRREDN-LADNSQQLQLEELMGALEKTRQELDATKAR 677
|
|
| RecN |
COG0497 |
DNA repair ATPase RecN [Replication, recombination and repair]; |
169-461 |
2.69e-04 |
|
DNA repair ATPase RecN [Replication, recombination and repair];
Pssm-ID: 440263 [Multi-domain] Cd Length: 555 Bit Score: 44.30 E-value: 2.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 169 AIKDALNENSQLQESQKQLLQETEMMKEQVNDLDkqKVALEEsraQAEQALSEKE---SQIETLVTSLLKMkdwAAVLGE 245
Cdd:COG0497 166 AWRALKKELEELRADEAERARELDLLRFQLEELE--AAALQP---GEEEELEEERrrlSNAEKLREALQEA---LEALSG 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 246 ADDGNLDLdmksglentaaLDNqpkgALKKLIYAAKLNASLKALEGERNQVYTQLSEVdqvkedltehiksleskQASLQ 325
Cdd:COG0497 238 GEGGALDL-----------LGQ----ALRALERLAEYDPSLAELAERLESALIELEEA-----------------ASELR 285
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 326 SEKTEFESESQKLQQKLKVITELYQenemkLHRK--LTVEE--NYRLEKEEKLskvdEKISHATEELETCRQRAKDLEEE 401
Cdd:COG0497 286 RYLDSLEFDPERLEEVEERLALLRR-----LARKygVTVEEllAYAEELRAEL----AELENSDERLEELEAELAEAEAE 356
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 259155338 402 LERtihsyQGQVISHEKKAhdnwlAARTLE----RNLNDLRKENAhnrqkltetEFKFELLEKD 461
Cdd:COG0497 357 LLE-----AAEKLSAARKK-----AAKKLEkavtAELADLGMPNA---------RFEVEVTPLE 401
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
280-427 |
2.75e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 42.99 E-value: 2.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 280 AKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFES---------ESQKLQQKLKvitelYQ 350
Cdd:COG1579 27 KELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEqlgnvrnnkEYEALQKEIE-----SL 101
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 259155338 351 ENEMKLHRKLTVEENYRLE-KEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEKKAHDNWLAA 427
Cdd:COG1579 102 KRRISDLEDEILELMERIEeLEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAKIPPELLAL 179
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
125-409 |
2.76e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 44.66 E-value: 2.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 125 DELMADISKRiQSLEDESKSLKSQVAEAKTTFRIFE---------INEERLKGAIKDALNENSQL----QESQKQLLQET 191
Cdd:TIGR01612 1486 NELKEHIDKS-KGCKDEADKNAKAIEKNKELFEQYKkdvtellnkYSALAIKNKFAKTKKDSEIIikeiKDAHKKFILEA 1564
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 192 EMMKEQVNDLDKQKVALEESRA---QAEQALSEKESQIETLVTSLLKMKDwaavlgeaddgnldldmksglentaaldnq 268
Cdd:TIGR01612 1565 EKSEQKIKEIKKEKFRIEDDAAkndKSNKAAIDIQLSLENFENKFLKISD------------------------------ 1614
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 269 pkgaLKKliyaaKLNASLKALEGERNQVY--------TQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFEsesqKLQQ 340
Cdd:TIGR01612 1615 ----IKK-----KINDCLKETESIEKKISsfsidsqdTELKENGDNLNSLQEFLESLKDQKKNIEDKKKELD----ELDS 1681
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 259155338 341 KLKVItelyqENEMKLHRKltveeNYRLEKEEklsKVDEKISHATEELETCRQRakdLEEELERTIHSY 409
Cdd:TIGR01612 1682 EIEKI-----EIDVDQHKK-----NYEIGIIE---KIKEIAIANKEEIESIKEL---IEPTIENLISSF 1734
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
129-422 |
2.81e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 44.33 E-value: 2.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 129 ADISKRIQSLEDESKSLKSQvaeaKTTFRIFEINEERLKGAIKDALNENSQLQESQK------QLLQET----------- 191
Cdd:pfam05483 99 AELKQKENKLQENRKIIEAQ----RKAIQELQFENEKVSLKLEEEIQENKDLIKENNatrhlcNLLKETcarsaektkky 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 192 EMMKEQV--------NDLDKQKVALEESRAQAEQA-------------------------LSEKESQIETLVTSLL---- 234
Cdd:pfam05483 175 EYEREETrqvymdlnNNIEKMILAFEELRVQAENArlemhfklkedhekiqhleeeykkeINDKEKQVSLLLIQITeken 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 235 KMKDWAAVLGEADDGNLDLDMKSGL--ENTAALdNQPKGALKKLIYAAKLN-----ASLKALEgERNQVYTQlsEVDQVK 307
Cdd:pfam05483 255 KMKDLTFLLEESRDKANQLEEKTKLqdENLKEL-IEKKDHLTKELEDIKMSlqrsmSTQKALE-EDLQIATK--TICQLT 330
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 308 EDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLhRKLTVEENYRLEKEEKLSKVDEKISHATEE 387
Cdd:pfam05483 331 EEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQL-KIITMELQKKSSELEEMTKFKNNKEVELEE 409
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 259155338 388 LETC----------RQRAKDLEEELERTIHSYQGQVISHEKKAHD 422
Cdd:pfam05483 410 LKKIlaedeklldeKKQFEKIAEELKGKEQELIFLLQAREKEIHD 454
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
128-459 |
3.71e-04 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 43.67 E-value: 3.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 128 MADISKRIQSLEDESKSLKsqVAEAKTtfrifEIN--EERLKGA---IKDALNENSQLQESQKQLLQETEMMKEQVNDLD 202
Cdd:PRK04778 81 LPDIEEQLFEAEELNDKFR--FRKAKH-----EINeiESLLDLIeedIEQILEELQELLESEEKNREEVEQLKDLYRELR 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 203 KQkvaLEESRAQAEQALSEKESQIETLVTSLLKMKDwaavlgEADDGN--------LDLDMKsglenTAALDNQ----PK 270
Cdd:PRK04778 154 KS---LLANRFSFGPALDELEKQLENLEEEFSQFVE------LTESGDyveareilDQLEEE-----LAALEQImeeiPE 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 271 GaLKKLiyAAKLNASLKALEgernQVYTQLSE---------VDQVKEDLTEHIKSLESKQASLqsEKTEFESESQKLQQK 341
Cdd:PRK04778 220 L-LKEL--QTELPDQLQELK----AGYRELVEegyhldhldIEKEIQDLKEQIDENLALLEEL--DLDEAEEKNEEIQER 290
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 342 lkvITELYQ--ENEMKLHRKLtveenyrlekEEKLSKVDEKISHATEEletcrqrAKDLEEELERTIHSYQgqvISHEKK 419
Cdd:PRK04778 291 ---IDQLYDilEREVKARKYV----------EKNSDTLPDFLEHAKEQ-------NKELKEEIDRVKQSYT---LNESEL 347
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 259155338 420 AHdnwlaARTLERNLNDLRKENAHNRQKLTETEFKFELLE 459
Cdd:PRK04778 348 ES-----VRQLEKQLESLEKQYDEITERIAEQEIAYSELQ 382
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
123-461 |
4.50e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.66 E-value: 4.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 123 EQDELMADISKRIQSLEDESkslkSQVAEAKTTFrifeinEERL--KGAIKDALNEnsQLQESQKQLLQETEMMKEQVND 200
Cdd:pfam10174 412 DKDKQLAGLKERVKSLQTDS----SNTDTALTTL------EEALseKERIIERLKE--QREREDRERLEELESLKKENKD 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 201 LDKQKVALEESRAQAEQALSEKESQIETLVTSLLKmKDwaavlgeADDGNLDLDMKSGLENTAALDNQPKGALKKLIYA- 279
Cdd:pfam10174 480 LKEKVSALQPELTEKESSLIDLKEHASSLASSGLK-KD-------SKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVr 551
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 280 --AKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESES--QKLQQKLKVITELYQENEMK 355
Cdd:pfam10174 552 tnPEINDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENEKNDKDKKIAELESLTlrQMKEQNKKVANIKHGQQEMK 631
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 356 LHRKLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTihsyqgQVISHEKKAHdnwlaartlernLN 435
Cdd:pfam10174 632 KKGAQLLEEARRREDNLADNSQQLQLEELMGALEKTRQELDATKARLSST------QQSLAEKDGH------------LT 693
|
330 340 350
....*....|....*....|....*....|..
gi 259155338 436 DLRKEnahNRQKLTET-EFKFELL-----EKD 461
Cdd:pfam10174 694 NLRAE---RRKQLEEIlEMKQEALlaaisEKD 722
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
135-353 |
4.68e-04 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 43.75 E-value: 4.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 135 IQSLEDESKSLKSQVAEAKTTFRIFEiNEERLKGAIKDALNENSQLQesqKQLLQETEMMKEQVNDLDKQKVALEESRAQ 214
Cdd:PRK11281 41 VQAQLDALNKQKLLEAEDKLVQQDLE-QTLALLDKIDRQKEETEQLK---QQLAQAPAKLRQAQAELEALKDDNDEETRE 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 215 AEQALS--EKESQIETLVTSLlkmKDWAAVLGEAddgNLDL------------DMKSGLENTAALDNQPKG--ALKKLIY 278
Cdd:PRK11281 117 TLSTLSlrQLESRLAQTLDQL---QNAQNDLAEY---NSQLvslqtqperaqaALYANSQRLQQIRNLLKGgkVGGKALR 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 279 A---AKLNASLKALEGE----RN--QVYTQLSEVDQVKEDL-TEHIKSLESKQASLQ----------SEKTEFESESQKL 338
Cdd:PRK11281 191 PsqrVLLQAEQALLNAQndlqRKslEGNTQLQDLLQKQRDYlTARIQRLEHQLQLLQeainskrltlSEKTVQEAQSQDE 270
|
250
....*....|....*
gi 259155338 339 QQKLKVITELYQENE 353
Cdd:PRK11281 271 AARIQANPLVAQELE 285
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
126-321 |
5.06e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.75 E-value: 5.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 126 ELMADISKRIQSLEdESKSLKSQVAEAKTTFRIFEINEERL---KGAIKDALNEN--SQLQESQKQLLQETEMMKEQVND 200
Cdd:COG4913 242 EALEDAREQIELLE-PIRELAERYAAARERLAELEYLRAALrlwFAQRRLELLEAelEELRAELARLEAELERLEARLDA 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 201 LDKQKVALEESRAQA--------EQALSEKESQIETLVTSLLKMKDWAAVLGEAddgnLDLDMKSGLENTAALDNQpkga 272
Cdd:COG4913 321 LREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLP----LPASAEEFAALRAEAAAL---- 392
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 259155338 273 lkkliyAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQ 321
Cdd:COG4913 393 ------LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRK 435
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
130-404 |
5.55e-04 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 43.49 E-value: 5.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 130 DISKRIQSLEDESKSLKSQVAEAKTTFRifeineerlkgAIKDALNENSQLQESQK-----QLLQETEMMkEQVNDLDKQ 204
Cdd:PRK02224 409 NAEDFLEELREERDELREREAELEATLR-----------TARERVEEAEALLEAGKcpecgQPVEGSPHV-ETIEEDRER 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 205 KVALEESRAQAEQALSEKESQIETLvTSLLKMKDWAAVLGEADDGNLDL--DMKSGLENTA-ALDNQPKGALKKLIYAAK 281
Cdd:PRK02224 477 VEELEAELEDLEEEVEEVEERLERA-EDLVEAEDRIERLEERREDLEELiaERRETIEEKReRAEELRERAAELEAEAEE 555
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 282 LNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLEsKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKL----H 357
Cdd:PRK02224 556 KREAAAEAEEEAEEAREEVAELNSKLAELKERIESLE-RIRTLLAAIADAEDEIERLREKREALAELNDERRERLaekrE 634
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 259155338 358 RKLTVEENY---RLEK--------EEKLSKVDEKISHATE-----------------ELETCRQRAKDLEEELER 404
Cdd:PRK02224 635 RKRELEAEFdeaRIEEaredkeraEEYLEQVEEKLDELREerddlqaeigaveneleELEELRERREALENRVEA 709
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
273-419 |
6.50e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 42.22 E-value: 6.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 273 LKKLIYAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITEL---- 348
Cdd:COG1579 6 LRALLDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgnv 85
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 259155338 349 -----YQ--ENEM-KLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQGQVISHEKK 419
Cdd:COG1579 86 rnnkeYEalQKEIeSLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
122-450 |
6.55e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 43.50 E-value: 6.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 122 SEQDELMADISKRIQSLEDESKSLKSQVAEAkttfrifeinEERLKGAIKDALNENSQLQESQKQLLQETEMMKEQVNDL 201
Cdd:TIGR00606 691 AELQEFISDLQSKLRLAPDKLKSTESELKKK----------EKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDI 760
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 202 DKQKVALEESRAQAEQALSEKESQiETLVTSLLKMKdwaavlgeaddgNLDLDMKSGLENTAALDNQPKGALKKLIYAaK 281
Cdd:TIGR00606 761 QRLKNDIEEQETLLGTIMPEEESA-KVCLTDVTIME------------RFQMELKDVERKIAQQAAKLQGSDLDRTVQ-Q 826
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 282 LNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQ-------KLKVITELYQENEM 354
Cdd:TIGR00606 827 VNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQfeeqlveLSTEVQSLIREIKD 906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 355 KLHRKLTVEENyrleKEEKLSKVDEKISHATEELETCRQRAKDLEEELERtIHSYQGQVISHEKKAHDNWLAARTLERN- 433
Cdd:TIGR00606 907 AKEQDSPLETF----LEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKN-IHGYMKDIENKIQDGKDDYLKQKETELNt 981
|
330
....*....|....*..
gi 259155338 434 LNDLRKENAHNRQKLTE 450
Cdd:TIGR00606 982 VNAQLEECEKHQEKINE 998
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
179-460 |
7.57e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 43.11 E-value: 7.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 179 QLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVlgEADDGNLDLDMKSG 258
Cdd:TIGR00606 574 QLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDE--ESDLERLKEEIEKS 651
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 259 LENTAALdnqpkgALKKLIYAAKLNASLkalegERNQVYTQLSEVD-QVKEDLTEHIKSLESKQASLQSEKTEFESESQK 337
Cdd:TIGR00606 652 SKQRAML------AGATAVYSQFITQLT-----DENQSCCPVCQRVfQTEAELQEFISDLQSKLRLAPDKLKSTESELKK 720
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 338 LQQKLKVITELY--QENEMKLHRKLTVEENYRLEK-EEKLSKVDEKISHATEELETC---RQRAKDLEEELErTIHSYQG 411
Cdd:TIGR00606 721 KEKRRDEMLGLApgRQSIIDLKEKEIPELRNKLQKvNRDIQRLKNDIEEQETLLGTImpeEESAKVCLTDVT-IMERFQM 799
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 259155338 412 QVISHEKKAHDnwLAART----LERNLNDLRKENAHNRQKLTETEFKFELLEK 460
Cdd:TIGR00606 800 ELKDVERKIAQ--QAAKLqgsdLDRTVQQVNQEKQEKQHELDTVVSKIELNRK 850
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
292-383 |
8.26e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.84 E-value: 8.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 292 ERNQVYTQLSE-VDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMKLHR--KLTVEEnyrl 368
Cdd:PRK12704 79 ERRNELQKLEKrLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERisGLTAEE---- 154
|
90
....*....|....*
gi 259155338 369 EKEEKLSKVDEKISH 383
Cdd:PRK12704 155 AKEILLEKVEEEARH 169
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
118-300 |
1.31e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 1.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 118 RAKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKttfRIFEINEERLKgaIKDALNENSQLQESQKQLLQ---ETEMM 194
Cdd:COG4913 616 EAELAELEEELAEAEERLEALEAELDALQERREALQ---RLAEYSWDEID--VASAEREIAELEAELERLDAssdDLAAL 690
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 195 KEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSL--LKMKDWAAVLGEADDGNLDLDMKSGLENTAALDNQpkga 272
Cdd:COG4913 691 EEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELdeLQDRLEAAEDLARLELRALLEERFAAALGDAVERE---- 766
|
170 180
....*....|....*....|....*...
gi 259155338 273 lkkliYAAKLNASLKALEGERNQVYTQL 300
Cdd:COG4913 767 -----LRENLEERIDALRARLNRAEEEL 789
|
|
| Spc7 |
smart00787 |
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ... |
281-404 |
1.73e-03 |
|
Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.
Pssm-ID: 197874 [Multi-domain] Cd Length: 312 Bit Score: 41.16 E-value: 1.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 281 KLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKV-ITELYQENEMKLhRK 359
Cdd:smart00787 148 GLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEkLKKLLQEIMIKV-KK 226
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 259155338 360 LTveenyrlEKEEKLSKVDEKISHATEELETCRQRAKDLEEELER 404
Cdd:smart00787 227 LE-------ELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQ 264
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
69-418 |
2.30e-03 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 41.48 E-value: 2.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 69 ELSALIEEKCKLLDKVSIVQKEATyENLEQSKSKLEDEILlleekleeeraKHSEQ-DELMADISKRiQSLEDESKSLKS 147
Cdd:COG5185 254 KLEKLVEQNTDLRLEKLGENAESS-KRLNENANNLIKQFE-----------NTKEKiAEYTKSIDIK-KATESLEEQLAA 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 148 qvAEAKTTFrifeineERLKgaikdaLNENSQLQESQKQLLQETEMMKEQVNDLDKQK--VALEESRAQAEQALSEKESQ 225
Cdd:COG5185 321 --AEAEQEL-------EESK------RETETGIQNLTAEIEQGQESLTENLEAIKEEIenIVGEVELSKSSEELDSFKDT 385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 226 IETLVTSLL-KMKDWAAVLGEADDgNLDLDMKSGLENTAALDNQPKGALKKLIYAAKLnasLKALEGERNQVYTQLSevD 304
Cdd:COG5185 386 IESTKESLDeIPQNQRGYAQEILA-TLEDTLKAADRQIEELQRQIEQATSSNEEVSKL---LNELISELNKVMREAD--E 459
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 305 QVKEDLTEHIKSLESkqaSLQSEKTEFESESQKLQQKLKVITELYQENEMKLHRKLTVEENYRLEKEEKL--SKVDEKIS 382
Cdd:COG5185 460 ESQSRLEEAYDEINR---SVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAESLkdFMRARGYA 536
|
330 340 350
....*....|....*....|....*....|....*.
gi 259155338 383 HATEElETCRQRAKDLEEELERTIhSYQGQVISHEK 418
Cdd:COG5185 537 HILAL-ENLIPASELIQASNAKTD-GQAANLRTAVI 570
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
70-404 |
2.49e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 41.26 E-value: 2.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 70 LSALIEEKCKLLDKV--SIVQKEATYENLEQSKSKLEDEILLLEEKLeeeRAKHSEQDELMADISKRIQSLED------- 140
Cdd:pfam15921 466 LTAQLESTKEMLRKVveELTAKKMTLESSERTVSDLTASLQEKERAI---EATNAEITKLRSRVDLKLQELQHlknegdh 542
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 141 ------ESKSLKSQVAEAKTTFRIFEINEE----------RLKGAIkdaLNENSQLQES---QKQLLQETEMMKEQ---- 197
Cdd:pfam15921 543 lrnvqtECEALKLQMAEKDKVIEILRQQIEnmtqlvgqhgRTAGAM---QVEKAQLEKEindRRLELQEFKILKDKkdak 619
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 198 -------VNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEaDDGNLDLDMKSGLENTAALDNQPK 270
Cdd:pfam15921 620 irelearVSDLELEKVKLVNAGSERLRAVKDIKQERDQLLNEVKTSRNELNSLSE-DYEVLKRNFRNKSEEMETTTNKLK 698
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 271 GALKKLIYA-AKLNASLKALEGE-----------RNQVYTQLSEVDQvkedLTEHIKSLESKQASLQSEKTEFESESQKL 338
Cdd:pfam15921 699 MQLKSAQSElEQTRNTLKSMEGSdghamkvamgmQKQITAKRGQIDA----LQSKIQFLEEAMTNANKEKHFLKEEKNKL 774
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 259155338 339 QQKLKVITElyQENEMKLHRKLTVEENYRLekEEKLSKVDEKISHATEELETCRQRAKDLEEELER 404
Cdd:pfam15921 775 SQELSTVAT--EKNKMAGELEVLRSQERRL--KEKVANMEVALDKASLQFAECQDIIQRQEQESVR 836
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
69-450 |
4.61e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 40.54 E-value: 4.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 69 ELSALIEEKCKLLDKVSIVQKEatYENLEQSKSKLEDEILLLEEKLEEERAKHSEQDELMADISKRIQSLEDESKSLKSQ 148
Cdd:pfam01576 6 EMQAKEEELQKVKERQQKAESE--LKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESR 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 149 VAEAKTTFRIFEINEERLKGAIKDaLNENSQLQESQKQLLQ---------------ETEMMKEQVNDLDKQKVALEESRA 213
Cdd:pfam01576 84 LEEEEERSQQLQNEKKKMQQHIQD-LEEQLDEEEAARQKLQlekvtteakikkleeDILLLEDQNSKLSKERKLLEERIS 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 214 QAEQALSEKESQIETLvtSLLKMKDWAavlgeaddgnldldMKSGLENTAALDNQPKGALKKLiyaaklnasLKALEGER 293
Cdd:pfam01576 163 EFTSNLAEEEEKAKSL--SKLKNKHEA--------------MISDLEERLKKEEKGRQELEKA---------KRKLEGES 217
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 294 NQVYTQLSEVDQVKEDLTehiKSLESKQASLQSEKTEFESESQKLQQKLKVITELyQENEMKLHRKLTVEENYRlEKEEK 373
Cdd:pfam01576 218 TDLQEQIAELQAQIAELR---AQLAKKEEELQAALARLEEETAQKNNALKKIREL-EAQISELQEDLESERAAR-NKAEK 292
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 374 LSKvdekisHATEELETCRqraKDLEEELERT-----IHSYQGQVISHEKKAHDNwlAARTLERNLNDLRKENAHNRQKL 448
Cdd:pfam01576 293 QRR------DLGEELEALK---TELEDTLDTTaaqqeLRSKREQEVTELKKALEE--ETRSHEAQLQEMRQKHTQALEEL 361
|
..
gi 259155338 449 TE 450
Cdd:pfam01576 362 TE 363
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
280-410 |
4.74e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 39.89 E-value: 4.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 280 AKLNASLKALEGERNQVYTQL----SEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQENEMK 355
Cdd:COG4372 48 EQLREELEQAREELEQLEEELeqarSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDL 127
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 259155338 356 lhrkltveENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIHSYQ 410
Cdd:COG4372 128 --------EQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
|
|
| HMMR_N |
pfam15905 |
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ... |
166-412 |
5.05e-03 |
|
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.
Pssm-ID: 464932 [Multi-domain] Cd Length: 329 Bit Score: 39.79 E-value: 5.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 166 LKGAIKDALNENSQLQESQKQ---LLQETEMMKEQVNDLDKQKVALEesrAQAEQALSEKESQIETlVTSLLKmkdwaaV 242
Cdd:pfam15905 61 LKKKSQKNLKESKDQKELEKEiraLVQERGEQDKRLQALEEELEKVE---AKLNAAVREKTSLSAS-VASLEK------Q 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 243 LGEADDGNLDLDMKSGLENTAALDNQPKGALKKLiyAAKLNASLKALEGERNQVYTQLSEVDQVKEDLTEHIKSLESKQA 322
Cdd:pfam15905 131 LLELTRVNELLKAKFSEDGTQKKMSSLSMELMKL--RNKLEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLV 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 323 SLQSEKTEFESESQKLqqkLKVITELYQENE--MKLHRKLTVEENYRLEKEEKL----SKVDEKISHATEELETCRQRAK 396
Cdd:pfam15905 209 STEKEKIEEKSETEKL---LEYITELSCVSEqvEKYKLDIAQLEELLKEKNDEIeslkQSLEEKEQELSKQIKDLNEKCK 285
|
250
....*....|....*.
gi 259155338 397 DLEEELERTIHSYQGQ 412
Cdd:pfam15905 286 LLESEKEELLREYEEK 301
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
308-419 |
6.79e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 39.76 E-value: 6.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 308 EDLTEHIksLESKQASLQSEKTEFESESQKLQQKLKviTELYQENEMKLhRKLTVEENYRLEKEEKLSKVDEKISHATEE 387
Cdd:PRK12704 37 EEEAKRI--LEEAKKEAEAIKKEALLEAKEEIHKLR--NEFEKELRERR-NELQKLEKRLLQKEENLDRKLELLEKREEE 111
|
90 100 110
....*....|....*....|....*....|..
gi 259155338 388 LETCRQRAKDLEEELERTIHSYQGQVISHEKK 419
Cdd:PRK12704 112 LEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
|
|
| ATG16 |
pfam08614 |
Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for ... |
279-443 |
6.83e-03 |
|
Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.
Pssm-ID: 462536 [Multi-domain] Cd Length: 176 Bit Score: 38.37 E-value: 6.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 279 AAKLNASLKALEGERNQVYTQLSEVD-QVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITElyqenemKLH 357
Cdd:pfam08614 16 TALLEAENAKLQSEPESVLPSTSSSKlSKASPQSASIQSLEQLLAQLREELAELYRSRGELAQRLVDLNE-------ELQ 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 358 rkltveenyrlEKEEKLSKVDEKISHATEELETCRQRAKDLEEEL---ERTIHSYQGQVISHekkahdnWLAARTLERNL 434
Cdd:pfam08614 89 -----------ELEKKLREDERRLAALEAERAQLEEKLKDREEELrekRKLNQDLQDELVAL-------QLQLNMAEEKL 150
|
....*....
gi 259155338 435 NDLRKENAH 443
Cdd:pfam08614 151 RKLEKENRE 159
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
122-364 |
6.86e-03 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 39.61 E-value: 6.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 122 SEQDELMADiskRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALNEN----SQLQESQKQLLQETEMMKEQ 197
Cdd:PHA02562 166 SEMDKLNKD---KIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKqnkyDELVEEAKTIKAEIEELTDE 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 198 VNDLDKQKVALEESRAQAEQALSEKESQIETLvTSLLKM--------------KDWAAVLGEADDGNLDLDMKSGLENTA 263
Cdd:PHA02562 243 LLNLVMDIEDPSAALNKLNTAAAKIKSKIEQF-QKVIKMyekggvcptctqqiSEGPDRITKIKDKLKELQHSLEKLDTA 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 264 ALDNQpkgalKKLIYAAKLNASLKALEGERNQVYTQLS-EVDQvkedltehIKSLESKQASLQSEKTEFESESQKLQQKL 342
Cdd:PHA02562 322 IDELE-----EIMDEFNEQSKKLLELKNKISTNKQSLItLVDK--------AKKVKAAIEELQAEFVDNAEELAKLQDEL 388
|
250 260
....*....|....*....|..
gi 259155338 343 KVITELYQENEMKLHRKLTVEE 364
Cdd:PHA02562 389 DKIVKTKSELVKEKYHRGIVTD 410
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
174-452 |
7.15e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 39.95 E-value: 7.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 174 LNENSQLQESQKQLLQETEMMKEQVNDLDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAAVLGEADDGNLDL 253
Cdd:TIGR00618 534 EQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 254 DMKSGLENTAALDNQPKGALKKLIyAAKLNASLKALEGErnqvytqlsEVDQVKEDLTEHIKSLESKQASLQSEKTEFES 333
Cdd:TIGR00618 614 QHALLRKLQPEQDLQDVRLHLQQC-SQELALKLTALHAL---------QLTLTQERVREHALSIRVLPKELLASRQLALQ 683
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 334 ESQKLQQKLKVITELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELERTIhsyQGQV 413
Cdd:TIGR00618 684 KMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVL---KART 760
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 259155338 414 ISHEKKAHDNWLAART------LERNLNDLRKENAHNRQKLTETE 452
Cdd:TIGR00618 761 EAHFNNNEEVTAALQTgaelshLAAEIQFFNRLREEDTHLLKTLE 805
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
123-399 |
7.28e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.12 E-value: 7.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 123 EQDELMADISKRIQSLEDESKSLKSQVAEAKttfrifEINEERlkgaikDALNEN-SQLQESQKQLLQETEMMKEQVNDL 201
Cdd:COG1340 30 KRDELNEELKELAEKRDELNAQVKELREEAQ------ELREKR------DELNEKvKELKEERDELNEKLNELREELDEL 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 202 DKQKVALEESR---AQAEQALSEKESQIETLVTSL-------LKMKDWAAVLGEADdgnldlDMKSGLENTAALDNQPKG 271
Cdd:COG1340 98 RKELAELNKAGgsiDKLRKEIERLEWRQQTEVLSPeeekelvEKIKELEKELEKAK------KALEKNEKLKELRAELKE 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 272 ALKKliyAAKLNASLKALEGERNQVYTQLSE----VDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLKVITE 347
Cdd:COG1340 172 LRKE---AEEIHKKIKELAEEAQELHEEMIElykeADELRKEADELHKEIVEAQEKADELHEEIIELQKELRELRKELKK 248
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 259155338 348 LY-QENEMKLHRKLTVEENYRLEKEEKLSKvDEKIShaTEELETCrQRAKDLE 399
Cdd:COG1340 249 LRkKQRALKREKEKEELEEKAEEIFEKLKK-GEKLT--TEELKLL-QKSGLLE 297
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
121-451 |
8.43e-03 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 39.81 E-value: 8.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 121 HSEQDELMADISKRIQSLEDESKSLKSQVAEAKTTFRIFEINEERLKGAIKDALnENSQLQESQKQLLQETEMMKEQVND 200
Cdd:PTZ00440 438 NLEIIEIKKKYDEKINELKKSINQLKTLISIMKSFYDLIISEKDSMDSKEKKES-SDSNYQEKVDELLQIINSIKEKNNI 516
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 201 LDKQKVALEESRAQAEQALSEKESQIETLVTSLLKMKDWAavlgeaDDGNLDLDMKSGLENtaaldnqpkgalkKLIYAa 280
Cdd:PTZ00440 517 VNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLI------KDEKLKRSMKNDIKN-------------KIKYI- 576
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 281 klnaslkalegerNQVYTQLSEVDQVKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKLK-VITELYQENEMKLHRK 359
Cdd:PTZ00440 577 -------------EENVDHIKDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDLQEKVKyILNKFYKGDLQELLDE 643
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 360 LT--VEENYRLEKEeklskvdekiSHATEELETCRQRAKDLEEELERTIHSYQGQVISHEKKAHDNWLAAR--TLERNLN 435
Cdd:PTZ00440 644 LShfLDDHKYLYHE----------AKSKEDLQTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKenIIKKQLN 713
|
330
....*....|....*.
gi 259155338 436 DLRKENAHNRQKLTET 451
Cdd:PTZ00440 714 NIEQDISNSLNQYTIK 729
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
301-404 |
8.51e-03 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 39.53 E-value: 8.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 301 SEVDQVKEDLTE----HIkslESKQASLQSEKT-EFESESQKLQQKLKVITELYQENEMKLHRKLTVEENyRLEK----- 370
Cdd:PRK05771 16 SYKDEVLEALHElgvvHI---EDLKEELSNERLrKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVK-SLEElikdv 91
|
90 100 110
....*....|....*....|....*....|....
gi 259155338 371 EEKLSKVDEKISHATEELETCRQRAKDLEEELER 404
Cdd:PRK05771 92 EEELEKIEKEIKELEEEISELENEIKELEQEIER 125
|
|
| TPR_MLP1_2 |
pfam07926 |
TPR/MLP1/MLP2-like protein; The sequences featured in this family are similar to a region of ... |
314-404 |
8.78e-03 |
|
TPR/MLP1/MLP2-like protein; The sequences featured in this family are similar to a region of human TPR protein and to yeast myosin-like proteins 1 (MLP1) and 2 (MLP2). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores. TPR is thought to be a component of nuclear pore complex- attached intra-nuclear filaments, and is implicated in nuclear protein import. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerization of kinase domains or by targeting these kinases to the nuclear pore complex. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity.
Pssm-ID: 462316 [Multi-domain] Cd Length: 129 Bit Score: 37.23 E-value: 8.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 259155338 314 IKSLESKQASLQSEKTEFESESQKLQQKLKVITELYQ------ENEMKLH----RKLTVEENYRLEKEEKLSKVDEKISH 383
Cdd:pfam07926 3 LSSLQSEIKRLKEEAADAEAQLQKLQEDLEKQAEIAReaqqnyERELVLHaediKALQALREELNELKAEIAELKAEAES 82
|
90 100
....*....|....*....|.
gi 259155338 384 ATEELETCRQRAKDLEEELER 404
Cdd:pfam07926 83 AKAELEESEESWEEQKKELEK 103
|
|
|