|
Name |
Accession |
Description |
Interval |
E-value |
| CASP_C |
pfam08172 |
CASP C terminal; This domain is the C-terminal region of the CASP family of proteins. It is a ... |
506-726 |
5.64e-68 |
|
CASP C terminal; This domain is the C-terminal region of the CASP family of proteins. It is a Golgi membrane protein which is thought to have a role in vesicle transport.
Pssm-ID: 462392 [Multi-domain] Cd Length: 247 Bit Score: 224.47 E-value: 5.64e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 506 ELQIHLTEATAKAVEQKELIARLEQDLSTIQSIQR--PDAEGASEQGLEKIPEPIKEATAlfyGPSMSSSGTLPEGQV-- 581
Cdd:pfam08172 1 TLQEELSSLNAELEEQQELNAKLENDLLKVQDEASnaFSFNDASSAGSGVSRYPPSGGRR---SPTSSIISGFEPSESss 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 582 ---DSLLSIISSQRERFRTRNQELEAESRMAQHTIQALQSELDSLRADNIKLFEKIKFLQSYPGRGIGSDDTEL------ 652
Cdd:pfam08172 78 ssdSSILPIVTSQRDRFRQRNAELEEELRKQFETISSLRQEIASLQKDNLKLYEKTRYLQSYNRGGGGGTKSSSstsssa 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 653 ----------------RYSSQYEERLDPFSSFSKRERQRKYLGLSPWDKATLGMGRLILSNKMARTISFFYTLFLHCLVF 716
Cdd:pfam08172 158 saygnnpnpsdvealdKYRKAYEESLNPFAAFRGRESERAYKRLSPLERLVLSLTRLVLANRTSRNLFFFYCLALHLLVF 237
|
250
....*....|
gi 601984296 717 LVLYKLAWSE 726
Cdd:pfam08172 238 FTLYYVSNSS 247
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
192-460 |
4.31e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.17 E-value: 4.31e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 192 DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIA--LEKEQKLQNDFA 269
Cdd:TIGR02168 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEerIAQLSKELTELE 760
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 270 EKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDE----------ETTAKADEIEMIMTDLERANQRA 339
Cdd:TIGR02168 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDElraeltllneEAANLRERLESLERRIAATERRL 840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 340 EVAQREAETLREQLSSANHSLqlaSQIQKAPDVAIEVLTrsSLEVELAAKEREIAQLVEDVQRLQASLTKLREnsasQIS 419
Cdd:TIGR02168 841 EDLEEQIEELSEDIESLAAEI---EELEELIEELESELE--ALLNERASLEEALALLRSELEELSEELRELES----KRS 911
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 601984296 420 QLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNTLKSMEF 460
Cdd:TIGR02168 912 ELRRELEELREKLAQLELRLEGlEVRIDNLQERLSEEYSLTL 953
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
208-540 |
3.37e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.00 E-value: 3.37e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 208 ETENQ--KLRETLEEYNKEFAEVKNQEVTIKALKEKIREYeQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSK 285
Cdd:TIGR02168 176 ETERKleRTRENLDRLEDILNELERQLKSLERQAEKAERY-KELKAELRELELALLVLRLEELREELEELQEELKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 286 LEEAEHKLQTLQTALEKTRTELFDLKTKYDEETT---AKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQL 362
Cdd:TIGR02168 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKelyALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 363 ASQIQKAPDVAIEVLTRSSleVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQ 442
Cdd:TIGR02168 335 LAEELAELEEKLEELKEEL--ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERL 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 443 ADYEEVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEknrsLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQK 522
Cdd:TIGR02168 413 EDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEE----LQEELERLEEALEELREELEEAEQALDAAERELAQLQ 488
|
330
....*....|....*...
gi 601984296 523 ELIARLEQDLSTIQSIQR 540
Cdd:TIGR02168 489 ARLDSLERLQENLEGFSE 506
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
196-447 |
3.60e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 66.88 E-value: 3.60e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 196 QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREyEQTLKSQAETIALEKEQKLQNDFAEKERKL 275
Cdd:COG1196 226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRL-ELEELELELEEAQAEEYELLAELARLEQDI 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 276 ---QETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMimtdLERANQRAEVAQREAETLREQ 352
Cdd:COG1196 305 arlEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAE----LAEAEEALLEAEAELAEAEEE 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 353 LSSANHSLQ--LASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNS 430
Cdd:COG1196 381 LEELAEELLeaLRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
|
250
....*....|....*..
gi 601984296 431 TLKQLEEKLKGQADYEE 447
Cdd:COG1196 461 LLELLAELLEEAALLEA 477
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
197-537 |
8.23e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 62.39 E-value: 8.23e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 197 LEIKVQRLHDIETEnqklRETLEEYnkefaevknqevtiKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQ 276
Cdd:TIGR02169 193 IDEKRQQLERLRRE----REKAERY--------------QALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 277 ETQMSTTSKLEEAEHKLQTLQTALEKtrtelfdLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSsa 356
Cdd:TIGR02169 255 KLTEEISELEKRLEEIEQLLEELNKK-------IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLA-- 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 357 nhslQLASQIQKApdvaieVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRensaSQISQLEQQLNAKNSTLKQLE 436
Cdd:TIGR02169 326 ----KLEAEIDKL------LAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR----AELEEVDKEFAETRDELKDYR 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 437 EKLkgqadyEEVKKELNTLKsmefapsegaGTQDStkplevlLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATA 516
Cdd:TIGR02169 392 EKL------EKLKREINELK----------RELDR-------LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448
|
330 340
....*....|....*....|.
gi 601984296 517 KAVEQKELIARLEQDLSTIQS 537
Cdd:TIGR02169 449 EIKKQEWKLEQLAADLSKYEQ 469
|
|
| HOOK |
pfam05622 |
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ... |
143-440 |
3.31e-09 |
|
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.
Pssm-ID: 461694 [Multi-domain] Cd Length: 528 Bit Score: 60.09 E-value: 3.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 143 EDLRKQVAPLL---KSFQGEIDALSKRSKEAEAafltvykrlIDVPDPVPA---LDVGQQLEIKVQRLHDIETENQKLRE 216
Cdd:pfam05622 17 HELDQQVSLLQeekNSLQQENKKLQERLDQLES---------GDDSGTPGGkkyLLLQKQLEQLQEENFRLETARDDYRI 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 217 TLEEYNKEFAE--VKNQEVT-----IKALKEKIREYEQTLK--SQAETIALEKEQKLQ--NDFAEKERKLQETQ---MST 282
Cdd:pfam05622 88 KCEELEKEVLElqHRNEELTslaeeAQALKDEMDILRESSDkvKKLEATVETYKKKLEdlGDLRRQVKLLEERNaeyMQR 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 283 TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTaKADEIEMIMTDLEranQRAEVAQREAETL---REQLSSANHS 359
Cdd:pfam05622 168 TLQLEEELKKANALRGQLETYKRQVQELHGKLSEESK-KADKLEFEYKKLE---EKLEALQKEKERLiieRDTLRETNEE 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 360 LQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRE----NSASQISQLEQQLNAKNSTLKQL 435
Cdd:pfam05622 244 LRCAQLQQAELSQADALLSPSSDPGDNLAAEIMPAEIREKLIRLQHENKMLRLgqegSYRERLTELQQLLEDANRRKNEL 323
|
....*
gi 601984296 436 EEKLK 440
Cdd:pfam05622 324 ETQNR 328
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
190-417 |
5.20e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 59.01 E-value: 5.20e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 190 ALDVGQQLEIKVQRLH-DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIAL--EKEQKLQN 266
Cdd:COG4942 18 QADAAAEAEAELEQLQqEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAEleKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 267 DFAEKERKLQET-----QMSTTSKLEEAEHKLQTLQTAlekTRTELFDLKTKYDEEttaKADEIEMIMTDLERANQRAEV 341
Cdd:COG4942 98 ELEAQKEELAELlralyRLGRQPPLALLLSPEDFLDAV---RRLQYLKYLAPARRE---QAEELRADLAELAALRAELEA 171
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984296 342 AQREAETLREQLSSANHSLQLASQIQKApdvaievlTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ 417
Cdd:COG4942 172 ERAELEALLAELEEERAALEALKAERQK--------LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
152-457 |
7.35e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 59.31 E-value: 7.35e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 152 LLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPALDvgQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVknq 231
Cdd:TIGR02169 689 ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE--QEEEKLKERLEELEEDLSSLEQEIENVKSELKEL--- 763
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 232 EVTIKALKEKIREYEQTLKSQAETIALEKEQKLQN--DFAEKERKLQETQmstTSKLEEAEHKLQTLQTALEKTRTELFD 309
Cdd:TIGR02169 764 EARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAelSKLEEEVSRIEAR---LREIEQKLNRLTLEKEYLEKEIQELQE 840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 310 LKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSS-----ANHSLQLASQIQKAPDVAIEVLTRSSLEV 384
Cdd:TIGR02169 841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDlkkerDELEAQLRELERKIEELEAQIEKKRKRLS 920
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984296 385 ELAAKEREIAQLVEDVQRLQASLTKLRENSASqISQLEQQLNAKNSTLKQLEE-KLKGQADYEEVKKELNTLKS 457
Cdd:TIGR02169 921 ELKAKLEALEEELSEIEDPKGEDEEIPEEELS-LEDVQAELQRVEEEIRALEPvNMLAIQEYEEVLKRLDELKE 993
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
104-401 |
1.07e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.79 E-value: 1.07e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 104 KLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPL---LKSFQGEIDALSKRSKEAEAAFLTVYKR 180
Cdd:COG1196 217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELeaeLEELRLELEELELELEEAQAEEYELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 181 LIDVpdpvpaldvGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETI--AL 258
Cdd:COG1196 297 LARL---------EQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAeeAL 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 259 EKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmimtdlERANQR 338
Cdd:COG1196 368 LEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEE------EEEEEE 441
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 601984296 339 AEVAQREAETLREQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQ 401
Cdd:COG1196 442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
212-436 |
1.70e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 58.39 E-value: 1.70e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 212 QKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALekEQKLQNDFAEKERKLQETQM-STTSKLEEAE 290
Cdd:COG4913 231 VEHFDDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYL--RAALRLWFAQRRLELLEAELeELRAELARLE 308
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 291 HKLQTLQTALEKTRTELFDLKTKYDEettAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQiqkap 370
Cdd:COG4913 309 AELERLEARLDALREELDELEAQIRG---NGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAE----- 380
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984296 371 dvaievltrsslevELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLE 436
Cdd:COG4913 381 --------------EFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
108-626 |
2.17e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.02 E-value: 2.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 108 ELDATATVLANRQDESEQSRKRLIEQSREFKKNtpedlRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLidvpdp 187
Cdd:COG1196 285 EAQAEEYELLAELARLEQDIARLEERRRELEER-----LEELEEELAELEEELEELEEELEELEEELEEAEEEL------ 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 188 vpaLDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQND 267
Cdd:COG1196 354 ---EEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEAL 430
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 268 FAEKERKLQETQmsttsKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmimtDLERANQRAEVAQREAE 347
Cdd:COG1196 431 AELEEEEEEEEE-----ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE----ELAEAAARLLLLLEAEA 501
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 348 TLREQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKE--REIAQLVEDVQRLQASLTKLRENSASQISQLEQQL 425
Cdd:COG1196 502 DYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALaaALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDK 581
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 426 NAKNSTLKQLEEKLKGQADYEEVKKELNTLKSMEFAPSE-------GAGTQDSTKPLEVLLLEKNRSLQSENATLRISNS 498
Cdd:COG1196 582 IRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDtllgrtlVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGS 661
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 499 DLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTIQSIQRPDAEGASEQGLEKIPEPIKEATAlfygpsmsssgtlpE 578
Cdd:COG1196 662 LTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEAL--------------E 727
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 601984296 579 GQVDSLLSIISSQRERFRTRNQELEAESRMAQHTIQALQSELDSLRAD 626
Cdd:COG1196 728 EQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLERE 775
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
167-455 |
3.75e-08 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 56.89 E-value: 3.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 167 SKEAEAAFLTVYKRLIDVPDPVPALDVGQQLEIKVQRLHDIETE--NQKLRETLEEYNKEFAEVKNQEVTIKALKEKIRE 244
Cdd:COG5185 221 LLEKAKEIINIEEALKGFQDPESELEDLAQTSDKLEKLVEQNTDlrLEKLGENAESSKRLNENANNLIKQFENTKEKIAE 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 245 YEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYD-EETTAKAD 323
Cdd:COG5185 301 YTKSIDIKKATESLEEQLAAAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVElSKSSEELD 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 324 -----------EIEMIMTDLERANQRAE--------VAQREAETLREQLSSANHSLQLASQIQKA-------PDVAIEVL 377
Cdd:COG5185 381 sfkdtiestkeSLDEIPQNQRGYAQEILatledtlkAADRQIEELQRQIEQATSSNEEVSKLLNEliselnkVMREADEE 460
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 378 TRSSLEVELAAKEREIAQ----LVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLK--GQADYEEVKKE 451
Cdd:COG5185 461 SQSRLEEAYDEINRSVRSkkedLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAESLKdfMRARGYAHILA 540
|
....
gi 601984296 452 LNTL 455
Cdd:COG5185 541 LENL 544
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
152-458 |
9.16e-08 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.84 E-value: 9.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 152 LLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVpdpvpalDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQ 231
Cdd:PRK03918 453 LLEEYTAELKRIEKELKEIEEKERKLRKELREL-------EKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAE 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 232 EvtIKALKEKIREYEQTLKSQAETiaLEKEQKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQ-TALEKTRTELFDL 310
Cdd:PRK03918 526 E--YEKLKEKLIKLKGEIKSLKKE--LEKLEELKKKLAELEKKLDELE----EELAELLKELEELGfESVEELEERLKEL 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 311 KTKYDEETTAKADEIEmimtdLERANQRAEVAQREAETLREQLSSANHSLQLASQiqkapdvAIEVLTRSSLEVELAAKE 390
Cdd:PRK03918 598 EPFYNEYLELKDAEKE-----LEREEKELKKLEEELDKAFEELAETEKRLEELRK-------ELEELEKKYSEEEYEELR 665
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984296 391 REIAQLVEDVQRLQAsltklrensasQISQLEQQLNAKNSTLKQLEEKLKgqaDYEEVKKELNTLKSM 458
Cdd:PRK03918 666 EEYLELSRELAGLRA-----------ELEELEKRREEIKKTLEKLKEELE---EREKAKKELEKLEKA 719
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
226-465 |
1.35e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 54.38 E-value: 1.35e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 226 AEVKNQEVTIKALKEKIREYEQTLKSQAETI--ALEKEQKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQTALEKT 303
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEkaLLKQLAALERRIAALARRIRALE----QELAALEAELAELEKEIAEL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 304 RTELFDLKTKYDE-----ETTAKADEIEMIM--TDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVAiev 376
Cdd:COG4942 96 RAELEAQKEELAEllralYRLGRQPPLALLLspEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAE--- 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 377 ltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRensaSQISQLEQQLNAKNSTLKQLEEKLKGQAdyEEVKKELNTLK 456
Cdd:COG4942 173 --RAELEALLAELEEERAALEALKAERQKLLARLE----KELAELAAELAELQQEAEELEALIARLE--AEAAAAAERTP 244
|
....*....
gi 601984296 457 SMEFAPSEG 465
Cdd:COG4942 245 AAGFAALKG 253
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
195-553 |
1.49e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 55.03 E-value: 1.49e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRLHDIETENQKLRETLEEYNKEfaeVKNQEVTIKALKEKIreyeQTLKSQAETIALEKEQKLQNDFAE---- 270
Cdd:TIGR04523 246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKE---LEQNNKKIKELEKQL----NQLKSEISDLNNQKEQDWNKELKSelkn 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 271 KERKLQETQmSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQraevaqrEAETLR 350
Cdd:TIGR04523 319 QEKKLEEIQ-NQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQ-------EIKNLE 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 351 EQLSSANHSLQLASQIQKAPDVAIEVltrssLEVELAAKEREIAQLVEDVQRLQASLTKLREnsasQISQLEQQLNAKNS 430
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQIKK-----LQQEKELLEKEIERLKETIIKNNSEIKDLTN----QDSVKELIIKNLDN 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 431 TLKQLEEKLKgqadyeEVKKELNTLKSmefapsegaGTQDSTKPLEvlllEKNrslqSENATLRISNSDLSGRCAELQIH 510
Cdd:TIGR04523 462 TRESLETQLK------VLSRSINKIKQ---------NLEQKQKELK----SKE----KELKKLNEEKKELEEKVKDLTKK 518
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 601984296 511 LTEATAKAVEQKELIARLEQDLSTIQSIQRPDAEGASEQGLEK 553
Cdd:TIGR04523 519 ISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEK 561
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
195-368 |
3.72e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 53.62 E-value: 3.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERK 274
Cdd:COG4717 74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAE---LEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 275 LQETQMSttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAK----ADEIEMIMTDLERANQRAEVAQREAETLR 350
Cdd:COG4717 151 LEERLEE----LRELEEELEELEAELAELQEELEELLEQLSLATEEElqdlAEELEELQQRLAELEEELEEAQEELEELE 226
|
170
....*....|....*...
gi 601984296 351 EQLSSANHSLQLASQIQK 368
Cdd:COG4717 227 EELEQLENELEAAALEER 244
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
195-457 |
4.10e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 53.49 E-value: 4.10e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQAETIALEKeQKLQNDFAE 270
Cdd:TIGR04523 412 EQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQdsvkELIIKNLDNTRESLETQLKVLSRSINKIK-QNLEQKQKE 490
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 271 KERKLQETQMST--TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAET 348
Cdd:TIGR04523 491 LKSKEKELKKLNeeKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEI 570
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 349 lrEQLSSANHSLqLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLrensASQISQLEQQLNAK 428
Cdd:TIGR04523 571 --EELKQTQKSL-KKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKL----SSIIKNIKSKKNKL 643
|
250 260 270
....*....|....*....|....*....|
gi 601984296 429 NSTLKQLEEKLKG-QADYEEVKKELNTLKS 457
Cdd:TIGR04523 644 KQEVKQIKETIKEiRNKWPEIIKKIKESKT 673
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
103-617 |
8.97e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.84 E-value: 8.97e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKqvAPLLKSFQGEIDALSKRSKEAEAAfltvyKRLI 182
Cdd:PTZ00121 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK--ADELKKAAAAKKKADEAKKKAEEK-----KKAD 1434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRET--LEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETI---- 256
Cdd:PTZ00121 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAkkADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKkade 1514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 257 ---ALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMImtDLE 333
Cdd:PTZ00121 1515 akkAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKA--EEA 1592
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 334 RANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLREN 413
Cdd:PTZ00121 1593 RIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 414 SASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKElntlksmefapsegaGTQDSTKPLEVLLLEKNRSLQSENATl 493
Cdd:PTZ00121 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK---------------EAEEKKKAEELKKAEEENKIKAEEAK- 1736
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 494 risnsdlsgRCAELQIHLTEATAKAVEQKELIARLEQDLSTIQSIQRPDAEGASEQGLEKIPEPIKEATALFYGPSMSSS 573
Cdd:PTZ00121 1737 ---------KEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
|
490 500 510 520
....*....|....*....|....*....|....*....|....
gi 601984296 574 GTLPEGQVDSLLsIISSQRERFRTRNQELEAESRMAQHTIQALQ 617
Cdd:PTZ00121 1808 ANIIEGGKEGNL-VINDSKEMEDSAIKEVADSKNMQLEEADAFE 1850
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
283-499 |
1.42e-06 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 52.36 E-value: 1.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 283 TSKLEEAEHKLQTLQTALEKTRTELFDLKTKydEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQlsSANHSLQL 362
Cdd:TIGR01612 1073 KEILEEAEINITNFNEIKEKLKHYNFDDFGK--EENIKYADEINKIKDDIKNLDQKIDHHIKALEEIKKK--SENYIDEI 1148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 363 ASQIQKAPDVAIEVLTRSSLEvELAAKEREIA-------QLVEDVQRLQASLTKLRENSASqisqLEQQLNAKNSTLKQL 435
Cdd:TIGR01612 1149 KAQINDLEDVADKAISNDDPE-EIEKKIENIVtkidkkkNIYDEIKKLLNEIAEIEKDKTS----LEEVKGINLSYGKNL 1223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984296 436 EEKLKGQADyEEVKKELNTLKSMEfAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSD 499
Cdd:TIGR01612 1224 GKLFLEKID-EEKKKSEHMIKAME-AYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDD 1285
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
195-355 |
2.07e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.45 E-value: 2.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERK 274
Cdd:COG4913 274 LEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQLEREIERLERE 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 275 LQETQmsttSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLS 354
Cdd:COG4913 354 LEERE----RRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIA 429
|
.
gi 601984296 355 S 355
Cdd:COG4913 430 S 430
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
157-532 |
2.09e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 51.31 E-value: 2.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 157 QGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYnkefaevKNQEVTIK 236
Cdd:COG4717 94 QEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEEL-------RELEEELE 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 237 ALKEKIREYEQTLKSQAETIALEKEQKLQnDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKtkyDE 316
Cdd:COG4717 167 ELEAELAELQEELEELLEQLSLATEEELQ-DLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAA---LE 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 317 ETTAKADEIEMIMT-----------DLERANQRAEVAQ-------REAETLREQLSSANHSLQLASQIQKAPDVAIEVLT 378
Cdd:COG4717 243 ERLKEARLLLLIAAallallglggsLLSLILTIAGVLFlvlgllaLLFLLLAREKASLGKEAEELQALPALEELEEEELE 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 379 R--SSLEVELAAKEREIAQLVEDVQRLQASLTKLRE-NSASQISQLEQQLN-----AKNSTLKQLEEKLKGQADYEEVKK 450
Cdd:COG4717 323 EllAALGLPPDLSPEELLELLDRIEELQELLREAEElEEELQLEELEQEIAallaeAGVEDEEELRAALEQAEEYQELKE 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 451 ELNTLKSmEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLteataKAVEQKELIARLEQ 530
Cdd:COG4717 403 ELEELEE-QLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAEL-----EQLEEDGELAELLQ 476
|
..
gi 601984296 531 DL 532
Cdd:COG4717 477 EL 478
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
236-438 |
2.49e-06 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 50.78 E-value: 2.49e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 236 KALKEKIREYEQTLKSQAETIALEKEQ-KLQNDFAEKERKL-----QETQMSTTSKLEEAE-HKLQTLQT---------- 298
Cdd:PHA02562 170 KLNKDKIRELNQQIQTLDMKIDHIQQQiKTYNKNIEEQRKKngeniARKQNKYDELVEEAKtIKAEIEELtdellnlvmd 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 299 ------ALEKTRTELFDLKTKYdeETTAKadEIEMI---------MTDLERANQRaevaqreAETLREQLSSANHSL-QL 362
Cdd:PHA02562 250 iedpsaALNKLNTAAAKIKSKI--EQFQK--VIKMYekggvcptcTQQISEGPDR-------ITKIKDKLKELQHSLeKL 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 363 ASQIQKAPDVAIEVLTRSS----LEVELAAKEREIAQLVEDVQRLQASLTKL---RENSASQISQLEQQLNAKNSTLKQL 435
Cdd:PHA02562 319 DTAIDELEEIMDEFNEQSKklleLKNKISTNKQSLITLVDKAKKVKAAIEELqaeFVDNAEELAKLQDELDKIVKTKSEL 398
|
...
gi 601984296 436 EEK 438
Cdd:PHA02562 399 VKE 401
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
259-639 |
2.49e-06 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 50.89 E-value: 2.49e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 259 EKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRtELFDLKTKYDEETTAKADEIEMIMTDLERANQR 338
Cdd:pfam05557 48 DRNQELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLN-EKESQLADAREVISCLKNELSELRRQIQRAELE 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 339 AEVAQREAETLREQ-----------------LSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKERE-IAQLVEDV 400
Cdd:pfam05557 127 LQSTNSELEELQERldllkakaseaeqlrqnLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELArIPELEKEL 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 401 QRLQASLTKLRENSASqISQLEQQLNAKNSTLKQLE----EKLKGQADYEEVKKELNTLKSMEFA-------PSEGAGTQ 469
Cdd:pfam05557 207 ERLREHNKHLNENIEN-KLLLKEEVEDLKRKLEREEkyreEAATLELEKEKLEQELQSWVKLAQDtglnlrsPEDLSRRI 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 470 DSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTI--------QSIQRP 541
Cdd:pfam05557 286 EQLQQREIVLKEENSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLtkerdgyrAILESY 365
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 542 DAEGASEQGLEKIPEPIKEATALFygPSMSSSGTLPEGQVDSLLSIISSQRERFRT--------RNQELEAESRMAQHTI 613
Cdd:pfam05557 366 DKELTMSNYSPQLLERIEEAEDMT--QKMQAHNEEMEAQLSVAEEELGGYKQQAQTlerelqalRQQESLADPSYSKEEV 443
|
410 420
....*....|....*....|....*.
gi 601984296 614 QALQSELDSLRADNIKLFEKIKFLQS 639
Cdd:pfam05557 444 DSLRRKLETLELERQRLREQKNELEM 469
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
103-401 |
3.72e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 50.83 E-value: 3.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREF--KKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfltvykr 180
Cdd:TIGR02168 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELesKLDELAEELAELEEKLEELKEELESLEAELEELEAE------- 366
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 181 lidvpdpvpaldvgqqleikvqrLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEK 260
Cdd:TIGR02168 367 -----------------------LEELESRLEELEEQLETLRSKVAQLELQ---IASLNNEIERLEARLERLEDRRERLQ 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 261 EQKLQNDFAEKERKLQETQMSTTSK---LEEAEHKLQTLQTALEKTRTELFDLKTKYDeETTAKADEIEMIMTDLERANQ 337
Cdd:TIGR02168 421 QEIEELLKKLEEAELKELQAELEELeeeLEELQEELERLEEALEELREELEEAEQALD-AAERELAQLQARLDSLERLQE 499
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984296 338 RAEVAQREAETLREQLSSANHSLQLASQIqkapdvaIEVLTRSSLEVELAAKEREIAQLVEDVQ 401
Cdd:TIGR02168 500 NLEGFSEGVKALLKNQSGLSGILGVLSEL-------ISVDEGYEAAIEAALGGRLQAVVVENLN 556
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
202-442 |
5.48e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.91 E-value: 5.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 202 QRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQklqndfAEKERKLQETQmS 281
Cdd:COG4913 610 AKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREI------AELEAELERLD-A 682
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 282 TTSKLEEAEHKLQTLQTALEKTRTELFDLKTKY---DEETTAKADEIEMIMTDLERANQRAEVAQRE-AETLREQLSSAN 357
Cdd:COG4913 683 SSDDLAALEEQLEELEAELEELEEELDELKGEIgrlEKELEQAEEELDELQDRLEAAEDLARLELRAlLEERFAAALGDA 762
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 358 HSLQLASQIQKApdvaievltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNA-KNSTLKQLE 436
Cdd:COG4913 763 VERELRENLEER---------IDALRARLNRAEEELERAMRAFNREWPAETADLDADLESLPEYLALLDRlEEDGLPEYE 833
|
....*.
gi 601984296 437 EKLKGQ 442
Cdd:COG4913 834 ERFKEL 839
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
318-533 |
5.53e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 49.38 E-value: 5.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 318 TTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQiqkapdvAIEVLTR--SSLEVELAAKEREIAQ 395
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALER-------RIAALARriRALEQELAALEAELAE 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 396 LVEDVQRLQASLTKLRENSASQISQLEQQ---------LNAKNST--------LKQLEEKLKGQAD-YEEVKKELNTLKS 457
Cdd:COG4942 88 LEKEIAELRAELEAQKEELAELLRALYRLgrqpplallLSPEDFLdavrrlqyLKYLAPARREQAEeLRADLAELAALRA 167
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984296 458 mefapsEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLS 533
Cdd:COG4942 168 ------ELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAA 237
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
103-455 |
5.60e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 50.06 E-value: 5.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAA-FLTVYKRL 181
Cdd:PRK03918 455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEeYEKLKEKL 534
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 182 IDVPDPVpaldvgQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEV-----TIKALKEKIREYEQtlksqaeti 256
Cdd:PRK03918 535 IKLKGEI------KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEelgfeSVEELEERLKELEP--------- 599
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 257 ALEKEQKLQNDFAEKERKLQEtQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEiemIMTDLERAN 336
Cdd:PRK03918 600 FYNEYLELKDAEKELEREEKE-LKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELRE---EYLELSREL 675
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 337 QRAEVAQREAETLREQLSSanhslqlasqiqkapdvAIEVLTRSSLEVELAAKERE-IAQLVEDVQRLQASLTK----LR 411
Cdd:PRK03918 676 AGLRAELEELEKRREEIKK-----------------TLEKLKEELEEREKAKKELEkLEKALERVEELREKVKKykalLK 738
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 601984296 412 ENSASQISQLEQQLNAK-------NSTLKQLEEKLKGQADYEEVKKELNTL 455
Cdd:PRK03918 739 ERALSKVGEIASEIFEEltegkysGVRVKAEENKVKLFVVYQGKERPLTFL 789
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
272-458 |
5.95e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 49.63 E-value: 5.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 272 ERKLQETQMSTT---SKLEEAEHKLQTLQTALE--KTRTELFDLktkyDEETTAKADEIEMIMTDLERANQRAEVAQREA 346
Cdd:COG3206 167 ELRREEARKALEfleEQLPELRKELEEAEAALEefRQKNGLVDL----SEEAKLLLQQLSELESQLAEARAELAEAEARL 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 347 ETLREQLSSanhSLQLASQIQKAPDVAIEVLTRSSLEVELA--------------AKEREIA----QLVEDVQRLQASLT 408
Cdd:COG3206 243 AALRAQLGS---GPDALPELLQSPVIQQLRAQLAELEAELAelsarytpnhpdviALRAQIAalraQLQQEAQRILASLE 319
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 601984296 409 KLRENSASQISQLEQQLNAKNSTLKQLEEKlkgQADYEEVKKELNTLKSM 458
Cdd:COG3206 320 AELEALQAREASLQAQLAQLEARLAELPEL---EAELRRLEREVEVAREL 366
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
186-440 |
6.01e-06 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 49.91 E-value: 6.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 186 DPVPALDVGQQLEiKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLK-SQAETIALEKeqkl 264
Cdd:PRK11281 34 DLPTEADVQAQLD-ALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRqAQAELEALKD---- 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 265 QNDFAEKER--KLQETQMSttSKLEEAEHKLQTLQTALEKTRTELFDLKTKydeettakadeiemimtdLERA-NQRAEV 341
Cdd:PRK11281 109 DNDEETRETlsTLSLRQLE--SRLAQTLDQLQNAQNDLAEYNSQLVSLQTQ------------------PERAqAALYAN 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 342 AQREAEtLREQLSS--ANHSLQLASQIQKapdvaievltrssLEVELAAKEREIA---QLVEDVQRLQASLTKLRENSAS 416
Cdd:PRK11281 169 SQRLQQ-IRNLLKGgkVGGKALRPSQRVL-------------LQAEQALLNAQNDlqrKSLEGNTQLQDLLQKQRDYLTA 234
|
250 260 270
....*....|....*....|....*....|.
gi 601984296 417 QISQLEQQL-------NAKNstLKQLEEKLK 440
Cdd:PRK11281 235 RIQRLEHQLqllqeaiNSKR--LTLSEKTVQ 263
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
203-511 |
8.72e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 49.35 E-value: 8.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 203 RLHDIETENQKLRETLEEYNKEFAE--VKNQEVTIKALKEKI---REYEQTLKSQAETIALEKEQK----------LQND 267
Cdd:pfam15921 246 QLEALKSESQNKIELLLQQHQDRIEqlISEHEVEITGLTEKAssaRSQANSIQSQLEIIQEQARNQnsmymrqlsdLEST 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 268 FAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTElfdlKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAE 347
Cdd:pfam15921 326 VSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQKLLADLHKREKELSLEKEQNK 401
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 348 TLREQlssanhslqlasqiQKAPDVAIEVLTRsslevELAAKEREiaqlvedVQRLQASLTKLRENSASQISQLEQQLNA 427
Cdd:pfam15921 402 RLWDR--------------DTGNSITIDHLRR-----ELDDRNME-------VQRLEALLKAMKSECQGQMERQMAAIQG 455
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 428 KNSTLKQ---LEEKLKGQADY-EEVKKELnTLKSMEFAPSEgagtqDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGR 503
Cdd:pfam15921 456 KNESLEKvssLTAQLESTKEMlRKVVEEL-TAKKMTLESSE-----RTVSDLTASLQEKERAIEATNAEITKLRSRVDLK 529
|
....*...
gi 601984296 504 CAELQiHL 511
Cdd:pfam15921 530 LQELQ-HL 536
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
317-537 |
9.20e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 9.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 317 ETTAKADEIEMIMTDLERANQRAEVAQREAETLR------EQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKE 390
Cdd:COG4913 222 DTFEAADALVEHFDDLERAHEALEDAREQIELLEpirelaERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELR 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 391 REIAQLVEDVQRLQASLTKLREnsasQISQLEQQL-NAKNSTLKQLEEKLKG-QADYEEVKKELNTLKsmefapsEGAGT 468
Cdd:COG4913 302 AELARLEAELERLEARLDALRE----ELDELEAQIrGNGGDRLEQLEREIERlERELEERERRRARLE-------ALLAA 370
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984296 469 QDSTKPLEVLLLEKNRSLQSENATlrisnsDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTIQS 537
Cdd:COG4913 371 LGLPLPASAEEFAALRAEAAALLE------ALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
143-335 |
1.32e-05 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 47.75 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 143 EDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfLTVYKRLIDvpdpvpalDVGQQLEIKVQRLHDI-ETENQKL-RETLEE 220
Cdd:cd22656 113 EEAKKTIKALLDDLLKEAKKYQDKAAKVVDK-LTDFENQTE--------KDQTALETLEKALKDLlTDEGGAIaRKEIKD 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 221 YNKEFAevKNQEVTIKALKEKIREYEQTLKSQAETIAleKEQKLQNDFaekerKLQETQM-STTSKLEEAEHKLQTLQTA 299
Cdd:cd22656 184 LQKELE--KLNEEYAAKLKAKIDELKALIADDEAKLA--AALRLIADL-----TAADTDLdNLLALIGPAIPALEKLQGA 254
|
170 180 190
....*....|....*....|....*....|....*.
gi 601984296 300 LEKTRTELFDLKTKYDEETTaKADEIEMIMTDLERA 335
Cdd:cd22656 255 WQAIATDLDSLKDLLEDDIS-KIPAAILAKLELEKA 289
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
211-532 |
1.58e-05 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 48.59 E-value: 1.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 211 NQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQA-ETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEA 289
Cdd:pfam07111 58 SQALSQQAELISRQLQELRRLEEEVRLLRETSLQQKMRLEAQAmELDALAVAEKAGQAEAEGLRAALAGAEMVRKNLEEG 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 290 EHK-LQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLE--RANQRAE--VAQREAETLREQLSSANHSLQlaS 364
Cdd:pfam07111 138 SQReLEEIQRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLEtkRAGEAKQlaEAQKEAELLRKQLSKTQEELE--A 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 365 QIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQ---------ASLTKLRENSASQISQL-EQQLNAKNSTLKQ 434
Cdd:pfam07111 216 QVTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQedradlqatVELLQVRVQSLTHMLALqEEELTRKIQPSDS 295
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 435 LEEKLKgqadyEEVKKELNTLKSMEFA-----PSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAE--- 506
Cdd:pfam07111 296 LEPEFP-----KKCRSLLNRWREKVFAlmvqlKAQDLEHRDSVKQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEvev 370
|
330 340 350
....*....|....*....|....*....|...
gi 601984296 507 -------LQIHLTEATAKAVEQKELIARLEQDL 532
Cdd:pfam07111 371 ermsakgLQMELSRAQEARRRQQQQTASAEEQL 403
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
190-614 |
1.74e-05 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 48.20 E-value: 1.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 190 ALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKefaevknqevtikALKEKIREYEQTLKSqaetiaLEKEQKLQNDFA 269
Cdd:pfam05557 36 ASALKRQLDRESDRNQELQKRIRLLEKREAEAEE-------------ALREQAELNRLKKKY------LEALNKKLNEKE 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 270 EKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEettakadeiemimtdleranQRAEVAqrEAETL 349
Cdd:pfam05557 97 SQLADAREVISCLKNELSELRRQIQRAELELQSTNSELEELQERLDL--------------------LKAKAS--EAEQL 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 350 REQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKERE-IAQLVEDVQRLQASLTKLRENSASqISQLEQQLNAK 428
Cdd:pfam05557 155 RQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELArIPELEKELERLREHNKHLNENIEN-KLLLKEEVEDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 429 NSTLKQLE----EKLKGQADYEEVKKELNTLKSMEFA-------PSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISN 497
Cdd:pfam05557 234 KRKLEREEkyreEAATLELEKEKLEQELQSWVKLAQDtglnlrsPEDLSRRIEQLQQREIVLKEENSSLTSSARQLEKAR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 498 SDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTI--------QSIQRPDAEGASEQGLEKIPEPIKEAT------- 562
Cdd:pfam05557 314 RELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLtkerdgyrAILESYDKELTMSNYSPQLLERIEEAEdmtqkmq 393
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 563 --------------------------------ALFYGPSMSSSGTLPEGqVDSL---LSIISSQRERFRTRNQELeaESR 607
Cdd:pfam05557 394 ahneemeaqlsvaeeelggykqqaqtlerelqALRQQESLADPSYSKEE-VDSLrrkLETLELERQRLREQKNEL--EME 470
|
....*..
gi 601984296 608 MAQHTIQ 614
Cdd:pfam05557 471 LERRCLQ 477
|
|
| ClyA_NheA-like |
cd22654 |
Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA), and similar proteins; This ... |
190-455 |
2.19e-05 |
|
Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA), and similar proteins; This model contains Bacillus cereus tripartite non-hemolytic enterotoxin (Nhe) component A (NheA), a member of the cytolysin A (ClyA) family of alpha pore-forming toxins (alpha-PFTs). Non-hemolytic enterotoxin (Nhe), despite its name, is hemolytic and able to lyse erythrocytes from various mammalian organisms. It consists of three proteins, NheA, NheB and NheC, encoded by one operon containing three genes nheA, nheB and nheC, respectively. Separately, these three proteins show no toxicity; maximal activity is seen only when all three components are presented. The NheB and NheC components are able to bind to cell membranes while NheA is not; NheC primes the host cell for the formation of ion permeable NheB/C pores. Binding of NheA to NheB/NheC is thought to be the final stage of pore formation. Structure of NheA shows an elongated, almost entirely alpha-helical protein with an enlarged "head" domain compared with other cytolysins, displaying on its surface an enlarged beta-tongue which is of amphipathic rather than hydrophobic nature. It has been proposed that NheA could even form beta-barrel pores.
Pssm-ID: 439152 [Multi-domain] Cd Length: 333 Bit Score: 47.26 E-value: 2.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 190 ALDVGQQLEIKVQRLHDIeTENQKL-----RETLEEYNKEFAEVkNQEvtIKALKEKIREYEQTLKSQAETIALEKEQKl 264
Cdd:cd22654 18 ALVILKQPNVKIEAMPSL-TNHQQTakenvREWLDEYNPKLIDL-NQD--MINFSQRFNNYYDKLYDLAGKINEDEQAK- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 265 qNDFAEKERKLQEtqmsttskleeaehKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANqrAEVAQr 344
Cdd:cd22654 93 -EDFLNGINKLQS--------------QLQTIQNSMEQTSSNLNRFKTLLDADSKNFSTDAKKAIDSLSGSN--GEIAQ- 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 345 eaetLREQLSSANHSLQ-------------------LASQI---------QKAPDVAI------EVLTRSSLEVELAAKe 390
Cdd:cd22654 155 ----LRTQIKTINDEIQeeltkilnrpievgdgsinIGKQVftitittatTKTVDVTSigglinGIGNASDDEVKEAAN- 229
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 601984296 391 rEIAQLVEDVQRLQASLTKLrENSASQISQLEQQLNAKNSTLKqleeklKGQADYEEVKKELNTL 455
Cdd:cd22654 230 -KIQQKQKELVDLIKKLSDA-EIQATQLTLVEDQVNGFTELIK------RQIATLENLVEDWEML 286
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
114-386 |
2.21e-05 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 48.15 E-value: 2.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 114 TVLANRQDESEQSRKRLIE---QSREFK--KNTPEDLRKQVAPLLKSFQGEIDALSKrSKEAEAAFLTVYKRLIDVPDPv 188
Cdd:COG5022 868 TIYLQSAQRVELAERQLQElkiDVKSISslKLVNLELESEIIELKKSLSSDLIENLE-FKTELIARLKKLLNNIDLEEG- 945
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 189 PALDVGQQLEikVQRLHdieTENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKS--------QAETIALEK 260
Cdd:COG5022 946 PSIEYVKLPE--LNKLH---EVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAElskqygalQESTKQLKE 1020
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 261 EQK----LQNDF-------AEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmiM 329
Cdd:COG5022 1021 LPVevaeLQSASkiissesTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKTIN--V 1098
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 601984296 330 TDLERANQRAEVAQREAETLREQLSSANHSLQLASQI-QKAPDVAIEVLTRSSLEVEL 386
Cdd:COG5022 1099 KDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLsQLVNTLEPVFQKLSVLQLEL 1156
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
281-474 |
3.09e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 47.13 E-value: 3.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 281 STTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEettaKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL 360
Cdd:COG3883 20 AKQKELSELQAELEAAQAELDALQAELEELNEEYNE----LQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARAL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 361 QLASQIQKAPDV---------------AIEVLTRSSLEV--ELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQ 423
Cdd:COG3883 96 YRSGGSVSYLDVllgsesfsdfldrlsALSKIADADADLleELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEA 175
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 601984296 424 QLNAKNSTLKQLEEKLK-GQADYEEVKKELNTLKSMEFAPSEGAGTQDSTKP 474
Cdd:COG3883 176 QQAEQEALLAQLSAEEAaAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
208-638 |
3.25e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 47.34 E-value: 3.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 208 ETENQKLRETLEEYNKEFAEVKnqevtikalkEKIREYEqtlkSQAETiALEKEQKLQNDFAEKERKLQETQmsttsKLE 287
Cdd:PRK02224 198 EKEEKDLHERLNGLESELAELD----------EEIERYE----EQREQ-ARETRDEADEVLEEHEERREELE-----TLE 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 288 EAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQ 367
Cdd:PRK02224 258 AEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAA 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 368 KAPDVAIEVLTRSSLEVELAAKE-REIAQLVEDvqRLQASLTKLREnSASQISQLEQQLNAKNSTLKQLEEKLKGQADY- 445
Cdd:PRK02224 338 QAHNEEAESLREDADDLEERAEElREEAAELES--ELEEAREAVED-RREEIEELEEEIEELRERFGDAPVDLGNAEDFl 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 446 EEVKKELN-----------TLKSMEFAPSEGAGTQDSTK------PLE----VLLLEKNR----SLQSENATLRISNSDL 500
Cdd:PRK02224 415 EELREERDelrereaeleaTLRTARERVEEAEALLEAGKcpecgqPVEgsphVETIEEDRerveELEAELEDLEEEVEEV 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 501 SGRCAELQihlteataKAVEQKELIARLEQDLSTIQSIQRPDAEGASEQGlEKIPEPIKEATALfygpsmSSSGTLPEGQ 580
Cdd:PRK02224 495 EERLERAE--------DLVEAEDRIERLEERREDLEELIAERRETIEEKR-ERAEELRERAAEL------EAEAEEKREA 559
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 581 VDSLLSIISSQRERFRTRNQELEA--ESRMAQHTIQALQSELDSLRADNIKLFEKIKFLQ 638
Cdd:PRK02224 560 AAEAEEEAEEAREEVAELNSKLAElkERIESLERIRTLLAAIADAEDEIERLREKREALA 619
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
204-531 |
3.91e-05 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 47.12 E-value: 3.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 204 LHDIETENQKLRETLEEYNKEFAEVKN----------QEVTIKALKEKIREYEQTLKSQAETIALEKEQKlqndfaEKER 273
Cdd:pfam10174 201 LDQKEKENIHLREELHRRNQLQPDPAKtkalqtviemKDTKISSLERNIRDLEDEVQMLKTNGLLHTEDR------EEEI 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 274 KLQETQMSTTSKLeeaEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD---EIEMIMTDLERANQRAEVAQREAETLR 350
Cdd:pfam10174 275 KQMEVYKSHSKFM---KNKIDQLKQELSKKESELLALQTKLETLTNQNSDckqHIEVLKESLTAKEQRAAILQTEVDALR 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 351 EQLSSANHSLQLAS-QIQKAPD----VAIEVltrSSLEVELAAKEREIAQLVEDVQRLQASL---TKLRENSASQISQLE 422
Cdd:pfam10174 352 LRLEEKESFLNKKTkQLQDLTEekstLAGEI---RDLKDMLDVKERKINVLQKKIENLQEQLrdkDKQLAGLKERVKSLQ 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 423 QQLNAKNSTLKQLEEKLkgqADYEEVKKELNTLKSMEfapsegagtqdstkplevlllekNRSLQSENATLRISNSDLSG 502
Cdd:pfam10174 429 TDSSNTDTALTTLEEAL---SEKERIIERLKEQRERE-----------------------DRERLEELESLKKENKDLKE 482
|
330 340
....*....|....*....|....*....
gi 601984296 503 RCAELQIHLTEATAKAVEQKELIARLEQD 531
Cdd:pfam10174 483 KVSALQPELTEKESSLIDLKEHASSLASS 511
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
191-458 |
3.99e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 47.27 E-value: 3.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 191 LDVGQQLEIKVQRLHDIETENQKLRETL--EEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQA----ETIA--LEKEQ 262
Cdd:TIGR00618 628 QDVRLHLQQCSQELALKLTALHALQLTLtqERVREHALSIRVLPKELLASRQLALQKMQSEKEQLtywkEMLAqcQTLLR 707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 263 KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRtELFDLKTKYDEETTAKADE----IEMIMTDLERANQR 338
Cdd:TIGR00618 708 ELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELM-HQARTVLKARTEAHFNNNEevtaALQTGAELSHLAAE 786
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 339 AEVAQREAETLREQLSsanhslQLASQIQKAPDVAIEVLTrssLEVELAAKEREiaqlvedvqrlqASLTKLRENSASQI 418
Cdd:TIGR00618 787 IQFFNRLREEDTHLLK------TLEAEIGQEIPSDEDILN---LQCETLVQEEE------------QFLSRLEEKSATLG 845
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 601984296 419 sQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLKSM 458
Cdd:TIGR00618 846 -EITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQI 884
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
193-437 |
5.10e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.98 E-value: 5.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 193 VGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIalekeQKLQNDFAEKE 272
Cdd:PRK03918 198 KEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKI-----RELEERIEELK 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 273 RKLQ--ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKtKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLR 350
Cdd:PRK03918 273 KEIEelEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIE-KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELE 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 351 EQLSSANHSLQLASQIqKAPDVAIEvltrsSLEVELAAKERE-IAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKN 429
Cdd:PRK03918 352 KRLEELEERHELYEEA-KAKKEELE-----RLKKRLTGLTPEkLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK 425
|
....*...
gi 601984296 430 STLKQLEE 437
Cdd:PRK03918 426 KAIEELKK 433
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
110-443 |
5.70e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 46.57 E-value: 5.70e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 110 DATATVLANRQDE----SEQSRKRLIEQ--SREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRlid 183
Cdd:PRK02224 306 DADAEAVEARREEledrDEELRDRLEECrvAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDR--- 382
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 184 vPDPVPALDvgQQLEIKVQRLHDIETEnqklRETLEEYNKEFAEVKNqevtikALKEKIREYEQTLKSQAETIALEKEQK 263
Cdd:PRK02224 383 -REEIEELE--EEIEELRERFGDAPVD----LGNAEDFLEELREERD------ELREREAELEATLRTARERVEEAEALL 449
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 264 LQNDFAEKERKLQETQMSTTskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAK--ADEIEMIMTDLERANQRAEV 341
Cdd:PRK02224 450 EAGKCPECGQPVEGSPHVET--IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVeaEDRIERLEERREDLEELIAE 527
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 342 AQREAETLREQLSSANHSLQ-LASQIQKAPDVAIEVLTRS-SLEVELAAKEREIAQLVEDVQRLQ--ASLTKLRENSASQ 417
Cdd:PRK02224 528 RRETIEEKRERAEELRERAAeLEAEAEEKREAAAEAEEEAeEAREEVAELNSKLAELKERIESLEriRTLLAAIADAEDE 607
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 601984296 418 ISQLEQQ--------------LNAKNSTLKQLEEKLKGQA 443
Cdd:PRK02224 608 IERLREKrealaelnderrerLAEKRERKRELEAEFDEAR 647
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
198-440 |
6.52e-05 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 45.67 E-value: 6.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 198 EIKVQRLhDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLksQAETIALEKEQKLQNDFAEKERKLQE 277
Cdd:COG1340 75 ELKEERD-ELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQ--QTEVLSPEEEKELVEKIKELEKELEK 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 278 TQmsttsKLEEAEHKLQTLQTALEKTRTELFDLKTKYdEETTAKADEIEMIMTDLeranqraevaQREAETLREQLSSAN 357
Cdd:COG1340 152 AK-----KALEKNEKLKELRAELKELRKEAEEIHKKI-KELAEEAQELHEEMIEL----------YKEADELRKEADELH 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 358 HSLQLASQiqkapdvaievltrsslevELAAKEREIAQLVEDVQRLQASLTKLRENSASqiSQLEQQLNAKNSTLKQLEE 437
Cdd:COG1340 216 KEIVEAQE-------------------KADELHEEIIELQKELRELRKELKKLRKKQRA--LKREKEKEELEEKAEEIFE 274
|
...
gi 601984296 438 KLK 440
Cdd:COG1340 275 KLK 277
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
202-344 |
9.15e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 44.53 E-value: 9.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 202 QRLHDIETENQKLRETLEEYNKEF----AEVKNQEVTIKALKEKIREYEQTL---KSQAETIALEKEQklqnDFAEKERK 274
Cdd:COG1579 31 AELAELEDELAALEARLEAAKTELedleKEIKRLELEIEEVEARIKKYEEQLgnvRNNKEYEALQKEI----ESLKRRIS 106
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 601984296 275 LQETQMS-TTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTdlERANQRAEVAQR 344
Cdd:COG1579 107 DLEDEILeLMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEA--EREELAAKIPPE 175
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
141-466 |
1.62e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 45.21 E-value: 1.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 141 TPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFltvykrlidvpdpvpaldvgQQLEIKVQRLHDietENQKLRETLEE 220
Cdd:pfam12128 615 SAREKQAAAEEQLVQANGELEKASREETFARTAL--------------------KNARLDLRRLFD---EKQSEKDKKNK 671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 221 YNKEFAEVKNQEVT-----IKALKEKIREYEQTLKSQAETIALEKEQKLQNdfAEKERKLQETQMSTTSKLEEAEHKLQt 295
Cdd:pfam12128 672 ALAERKDSANERLNsleaqLKQLDKKHQAWLEEQKEQKREARTEKQAYWQV--VEGALDAQLALLKAAIAARRSGAKAE- 748
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 296 lQTALEKTRTElfDLKTK-YDEETTAK-ADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQlasqiqkapdva 373
Cdd:pfam12128 749 -LKALETWYKR--DLASLgVDPDVIAKlKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLA------------ 813
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 374 ievltrssleVELAAKEREIAQLVEDVQRLQASlTKLRensasqISQLEQQLNAKNSTLKQLEEKLKGQADYEEvkkELN 453
Cdd:pfam12128 814 ----------TQLSNIERAISELQQQLARLIAD-TKLR------RAKLEMERKASEKQQVRLSENLRGLRCEMS---KLA 873
|
330
....*....|...
gi 601984296 454 TLKsmEFAPSEGA 466
Cdd:pfam12128 874 TLK--EDANSEQA 884
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
215-632 |
1.63e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 45.42 E-value: 1.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 215 RETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETialekEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQ 294
Cdd:TIGR00606 237 REIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSR-----KKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQ 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 295 TLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLaSQIQKAPDVAI 374
Cdd:TIGR00606 312 RTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLEL-DGFERGPFSER 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 375 EVLTRSSLEVE-LAAKEREIAQLVEDVQRLQASLTKlrensasQISQLEQQLNAKNSTLKQLEEKL-KGQADYEEVKKEL 452
Cdd:TIGR00606 391 QIKNFHTLVIErQEDEAKTAAQLCADLQSKERLKQE-------QADEIRDEKKGLGRTIELKKEILeKKQEELKFVIKEL 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 453 NTL---------KSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATakaveQKE 523
Cdd:TIGR00606 464 QQLegssdrileLDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRT-----QME 538
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 524 LIARLEQDlsTIQSIQRPDAEGASEQglekipepIKEATALFYGPSMSSSgtlpegqVDSLLSIISSQRERFRTRNQELE 603
Cdd:TIGR00606 539 MLTKDKMD--KDEQIRKIKSRHSDEL--------TSLLGYFPNKKQLEDW-------LHSKSKEINQTRDRLAKLNKELA 601
|
410 420
....*....|....*....|....*....
gi 601984296 604 AESRMAQHTIQALQSELDSLRADNIKLFE 632
Cdd:TIGR00606 602 SLEQNKNHINNELESKEEQLSSYEDKLFD 630
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
207-623 |
1.98e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 45.21 E-value: 1.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 207 IETENQKLRETLEEYNKEFAEVKNQevtikaLKEKIREYEQTLKSQAETIALEKEQ---------KLQNDFAEKERKLQE 277
Cdd:pfam12128 274 IASRQEERQETSAELNQLLRTLDDQ------WKEKRDELNGELSAADAAVAKDRSElealedqhgAFLDADIETAAADQE 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 278 TQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakADEIEMIMTDLE-----RANQRAE---VAQREAETL 349
Cdd:pfam12128 348 QLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKEQN---NRDIAGIKDKLAkireaRDRQLAVaedDLQALESEL 424
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 350 REQLSSANHSL-----QLASQI------QKAPDVAIEVLTRSSLEVELAAKEREIAQL----VEDVQRLQASLTKLRENS 414
Cdd:pfam12128 425 REQLEAGKLEFneeeyRLKSRLgelklrLNQATATPELLLQLENFDERIERAREEQEAanaeVERLQSELRQARKRRDQA 504
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 415 ASQISQLEQQLNAKNSTLKQLEEKLKGQAdyeevkkelNTLksMEFAPSEGAGTQDST-KPLEVLLLEKNRSLQSENATL 493
Cdd:pfam12128 505 SEALRQASRRLEERQSALDELELQLFPQA---------GTL--LHFLRKEAPDWEQSIgKVISPELLHRTDLDPEVWDGS 573
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 494 RISNSDLSGrcAELQIHLTEATAKAVEQKELIARLEQDLSTIQSiQRPDAEGASEQgLEKIPEPIKEATAlfygpSMSSS 573
Cdd:pfam12128 574 VGGELNLYG--VKLDLKRIDVPEWAASEEELRERLDKAEEALQS-AREKQAAAEEQ-LVQANGELEKASR-----EETFA 644
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 601984296 574 GTLPEGQVDSLLSIISSQRERFRTRNQELEAESRMAQHTIQALQSELDSL 623
Cdd:pfam12128 645 RTALKNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQL 694
|
|
| Spc7 |
smart00787 |
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ... |
203-312 |
2.18e-04 |
|
Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.
Pssm-ID: 197874 [Multi-domain] Cd Length: 312 Bit Score: 43.85 E-value: 2.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 203 RLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAET---IALEKEQKLQNDFA---EKERKLQ 276
Cdd:smart00787 138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQlkqLEDELEDCDPTELDrakEKLKKLL 217
|
90 100 110
....*....|....*....|....*....|....*.
gi 601984296 277 ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKT 312
Cdd:smart00787 218 QEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
116-322 |
2.53e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.62 E-value: 2.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 116 LANRQDESEQSRKRLieqsREFKKNT----PEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPAL 191
Cdd:COG3206 184 LPELRKELEEAEAAL----EEFRQKNglvdLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPEL 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 192 DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAET---IALEKEQKLQNDF 268
Cdd:COG3206 260 LQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQ---IAALRAQLQQEAQRILASLEAeleALQAREASLQAQL 336
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 601984296 269 AEKERKLQEtqmsttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKA 322
Cdd:COG3206 337 AQLEARLAE--------LPELEAELRRLEREVEVARELYESLLQRLEEARLAEA 382
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
213-520 |
2.64e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 44.58 E-value: 2.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 213 KLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTL--KSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAE 290
Cdd:pfam02463 230 DYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKlaQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 291 HKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAP 370
Cdd:pfam02463 310 VDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSA 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 371 DVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKK 450
Cdd:pfam02463 390 AKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 451 ELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVE 520
Cdd:pfam02463 470 SEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVE 539
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
200-438 |
6.33e-04 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 43.40 E-value: 6.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 200 KVQRLH-DIETENQKLRE---TLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQaETIALEKEQKLQNdFAEK 271
Cdd:COG3096 348 KIERYQeDLEELTERLEEqeeVVEEAAEQLAEAEARleaaEEEVDSLKSQLADYQQALDVQ-QTRAIQYQQAVQA-LEKA 425
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 272 ERKLQETQMSttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKA--DE----IEMIMTDLER--ANQRAE--- 340
Cdd:COG3096 426 RALCGLPDLT----PENAEDYLAAFRAKEQQATEEVLELEQKLSVADAARRqfEKayelVCKIAGEVERsqAWQTARell 501
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 341 --------VAQReAETLREQLSSANhslQLASQIQKAPDVAIEVLTRSSLEVELAAK-EREIAQLVEDVQRLQASLTKLR 411
Cdd:COG3096 502 rryrsqqaLAQR-LQQLRAQLAELE---QRLRQQQNAERLLEEFCQRIGQQLDAAEElEELLAELEAQLEELEEQAAEAV 577
|
250 260
....*....|....*....|....*..
gi 601984296 412 EnsasQISQLEQQLNAKNSTLKQLEEK 438
Cdd:COG3096 578 E----QRSELRQQLEQLRARIKELAAR 600
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
199-410 |
6.94e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 43.08 E-value: 6.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 199 IKVQRLHdIETENQKLRETLEEYNKEFAEVKNQEVTIKA----LKEKIREYEQTLKSQAETIalekeQKLQNDFAEKERK 274
Cdd:PHA02562 197 IKTYNKN-IEEQRKKNGENIARKQNKYDELVEEAKTIKAeieeLTDELLNLVMDIEDPSAAL-----NKLNTAAAKIKSK 270
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 275 LQ----ETQMST--------TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakadeiemimtdlERANQRAEvA 342
Cdd:PHA02562 271 IEqfqkVIKMYEkggvcptcTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELE--------------EIMDEFNE-Q 335
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984296 343 QREAETLREQLSSANHSLQLASQIQKAPDVAIEvltrsSLEVELAAKEREIAQLVEDVQRLQASLTKL 410
Cdd:PHA02562 336 SKKLLELKNKISTNKQSLITLVDKAKKVKAAIE-----ELQAEFVDNAEELAKLQDELDKIVKTKSEL 398
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
103-444 |
7.81e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.98 E-value: 7.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 103 AKLKRELDATATVLANRQDESEQSRKRLiEQSREFKKNTPEDLRkqvaplLKSFQGEIDALSKRSKEAEAAFLTVykrli 182
Cdd:COG4913 620 AELEEELAEAEERLEALEAELDALQERR-EALQRLAEYSWDEID------VASAEREIAELEAELERLDASSDDL----- 687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 183 dvpdpvpaldvgQQLEikvQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:COG4913 688 ------------AALE---EQLEELEAELEELEEELDELKGEIGRLEKE---LEQAEEELDELQDRLEAAEDLARLELRA 749
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 263 KLQNDFAEKERKLQETQMSttsklEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEiemIMTDLERANQ-RAEV 341
Cdd:COG4913 750 LLEERFAAALGDAVERELR-----ENLEERIDALRARLNRAEEELERAMRAFNREWPAETAD---LDADLESLPEyLALL 821
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 342 AQREAETL-------REQLSSANHS--LQLASQIQKAPDVAIEVLTR--SSLE---------VELAAKEREiaqlVEDVQ 401
Cdd:COG4913 822 DRLEEDGLpeyeerfKELLNENSIEfvADLLSKLRRAIREIKERIDPlnDSLKripfgpgryLRLEARPRP----DPEVR 897
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 601984296 402 RLQASLTKLRENSASQIsqlEQQLNAKNSTLKQLEEKLKGQAD 444
Cdd:COG4913 898 EFRQELRAVTSGASLFD---EELSEARFAALKRLIERLRSEEE 937
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
207-437 |
9.11e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 42.70 E-value: 9.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 207 IETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYeqtlKSQAETIALEKEQKLqndFAEKERKLQETQMSTTSKL 286
Cdd:COG3206 166 LELRREEARKALEFLEEQLPELRKE---LEEAEAALEEF----RQKNGLVDLSEEAKL---LLQQLSELESQLAEARAEL 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 287 EEAEHKLQTLQTALEKT----------------RTELFDLKTKYDEE-TTAKADEIEMIMTDLERANQRAEVAQREAETL 349
Cdd:COG3206 236 AEAEARLAALRAQLGSGpdalpellqspviqqlRAQLAELEAELAELsARYTPNHPDVIALRAQIAALRAQLQQEAQRIL 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 350 REQLSSANHSLQLASQIQKApdvaIEVLTRSSLEveLAAKEREIAQLVEDVQRLQASLTKLREnsasQISQLEQQLNAKN 429
Cdd:COG3206 316 ASLEAELEALQAREASLQAQ----LAQLEARLAE--LPELEAELRRLEREVEVARELYESLLQ----RLEEARLAEALTV 385
|
....*...
gi 601984296 430 STLKQLEE 437
Cdd:COG3206 386 GNVRVIDP 393
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
306-456 |
9.72e-04 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 42.53 E-value: 9.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 306 ELFD-LKTKYDEETTAKADEIEMIMTDLERANQRAEV--AQREAETLREQLSSANHSLQlasQIQKAPDVAIEVLTRSSL 382
Cdd:pfam06160 45 EKFEeWRKKWDDIVTKSLPDIEELLFEAEELNDKYRFkkAKKALDEIEELLDDIEEDIK---QILEELDELLESEEKNRE 121
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984296 383 EVElaakereiaQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEkLKGQADYEEVKKELNTLK 456
Cdd:pfam06160 122 EVE---------ELKDKYRELRKTLLANRFSYGPAIDELEKQLAEIEEEFSQFEE-LTESGDYLEAREVLEKLE 185
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
210-529 |
1.24e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 42.27 E-value: 1.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 210 ENQKLRETLE---EYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKL 286
Cdd:pfam02463 174 ALKKLIEETEnlaELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 287 EEAEhKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmimTDLERANQRAEVAQREAETLREQLSSANHSLQLASQI 366
Cdd:pfam02463 254 ESSK-QEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLA---KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKE 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 367 QKAPDVAIEvLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYE 446
Cdd:pfam02463 330 LKKEKEEIE-ELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQ 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 447 EVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQKELIA 526
Cdd:pfam02463 409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLEL 488
|
...
gi 601984296 527 RLE 529
Cdd:pfam02463 489 LLS 491
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
114-512 |
1.31e-03 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 42.35 E-value: 1.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 114 TVLANRQDESEQSRKRLiEQSREFKknTPEDlrkqvAPLLKSFQGEIDALSKRSKEAEAAflTVYKRLIDvpdpvpaldv 193
Cdd:PRK10929 16 GAYAATAPDEKQITQEL-EQAKAAK--TPAQ-----AEIVEALQSALNWLEERKGSLERA--KQYQQVID---------- 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 194 gqqleikvqrlhDIETENQKLRETLEEYNKEFAEVkNQEVTIKALKEKIREYEQTLKSQAETiaLEKEQKLQNDFAEKER 273
Cdd:PRK10929 76 ------------NFPKLSAELRQQLNNERDEPRSV-PPNMSTDALEQEILQVSSQLLEKSRQ--AQQEQDRAREISDSLS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 274 KLQETQMSTTSKLEEAEHKLQTL--------QTALEKTRTELFDLKtkydeettAKADEIEMIMTDL----ERANQRAEV 341
Cdd:PRK10929 141 QLPQQQTEARRQLNEIERRLQTLgtpntplaQAQLTALQAESAALK--------ALVDELELAQLSAnnrqELARLRSEL 212
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 342 AQREAETLREQLSSANHslQLASQIQKAPDVAIEvltrsslEVELAAKE---------------REIAQ-LVEDVQRLQA 405
Cdd:PRK10929 213 AKKRSQQLDAYLQALRN--QLNSQRQREAERALE-------STELLAEQsgdlpksivaqfkinRELSQaLNQQAQRMDL 283
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 406 SLTKLREnSASQISQLEQQLN-----------------AKNSTLKQLEEKLKGQA-DYEEVKKELNTLK---SMEFAPSE 464
Cdd:PRK10929 284 IASQQRQ-AASQTLQVRQALNtlreqsqwlgvsnalgeALRAQVARLPEMPKPQQlDTEMAQLRVQRLRyedLLNKQPQL 362
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 601984296 465 GAGTQDSTKPLEVlllEKNRSLQSENATLR-ISNSDLSGrCAELQIHLT 512
Cdd:PRK10929 363 RQIRQADGQPLTA---EQNRILDAQLRTQReLLNSLLSG-GDTLILELT 407
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
157-531 |
1.41e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.12 E-value: 1.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 157 QGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPAL-----DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQ 231
Cdd:pfam10174 344 QTEVDALRLRLEEKESFLNKKTKQLQDLTEEKSTLageirDLKDMLDVKERKINVLQKKIENLQEQLRDKDKQLAGLKER 423
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 232 evtIKALKEKIREYEQTLKSQAETIAlEKE---QKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELF 308
Cdd:pfam10174 424 ---VKSLQTDSSNTDTALTTLEEALS-EKEriiERLKEQREREDRERLEELESLKKENKDLKEKVSALQPELTEKESSLI 499
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 309 DLKTKydeettAKADEIEMIMTDLERANQRAEVAQREAETLReqlssanhslqLASQIQKAPDVAIEVLTRSSLEVELAA 388
Cdd:pfam10174 500 DLKEH------ASSLASSGLKKDSKLKSLEIAVEQKKEECSK-----------LENQLKKAHNAEEAVRTNPEINDRIRL 562
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 389 KEREIAQLVEDVQRLQASLTKL------RENSAS----QISQLE---------QQLNAKNSTLKQLEEKLKGQADYEEVK 449
Cdd:pfam10174 563 LEQEVARYKEESGKAQAEVERLlgilreVENEKNdkdkKIAELEsltlrqmkeQNKKVANIKHGQQEMKKKGAQLLEEAR 642
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 450 KELNTLKSmefapsegagtQDSTKPLEVLLLEKNRSLQSENAT-LRISNSDLSgrCAELQIHLTEATAKAVEQKELIARL 528
Cdd:pfam10174 643 RREDNLAD-----------NSQQLQLEELMGALEKTRQELDATkARLSSTQQS--LAEKDGHLTNLRAERRKQLEEILEM 709
|
...
gi 601984296 529 EQD 531
Cdd:pfam10174 710 KQE 712
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
187-629 |
1.88e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 41.81 E-value: 1.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 187 PVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQN 266
Cdd:PLN02939 38 RRRGFSSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQT 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 267 DFAEKErklqetQMSTTsKLEEAEHKLQTLQT---ALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQ 343
Cdd:PLN02939 118 NSKDGE------QLSDF-QLEDLVGMIQNAEKnilLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAA 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 344 RE---AETLREQLSSANHSLqlasqiqkAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASqISQ 420
Cdd:PLN02939 191 QEkihVEILEEQLEKLRNEL--------LIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEER-VFK 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 421 LEQQLNAKNSTLKQLEEKLkgQADYEEVKKeLNTLKsmefapsegagtQDStkplevlLLEKNRSLQsenatlrisnsdl 500
Cdd:PLN02939 262 LEKERSLLDASLRELESKF--IVAQEDVSK-LSPLQ------------YDC-------WWEKVENLQ------------- 306
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 501 sgrcaelqiHLTEATAKAVEQKELIARLEQDLstiqsiqrpdaegasEQGLEKIPEPIKEATALFYGPSMsssgtlpegq 580
Cdd:PLN02939 307 ---------DLLDRATNQVEKAALVLDQNQDL---------------RDKVDKLEASLKEANVSKFSSYK---------- 352
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 601984296 581 VDSLLSIISSQRERFRTRNQELEAESRMAQHTIQALQSELDSLRADNIK 629
Cdd:PLN02939 353 VELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEFQDTLSKLKEESKK 401
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
118-459 |
2.38e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 41.67 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 118 NRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAflTVYKRLIDVPDPVPALDVGQQL 197
Cdd:PTZ00121 1087 NRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKA--EEARKAEDAKRVEIARKAEDAR 1164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 198 EIKVQRLHDIETENQKLRETLE----EYNKEFAEVKNQEVTIKALK----EKIREYEQTLKSQAETIALEKEQKLQN-DF 268
Cdd:PTZ00121 1165 KAEEARKAEDAKKAEAARKAEEvrkaEELRKAEDARKAEAARKAEEerkaEEARKAEDAKKAEAVKKAEEAKKDAEEaKK 1244
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 269 AEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQ--RAEVAQREA 346
Cdd:PTZ00121 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEakKADEAKKKA 1324
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 347 ETLREQlssANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLN 426
Cdd:PTZ00121 1325 EEAKKK---ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401
|
330 340 350
....*....|....*....|....*....|...
gi 601984296 427 AKNSTLKQLEEKLKGQADYEEVKKELNTLKSME 459
Cdd:PTZ00121 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKAD 1434
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
289-457 |
2.78e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.97 E-value: 2.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 289 AEHKLQTLQTALEKTRTELFDLKTKYDeettAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQL------ 362
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELD----ALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEErreelg 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 363 ----ASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEK 438
Cdd:COG3883 90 erarALYRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAA 169
|
170 180
....*....|....*....|
gi 601984296 439 LKG-QADYEEVKKELNTLKS 457
Cdd:COG3883 170 KAElEAQQAEQEALLAQLSA 189
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
204-461 |
2.80e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 41.35 E-value: 2.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 204 LHDIETENQKLRETL----EEYNKEFAEVKNQEVTIKALKEKIREYEQTL-KSQAETIALE-KEQKLQNDFAEKERKLQE 277
Cdd:pfam10174 249 IRDLEDEVQMLKTNGllhtEDREEEIKQMEVYKSHSKFMKNKIDQLKQELsKKESELLALQtKLETLTNQNSDCKQHIEV 328
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 278 TQMSTTSKleeaEHKLQTLQTALEKTRTEL------FDLKTKY----DEETTAKADEIEMIMTDLERANQRAEVAQREAE 347
Cdd:pfam10174 329 LKESLTAK----EQRAAILQTEVDALRLRLeekesfLNKKTKQlqdlTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 348 TLREQLSSANhslqlaSQIQKAPDVAIEVLTRSS--------LEVELAAKEREIAQLVEDVQRLQ-------ASLTKLRE 412
Cdd:pfam10174 405 NLQEQLRDKD------KQLAGLKERVKSLQTDSSntdtalttLEEALSEKERIIERLKEQREREDrerleelESLKKENK 478
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 601984296 413 NSASQISQLEQQLNAKNSTLKQLEEKLKGQAdyEEVKKELNTLKSMEFA 461
Cdd:pfam10174 479 DLKEKVSALQPELTEKESSLIDLKEHASSLA--SSGLKKDSKLKSLEIA 525
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
249-451 |
3.16e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 40.92 E-value: 3.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 249 LKSQAETIALEKEQKlqndfAEKerklqetqmsttsKLEEAEHKLQTL-QTALEKTRTELFDLKTKYDEETTAKADEIEm 327
Cdd:PRK12704 25 RKKIAEAKIKEAEEE-----AKR-------------ILEEAKKEAEAIkKEALLEAKEEIHKLRNEFEKELRERRNELQ- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 328 imtDLERANQraevaQREaETLREQLSSANHSlqlasqiqkapdvaievltrsslEVELAAKEREIAQLVEDVQRLQASL 407
Cdd:PRK12704 86 ---KLEKRLL-----QKE-ENLDRKLELLEKR-----------------------EEELEKKEKELEQKQQELEKKEEEL 133
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 601984296 408 TKLREnsaSQISQLEQQLN-----AKNSTLKQLEEKLKGQA-----DYEEVKKE 451
Cdd:PRK12704 134 EELIE---EQLQELERISGltaeeAKEILLEKVEEEARHEAavlikEIEEEAKE 184
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
103-456 |
3.21e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 41.08 E-value: 3.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLI 182
Cdd:COG1196 425 ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEynkefaevknqeVTIKALKEKIREYEQTLksqAETIALEKEQ 262
Cdd:COG1196 505 GFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEA------------ALAAALQNIVVEDDEVA---AAAIEYLKAA 569
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 263 KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVA 342
Cdd:COG1196 570 KAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREV 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 343 QREAETLREQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQI---- 418
Cdd:COG1196 650 TLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEAleeq 729
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 601984296 419 ------SQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLK 456
Cdd:COG1196 730 leaereELLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
|
|
| GAS |
pfam13851 |
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ... |
216-435 |
3.23e-03 |
|
Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.
Pssm-ID: 464001 [Multi-domain] Cd Length: 200 Bit Score: 39.50 E-value: 3.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 216 ETLEEYNKEFAEVKN--QEVT------IKALKEKIREyeqtLKSQAEtialEKEQKLQNDFAEKERkLQEtqmsttsKLE 287
Cdd:pfam13851 1 ELMKNHEKAFNEIKNyyNDITrnnlelIKSLKEEIAE----LKKKEE----RNEKLMSEIQQENKR-LTE-------PLQ 64
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 288 EAEHKLQTLQTAL---EKTRTELfdlktkydEETTAKADEIEMIMTDLERAN----QRAEVAQREAETLREQLSSANHSL 360
Cdd:pfam13851 65 KAQEEVEELRKQLenyEKDKQSL--------KNLKARLKVLEKELKDLKWEHevleQRFEKVERERDELYDKFEAAIQDV 136
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984296 361 QlasqiQKApDVAIEVLTR--SSLEVELAAKEREIAQLvedvqrLQASltKLRENSASQISQ-LEQQLNAKNSTLKQL 435
Cdd:pfam13851 137 Q-----QKT-GLKNLLLEKklQALGETLEKKEAQLNEV------LAAA--NLDPDALQAVTEkLEDVLESKNQLIKDL 200
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
195-452 |
3.68e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 40.72 E-value: 3.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEK--------EQKLQN 266
Cdd:TIGR00618 239 QQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQieqqaqriHTELQS 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 267 DFAEKERKLQETQ--MSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADE--IEMIMTDLERANQRAEVA 342
Cdd:TIGR00618 319 KMRSRAKLLMKRAahVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTqhIHTLQQQKTTLTQKLQSL 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 343 QREAETLREQLSSAN--HSLQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLREnsasqisq 420
Cdd:TIGR00618 399 CKELDILQREQATIDtrTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKE-------- 470
|
250 260 270
....*....|....*....|....*....|..
gi 601984296 421 LEQQLNAKNSTLKQLEEKLKGQADYEEVKKEL 452
Cdd:TIGR00618 471 REQQLQTKEQIHLQETRKKAVVLARLLELQEE 502
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
216-523 |
4.09e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 40.89 E-value: 4.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 216 ETLEEYNKEFAEVKNQEVtIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQT 295
Cdd:PTZ00121 1030 EELTEYGNNDDVLKEKDI-IDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTET 1108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 296 LQTALEKTRtelfdlktkydEETTAKADEIEMImTDLERAN--QRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVA 373
Cdd:PTZ00121 1109 GKAEEARKA-----------EEAKKKAEDARKA-EEARKAEdaRKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAK 1176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 374 IEVLTRSSLEVELAAKEREiaqlVEDVQRLQASLTKLRENSASQISQLEQQLNAKnsTLKQLEEKLKGQADYEEVKKELN 453
Cdd:PTZ00121 1177 KAEAARKAEEVRKAEELRK----AEDARKAEAARKAEEERKAEEARKAEDAKKAE--AVKKAEEAKKDAEEAKKAEEERN 1250
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984296 454 TLKSMEFAPSEGAG---------TQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQKE 523
Cdd:PTZ00121 1251 NEEIRKFEEARMAHfarrqaaikAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
196-531 |
4.10e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 40.80 E-value: 4.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 196 QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKireyeqtlKSQAETIALEKEQKLQNDFAEKERKL 275
Cdd:TIGR00606 232 QLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSR--------KKQMEKDNSELELKMEKVFQGTDEQL 303
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 276 QETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADeiEMIMTDLERANQRAEVAQREAETLREQLSS 355
Cdd:TIGR00606 304 NDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGR--LQLQADRHQEHIRARDSLIQSLATRLELDG 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 356 ANHSLQLASQIQKAPDVAIEVLTR-----SSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNS 430
Cdd:TIGR00606 382 FERGPFSERQIKNFHTLVIERQEDeaktaAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIK 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 431 TLKQLEEKLKgqadyEEVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLlleknrSLQSENATLRISNSDLSGRCAELQIH 510
Cdd:TIGR00606 462 ELQQLEGSSD-----RILELDQELRKAERELSKAEKNSLTETLKKEVK------SLQNEKADLDRKLRKLDQEMEQLNHH 530
|
330 340
....*....|....*....|.
gi 601984296 511 LTEATakaveQKELIARLEQD 531
Cdd:TIGR00606 531 TTTRT-----QMEMLTKDKMD 546
|
|
| Wtap |
pfam17098 |
WTAP/Mum2p family; The Wtap family includes female-lethal(2)D from Drosophila and ... |
380-537 |
4.11e-03 |
|
WTAP/Mum2p family; The Wtap family includes female-lethal(2)D from Drosophila and pre-mRNA-splicing regulator WTAP from mammals. The former is required for female-specific splicing of Sex-lethal RNA, and the latter is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. The family also includes the yeast Mum2p protein which is part of the Mis complex.
Pssm-ID: 465345 [Multi-domain] Cd Length: 155 Bit Score: 38.43 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 380 SSLEVELAAKEREIAQLVEDVQRLQASLTKlrENSASQISQLEQQLNAKNSTLKQLEEKLKgqADYEEVKKELNTLKsme 459
Cdd:pfam17098 7 NLLLARLAEKEQEIQELKAQLQDLKQSLQP--PSSQLRSLLLDPAVNLEFLRLKKELEEKK--KKLKEAQLELAAWK--- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 460 FAPSEGAGTQdstkplevlLLEKNRSLQSENATLRISNSdlSGRCAELQIHLteATAKAV---------EQKELIARLEQ 530
Cdd:pfam17098 80 FTPDSTTGKR---------LMAKCRLLQQENEELGRQLS--EGRIAKLEIEL--ALQKKVveelkksleELDEFLIELDE 146
|
....*..
gi 601984296 531 DLSTIQS 537
Cdd:pfam17098 147 ELEGLQS 153
|
|
| PLN03229 |
PLN03229 |
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional |
211-578 |
4.56e-03 |
|
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Pssm-ID: 178768 [Multi-domain] Cd Length: 762 Bit Score: 40.61 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 211 NQKLRETleeynkEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDF------AEKERKLQETQMSTTS 284
Cdd:PLN03229 420 NMKKREA------VKTPVRELEGEVEKLKEQILKAKESSSKPSELALNEMIEKLKKEIdleyteAVIAMGLQERLENLRE 493
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 285 KLEEAEHKLQTLQTALektrtelfdlktkydeettakADEIEMIMTDLERANQRAevaqreaetlrEQLSSANHSLQLAS 364
Cdd:PLN03229 494 EFSKANSQDQLMHPVL---------------------MEKIEKLKDEFNKRLSRA-----------PNYLSLKYKLDMLN 541
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 365 QIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLeqqlnakNSTLKQLEEKLKGQAD 444
Cdd:PLN03229 542 EFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEIKEKMEALKAEVASSGASSGDEL-------DDDLKEKVEKMKKEIE 614
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 445 YE--EVKK----ELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLqsENAtlrISNSDLSGRCAELQIHLTEATAKA 518
Cdd:PLN03229 615 LElaGVLKsmglEVIGVTKKNKDTAEQTPPPNLQEKIESLNEEINKKI--ERV---IRSSDLKSKIELLKLEVAKASKTP 689
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 601984296 519 -VEQKELIARLEQDlstiqsIQRPDAEGASEQGLEKIPEPIKEATALFYGPSMSSSGTLPE 578
Cdd:PLN03229 690 dVTEKEKIEALEQQ------IKQKIAEALNSSELKEKFEELEAELAAARETAAESNGSLKN 744
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
312-635 |
4.71e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 40.31 E-value: 4.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 312 TKYD---EETTAKADEIEmimTDLERANQ-RAEVAQReAETLREQLSSANHSLQLASQIQKApDVAIEVLTRSSLEVELA 387
Cdd:COG1196 168 SKYKerkEEAERKLEATE---ENLERLEDiLGELERQ-LEPLERQAEKAERYRELKEELKEL-EAELLLLKLRELEAELE 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 388 AKEREIAQLVEDVQRLQASLTKLREnsasQISQLEQQLNAKNSTLKQLeeklkgQADYEEVKKELNTLksmefapsegAG 467
Cdd:COG1196 243 ELEAELEELEAELEELEAELAELEA----ELEELRLELEELELELEEA------QAEEYELLAELARL----------EQ 302
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 468 TQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTIQSIQRpDAEGAS 547
Cdd:COG1196 303 DIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELA-EAEEEL 381
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 548 EQGLEKIPEPIKEATALfygpsmsssgTLPEGQVDSLLSIISSQRERFRTRNQELEAESRMAQHTIQALQSELDSLRADN 627
Cdd:COG1196 382 EELAEELLEALRAAAEL----------AAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
|
....*...
gi 601984296 628 IKLFEKIK 635
Cdd:COG1196 452 AELEEEEE 459
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
262-449 |
4.83e-03 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 40.06 E-value: 4.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 262 QKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTA---LEKTRTELFDLKTKYDEETTAKAdeiemimtdleranqR 338
Cdd:PRK11637 50 KSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQAsrkLRETQNTLNQLNKQIDELNASIA---------------K 114
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 339 AEVAQREAET-LREQLSSA----NHS-LQL-----ASQ-----------IQKAPDVAIEVLTRSSleVELAAKEREIaql 396
Cdd:PRK11637 115 LEQQQAAQERlLAAQLDAAfrqgEHTgLQLilsgeESQrgerilayfgyLNQARQETIAELKQTR--EELAAQKAEL--- 189
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 601984296 397 vEDVQRLQASLtkLRENSASQiSQLEQQLNAKNSTLKQLEEKL-KGQADYEEVK 449
Cdd:PRK11637 190 -EEKQSQQKTL--LYEQQAQQ-QKLEQARNERKKTLTGLESSLqKDQQQLSELR 239
|
|
| KpsE |
COG3524 |
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis]; |
320-444 |
4.98e-03 |
|
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442746 [Multi-domain] Cd Length: 370 Bit Score: 39.83 E-value: 4.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 320 AKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQkapdvaievlTRSSLEVELAAKEREIAQLved 399
Cdd:COG3524 174 AREDAVRFAEEEVERAEERLRDAREALLAFRNRNGILDPEATAEALLQ----------LIATLEGQLAELEAELAAL--- 240
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 601984296 400 vqrlqasLTKLRENSAsQISQLEQQLNAKNSTLKQLEEKLKGQAD 444
Cdd:COG3524 241 -------RSYLSPNSP-QVRQLRRRIAALEKQIAAERARLTGASG 277
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
227-395 |
5.01e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 39.95 E-value: 5.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 227 EVKNQEVTIKALKEKIREYEQTLksqaetiALEKEQKLQndfaekerkLQETQMSTTSKLEEAEH---KLQTLQTALEKT 303
Cdd:PRK09039 47 EISGKDSALDRLNSQIAELADLL-------SLERQGNQD---------LQDSVANLRASLSAAEAersRLQALLAELAGA 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 304 RTELfdlkTKYDEETTAKADEIEMIMtdlERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVAIEVLTRsSLE 383
Cdd:PRK09039 111 GAAA----EGRAGELAQELDSEKQVS---ARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGR-RLN 182
|
170
....*....|..
gi 601984296 384 VELAAKEREIAQ 395
Cdd:PRK09039 183 VALAQRVQELNR 194
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
195-637 |
5.50e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 40.00 E-value: 5.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 195 QQLEIKVQRL-HDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIalekeQKLQND--FAEK 271
Cdd:TIGR04523 36 KQLEKKLKTIkNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKI-----NKLNSDlsKINS 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 272 ERKLQETQMSTT----SKLE----EAEHKLQTLQTALEKTRTELFDLKTKYDEETTakadEIEMIMTDLERANQRAEVAQ 343
Cdd:TIGR04523 111 EIKNDKEQKNKLevelNKLEkqkkENKKNIDKFLTEIKKKEKELEKLNNKYNDLKK----QKEELENELNLLEKEKLNIQ 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 344 REAETLREQLSSANHSL-QLASQIQKAPDVAIEVL----TRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQI 418
Cdd:TIGR04523 187 KNIDKIKNKLLKLELLLsNLKKKIQKNKSLESQISelkkQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIK 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 419 SQLE---QQLNAKNSTLKQLEEKLKgqadyeEVKKELNTLKSMEfapsegagTQDSTKPLEVLLLEKNRSLQSENATLRI 495
Cdd:TIGR04523 267 KQLSekqKELEQNNKKIKELEKQLN------QLKSEISDLNNQK--------EQDWNKELKSELKNQEKKLEEIQNQISQ 332
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 496 SNSDLSGRCAELQIHLTEATAKAVEQKELIARLEQDLSTIQSIQRPDAEgaSEQGLEKIpepikeatalfygpsmsssgt 575
Cdd:TIGR04523 333 NNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQS--YKQEIKNL--------------------- 389
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984296 576 lpEGQVDSLLSIISSQRErfrtRNQELEAESRMAQHTIQALQSELDSLRADNIKLFEKIKFL 637
Cdd:TIGR04523 390 --ESQINDLESKIQNQEK----LNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDL 445
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
113-356 |
5.84e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 39.88 E-value: 5.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 113 ATVLANRQDESEQSRKRLIeQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTvykrlidvpdpvpALD 192
Cdd:pfam07888 29 AELLQNRLEECLQERAELL-QAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQ-------------SRE 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 193 VGQQLEIKVQRLHDIETENQKLRETLEEYNKE-FAEVKNQEVTIKALKEKIREYEQTLKSQAETIalEKEQKLQNDFAEK 271
Cdd:pfam07888 95 KHEELEEKYKELSASSEELSEEKDALLAQRAAhEARIRELEEDIKTLTQRVLERETELERMKERA--KKAGAQRKEEEAE 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 272 ERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLK----TKYDEETTA--KADEIEMIMTDLERANQRAEVAQRE 345
Cdd:pfam07888 173 RKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQdtitTLTQKLTTAhrKEAENEALLEELRSLQERLNASERK 252
|
250
....*....|.
gi 601984296 346 AETLREQLSSA 356
Cdd:pfam07888 253 VEGLGEELSSM 263
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
200-452 |
7.97e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 39.72 E-value: 7.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 200 KVQRLHDIETENQKLRETLEEYNKEFaevKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQ 279
Cdd:pfam17380 235 KMERRKESFNLAEDVTTMTPEYTVRY---NGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEKMEQERLRQEKEEKARE 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 280 MSTTSKLEEAEHKLQTL------------QTALEKTRT-ELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREA 346
Cdd:pfam17380 312 VERRRKLEEAEKARQAEmdrqaaiyaeqeRMAMERERElERIRQEERKRELERIRQEEIAMEISRMRELERLQMERQQKN 391
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 347 ETLREQLSSANHSLQLASQIQKAPDVAIEVLTRSSLEVElAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLN 426
Cdd:pfam17380 392 ERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQE-EARQREVRRLEEERAREMERVRLEEQERQQQVERLRQQEE 470
|
250 260
....*....|....*....|....*.
gi 601984296 427 AKNSTLKQLEEKLKGQADYEEVKKEL 452
Cdd:pfam17380 471 ERKRKKLELEKEKRDRKRAEEQRRKI 496
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
257-420 |
8.44e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 38.75 E-value: 8.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 257 ALEKEQKLQNDFAEKERKLQETQMsttsKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEettaKADEIEMIMTDLERAN 336
Cdd:COG1579 8 ALLDLQELDSELDRLEHRLKELPA----ELAELEDELAALEARLEAAKTELEDLEKEIKR----LELEIEEVEARIKKYE 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 337 QRAEVA--QREAETLREQLSSANHSLQLASQIQKAPDVAIEVLT---------RSSLEVELAAKEREIAQLVEDVQRLQA 405
Cdd:COG1579 80 EQLGNVrnNKEYEALQKEIESLKRRISDLEDEILELMERIEELEeelaeleaeLAELEAELEEKKAELDEELAELEAELE 159
|
170
....*....|....*
gi 601984296 406 SLTKLRENSASQISQ 420
Cdd:COG1579 160 ELEAEREELAAKIPP 174
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
157-441 |
8.62e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 39.56 E-value: 8.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 157 QGEIDALSKRSKEAEAAFLTVYKRLIDVpdpVPALDVGQ----QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQE 232
Cdd:PRK04863 375 DEQQEENEARAEAAEEEVDELKSQLADY---QQALDVQQtraiQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKE 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 233 vtiKALKEKIREYEQTLkSQAETIALEKEQKLQ---------------NDFAEKERKLQETQMSTTSkLEEAEHKLQTLQ 297
Cdd:PRK04863 452 ---QEATEELLSLEQKL-SVAQAAHSQFEQAYQlvrkiagevsrseawDVARELLRRLREQRHLAEQ-LQQLRMRLSELE 526
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 298 TALEKTRTeLFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL-QLASQIQK----APD- 371
Cdd:PRK04863 527 QRLRQQQR-AERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLeQLQARIQRlaarAPAw 605
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984296 372 -VAIEVLTRSSLEVELAAKERE-IAQLVEDVQRLQASLTKLRENSASQISQLEQQLN-------AKNSTLKQLEEKLKG 441
Cdd:PRK04863 606 lAAQDALARLREQSGEEFEDSQdVTEYMQQLLERERELTVERDELAARKQALDEEIErlsqpggSEDPRLNALAERFGG 684
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
203-499 |
9.13e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 39.64 E-value: 9.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 203 RLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALkEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQmST 282
Cdd:PTZ00108 1047 RFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEEL-GAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLK-NT 1124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 283 TSK---LEEaehkLQTLQTALEKTRtelfdlktKYDEETTAKADEIEMIMTDLERANQRAEvAQREAETLREqlSSANHS 359
Cdd:PTZ00108 1125 TPKdmwLED----LDKFEEALEEQE--------EVEEKEIAKEQRLKSKTKGKASKLRKPK-LKKKEKKKKK--SSADKS 1189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 360 LQLASQIQKAPDVAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKL 439
Cdd:PTZ00108 1190 KKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEG 1269
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984296 440 KGQADYEE---VKKELNTLKSMEFAPSEGAGTQDST------KPLEVLLLEKNRSLQSENATLRISNSD 499
Cdd:PTZ00108 1270 KPKNAPKRvsaVQYSPPPPSKRPDGESNGGSKPSSPtkkkvkKRLEGSLAALKKKKKSEKKTARKKKSK 1338
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
382-456 |
9.36e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 39.51 E-value: 9.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 382 LEVELAAKEREIAQLVEDVQRLQASLTKLRENS-------------------ASQISQLEQQ---LNAKNSTLKQLEEKL 439
Cdd:COG4913 615 LEAELAELEEELAEAEERLEALEAELDALQERRealqrlaeyswdeidvasaEREIAELEAElerLDASSDDLAALEEQL 694
|
90
....*....|....*...
gi 601984296 440 KG-QADYEEVKKELNTLK 456
Cdd:COG4913 695 EElEAELEELEEELDELK 712
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
117-317 |
9.82e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 39.67 E-value: 9.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 117 ANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPL---LKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPALDv 193
Cdd:TIGR02169 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIekeIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE- 895
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984296 194 gQQLEIKVQRLHDIETENQKLRETLEEynkefaevknQEVTIKALKEKIREYEQTLKS----QAETIALEKEQKLQNDFA 269
Cdd:TIGR02169 896 -AQLRELERKIEELEAQIEKKRKRLSE----------LKAKLEALEEELSEIEDPKGEdeeiPEEELSLEDVQAELQRVE 964
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 601984296 270 EKERKLQETQMSTTSKLEEAEHKLQTLQ---TALEKTRTELFDLKTKYDEE 317
Cdd:TIGR02169 965 EEIRALEPVNMLAIQEYEEVLKRLDELKekrAKLEEERKAILERIEEYEKK 1015
|
|
|