|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
43-199 |
1.56e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.57 E-value: 1.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVP-SPSPGLRDQASSPFPKTAAPtaQAPRTGPPRTTVRKTGATTpSAGSPEIIPPLRTSAQPAATPFPALDL--- 118
Cdd:PHA03247 2745 PAGPATPgGPARPARPPTTAGPPAPAPP--AAPAAGPPRRLTRPAVASL-SESRESLPSPWDPADPPAAVLAPAAALppa 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 119 -SPATPSE--DGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGvPPTAPVTEAPTS 195
Cdd:PHA03247 2822 aSPAGPLPppTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR-PAVSRSTESFAL 2900
|
....
gi 111598761 196 PPPE 199
Cdd:PHA03247 2901 PPDQ 2904
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
251-295 |
6.48e-09 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 51.97 E-value: 6.48e-09
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 251 PCHCSPHGAVSILCNS-SGNCQCKVGVTGSMCDKCQDGHYGFGKTG 295
Cdd:cd00055 1 PCDCNGHGSLSGQCDPgTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
202-245 |
1.11e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 51.20 E-value: 1.11e-08
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFYLNHT 245
Cdd:pfam00053 1 CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
49-198 |
2.25e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 57.47 E-value: 2.25e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPS---------AGSPEIIPPLRTSAQPAATPFPALDLS 119
Cdd:pfam03154 197 AGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQrlpsphpplQPMTQPPPPSQVSPQPLPQPSLHGQMP 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 120 PATPSEDGHTPTTESP-PSRPAPTTLASTVGQ-PPTTSVVTTAQASSTPGTPtaesPDRSSNSSGVPP------TAPVTE 191
Cdd:pfam03154 277 PMPHSLQTGPSHMQHPvPPQPFPLTPQSSQSQvPPGPSPAAPGQSQQRIHTP----PSQSQLQSQQPPreqplpPAPLSM 352
|
....*..
gi 111598761 192 APTSPPP 198
Cdd:pfam03154 353 PHIKPPP 359
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
252-299 |
2.62e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 50.04 E-value: 2.62e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 111598761 252 CHCSPHGAVSILCNSS-GNCQCKVGVTGSMCDKCQDGHYGFGKTGCLPC 299
Cdd:pfam00053 1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
252-296 |
3.44e-08 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 49.62 E-value: 3.44e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 252 CHCSPHGAVSILCNS-SGNCQCKVGVTGSMCDKCQDGHYGFGKTGC 296
Cdd:smart00180 1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
202-247 |
5.99e-08 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 49.27 E-value: 5.99e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFYLNHTVG 247
Cdd:cd00055 2 CDCNGHGSLS-GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
202-241 |
1.03e-07 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 48.46 E-value: 1.03e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFY 241
Cdd:smart00180 1 CDCDPGGSAS-GTCDPDTGQCECKPNVTGRRCDRCAPGYY 39
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
298-345 |
3.82e-07 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 46.96 E-value: 3.82e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 111598761 298 PCQCNNR---SDSCDVHTGACLnCQENSKGEHCEECKEGFYPSPDAAKQCH 345
Cdd:cd00055 1 PCDCNGHgslSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
299-335 |
1.11e-06 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 45.81 E-value: 1.11e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 299 CQCNNR---SDSCDVHTGACLnCQENSKGEHCEECKEGFY 335
Cdd:pfam00053 1 CDCNPHgslSDTCDPETGQCL-CKPGVTGRHCDRCKPGYY 39
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
37-198 |
5.59e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.59e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQVVPSPSPGlrdqASSPFPKTAAPTAQAPRTGPPRTTVrKTGATTPSAGSPEIIPPLRTSAQPAATPFPAL 116
Cdd:COG3469 55 GSAGSGTGTTAASSTAAT----SSTTSTTATATAAAAAATSTSATLV-ATSTASGANTGTSTVTTTSTGAGSVTSTTSST 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 117 DLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPdrsSNSSGVPPTAPVTEAPTSP 196
Cdd:COG3469 130 AGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTG 206
|
..
gi 111598761 197 PP 198
Cdd:COG3469 207 PP 208
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
398-442 |
8.46e-06 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 43.11 E-value: 8.46e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 111598761 398 CECHGHVDPIKTpkiCKPESGECInCLHNTTGFWCEKCLEGYVRD 442
Cdd:pfam00053 1 CDCNPHGSLSDT---CDPETGQCL-CKPGVTGRHCDRCKPGYYGL 41
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
299-335 |
1.64e-05 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 42.30 E-value: 1.64e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 299 CQCN---NRSDSCDVHTGACLnCQENSKGEHCEECKEGFY 335
Cdd:smart00180 1 CDCDpggSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYY 39
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
398-442 |
7.98e-05 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 40.42 E-value: 7.98e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 111598761 398 CECHGHVDPiktPKICKPESGECInCLHNTTGFWCEKCLEGYVRD 442
Cdd:cd00055 2 CDCNGHGSL---SGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGL 42
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
398-447 |
2.55e-04 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 38.83 E-value: 2.55e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 111598761 398 CECH--GHVDPIktpkiCKPESGECInCLHNTTGFWCEKCLEGYVRDLQRNC 447
Cdd:smart00180 1 CDCDpgGSASGT-----CDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
83-165 |
3.06e-04 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 43.35 E-value: 3.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 83 VRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPS--EDGhTPTTESPPSrPAPTTLASTV---GQPPTTSVV 157
Cdd:TIGR00601 74 VSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASavEEK-SPSEESATA-TAPESPSTSVpssGSDAASTLV 151
|
....*...
gi 111598761 158 TTAQASST 165
Cdd:TIGR00601 152 VGSERETT 159
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
41-184 |
5.27e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 42.45 E-value: 5.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 41 GGAEGQVVPSPSPglrdqASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSP 120
Cdd:NF040712 193 GRPLRPLATVPRL-----AREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPA 267
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 121 ATPSEDGHTPTTESPPSRPAPTTLAST---VGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVP 184
Cdd:NF040712 268 AEPDEATRDAGEPPAPGAAETPEAAEPpapAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
50-199 |
7.27e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.45 E-value: 7.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 50 SPSPGLRDQASSPFPKTAA-PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRT-----SAQPAaTPFPALDLSPATP 123
Cdd:NF033839 316 TPKPEVKPQLEKPKPEVKPqPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKpkpevKPQPE-TPKPEVKPQPEKP 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 124 SEDGH-TPTTESPPSRPAPTTLASTVG---QPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:NF033839 395 KPEVKpQPEKPKPEVKPQPEKPKPEVKpqpEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPE 474
|
|
| KREPA2 |
cd23959 |
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ... |
61-202 |
1.12e-03 |
|
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.
Pssm-ID: 467780 [Multi-domain] Cd Length: 424 Bit Score: 41.78 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 61 SPFPKTAAPTAQAPRTGPPRTTVRKTgATTPSAGSPEIipPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPA 140
Cdd:cd23959 117 NPFSASSSTQRETHKTAQVAPPKAEP-QTAPVTPFGQL--PMFGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSA 193
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 111598761 141 PTTLASTVGQPPTTSVVTTAQASSTPGTPTAESpdrSSNSSGVPPTAPvteaPTSPPPEHMC 202
Cdd:cd23959 194 SPFATATDTAPSSGAPDGFPAEASAPSPFAAPA---SAASFPAAPVAN----GEAATPTHAC 248
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
346-399 |
1.83e-03 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 36.56 E-value: 1.83e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 111598761 346 RCPCSAV-TSTGNCTIESGEleptCdQCKDGYTGQNCNKCENGYYNSDSICTQCE 399
Cdd:cd00055 1 PCDCNGHgSLSGQCDPGTGQ----C-ECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
49-197 |
1.85e-03 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 40.91 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRK--TGATTPSAGSPEIIPPLRTSAQPAATPfpaldlsPATPSED 126
Cdd:NF040712 190 PDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGApaTDSDPAEAGTPDDLASARRRRAGVEQP-------EDEPVGP 262
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 127 GHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASStPGTPTAEspdrssnsSGVPPTAPVTEAPTSPP 197
Cdd:NF040712 263 GAAPAAEPDEATRDAGEPPAPGAAETPEAAEPPAPAPA-APAAPAA--------PEAEEPARPEPPPAPKP 324
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
347-392 |
5.63e-03 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 34.98 E-value: 5.63e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 111598761 347 CPCSAV-TSTGNCTIESGEleptCdQCKDGYTGQNCNKCENGYYNSD 392
Cdd:smart00180 1 CDCDPGgSASGTCDPDTGQ----C-ECKPNVTGRRCDRCAPGYYGDG 42
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
43-199 |
1.56e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.57 E-value: 1.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVP-SPSPGLRDQASSPFPKTAAPtaQAPRTGPPRTTVRKTGATTpSAGSPEIIPPLRTSAQPAATPFPALDL--- 118
Cdd:PHA03247 2745 PAGPATPgGPARPARPPTTAGPPAPAPP--AAPAAGPPRRLTRPAVASL-SESRESLPSPWDPADPPAAVLAPAAALppa 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 119 -SPATPSE--DGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGvPPTAPVTEAPTS 195
Cdd:PHA03247 2822 aSPAGPLPppTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR-PAVSRSTESFAL 2900
|
....
gi 111598761 196 PPPE 199
Cdd:PHA03247 2901 PPDQ 2904
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
48-198 |
1.44e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 1.44e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 48 VPSPSPGLRDQASSPfPKTAAPTAQAPRTGP-----PRTTVRKTGATTPSAGSPEIIPPlrTSAQPAATPFPALDLSPAT 122
Cdd:PHA03247 2719 TPLPPGPAAARQASP-ALPAAPAPPAVPAGPatpggPARPARPPTTAGPPAPAPPAAPA--AGPPRRLTRPAVASLSESR 2795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 123 -----PSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAqasstPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPP 197
Cdd:PHA03247 2796 eslpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA-----PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
.
gi 111598761 198 P 198
Cdd:PHA03247 2871 P 2871
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
47-198 |
1.87e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 1.87e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 47 VVPSPSPGLRDQAS----SPFPKTAAPTAQAPRTGPPRTT------------VRKTGATTPSAGSPEII--PPLRTSAQP 108
Cdd:PHA03247 2810 AVLAPAAALPPAASpagpLPPPTSAQPTAPPPPPGPPPPSlplggsvapggdVRRRPPSRSPAAKPAAParPPVRRLARP 2889
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 109 AAT----PFPALDLSPA---TPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSS 181
Cdd:PHA03247 2890 AVSrsteSFALPPDQPErppQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVP 2969
|
170 180
....*....|....*....|....*
gi 111598761 182 G--------VPPTAPVTEAPTSPPP 198
Cdd:PHA03247 2970 GrvavprfrVPQPAPSREAPASSTP 2994
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
44-198 |
2.13e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.72 E-value: 2.13e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 44 EGQVVPSPSPGLRDQASSP---------FPKTAAPTAQAPRTGPPRTTVRKTGATT-------PSAGSPEIIPPLRTSAQ 107
Cdd:PHA03247 2640 PHPPPTVPPPERPRDDPAPgrvsrprraRRLGRAAQASSPPQRPRRRAARPTVGSLtsladppPPPPTPEPAPHALVSAT 2719
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 108 PAAT-------PFPALDLSPATPS--EDGHTPTTESPPSRP-------APTTLASTVGQPPTTSVVtTAQASSTPGTPTA 171
Cdd:PHA03247 2720 PLPPgpaaarqASPALPAAPAPPAvpAGPATPGGPARPARPpttagppAPAPPAAPAAGPPRRLTR-PAVASLSESRESL 2798
|
170 180 190
....*....|....*....|....*....|...
gi 111598761 172 ESP-DRSSNSSGVPPTAPV-----TEAPTSPPP 198
Cdd:PHA03247 2799 PSPwDPADPPAAVLAPAAAlppaaSPAGPLPPP 2831
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
49-199 |
2.85e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.34 E-value: 2.85e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGlRDQASSPFPKTAAPTA-----------QAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALD 117
Cdd:PHA03247 2767 PAPAPP-AAPAAGPPRRLTRPAVaslsesreslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 118 LSPATPSEDGHTP---TTESPPSRPAPTTLAsTVGQPPTTSVvtTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPT 194
Cdd:PHA03247 2846 PPPSLPLGGSVAPggdVRRRPPSRSPAAKPA-APARPPVRRL--ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
|
....*
gi 111598761 195 SPPPE 199
Cdd:PHA03247 2923 PPPPP 2927
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
37-197 |
4.10e-09 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 59.48 E-value: 4.10e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQVVPSPSPGLRdqASSPFPKTAAPTAQAPRTGPPrtTVRKTGATTPSAGSPeiIPPLRTSAqPAATPFPAL 116
Cdd:PRK07003 362 VTGGGAPGGGVPARVAGAVP--APGARAAAAVGASAVPAVTAV--TGAAGAALAPKAAAA--AAATRAEA-PPAAPAPPA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 117 DLSPATPSEDGHTPTTESPPSR-PAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAE-SPDRSSNSSGVPPTAPVTEAPT 194
Cdd:PRK07003 435 TADRGDDAADGDAPVPAKANARaSADSRCDERDAQPPADSGSASAPASDAPPDAAFEpAPRAAAPSAATPAAVPDARAPA 514
|
...
gi 111598761 195 SPP 197
Cdd:PRK07003 515 AAS 517
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
251-295 |
6.48e-09 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 51.97 E-value: 6.48e-09
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 251 PCHCSPHGAVSILCNS-SGNCQCKVGVTGSMCDKCQDGHYGFGKTG 295
Cdd:cd00055 1 PCDCNGHGSLSGQCDPgTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
202-245 |
1.11e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 51.20 E-value: 1.11e-08
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFYLNHT 245
Cdd:pfam00053 1 CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
53-201 |
1.55e-08 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 57.58 E-value: 1.55e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 53 PGLRDQASSPFPKTAAPTAQ-APRTGPPRTTV-RKTGATTPSAGSPEIIPPLRTSAQPAATPFPALD-LSPATPSEDGHT 129
Cdd:PRK12323 365 PGQSGGGAGPATAAAAPVAQpAPAAAAPAAAApAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEaLAAARQASARGP 444
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 111598761 130 PTTESPPSRPAPTTLASTvgQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPEHM 201
Cdd:PRK12323 445 GGAPAPAPAPAAAPAAAA--RPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQP 514
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
41-198 |
1.74e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 58.03 E-value: 1.74e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 41 GGAEGQVVPSPSP--------------------GLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAgSPEiiP 100
Cdd:PHA03247 2606 GDPRGPAPPSPLPpdthapdppppspspaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS-PPQ--R 2682
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 101 PLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNS 180
Cdd:PHA03247 2683 PRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
|
170
....*....|....*...
gi 111598761 181 SGVPPTAPVTEAPTSPPP 198
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPP 2780
|
|
| PHA03381 |
PHA03381 |
tegument protein VP22; Provisional |
49-196 |
2.20e-08 |
|
tegument protein VP22; Provisional
Pssm-ID: 177618 [Multi-domain] Cd Length: 290 Bit Score: 55.79 E-value: 2.20e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPT---AQAPRTGPPRTTVRK-----TGATTPSAGSPEIIPPLRTSAQPAAT-PFPALDLS 119
Cdd:PHA03381 30 ASPARVSFEEPADRARRGAGQArgrSQAERRFHHYDEARAdypyyTGSSSEDERPADPRPSRRPHAQPEASgPGPARGAR 109
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 120 PATPSEdGHTPTTESPPSRPAPTTLASTvgqPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSP 196
Cdd:PHA03381 110 GPAGSR-GRGRRAESPSPRDPPNPKGAS---APRGRKSACADSAALLDAPAPAAPKRQKTPAGLARKLHFSTAPTSP 182
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
49-198 |
2.25e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 57.47 E-value: 2.25e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPS---------AGSPEIIPPLRTSAQPAATPFPALDLS 119
Cdd:pfam03154 197 AGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQrlpsphpplQPMTQPPPPSQVSPQPLPQPSLHGQMP 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 120 PATPSEDGHTPTTESP-PSRPAPTTLASTVGQ-PPTTSVVTTAQASSTPGTPtaesPDRSSNSSGVPP------TAPVTE 191
Cdd:pfam03154 277 PMPHSLQTGPSHMQHPvPPQPFPLTPQSSQSQvPPGPSPAAPGQSQQRIHTP----PSQSQLQSQQPPreqplpPAPLSM 352
|
....*..
gi 111598761 192 APTSPPP 198
Cdd:pfam03154 353 PHIKPPP 359
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
252-299 |
2.62e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 50.04 E-value: 2.62e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 111598761 252 CHCSPHGAVSILCNSS-GNCQCKVGVTGSMCDKCQDGHYGFGKTGCLPC 299
Cdd:pfam00053 1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
252-296 |
3.44e-08 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 49.62 E-value: 3.44e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 252 CHCSPHGAVSILCNS-SGNCQCKVGVTGSMCDKCQDGHYGFGKTGC 296
Cdd:smart00180 1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
49-198 |
5.78e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 5.78e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPfPKTAAPTAQAPRTGPPRTTVrkTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEdGH 128
Cdd:PHA03247 2704 PPPTPEPAPHALVS-ATPLPPGPAAARQASPALPA--APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA-GP 2779
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 129 TPTTESPPSRPAPTTLAStVGQPPTTSVVTTAQASSTPGTPTAESPdrssnSSGVPPtaPVTEAPTSPPP 198
Cdd:PHA03247 2780 PRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASP-----AGPLPP--PTSAQPTAPPP 2841
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
202-247 |
5.99e-08 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 49.27 E-value: 5.99e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFYLNHTVG 247
Cdd:cd00055 2 CDCNGHGSLS-GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
49-198 |
6.48e-08 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 55.65 E-value: 6.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTV-RKTGATTPSAGSPEIIPPLRTSAQPAAT-PFPALDLSPATPSED 126
Cdd:PRK12323 415 AARAVAAAPARRSPAPEALAAARQASARGPGGAPApAPAPAAAPAAAARPAAAGPRPVAAAAAAaPARAAPAAAPAPADD 494
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 111598761 127 GHTPTTESPPSRPAPTTLASTVGQPPttsvvttAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPP 198
Cdd:PRK12323 495 DPPPWEELPPEFASPAPAQPDAAPAG-------WVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEP 559
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
202-241 |
1.03e-07 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 48.46 E-value: 1.03e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 202 CNCSEVGSLDvKRCNQTTGQCDCHVGYQGLHCDTCKEGFY 241
Cdd:smart00180 1 CDCDPGGSAS-GTCDPDTGQCECKPNVTGRRCDRCAPGYY 39
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
38-199 |
1.25e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.99 E-value: 1.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 38 TGGGGAEGQVVPSPSPGLR----------DQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQ 107
Cdd:PRK07764 590 PAPGAAGGEGPPAPASSGPpeeaarpaapAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 108 PAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPttsvvtTAQASSTPGTPTAESPDRSSNSSGVP-PT 186
Cdd:PRK07764 670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA------DDPAAQPPQAAQGASAPSPAADDPVPlPP 743
|
170
....*....|...
gi 111598761 187 APVTEAPTSPPPE 199
Cdd:PRK07764 744 EPDDPPDPAGAPA 756
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
43-199 |
1.45e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.94 E-value: 1.45e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPSPglrdqasSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPAT 122
Cdd:PHA03247 2563 APDRSVPPPRP-------APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 123 PSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPgtptaesPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-------PQRPRRRAARPTVGSLTSLADPPPPP 2705
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
48-197 |
1.76e-07 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 54.41 E-value: 1.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 48 VPSPSPGLRDQAS--------SPFPKTAAPTAQAPRTGPP-RTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDL 118
Cdd:PHA03307 70 GPPPGPGTEAPANesrstptwSLSTLAPASPAREGSPTPPgPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 111598761 119 SPATPSEDGHTPTTESPPSRPAPTTLASTVGQppttsvvtTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPP 197
Cdd:PHA03307 150 ASPPAAGASPAAVASDAASSRQAALPLSSPEE--------TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSP 220
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
42-214 |
1.78e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 1.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 42 GAEGQVVPSPSPglRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPsagspeiiPPLRT--SAQPAATPFPAldls 119
Cdd:PHA03247 332 GAMEVVSPLPRP--RQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRAS--------LPTRKrrSARHAATPFAR---- 397
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 120 pATPSEDGHTPTTESPPSRPAPTTLASTVGQPPttsvvttaqassTPGTPTAeSPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PHA03247 398 -GPGGDDQTRPAAPVPASVPTPAPTPVPASAPP------------PPATPLP-SAEPGSDDGPAPPPERQPPAPATEPAP 463
|
170
....*....|....*
gi 111598761 200 HMCNCSEVGSLDVKR 214
Cdd:PHA03247 464 DDPDDATRKALDALR 478
|
|
| TALPID3 |
pfam15324 |
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ... |
64-200 |
1.91e-07 |
|
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.
Pssm-ID: 434634 [Multi-domain] Cd Length: 1288 Bit Score: 54.51 E-value: 1.91e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPfpalDLSPAtPSEDGHTPTTESPPSRPAPTT 143
Cdd:pfam15324 964 QREPPVAASVPGDLPTKETLLPTPVPTPQPTPPCSPPSPLKEPSPVKTP----DSSPC-VSEHDFFPVKEIPPEKGADTG 1038
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 144 LASTVGQPPTTsvvttaqasstpgTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPEH 200
Cdd:pfam15324 1039 PAVSLVITPTV-------------TPIATPPPAATPTPPLSENSIDKLKSPSPELPK 1082
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
39-197 |
2.05e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 54.11 E-value: 2.05e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 39 GGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQ--APRTGPPRTTVRKTGATTPSAGSP--EIIPPLRTS--------- 105
Cdd:PRK12323 369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPpaAPAAAPAAAAAARAVAAAPARRSPapEALAAARQAsargpggap 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 106 ---AQPAATPFPAldLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQ--------PPTTSVVTTAQASSTPGTPTAESP 174
Cdd:PRK12323 449 apaPAPAAAPAAA--ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDdpppweelPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180
....*....|....*....|...
gi 111598761 175 DRSSNSSGVPPTAPVTEAPTSPP 197
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAP 549
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
46-197 |
2.16e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 2.16e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 46 QVVPSPS-PGLRDQASSPF--PKTAAPTA------QAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPAL 116
Cdd:PHA03247 2572 RPAPRPSePAVTSRARRPDapPQSARPRApvddrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 117 DLSPATP---------SEDGHTPTTESPPSRPAPTTLASTVGQ-------PPTTSVVTTAQASSTPGTPTAESPDRSSNS 180
Cdd:PHA03247 2652 PRDDPAPgrvsrprraRRLGRAAQASSPPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQA 2731
|
170
....*....|....*..
gi 111598761 181 SGVPPTAPVTEAPTSPP 197
Cdd:PHA03247 2732 SPALPAAPAPPAVPAGP 2748
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
64-199 |
2.30e-07 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 53.95 E-value: 2.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTGPPrttvrktgaTTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSedghtPTTESPPSRPAPTT 143
Cdd:PRK14951 366 PAAAAEAAAPAEKKTP---------ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP-----PAAAPPAPVAAPAA 431
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 144 LASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PRK14951 432 AAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAAR 487
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
49-198 |
2.40e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 2.40e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTG-PPRTTVRKTGATTPSagsPeiipPLRTSAQPAATPFPALDLSPATPSedG 127
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPlPPPTSAQPTAPPPPP---G----PPPPSLPLGGSVAPGGDVRRRPPS--R 2869
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 128 HTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSsnssgvPPTAPVTEAPTSPPP 198
Cdd:PHA03247 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP------QPQPPPPPQPQPPPP 2934
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
39-169 |
2.41e-07 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 53.92 E-value: 2.41e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 39 GGGGAEGQVVPSPSPGlrDQASSPFPKTAAPTAQAPRTGpprttvrktgaTTPSAGSPeiipplrTSAQPAATPFPAL-- 116
Cdd:PRK14959 377 GASAPSGSAAEGPASG--GAATIPTPGTQGPQGTAPAAG-----------MTPSSAAP-------ATPAPSAAPSPRVpw 436
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 111598761 117 DLSPATPSEDGHTP-----TTESPPSRPAPTTLASTVGQPPTTSvVTTAQASSTPGTP 169
Cdd:PRK14959 437 DDAPPAPPRSGIPPrpaprMPEASPVPGAPDSVASASDAPPTLG-DPSDTAEHTPSGP 493
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
36-199 |
3.11e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 3.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 36 NVTGGGGAEGQVVPSPS-----PGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGsPEIIPPLRTSAQPAA 110
Cdd:PRK07764 602 APASSGPPEEAARPAAPaapaaPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG-GDGWPAKAGGAAPAA 680
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 111 TPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPP----TTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPT 186
Cdd:PRK07764 681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPqaaqGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPP 760
|
170
....*....|...
gi 111598761 187 APVTEAPTSPPPE 199
Cdd:PRK07764 761 PPAPAPAAAPAAA 773
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
298-345 |
3.82e-07 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 46.96 E-value: 3.82e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 111598761 298 PCQCNNR---SDSCDVHTGACLnCQENSKGEHCEECKEGFYPSPDAAKQCH 345
Cdd:cd00055 1 PCDCNGHgslSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
49-198 |
4.47e-07 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 53.25 E-value: 4.47e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDqASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGH 128
Cdd:PHA03307 104 GSPTPPGPS-SPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET 182
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 111598761 129 TPTTESPPS----RPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPP 198
Cdd:PHA03307 183 ARAPSSPPAepppSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP 256
|
|
| PHA03325 |
PHA03325 |
nuclear-egress-membrane-like protein; Provisional |
62-198 |
8.71e-07 |
|
nuclear-egress-membrane-like protein; Provisional
Pssm-ID: 223044 Cd Length: 418 Bit Score: 51.42 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 62 PFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPlrtsAQPAATPF-PALDLSPATPSEDGHTPTTESPPSRPA 140
Cdd:PHA03325 266 SSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPE----PLPASLPPpPVRRPRVKHPEAGKEEPDGARNAEAKE 341
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 141 PTTLASTVGQPPTTSVVTTAQASSTPGTP--TAESPDRSSNSSGVP----------PTAPVTEAPTSPPP 198
Cdd:PHA03325 342 PAQPATSTSSKGSSSAQNKDSGSTGPGSSlaAASSFLEDDDFGSPPldlttslrhmPSPSVTSAPEPPSI 411
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
46-198 |
9.99e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 51.84 E-value: 9.99e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 46 QVVPSPSPglRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPA-TPS 124
Cdd:pfam05109 418 KVIFSKAP--ESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSpSPR 495
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 125 EDGH-----------------TPTTESPP---SRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVP 184
Cdd:pfam05109 496 DNGTeskapdmtsptsavttpTPNATSPTpavTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170
....*....|....
gi 111598761 185 PTAPvTEAPTSPPP 198
Cdd:pfam05109 576 KTSP-TSAVTTPTP 588
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
299-335 |
1.11e-06 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 45.81 E-value: 1.11e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 299 CQCNNR---SDSCDVHTGACLnCQENSKGEHCEECKEGFY 335
Cdd:pfam00053 1 CDCNPHgslSDTCDPETGQCL-CKPGVTGRHCDRCKPGYY 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
38-195 |
1.26e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 51.46 E-value: 1.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 38 TGGGGAEGQVVPSPSPGLRDQAS-SPFPKTAAPTAQAPRTGP------PRTTVRKTGATTPSAGSPEIIPPLRTSAQPAA 110
Cdd:pfam05109 477 TPAGTTSGASPVTPSPSPRDNGTeSKAPDMTSPTSAVTTPTPnatsptPAVTTPTPNATSPTLGKTSPTSAVTTPTPNAT 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 111 TPFPALdlspATPSEDGHTPT--TESPPS---RPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPP 185
Cdd:pfam05109 557 SPTPAV----TTPTPNATIPTlgKTSPTSavtTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTG 632
|
170
....*....|
gi 111598761 186 TAPVTEAPTS 195
Cdd:pfam05109 633 QHNITSSSTS 642
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
47-201 |
1.67e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 51.14 E-value: 1.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 47 VVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPeiipplrtSAQPAATPFPALDLSPATPSED 126
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAA--------PAEASAAPAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 111598761 127 GHTPTTESPPSRPAPTTLASTVGQPPTTSvvTTAQASSTPGTPTAESPDrssnssgvPPTAPVTEAPTSPPPEHM 201
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPA--PAAPAAPAGAAPAQPAPA--------PAATPPAGQADDPAAQPP 722
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
102-199 |
1.94e-06 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 51.01 E-value: 1.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 102 LRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLAstvgqPPTTSVVTTAQASSTPGTPTAESPDRSSNSS 181
Cdd:PRK11907 18 LTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTET-----PVAATTAAEAPSSSETAETSDPTSEATDTTT 92
|
90
....*....|....*...
gi 111598761 182 GVPPTAPVTEAPTSPPPE 199
Cdd:PRK11907 93 SEARTVTPAATETSKPVE 110
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
66-199 |
2.06e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.75 E-value: 2.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 66 TAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPAldlspatpsedghtpttesPPSRPAPTTLA 145
Cdd:PRK07764 387 VAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA-------------------PAPAPPSPAGN 447
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 146 STVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTE--APTSPPPE 199
Cdd:PRK07764 448 APAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAApaAPAAPAAP 503
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
66-199 |
2.09e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.75 E-value: 2.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 66 TAAPTAQAPRTGPPRTTVRKTGATTPSAGSPeiipPLRTSAQPAATPFPALD--LSPATPSEDGHTPTTESPPSRPAPTT 143
Cdd:PRK07764 368 SDDERGLLARLERLERRLGVAGGAGAPAAAA----PSAAAAAPAAAPAPAAAapAAAAAPAPAAAPQPAPAPAPAPAPPS 443
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 111598761 144 --LASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PRK07764 444 paGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
89-196 |
2.37e-06 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 50.10 E-value: 2.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 89 TTPSAGSPEIIPPLRTSAQP---AATPFPALDLSPATPSEdgHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASST 165
Cdd:PRK12799 298 TVPVAAVTPSSAVTQSSAITpssAAIPSPAVIPSSVTTQS--ATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQ 375
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 111598761 166 PGTPTAESPDRSSNSSGVP---------PTAPVTEAPTSP 196
Cdd:PRK12799 376 PQPMSTTETQQSSTGNITStangpttslPAAPASNIPVSP 415
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
40-200 |
2.62e-06 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 50.48 E-value: 2.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 40 GGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPrttvrKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLS 119
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAA-----PAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 120 PATPSEDghtpttESPPSRPAPTTLASTVgqppttsvvttaQASSTPGTPTAESPdrssnssgvPPTAPVtEAPTSPPPE 199
Cdd:PRK14951 442 PAAVALA------PAPPAQAAPETVAIPV------------RVAPEPAVASAAPA---------PAAAPA-AARLTPTEE 493
|
.
gi 111598761 200 H 200
Cdd:PRK14951 494 G 494
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
37-198 |
2.73e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.55 E-value: 2.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPL--------RTSAQP 108
Cdd:PHA03307 183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGwgpenecpLPRPAP 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 109 AATPFPALDLSPATPSEDGHTPTTE-SPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSS---GVP 184
Cdd:PHA03307 263 ITLPTRIWEASGWNGPSSRPGPASSsSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrgaAVS 342
|
170
....*....|....
gi 111598761 185 PTAPVTEAPTSPPP 198
Cdd:PHA03307 343 PGPSPSRSPSPSRP 356
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
49-198 |
4.16e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.15 E-value: 4.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTA-------------QAPRTGPPRTTVRKTGATT----PSAGSP--EIIPPLRTSAQPA 109
Cdd:pfam03154 349 PLSMPHIKPPPTTPIPQLPNPQShkhpphlsgpspfQMNSNLPPPPALKPLSSLSthhpPSAHPPplQLMPQSQQLPPPP 428
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 110 ATPfPALDLSPATPSEDGHTPTTES----PPSRPAPTTLASTVGQPPTTSVvTTAQASSTPGTPTAESPDRSSNSSGVP- 184
Cdd:pfam03154 429 AQP-PVLTQSQSLPPPAASHPPTSGlhqvPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSASVSSSGPv 506
|
170 180
....*....|....*....|....*....
gi 111598761 185 ---PTAPV------------TEAPTSPPP 198
Cdd:pfam03154 507 paaVSCPLppvqikeealdeAEEPESPPP 535
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
37-177 |
4.37e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.98 E-value: 4.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEiipplrtSAQPAATPFPAl 116
Cdd:PRK07764 398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP-------AAAPSAQPAPA- 469
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 117 dlspATPSEdghtPTTESPPSRPAPttlastvGQPPTTSVVTTAQASSTPGTPTAESPDRS 177
Cdd:PRK07764 470 ----PAAAP----EPTAAPAPAPPA-------APAPAAAPAAPAAPAAPAGADDAATLRER 515
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
50-188 |
5.41e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.78 E-value: 5.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 50 SPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPAL------------- 116
Cdd:PHA03307 158 SPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPgrsaaddagasss 237
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 111598761 117 DLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQP-PTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAP 188
Cdd:PHA03307 238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEAsGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS 310
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
37-198 |
5.59e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.59e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQVVPSPSPGlrdqASSPFPKTAAPTAQAPRTGPPRTTVrKTGATTPSAGSPEIIPPLRTSAQPAATPFPAL 116
Cdd:COG3469 55 GSAGSGTGTTAASSTAAT----SSTTSTTATATAAAAAATSTSATLV-ATSTASGANTGTSTVTTTSTGAGSVTSTTSST 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 117 DLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPdrsSNSSGVPPTAPVTEAPTSP 196
Cdd:COG3469 130 AGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTG 206
|
..
gi 111598761 197 PP 198
Cdd:COG3469 207 PP 208
|
|
| PRK13335 |
PRK13335 |
superantigen-like protein SSL3; Reviewed; |
66-166 |
5.80e-06 |
|
superantigen-like protein SSL3; Reviewed;
Pssm-ID: 139494 [Multi-domain] Cd Length: 356 Bit Score: 48.97 E-value: 5.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 66 TAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPA---ATPFPALDLSPATPSEDGHTPTTESPPSRPAPT 142
Cdd:PRK13335 63 TQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLnisATPAPKQEQSQTTTESTTPKTKVTTPPSTNTPQ 142
|
90 100
....*....|....*....|....
gi 111598761 143 TLASTVGQPPTTSVVTTAQASSTP 166
Cdd:PRK13335 143 PMQSTKSDTPQSPTIKQAQTDMTP 166
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
49-199 |
7.38e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 49.09 E-value: 7.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPrttvrktgATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGH 128
Cdd:PRK07994 364 PLPEPEVPPQSAAPAASAQATAAPTAAVAPP--------QAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 129 TPTTESPPS-----RPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PRK07994 436 TKAKKSEPAaasraRPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPE 511
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
398-442 |
8.46e-06 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 43.11 E-value: 8.46e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 111598761 398 CECHGHVDPIKTpkiCKPESGECInCLHNTTGFWCEKCLEGYVRD 442
Cdd:pfam00053 1 CDCNPHGSLSDT---CDPETGQCL-CKPGVTGRHCDRCKPGYYGL 41
|
|
| PRK13042 |
PRK13042 |
superantigen-like protein SSL4; Reviewed; |
76-174 |
1.08e-05 |
|
superantigen-like protein SSL4; Reviewed;
Pssm-ID: 183854 [Multi-domain] Cd Length: 291 Bit Score: 47.71 E-value: 1.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 76 TGPPRTTVRKTGATTPSA---GSPEIIPPLRTSAQPAATPfpaldlSPATPSedghTPTTESPPSRPAPTTlastvgqpP 152
Cdd:PRK13042 18 TGVITTTTQAANATTPSStkvEAPQSTPPSTKVEAPQSKP------NATTPP----STKVEAPQQTPNATT--------P 79
|
90 100
....*....|....*....|..
gi 111598761 153 TTSVVTTAQASSTPGTPTAESP 174
Cdd:PRK13042 80 SSTKVETPQSPTTKQVPTEINP 101
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
49-199 |
1.12e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 1.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRT------TVRKTGATTPSAGSPEiipPLRTSAQPAATPFPALDLSPAT 122
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPqpplapTTDPAGAGEPSGAVPQ---PWLGALVPGRVAVPRFRVPQPA 2983
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 123 PSEDGHTPTTESPPSRPAP--TTLASTVG-----QPPTTSVVTTAQAS-------------STPGTPTAESPDRSSNssg 182
Cdd:PHA03247 2984 PSREAPASSTPPLTGHSLSrvSSWASSLAlheetDPPPVSLKQTLWPPddtedsdadslfdSDSERSDLEALDPLPP--- 3060
|
170
....*....|....*..
gi 111598761 183 vPPTAPVTEAPTSPPPE 199
Cdd:PHA03247 3061 -EPHDPFAHEPDPATPE 3076
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
39-183 |
1.37e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 48.06 E-value: 1.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 39 GGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDL 118
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 119 SPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTS-VVTTAQASSTPGTPTAESPDRSSNSSGV 183
Cdd:PRK07764 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPpPSPPSEEEEMAEDDAPSMDDEDRRDAEE 803
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
108-200 |
1.61e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 47.88 E-value: 1.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 108 PAATPFPALDLSPAtPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTtAQASSTPGTPTAESPDRSSNSSGVPPTA 187
Cdd:PRK14950 364 PAPQPAKPTAAAPS-PVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVP-PRPVAPPVPHTPESAPKLTRAAIPVDEK 441
|
90
....*....|...
gi 111598761 188 PVTEAPTSPPPEH 200
Cdd:PRK14950 442 PKYTPPAPPKEEE 454
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
299-335 |
1.64e-05 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 42.30 E-value: 1.64e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 111598761 299 CQCN---NRSDSCDVHTGACLnCQENSKGEHCEECKEGFY 335
Cdd:smart00180 1 CDCDpggSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYY 39
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
49-214 |
2.18e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGL---RDQASSPFPKTAAPT----AQAPRTGPPRTTVRKTGA---TTPSAGSPEiiPPLRTSAQPAAT----PFP 114
Cdd:PHA03247 2494 AAPDPGGggpPDPDAPPAPSRLAPAilpdEPVGEPVHPRMLTWIRGLeelASDDAGDPP--PPLPPAAPPAAPdrsvPPP 2571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 115 ALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPvteapt 194
Cdd:PHA03247 2572 RPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT------ 2645
|
170 180
....*....|....*....|
gi 111598761 195 SPPPEHMCNCSEVGSLDVKR 214
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPR 2665
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
69-196 |
2.34e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 47.37 E-value: 2.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 69 PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPfpALDLSPATPSedghtPTTESPPSRPAPTTLASTV 148
Cdd:PRK14959 367 PVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAP--AAGMTPSSAA-----PATPAPSAAPSPRVPWDDA 439
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 111598761 149 GQPPTTSVVTTAQASSTPGT-PTAESPDRSSNSSGVPPT--APVTEAPTSP 196
Cdd:PRK14959 440 PPAPPRSGIPPRPAPRMPEAsPVPGAPDSVASASDAPPTlgDPSDTAEHTP 490
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
38-174 |
2.38e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 47.46 E-value: 2.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 38 TGGGGAEGQVVPSPSpgLRDQASSPFPKTAAPTAQAPRTgpprTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPald 117
Cdd:PRK14971 368 DASGGRGPKQHIKPV--FTQPAAAPQPSAAAAASPSPSQ----SSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVN--- 438
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 118 lsPATPSEDGHTPTTESPPSRPAPTTLASTVgqPPTTSVVTTAQASSTPGTPTAESP 174
Cdd:PRK14971 439 --PPSTAPQAVRPAQFKEEKKIPVSKVSSLG--PSTLRPIQEKAEQATGNIKEAPTG 491
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
43-198 |
2.63e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 47.15 E-value: 2.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIP------PLRTSAQPAATPFPAl 116
Cdd:PRK07003 423 AEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPasdappDAAFEPAPRAAAPSA- 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 117 dlSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTsvvTTAQASSTPGTPTAE-----------SPDRSSNSSGVP- 184
Cdd:PRK07003 502 --ATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTP---AAAAPAARAGGAAAAldvlrnagmrvSSDRGARAAAAAk 576
|
170
....*....|....
gi 111598761 185 PTAPVTEAPTSPPP 198
Cdd:PRK07003 577 PAAAPAAAPKPAAP 590
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
64-200 |
3.19e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 46.78 E-value: 3.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTG--PPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALD-----LSPATPSEDGHTPTTESPP 136
Cdd:PRK07994 361 PAAPLPEPEVPPQSaaPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPettsqLLAARQQLQRAQGATKAKK 440
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 137 SRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDR--SSNSSGVPPTAPVTEAPTSPPPEH 200
Cdd:PRK07994 441 SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkATNPVEVKKEPVATPKALKKALEH 506
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
30-187 |
4.24e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 46.52 E-value: 4.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 30 STASAGNVTGGGGAEGqVVPSPSPGLRDQASSPFPKTAAPTAQAPR-TGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQP 108
Cdd:PRK07764 640 SAAPAPGVAAPEHHPK-HVAVPDASDGGDGWPAKAGGAAPAAPPPApAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 111598761 109 AATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTA 187
Cdd:PRK07764 719 AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
41-197 |
4.73e-05 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 46.58 E-value: 4.73e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 41 GGAEGQ-----VVPSPSPGLRDQASSPFPK--------------------------TAAPTAQAPRTG-PPRTTVRKTGA 88
Cdd:COG5665 232 VGVEWWgdpslLATPPATPATEEKSSQQPKsqptspsggttppstnqlttsntptsTAKAQPQPPTKKqPAKEPPSDTAS 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 89 TTPSAGSPEIIP-------------PL----RTSAQPAATPFPALDlsPATPSEDGHTPTTESPPSRPAPTTLASTVGQP 151
Cdd:COG5665 312 GNPSAPSVLINSdsptsedpatasvPTteetTAFTTPSSVPSTPAE--KDTPATDLATPVSPTPPETSVDKKVSPDSATS 389
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 111598761 152 PTTSVVTTAQASS--TPGT-------PTAESPdrSSNSSGVPPTAPVTEAPTSPP 197
Cdd:COG5665 390 STKSEKEGGTASSpmPPNIaigakddVDATDP--SQEAKEYTKNAPMTPEADSAP 442
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
51-193 |
4.91e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 46.25 E-value: 4.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 51 PSPGLRDQASSPFPKTAAPTAQAPRTGPPrttvrktgATTPSAGSPEIIPPlrTSAQPAATPFPALDLSPATPSEDGHTP 130
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPV--------AQAAAAPAPAAAPA--AAASAPAAPPAAAPPAPVAAPAAAAPA 435
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 111598761 131 TTESPPSRPAPTTLASTVgQPPTTSVVTTAQASSTPGTPTAESPdrssnSSGVPPTAPVTEAP 193
Cdd:PRK14951 436 AAPAAAPAAVALAPAPPA-QAAPETVAIPVRVAPEPAVASAAPA-----PAAAPAAARLTPTE 492
|
|
| FimV |
COG3170 |
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures]; |
43-200 |
5.62e-05 |
|
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
Pssm-ID: 442403 [Multi-domain] Cd Length: 508 Bit Score: 45.94 E-value: 5.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPSPGLRDQASSPFPKTAA---PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQ------PAATP- 112
Cdd:COG3170 108 AYAAAAAAPAAAPAPAPAAPAAAAAAadqPAAEAAPAASGEYYPVRPGDTLWSIAARPVRPSSGVSLDqmmvalYRANPd 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 113 -FPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTE 191
Cdd:COG3170 188 aFIDGNINRLKAGAVLRVPAAEEVAALSPAEARQEVQAQSADWAAYRARLAAAVEPAPAAAAPAAPPAAAAAAGPVPAAA 267
|
....*....
gi 111598761 192 APTSPPPEH 200
Cdd:COG3170 268 EDTLSPEVT 276
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
64-195 |
6.09e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 45.72 E-value: 6.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPA---TPSEDGHTPTTESPPSRPA 140
Cdd:pfam17823 158 PRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAataTGHPAAGTALAAVGNSSPA 237
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 141 PTTLASTVGQPPTTSVVTTAQASSTPGT----------------PTAESPDRSSNSSGVPPTAPVTEAPTS 195
Cdd:pfam17823 238 AGTVTAAVGTVTPAALATLAAAAGTVASaagtinmgdpharrlsPAKHMPSDTMARNPAAPMGAQAQGPII 308
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
59-200 |
6.28e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.90 E-value: 6.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 59 ASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEiipplrTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSR 138
Cdd:COG3469 90 TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS------TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGT 163
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 111598761 139 PAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESpdrssnssgvPPTAPVTEAPTSPPPEH 200
Cdd:COG3469 164 TTTSTTTTTTSASTTPSATTTATATTASGATTPSA----------TTTATTTGPPTPGLPKH 215
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
59-183 |
7.22e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 45.57 E-value: 7.22e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 59 ASSPFPKTAAPTAQAPRTGPPRTTVrktgATTPSAGSPEIIPPLRTSAQPAaTPfpaldlsPATPSEDGHTPTTESPPSR 138
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAP----STRPKAAAAANIPPKEPVRETA-TP-------PPVPPRPVAPPVPHTPESA 428
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 111598761 139 PAPTTLASTVGQPPTtsvvttaqasSTPGTPTAESpDRSSNSSGV 183
Cdd:PRK14950 429 PKLTRAAIPVDEKPK----------YTPPAPPKEE-EKALIADGD 462
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
398-442 |
7.98e-05 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 40.42 E-value: 7.98e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 111598761 398 CECHGHVDPiktPKICKPESGECInCLHNTTGFWCEKCLEGYVRD 442
Cdd:cd00055 2 CDCNGHGSL---SGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGL 42
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
59-198 |
9.11e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 45.47 E-value: 9.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 59 ASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPE--IIPPLRTSAQPAATPFPALDLSPATPSedghtptTESPP 136
Cdd:PRK08691 391 AKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEnnDVPPWEDAPDEAQTAAGTAQTSAKSIQ-------TASEA 463
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 111598761 137 SRPAPttlaSTVGQPPTTSVVTTAQASSTPgtptAESPDRSS-NSSGVPPTAPVTEAPTSPPP 198
Cdd:PRK08691 464 ETPPE----NQVSKNKAADNETDAPLSEVP----SENPIQATpNDEAVETETFAHEAPAEPFY 518
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
66-195 |
9.47e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 45.34 E-value: 9.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 66 TAAPTAQAPRTGPPRTTvrktGATTPSAGSPEIIPPLRTSAQPAATPFPALdlSPATPSEDGHTPTTESP-------PSR 138
Cdd:pfam17823 87 TAEHTPHGTDLSEPATR----EGAADGAASRALAAAASSSPSSAAQSLPAA--IAALPSEAFSAPRAAACranasaaPRA 160
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 111598761 139 PAPTTLASTVGQP-PTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTS 195
Cdd:pfam17823 161 AIAAASAPHAASPaPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATA 218
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
35-198 |
9.94e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 45.45 E-value: 9.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 35 GNVTGGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGP-PRTTVRKTgaTTPSAGSPEIIPPLRTSAQPAATPF 113
Cdd:PTZ00449 555 GEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKrPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 114 PALDLSPATPS-----EDGHTPTTESPPSRPAP--------------TTLASTVGQPPTTSVVTTAQASSTPGTPTAESP 174
Cdd:PTZ00449 633 PKRPPPPQRPSsperpEGPKIIKSPKPPKSPKPpfdpkfkekfyddyLDAAAKSKETKTTVVLDESFESILKETLPETPG 712
|
170 180
....*....|....*....|....
gi 111598761 175 DRSSNSSGVPPTAPVTEAPTSPPP 198
Cdd:PTZ00449 713 TPFTTPRPLPPKLPRDEEFPFEPI 736
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
44-126 |
2.13e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 44.16 E-value: 2.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 44 EGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTP-SAGSPEIIPPLRTSA--QPAATPFPALDLSP 120
Cdd:PRK14954 377 DGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPaSAPTPEQQPPVARSAplPPSPQASAPRNVAS 456
|
....*.
gi 111598761 121 ATPSED 126
Cdd:PRK14954 457 GKPGVD 462
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
43-198 |
2.22e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.39 E-value: 2.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPS--AGSPEIIPPLRTSAQPAATPFPALDLSP 120
Cdd:PHA03307 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSreSSSSSTSSSSESSRGAAVSPGPSPSRSP 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 121 aTPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDR--SSNSSGVPPTAPVT-EAPTSPP 197
Cdd:PHA03307 352 -SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRfpAGRPRPSPLDAGAAsGAFYARY 430
|
.
gi 111598761 198 P 198
Cdd:PHA03307 431 P 431
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
67-198 |
2.44e-04 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 43.62 E-value: 2.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 67 AAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAAT---PFPALDLSPATPSEDGHTPTTESPPsrPAPTT 143
Cdd:pfam13254 202 EVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTdkeQSPAPTSASEPPPKTKELPKDSEEP--AAPSK 279
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 111598761 144 LASTVGQPPTTSVVTTAQASSTPGTPTAESPdrSSNSSGVPPTAPVTEAPTSPPP 198
Cdd:pfam13254 280 SAEASTEKKEPDTESSPETSSEKSAPSLLSP--VSKASIDKPLSSPDRDPLSPKP 332
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
398-447 |
2.55e-04 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 38.83 E-value: 2.55e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 111598761 398 CECH--GHVDPIktpkiCKPESGECInCLHNTTGFWCEKCLEGYVRDLQRNC 447
Cdd:smart00180 1 CDCDpgGSASGT-----CDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
48-204 |
2.63e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 48 VPSPSPGLRDQASS----------PFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAgspeiiPPLRTSAQPAATPFPALD 117
Cdd:pfam03154 148 IPSPQDNESDSDSSaqqqilqtqpPVLQAQSGAASPPSPPPPGTTQAATAGPTPSA------PSVPPQGSPATSQPPNQT 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 118 LSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVtTAQASSTPGTPTAESPDRSSNSSGvPPTAPVTEAPTS-P 196
Cdd:pfam03154 222 QSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV-SPQPLPQPSLHGQMPPMPHSLQTG-PSHMQHPVPPQPfP 299
|
....*...
gi 111598761 197 PPEHMCNC 204
Cdd:pfam03154 300 LTPQSSQS 307
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
43-200 |
2.70e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPSPGLRDQASSpFPKTAAPTAQAPRTGPPRTTvrktgattpsagspeiipPLrtsaQPAATPFPALDLSPAT 122
Cdd:pfam03154 305 SQSQVPPGPSPAAPGQSQQ-RIHTPPSQSQLQSQQPPREQ------------------PL----PPAPLSMPHIKPPPTT 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 123 PSEDGHTPTTESPP---SRPAPTTLASTVGQPPT----TSVVT----------------TAQASSTPGTPT--AESPDRS 177
Cdd:pfam03154 362 PIPQLPNPQSHKHPphlSGPSPFQMNSNLPPPPAlkplSSLSThhppsahppplqlmpqSQQLPPPPAQPPvlTQSQSLP 441
|
170 180
....*....|....*....|....
gi 111598761 178 SNSSGVPPTAPVTEAPTSPP-PEH 200
Cdd:pfam03154 442 PPAASHPPTSGLHQVPSQSPfPQH 465
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
83-165 |
3.06e-04 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 43.35 E-value: 3.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 83 VRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPS--EDGhTPTTESPPSrPAPTTLASTV---GQPPTTSVV 157
Cdd:TIGR00601 74 VSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASavEEK-SPSEESATA-TAPESPSTSVpssGSDAASTLV 151
|
....*...
gi 111598761 158 TTAQASST 165
Cdd:TIGR00601 152 VGSERETT 159
|
|
| PRK10118 |
PRK10118 |
flagellar hook length control protein FliK; |
86-193 |
3.37e-04 |
|
flagellar hook length control protein FliK;
Pssm-ID: 236652 [Multi-domain] Cd Length: 408 Bit Score: 43.32 E-value: 3.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 86 TGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGH-TPTTESPPSRPAPTTLASTVGQPPTTSVVTT---AQ 161
Cdd:PRK10118 153 QDNTTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHeKGLTSAQLTTAQPDDAPGTPAQPLTPLAAEAqakAE 232
|
90 100 110
....*....|....*....|....*....|....*.
gi 111598761 162 ASSTPgTPTAESPDRSSNSSGVPP----TAPVTEAP 193
Cdd:PRK10118 233 VISTP-SPVTAAASPTITPHQTQPlptaAAPVLSAP 267
|
|
| SAP130_C |
pfam16014 |
Histone deacetylase complex subunit SAP130 C-terminus; |
64-197 |
3.43e-04 |
|
Histone deacetylase complex subunit SAP130 C-terminus;
Pssm-ID: 464973 [Multi-domain] Cd Length: 371 Bit Score: 43.38 E-value: 3.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTT 143
Cdd:pfam16014 33 PVTVAVEALPGQNSEQQTASASPPSQHPAQAIPTILAPAAPPSQPSVVLSTLPAAMAVTPPIPASMANVVAPPTQPAASS 112
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 111598761 144 LASTvgqpPTTSVVttaqasstPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPP 197
Cdd:pfam16014 113 TAAC----AVSSVL--------PEIKIKQEAEPMDTSQSVPPLTPTSISPALTS 154
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
44-186 |
3.58e-04 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 43.56 E-value: 3.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 44 EGQVVPSPSPGLRDQASSPFPKTA-APTAQAPRTGPPRTTVRKTGATTPsagspeiipplrtsaQPAATPFPALDLSPaT 122
Cdd:PHA03269 20 ANLNTNIPIPELHTSAATQKPDPApAPHQAASRAPDPAVAPTSAASRKP---------------DLAQAPTPAASEKF-D 83
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 123 PSEDGHTPTTESPPSRPAPTTLASTVGQP--PTTSVVTTAQASSTPGTPTA-ESPDRSSNSSGVPPT 186
Cdd:PHA03269 84 PAPAPHQAASRAPDPAVAPQLAAAPKPDAaeAFTSAAQAHEAPADAGTSAAsKKPDPAAHTQHSPPP 150
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
36-184 |
4.03e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 42.20 E-value: 4.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 36 NVTGGGGAEGQVVPSPSP-GLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFP 114
Cdd:PHA03255 33 SAGNVTGTTAVTTPSPSAsGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNIT 112
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 115 ALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGT-PTAESPDRSSNSSGVP 184
Cdd:PHA03255 113 ATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAElPTVPDERQPSLSYGLP 183
|
|
| FimV |
COG3170 |
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures]; |
45-197 |
4.21e-04 |
|
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
Pssm-ID: 442403 [Multi-domain] Cd Length: 508 Bit Score: 43.24 E-value: 4.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 45 GQVVPSPSPglrDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGA--TTPSAGSPEIIPPlrtsAQPAATPFPALDLSPAT 122
Cdd:COG3170 200 GAVLRVPAA---EEVAALSPAEARQEVQAQSADWAAYRARLAAAvePAPAAAAPAAPPA----AAAAAGPVPAAAEDTLS 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 123 PsEDGHTPTTESPPSRPAP--------TTLASTVGQ---------PPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPP 185
Cdd:COG3170 273 P-EVTAAAAAEEADALPEAaaelaerlAALEAQLAElqrllalknPAPAAAVSAPAAAAAAATVEAAAPAAAAQPAAAAP 351
|
170
....*....|..
gi 111598761 186 tAPVTEAPTSPP 197
Cdd:COG3170 352 -APALDNPLLLA 362
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
52-183 |
4.25e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 4.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 52 SPGLRDQASSPFPKT---AAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQP--AATPFPaldlSPATPSED 126
Cdd:PHA03247 369 SAGRHHPKRASLPTRkrrSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPppPATPLP----SAEPGSDD 444
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 111598761 127 GHTPTTESPPSRPA--PTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGV 183
Cdd:PHA03247 445 GPAPPPERQPPAPAtePAPDDPDDATRKALDALRERRPPEPPGADLAELLGRHPDTAGT 503
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
58-164 |
4.60e-04 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 42.91 E-value: 4.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 58 QASSPFPKTAAPTAQAPRTGPPRTTVRKTgATTPSAGSPEIIPplRTSAQPAATPFPAldlSPATPSEDGHTP--TTESP 135
Cdd:COG3266 261 ASSASAPATTSLGEQQEVSLPPAVAAQPA-AAAAAQPSAVALP--AAPAAAAAAAAPA---EAAAPQPTAAKPvvTETAA 334
|
90 100
....*....|....*....|....*....
gi 111598761 136 PSRPAPTTLASTvgQPPTTSVVTTAQASS 164
Cdd:COG3266 335 PAAPAPEAAAAA--AAPAAPAVAKKLAAD 361
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
41-184 |
5.27e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 42.45 E-value: 5.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 41 GGAEGQVVPSPSPglrdqASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSP 120
Cdd:NF040712 193 GRPLRPLATVPRL-----AREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPA 267
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 121 ATPSEDGHTPTTESPPSRPAPTTLAST---VGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVP 184
Cdd:NF040712 268 AEPDEATRDAGEPPAPGAAETPEAAEPpapAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
87-210 |
6.70e-04 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 42.23 E-value: 6.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 87 GATTPSagSPEIIPPLRTSAQPAATPFPALDLSPATP-SEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASST 165
Cdd:PRK10905 120 NSTLPT--EPATVAPVRNGNASRQTAKTQTAERPATTrPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKA 197
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 111598761 166 PGTPTAESPDRSSNSSGVPPTAPVteAPTSPPPEHMCNCSEVGSL 210
Cdd:PRK10905 198 PAATSTPAPKETATTAPVQTASPA--QTTATPAAGGKTAGNVGSL 240
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
50-199 |
7.27e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.45 E-value: 7.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 50 SPSPGLRDQASSPFPKTAA-PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRT-----SAQPAaTPFPALDLSPATP 123
Cdd:NF033839 316 TPKPEVKPQLEKPKPEVKPqPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKpkpevKPQPE-TPKPEVKPQPEKP 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 124 SEDGH-TPTTESPPSRPAPTTLASTVG---QPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:NF033839 395 KPEVKpQPEKPKPEVKPQPEKPKPEVKpqpEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPE 474
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
101-197 |
8.23e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 41.94 E-value: 8.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 101 PLRTSAQPAATPFPAldlsPATPsedghTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNS 180
Cdd:PRK10856 163 PLDTSTTTDPATTPA----PAAP-----VDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDG 233
|
90
....*....|....*..
gi 111598761 181 SGVPPTApvTEAPTSPP 197
Cdd:PRK10856 234 AAPLPTD--QAGVSTPA 248
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
64-197 |
8.56e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 8.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPTAQAPRTGPP---RTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPA 140
Cdd:PHA03378 676 PSPTGANTMLPIQWAPgtmQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 111598761 141 PTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPP 197
Cdd:PHA03378 756 PPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPR 812
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
62-226 |
9.87e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 41.96 E-value: 9.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 62 PFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPlrTSAQPAATPFPALDLSPATPSEDG-------------- 127
Cdd:pfam05539 188 TYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS--SNPEPQTEPPPSQRGPSGSPQHPPsttsqdqsttgdgq 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 128 -HTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNS----SGVPPTAPVTEAPTSPPPEHMC 202
Cdd:pfam05539 266 eHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSppgvQANPTTQNLVDCKELDPPKPNS 345
|
170 180
....*....|....*....|....
gi 111598761 203 NCSEVGSLDvkrcNQTTGQCDCHV 226
Cdd:pfam05539 346 ICYGVGIYN----EALPRGCDIVV 365
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
49-198 |
1.05e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 1.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGP------PRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPAT 122
Cdd:PHA03378 596 PWPVPHPSQTPEPPTTQSHIPETSAPRQWPmplrpiPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQ 675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 123 PSEDGHT--------PTTESPPSRpAPTTL-----ASTVGQPPTTSVVTTAQASSTPGT--PTAESPDRSSNSSGVPPTA 187
Cdd:PHA03378 676 PSPTGANtmlpiqwaPGTMQPPPR-APTPMrppaaPPGRAQRPAAATGRARPPAAAPGRarPPAAAPGRARPPAAAPGRA 754
|
170
....*....|..
gi 111598761 188 -PVTEAPTSPPP 198
Cdd:PHA03378 755 rPPAAAPGRARP 766
|
|
| KREPA2 |
cd23959 |
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ... |
61-202 |
1.12e-03 |
|
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.
Pssm-ID: 467780 [Multi-domain] Cd Length: 424 Bit Score: 41.78 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 61 SPFPKTAAPTAQAPRTGPPRTTVRKTgATTPSAGSPEIipPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPA 140
Cdd:cd23959 117 NPFSASSSTQRETHKTAQVAPPKAEP-QTAPVTPFGQL--PMFGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSA 193
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 111598761 141 PTTLASTVGQPPTTSVVTTAQASSTPGTPTAESpdrSSNSSGVPPTAPvteaPTSPPPEHMC 202
Cdd:cd23959 194 SPFATATDTAPSSGAPDGFPAEASAPSPFAAPA---SAASFPAAPVAN----GEAATPTHAC 248
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
47-171 |
1.32e-03 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 41.62 E-value: 1.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 47 VVPSPSPGLRDQASSPFPKTAAPTAQA----PRTGPPRTTVRKTGATTPSAGspeiipplrtsAQPAATPFPALDLSPAT 122
Cdd:PRK12799 301 VAAVTPSSAVTQSSAITPSSAAIPSPAvipsSVTTQSATTTQASAVALSSAG-----------VLPSDVTLPGTVALPAA 369
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 111598761 123 PSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSvVTTAQASSTPGTPTA 171
Cdd:PRK12799 370 EPVNMQPQPMSTTETQQSSTGNITSTANGPTTS-LPAAPASNIPVSPTS 417
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
3-177 |
1.39e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.90 E-value: 1.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 3 GGAERAMRSLPSLGGLALLCCAAAAAASTASAGNVTGGGGAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTT 82
Cdd:PRK07764 635 APAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 83 VRKTGAtTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPttsvVTTAQA 162
Cdd:PRK07764 715 DDPAAQ-PPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE----EMAEDD 789
|
170
....*....|....*
gi 111598761 163 SSTPGTPTAESPDRS 177
Cdd:PRK07764 790 APSMDDEDRRDAEEV 804
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
44-140 |
1.44e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 41.17 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 44 EGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPrTTVRKTGATTPSAgsPEIIPPLRTSAQPAATPFPAldlSPATP 123
Cdd:PRK10856 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPA-VATAPAPAVDPQQ--NAVVAPSQANVDTAATPAPA---APATP 231
|
90
....*....|....*..
gi 111598761 124 SEDGHTPTTESPPSRPA 140
Cdd:PRK10856 232 DGAAPLPTDQAGVSTPA 248
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
73-173 |
1.49e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 1.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 73 APRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPAldlSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPP 152
Cdd:PRK12270 39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA---APPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
|
90 100
....*....|....*....|....*..
gi 111598761 153 TTSVVTTAQA------SSTPGTPTAES 173
Cdd:PRK12270 116 EVTPLRGAAAavaknmDASLEVPTATS 142
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
88-195 |
1.67e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 41.09 E-value: 1.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 88 ATTPSAGSPEIIPPLRTSA----QPAATPF-----PALDLSPATPSEDGHT---PTTESPPSRPAPTTLASTVGQPPTTS 155
Cdd:PHA03291 164 AAFPAEGTLAAPPLGEGSAdgscDPALPLSaprlgPADVFVPATPRPTPRTtasPETTPTPSTTTSPPSTTIPAPSTTIA 243
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 111598761 156 VVTTAQASSTPGTPTAESPdrssnssGVPPTAPVTEAPTS 195
Cdd:PHA03291 244 APQAGTTPEAEGTPAPPTP-------GGGEAPPANATPAP 276
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
346-399 |
1.83e-03 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 36.56 E-value: 1.83e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 111598761 346 RCPCSAV-TSTGNCTIESGEleptCdQCKDGYTGQNCNKCENGYYNSDSICTQCE 399
Cdd:cd00055 1 PCDCNGHgSLSGQCDPGTGQ----C-ECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
49-197 |
1.85e-03 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 40.91 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRK--TGATTPSAGSPEIIPPLRTSAQPAATPfpaldlsPATPSED 126
Cdd:NF040712 190 PDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGApaTDSDPAEAGTPDDLASARRRRAGVEQP-------EDEPVGP 262
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 127 GHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASStPGTPTAEspdrssnsSGVPPTAPVTEAPTSPP 197
Cdd:NF040712 263 GAAPAAEPDEATRDAGEPPAPGAAETPEAAEPPAPAPA-APAAPAA--------PEAEEPARPEPPPAPKP 324
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
67-146 |
1.87e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 41.09 E-value: 1.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 67 AAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSedghtpTTESPPSRPAPTTLAS 146
Cdd:PHA03291 206 ATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPG------GGEAPPANATPAPEAS 279
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
46-200 |
1.90e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 41.20 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 46 QVVPSPSPGLRDQASSPFPKTAAPTAQAPRTgPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATP-------FPALDL 118
Cdd:COG5180 183 KVLTEPRDALKDSPEKLDRPKVEVKDEAQEE-PPDLTGGADHPRPEAASSPKVDPPSTSEARSRPATvdaqpemRPPADA 261
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 119 SPATPSEDGHTPTTEsPPSRPA------PTTLASTVGQPPTTSV--VTTAQASSTPGTPTAESPD----RSSNSSGVPPT 186
Cdd:COG5180 262 KERRRAAIGDTPAAE-PPGLPVleagsePQSDAPEAETARPIDVkgVASAPPATRPVRPPGGARDpgtpRPGQPTERPAG 340
|
170
....*....|....
gi 111598761 187 APVTEAPTSPPPEH 200
Cdd:COG5180 341 VPEAASDAGQPPSA 354
|
|
| half-pint |
TIGR01645 |
poly-U binding splicing factor, half-pint family; The proteins represented by this model ... |
68-199 |
2.03e-03 |
|
poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Pssm-ID: 130706 [Multi-domain] Cd Length: 612 Bit Score: 41.21 E-value: 2.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 68 APTAQAPRT--GPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESP----PSRPAP 141
Cdd:TIGR01645 325 GPRAQSPATpsSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPglvaPTEINP 404
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 111598761 142 TTLAS-----TVGQPPTTSVVTTAQ-ASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:TIGR01645 405 SFLASprkkmKREKLPVTFGALDDTlAWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKE 468
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
43-168 |
2.16e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 40.71 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 43 AEGQVVPSPspglRDQASSPFPKTAAPTAQAPRTGPprttvrkTGATTPSAGspeiIPPLRTSAQPAATPFPALDLSPAT 122
Cdd:PHA03291 168 AEGTLAAPP----LGEGSADGSCDPALPLSAPRLGP-------ADVFVPATP----RPTPRTTASPETTPTPSTTTSPPS 232
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 111598761 123 PSEDGHTPTteSPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGT 168
Cdd:PHA03291 233 TTIPAPSTT--IAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAP 276
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
50-187 |
2.18e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.10 E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 50 SPSPGLRDQASSPFPKTAA-----PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSA-QPAATPFPALDLSPATP 123
Cdd:pfam17823 294 NPAAPMGAQAQGPIIQVSTdqpvhNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAkEPSASPVPVLHTSMIPE 373
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 111598761 124 SEdghtptTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTA 187
Cdd:pfam17823 374 VE------ATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDPKTLAMA 431
|
|
| MSA-2c |
pfam12238 |
Merozoite surface antigen 2c; This family of proteins is found in eukaryotes. Proteins in this ... |
129-182 |
2.24e-03 |
|
Merozoite surface antigen 2c; This family of proteins is found in eukaryotes. Proteins in this family are typically between 263 and 318 amino acids in length. There is a conserved SFT sequence motif. MSA-2 is a plasma membrane glycoprotein which can be found in Babesia bovis species.
Pssm-ID: 289042 Cd Length: 216 Bit Score: 39.73 E-value: 2.24e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 111598761 129 TPTTESPPSRPAPTTLASTVGQPPttsvvttaqASSTPGTPTAESPDRSSNSSG 182
Cdd:pfam12238 153 KPSRTSSTETPAPGDAESGVQQPP---------ASTPPQGPAPTTPSPSPESSG 197
|
|
| PRK12495 |
PRK12495 |
hypothetical protein; Provisional |
37-168 |
2.28e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 183558 [Multi-domain] Cd Length: 226 Bit Score: 39.85 E-value: 2.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 37 VTGGGGAEGQvvPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEiiPPLRTSAQPAATPfpal 116
Cdd:PRK12495 92 SQASPDDDAQ--PAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPAT--PDERRSPRQRPPV---- 163
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 111598761 117 dlsPATPSEDGHTPTTESPPSRPAPTTLASTVgqpptTSVVTTAQASSTPGT 168
Cdd:PRK12495 164 ---SGEPPTPSTPDAHVAGTLQAARESLVETL-----ARFARRAAATDDPRR 207
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
49-189 |
2.39e-03 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 40.70 E-value: 2.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPrTGPPRTTVRKTGATTPSAGSPEiiPPLRTSAQPAATPFPALDlSPATPSEDGH 128
Cdd:PTZ00436 222 PAKAAAAPAKAAAPPAKAAAAPAKAA-AAPAKAAAPPAKAAAPPAKAAA--PPAKAAAPPAKAAAPPAK-AAAPPAKAAA 297
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 129 TPTTESPPSRPAPTTLASTVGqPPTTSVVTTAQASSTPGTpTAESPDRSSnssgvppTAPV 189
Cdd:PTZ00436 298 APAKAAAAPAKAAAAPAKAAA-PPAKAAAPPAKAATPPAK-AAAPPAKAA-------AAPV 349
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
83-182 |
2.56e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 40.70 E-value: 2.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 83 VRKTGATTPSagsPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTE-SPPSRPAPTtlastvGQPPTTsvvttaQ 161
Cdd:PRK14954 374 VRNDGGVAPS---PAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELpSPASAPTPE------QQPPVA------R 438
|
90 100
....*....|....*....|.
gi 111598761 162 ASSTPGTPTAESPDRSSNSSG 182
Cdd:PRK14954 439 SAPLPPSPQASAPRNVASGKP 459
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
46-195 |
2.61e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 40.71 E-value: 2.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 46 QVVPSPSPGLRDQASSPfPKTAAPtaQAPRTGPPRTTVrkTGATTPSAGSPEiiPPLRTSAQPAATPFPALDLSPATPSE 125
Cdd:pfam17823 128 QSLPAAIAALPSEAFSA-PRAAAC--RANASAAPRAAI--AAASAPHAASPA--PRTAASSTTAASSTTAASSAPTTAAS 200
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 126 DghTPTTESPpSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTS 195
Cdd:pfam17823 201 S--APATLTP-ARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAA 267
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
47-199 |
2.66e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 2.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 47 VVPSPSPGLRDQASSPfPKTAAPTAQAPRTGPPRTtvrktgattpsAGSPEIIPPLRTSAQPAATPFPALDlSPATPSED 126
Cdd:PRK10263 340 VTQTPPVASVDVPPAQ-PTVAWQPVPGPQTGEPVI-----------APAPEGYPQQSQYAQPAVQYNEPLQ-QPVQPQQP 406
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 111598761 127 GHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTA---PVTEAPTSPPPE 199
Cdd:PRK10263 407 YYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTyqqPAAQEPLYQQPQ 482
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
141-199 |
2.68e-03 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 40.26 E-value: 2.68e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 141 PTTLASTVGQPPTTSvvtTAQASSTPGTPTAESPDRS-SNSSGVPPTAPVTEAPTSPPPE 199
Cdd:TIGR00601 77 PKTGTGKVAPPAATP---TSAPTPTPSPPASPASGMSaAPASAVEEKSPSEESATATAPE 133
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
66-192 |
2.69e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 40.75 E-value: 2.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 66 TAAPTAQAPRTGPPRTTvrktgaTTPSAGSPEIIPPLRTSAQPA---ATPFPALDLSPATPSEDGHTPttesppsrpaPT 142
Cdd:PHA03369 359 VLAAAAKVAVIAAPQTH------TGPADRQRPQRPDGIPYSVPArspMTAYPPVPQFCGDPGLVSPYN----------PQ 422
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 111598761 143 TLASTVGQPPTTSVvttaqasstPGTPTAESPDRSSNSSGVPPTAPVTEA 192
Cdd:PHA03369 423 SPGTSYGPEPVGPV---------PPQPTNPYVMPISMANMVYPGHPQEHG 463
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
107-199 |
3.20e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 40.53 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 107 QPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSV----VTTAQASSTPGTPTAESPDRSSNSSG 182
Cdd:PRK14971 358 QLAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAaaqpSAPQSATQPAGTPPTVSVDPPAAVPV 437
|
90
....*....|....*..
gi 111598761 183 VPPTAPVTEAPTSPPPE 199
Cdd:PRK14971 438 NPPSTAPQAVRPAQFKE 454
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
64-188 |
3.68e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 39.92 E-value: 3.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 64 PKTAAPT--AQAPRTgPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAATPFPAldlsPATPSedgHTPTTESPPSRPAP 141
Cdd:PRK10905 127 PATVAPVrnGNASRQ-TAKTQTAERPATTRPARKQAVIEPKKPQATAKTEPKPV----AQTPK---RTEPAAPVASTKAP 198
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 111598761 142 TTlASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAP 188
Cdd:PRK10905 199 AA-TSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAP 244
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
74-199 |
3.92e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.44 E-value: 3.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 74 PRTGPPRTTVRKTGATT---PSAGSPEIIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQ 150
Cdd:PHA03378 676 PSPTGANTMLPIQWAPGtmqPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 111598761 151 PPTTSVVTTAQASSTPGTPTAESPDRssnssgVPPTA---PVTEAPTSPPPE 199
Cdd:PHA03378 756 PPAAAPGRARPPAAAPGAPTPQPPPQ------APPAPqqrPRGAPTPQPPPQ 801
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
44-163 |
4.20e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 40.18 E-value: 4.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 44 EGQVVPSPSPgLRDQASSPFPKTAAPTaQAPRTGPPRTTVRKTGATTPsagSPEIIPPLRTSAQPAATPFPaldlspaTP 123
Cdd:PRK14950 357 EALLVPVPAP-QPAKPTAAAPSPVRPT-PAPSTRPKAAAAANIPPKEP---VRETATPPPVPPRPVAPPVP-------HT 424
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 111598761 124 SEDGHTPTTESPPSRPAPTTLAStvgqPPTTSVVTTAQAS 163
Cdd:PRK14950 425 PESAPKLTRAAIPVDEKPKYTPP----APPKEEEKALIAD 460
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
347-392 |
5.63e-03 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 34.98 E-value: 5.63e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 111598761 347 CPCSAV-TSTGNCTIESGEleptCdQCKDGYTGQNCNKCENGYYNSD 392
Cdd:smart00180 1 CDCDPGgSASGTCDPDTGQ----C-ECKPNVTGRRCDRCAPGYYGDG 42
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
47-153 |
6.45e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 39.45 E-value: 6.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 47 VVPSPSPGLRDQASSPFPK-----TAAPTAQAPRTGPPRTTVRKTGATTP-SAGSPEII--PPLRTSAQPAATPFPALDL 118
Cdd:COG3266 265 SAPATTSLGEQQEVSLPPAvaaqpAAAAAAQPSAVALPAAPAAAAAAAAPaEAAAPQPTaaKPVVTETAAPAAPAPEAAA 344
|
90 100 110
....*....|....*....|....*....|....*
gi 111598761 119 SPATPsedghTPTTESPPSRPAPTTLAStvgQPPT 153
Cdd:COG3266 345 AAAAP-----AAPAVAKKLAADEQWLAS---QPAS 371
|
|
| PHA03264 |
PHA03264 |
envelope glycoprotein D; Provisional |
98-198 |
6.82e-03 |
|
envelope glycoprotein D; Provisional
Pssm-ID: 223029 [Multi-domain] Cd Length: 416 Bit Score: 39.22 E-value: 6.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 98 IIPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTlASTVGQPPTTSVVTTAQASstPGTPTAeSPDRS 177
Cdd:PHA03264 253 VVPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPG-RETGGEGEGPEPAGRDGAA--GGEPKP-GPPRP 328
|
90 100
....*....|....*....|.
gi 111598761 178 SNSSGVPPTAPVTEAPTSPPP 198
Cdd:PHA03264 329 APDADRPEGWPSLEAITFPPP 349
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
49-199 |
6.98e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 39.52 E-value: 6.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 49 PSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPlrtsAQPAATPFPALDLSPATPSEDGH 128
Cdd:PLN03209 398 SKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPP----TSPSPTAPTGVSPSVSSTSSVPA 473
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 111598761 129 TPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGTPTAESPDRSSNSSGVPPTAPVTEAPTSPPPE 199
Cdd:PLN03209 474 VPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADE 544
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
42-198 |
8.08e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 39.45 E-value: 8.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 42 GAEGQVVPSPSPGLRDQASSPFPKTAAPTAQAPRTG---PPRTTVRKTGATTPSAGSPEIIP-PLRTSAQPAATPfPALD 117
Cdd:PRK07003 467 DAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAAtpaAVPDARAPAAASREDAPAAAAPPaPEARPPTPAAAA-PAAR 545
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 118 LSPATPSED---GHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPgTPTAESPDRSSNSSGVPPTAPVTEAPT 194
Cdd:PRK07003 546 AGGAAAALDvlrNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVP-TPRARAATGDAPPNGAARAEQAAESRG 624
|
....
gi 111598761 195 SPPP 198
Cdd:PRK07003 625 APPP 628
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
69-196 |
9.11e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 39.14 E-value: 9.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 69 PTAQAPRTGPPRTTVRKTGATTPSAGSPEIIPPLRTSAQPAAT-PFPaldLSPATPSEDghtpttESPPSRPAPTtlaST 147
Cdd:PLN03209 324 PSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVvPRP---LSPYTAYED------LKPPTSPIPT---PP 391
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 111598761 148 VGQPPTTSVVTTAQASSTPGTPTAespdrSSNSSGVPPTAPV-----TEAPTSP 196
Cdd:PLN03209 392 SSSPASSKSVDAVAKPAEPDVVPS-----PGSASNVPEVEPAqveakKTRPLSP 440
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
46-201 |
9.36e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 39.28 E-value: 9.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 46 QVVPSPSPGLRDQASSPFPKTAAPTAQAPRTGPPRTTVRKTGATTP------SAGSPEIIPPlrTSAQPAATPFPALDLS 119
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRarppaaAPGAPTPQPP--PQAPPAPQQRPRGAPT 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 120 PATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASSTPGT------PTAESPDRSSNSSGVPPTAPVTEAP 193
Cdd:PHA03378 796 PQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAalerqaAAGPTPSPGSGTSDKIVQAPVFYPP 875
|
....*...
gi 111598761 194 TSPPPEHM 201
Cdd:PHA03378 876 VLQPIQVM 883
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
87-201 |
9.59e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 39.00 E-value: 9.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 111598761 87 GATTPSAGSPeiiPPLRTSAQPAATPFPALDLSPATPSEDGHTPTTESPPSRPAPTTLASTVGQPPTTSVVTTAQASST- 165
Cdd:PHA03307 31 AADDLLSGSQ---GQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPt 107
|
90 100 110
....*....|....*....|....*....|....*..
gi 111598761 166 -PGTPTAESPDRSSNSSGVPPTAPvteaPTSPPPEHM 201
Cdd:PHA03307 108 pPGPSSPDPPPPTPPPASPPPSPA----PDLSEMLRP 140
|
|
|