|
Name |
Accession |
Description |
Interval |
E-value |
| MIF4G |
pfam02854 |
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ... |
758-986 |
2.56e-63 |
|
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. :
Pssm-ID: 397130 Cd Length: 203 Bit Score: 214.15 E-value: 2.56e-63
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854 1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854 72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496 918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854 125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
|
|
| W2_eIF4G1_like |
cd11559 |
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ... |
1427-1556 |
6.07e-55 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E. :
Pssm-ID: 211397 Cd Length: 134 Bit Score: 187.49 E-value: 6.07e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559 4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559 84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
|
|
| MA3 |
pfam02847 |
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ... |
1228-1340 |
4.99e-35 |
|
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains. :
Pssm-ID: 397128 Cd Length: 113 Bit Score: 129.70 E-value: 4.99e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847 1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 751130496 1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847 81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
4-456 |
7.38e-09 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 7.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
.
gi 751130496 456 Q 456
Cdd:PHA03247 2944 A 2944
|
|
| rad2 super family |
cl36701 |
DNA excision repair protein (rad2); All proteins in this family for which functions are known ... |
366-605 |
1.44e-03 |
|
DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair] The actual alignment was detected with superfamily member TIGR00600:
Pssm-ID: 273166 [Multi-domain] Cd Length: 1034 Bit Score: 43.35 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600 520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600 589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600 654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
|
250
....*....|
gi 751130496 596 DQWKPLNLEE 605
Cdd:TIGR00600 731 NEWQDISLEE 740
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MIF4G |
pfam02854 |
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ... |
758-986 |
2.56e-63 |
|
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
Pssm-ID: 397130 Cd Length: 203 Bit Score: 214.15 E-value: 2.56e-63
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854 1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854 72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496 918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854 125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
|
|
| W2_eIF4G1_like |
cd11559 |
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ... |
1427-1556 |
6.07e-55 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.
Pssm-ID: 211397 Cd Length: 134 Bit Score: 187.49 E-value: 6.07e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559 4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559 84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
|
|
| MIF4G |
smart00543 |
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ... |
759-986 |
7.53e-53 |
|
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)
Pssm-ID: 214713 Cd Length: 200 Bit Score: 184.10 E-value: 7.53e-53
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 759 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 838
Cdd:smart00543 2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 839 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 918
Cdd:smart00543 72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 751130496 919 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 986
Cdd:smart00543 123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
|
|
| MA3 |
pfam02847 |
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ... |
1228-1340 |
4.99e-35 |
|
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
Pssm-ID: 397128 Cd Length: 113 Bit Score: 129.70 E-value: 4.99e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847 1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 751130496 1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847 81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
|
|
| MA3 |
smart00544 |
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ... |
1228-1340 |
2.63e-34 |
|
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Pssm-ID: 214714 Cd Length: 113 Bit Score: 127.75 E-value: 2.63e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:smart00544 1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 751130496 1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:smart00544 81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
|
|
| eIF5C |
smart00515 |
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5; |
1496-1578 |
2.05e-27 |
|
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
Pssm-ID: 214705 Cd Length: 83 Bit Score: 106.99 E-value: 2.05e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1575
Cdd:smart00515 3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80
|
...
gi 751130496 1576 REA 1578
Cdd:smart00515 81 QEA 83
|
|
| W2 |
pfam02020 |
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ... |
1507-1583 |
4.01e-23 |
|
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.
Pssm-ID: 460415 Cd Length: 76 Bit Score: 94.52 E-value: 4.01e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 751130496 1507 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1583
Cdd:pfam02020 1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4-456 |
7.38e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 7.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
.
gi 751130496 456 Q 456
Cdd:PHA03247 2944 A 2944
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
196-455 |
1.77e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 196 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 275
Cdd:pfam05109 404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 276 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 348
Cdd:pfam05109 477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 349 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 425
Cdd:pfam05109 554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
|
250 260 270
....*....|....*....|....*....|
gi 751130496 426 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 455
Cdd:pfam05109 630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
|
|
| rad2 |
TIGR00600 |
DNA excision repair protein (rad2); All proteins in this family for which functions are known ... |
366-605 |
1.44e-03 |
|
DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273166 [Multi-domain] Cd Length: 1034 Bit Score: 43.35 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600 520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600 589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600 654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
|
250
....*....|
gi 751130496 596 DQWKPLNLEE 605
Cdd:TIGR00600 731 NEWQDISLEE 740
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MIF4G |
pfam02854 |
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ... |
758-986 |
2.56e-63 |
|
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
Pssm-ID: 397130 Cd Length: 203 Bit Score: 214.15 E-value: 2.56e-63
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854 1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854 72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496 918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854 125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
|
|
| W2_eIF4G1_like |
cd11559 |
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ... |
1427-1556 |
6.07e-55 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.
Pssm-ID: 211397 Cd Length: 134 Bit Score: 187.49 E-value: 6.07e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559 4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559 84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
|
|
| MIF4G |
smart00543 |
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ... |
759-986 |
7.53e-53 |
|
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)
Pssm-ID: 214713 Cd Length: 200 Bit Score: 184.10 E-value: 7.53e-53
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 759 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 838
Cdd:smart00543 2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 839 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 918
Cdd:smart00543 72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 751130496 919 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 986
Cdd:smart00543 123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
|
|
| MA3 |
pfam02847 |
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ... |
1228-1340 |
4.99e-35 |
|
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
Pssm-ID: 397128 Cd Length: 113 Bit Score: 129.70 E-value: 4.99e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847 1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 751130496 1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847 81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
|
|
| MA3 |
smart00544 |
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ... |
1228-1340 |
2.63e-34 |
|
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Pssm-ID: 214714 Cd Length: 113 Bit Score: 127.75 E-value: 2.63e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:smart00544 1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 751130496 1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:smart00544 81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
|
|
| eIF5C |
smart00515 |
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5; |
1496-1578 |
2.05e-27 |
|
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
Pssm-ID: 214705 Cd Length: 83 Bit Score: 106.99 E-value: 2.05e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1575
Cdd:smart00515 3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80
|
...
gi 751130496 1576 REA 1578
Cdd:smart00515 81 QEA 83
|
|
| W2 |
pfam02020 |
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ... |
1507-1583 |
4.01e-23 |
|
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.
Pssm-ID: 460415 Cd Length: 76 Bit Score: 94.52 E-value: 4.01e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 751130496 1507 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1583
Cdd:pfam02020 1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
|
|
| W2 |
cd11473 |
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ... |
1427-1550 |
2.40e-19 |
|
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211395 Cd Length: 135 Bit Score: 85.61 E-value: 2.40e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLK-DGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFE----TPLRVDVQVLKVRARLLQKYL 1501
Cdd:cd11473 4 KKLRDSLLKELEeDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADsislTQKEQLVLVLKKYGPVLRELL 83
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 751130496 1502 CD-EQKELQALYALQALVVT--LEQPANLLRMFFDALYDEDVVKEDAFYSWE 1550
Cdd:cd11473 84 KLiKKDQLYLLLKIEKLCLQlkLSELISLLEKILDLLYDADVLSEEAILSWF 135
|
|
| W2_eIF2B_epsilon |
cd11558 |
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ... |
1496-1583 |
1.16e-13 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211396 Cd Length: 169 Bit Score: 70.37 E-value: 1.16e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQGKGVALKSVTAFFNWL 1575
Cdd:cd11558 82 LLENYVKSQDDQVELLLALEEFCLESEEGGPLFAKLLHALYDLDILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWL 161
|
....*...
gi 751130496 1576 REAEDEES 1583
Cdd:cd11558 162 EEAEEESD 169
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4-456 |
7.38e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 7.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
.
gi 751130496 456 Q 456
Cdd:PHA03247 2944 A 2944
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
9-563 |
1.02e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.72 E-value: 1.02e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 9 GPPPARSPGLPQPA----FPPGQTAPVVfSTPQAT----------QMNTPSQPRQHFYPSRAQPPSSAASrvqsaaparp 74
Cdd:PHA03247 2550 DPPPPLPPAAPPAApdrsVPPPRPAPRP-SEPAVTsrarrpdappQSARPRAPVDDRGDPRGPAPPSPLP---------- 2618
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 75 gpaphvyPAGSQVMMIPSQISYSASQgayyiPGQGRSTYVVPTQQYPVQPGAPGFYPGASPTEFGTYAGAYYPAQGVQQ- 153
Cdd:PHA03247 2619 -------PDTHAPDPPPPSPSPAANE-----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRr 2686
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 154 -FPASVAPAPVLMNQPPQIAPKRERktirirdPNQGGKDITEEIMSGARTASTPTPPQTGGslePQPNGESPQVAVIIRP 232
Cdd:PHA03247 2687 aARPTVGSLTSLADPPPPPPTPEPA-------PHALVSATPLPPGPAAARQASPALPAAPA---PPAVPAGPATPGGPAR 2756
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 233 DDRSQGAAIGGRPGLP-GPEHSP--GTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPISCE 309
Cdd:PHA03247 2757 PARPPTTAGPPAPAPPaAPAAGPprRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 310 TgepyclsPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSP 389
Cdd:PHA03247 2837 T-------APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKP-AAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 390 CASESLVPIAPTAQPEELLNGAPSP-----------PAVDLSPVSEPEEQAKKVSSAALASILSPA-----PPVAPSDTS 453
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPpppprpqpplaPTTDPAGAGEPSGAVPQPWLGALVPGRVAVprfrvPQPAPSREA 2988
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 454 PAQEEEMEEDDDDEEGGEAESEKGgedVPLDSTPVPAQLSQNLEVAAATQVAvSVPKRRRKIKELNKKEAVgDLLDAFKE 533
Cdd:PHA03247 2989 PASSTPPLTGHSLSRVSSWASSLA---LHEETDPPPVSLKQTLWPPDDTEDS-DADSLFDSDSERSDLEAL-DPLPPEPH 3063
|
570 580 590
....*....|....*....|....*....|
gi 751130496 534 VDPAVPEVENQPPTGSNPSPESEGSMVPTQ 563
Cdd:PHA03247 3064 DPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
5-353 |
5.41e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 58.03 E-value: 5.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 5 PQPTGPPPARSPGLPQP--------AFPPGQTAPVVFSTPQATQmnTPSQPRQHFYP-SRAQPPSSAASRVQSAAPARPG 75
Cdd:PHA03247 2706 PTPEPAPHALVSATPLPpgpaaarqASPALPAAPAPPAVPAGPA--TPGGPARPARPpTTAGPPAPAPPAAPAAGPPRRL 2783
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 76 PAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYP----VQPGAPGFYPGASPTEFgTYAGAYYPAQGV 151
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsAQPTAPPPPPGPPPPSL-PLGGSVAPGGDV 2862
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 152 QQFPASVAPAPVL----------MNQP-------PQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGGS 214
Cdd:PHA03247 2863 RRRPPSRSPAAKPaaparppvrrLARPavsrsteSFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 215 LEPQPNGESPQVAVIIRPDDRSqGAAIGGR------------PGLPGPEHSPGTESQPSSPSPTPSPPPIL-----EPGS 277
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQPWL-GALVPGRvavprfrvpqpaPSREAPASSTPPLTGHSLSRVSSWASSLAlheetDPPP 3021
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496 278 ESNLGVLSIPGDtmTTGMIPMSVEESTPISCETGEPYCLSPEPTLA---EPILEVEVTLSKPIPESEFSSSPLQVSTAL 353
Cdd:PHA03247 3022 VSLKQTLWPPDD--TEDSDADSLFDSDSERSDLEALDPLPPEPHDPfahEPDPATPEAGARESPSSQFGPPPLSANAAL 3098
|
|
| W2_eIF5 |
cd11561 |
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ... |
1425-1583 |
3.27e-07 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211399 Cd Length: 157 Bit Score: 51.46 E-value: 3.27e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1425 AFEELRRQLEKLLKDGGSNQrvfdwIDANLNEQQIASNTLVRALmttvcysAIIFETPLRVD-VQVLKVRARLLQKYLCD 1503
Cdd:cd11561 7 RVDELGEFLKKNKDESGLSE-----LKEILKEAERLDVVKDKAV-------LVLAEVLFDENiVKEIKKRKALLLKLVTD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1504 EQKELQALYALQALVVtlEQPANLLRMF---FDALYDEDVVKEDAFYSW---ESSKDPAEQQGKGVaLKSVTAFFNWLRE 1577
Cdd:cd11561 75 EKAQKALLGGIERFCG--KHSPELLKKVpliLKALYDNDILEEEVILKWyekVSKKYVSKEKSKKV-RKAAEPFVEWLEE 151
|
....*.
gi 751130496 1578 AEDEES 1583
Cdd:cd11561 152 AEEEEE 157
|
|
| W2_eIF5C_like |
cd11560 |
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ... |
1421-1581 |
2.17e-06 |
|
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211398 [Multi-domain] Cd Length: 194 Bit Score: 49.90 E-value: 2.17e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1421 QRTLAFEELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASN--------TLVRALMTTVCYSA---IIFETPLRVdvqv 1489
Cdd:cd11560 29 YRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKSSlpehevvgLLWTALMDAVEWSKkedQIAEQALRH---- 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1490 LKVRARLLQKYLCDEQKELQALYALQalVVTLEQpANLLRMFFD---ALYDEDVVKEDAFYSWesSKDPAEQQGKGVALK 1566
Cdd:cd11560 105 LKKYAPLLAAFCTTARAELALLNKIQ--EYCYEN-MKFMKVFQKivkLLYKADVLSEDAILKW--YKKGHSPKGKQVFLK 179
|
170
....*....|....*
gi 751130496 1567 SVTAFFNWLREAEDE 1581
Cdd:cd11560 180 QMEPFVEWLQEAEEE 194
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1-214 |
1.71e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.60 E-value: 1.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1 MNKAPQPTGP----PPARSPGLPQPafPPgqTAPVVFSTPQATQmnTPSQPRQHfYPSRAQPPSSAASRVQSaaparpgp 76
Cdd:PHA03378 672 IPYQPSPTGAntmlPIQWAPGTMQP--PP--RAPTPMRPPAAPP--GRAQRPAA-ATGRARPPAAAPGRARP-------- 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 77 aphvyPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPgapgfyPGASPTEFGTYAGAYYPAQGVQQFPA 156
Cdd:PHA03378 737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP------PQAPPAPQQRPRGAPTPQPPPQAGPT 805
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 157 SVAPAPvlMNQPPQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTAST-PTP-PQTGGS 214
Cdd:PHA03378 806 SMQLMP--RAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAgPTPsPGSGTS 863
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
196-455 |
1.77e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 1.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 196 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 275
Cdd:pfam05109 404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 276 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 348
Cdd:pfam05109 477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 349 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 425
Cdd:pfam05109 554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
|
250 260 270
....*....|....*....|....*....|
gi 751130496 426 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 455
Cdd:pfam05109 630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
3-172 |
2.45e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.80 E-value: 2.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 3 KAPQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPSQPRQHFYPSRAQPPSSAASRVQSAAparpgpaphvyp 82
Cdd:pfam09770 206 QAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQ------------ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 83 agsqvmMIPSQIsySASQGAYYIPGQGRSTYVVPTQ--QYPVQPGAPGF-YPGAsptefGTYAGAYYPAQGVQQFPASVA 159
Cdd:pfam09770 274 ------PDPAQP--SIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVgYPQN-----PQPGVQPAPAHQAHRQQGSFG 340
|
170
....*....|...
gi 751130496 160 PAPVLMNQPPQIA 172
Cdd:pfam09770 341 RQAPIITHPQQLA 353
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
155-568 |
7.23e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 7.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 155 PASVAPAPVLMNQPPQIAPK--------RERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGgSLEPQPNGESPqv 226
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRpsepavtsRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-APDPPPPSPSP-- 2633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 227 aviirpddRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPi 306
Cdd:PHA03247 2634 --------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 307 scetgEPyclSPEPTlaePILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNGVIPSEDLEPEVESSTEPAPPP 386
Cdd:PHA03247 2705 -----PP---TPEPA---PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 387 LSPCASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEeqakkvSSAALASILSPAPPVAPSDTS-PAQEEEMEEDDD 465
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA------PAAALPPAASPAGPLPPPTSAqPTAPPPPPGPPP 2847
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 466 DEEGGEAESEKGGedvPLDSTPVPAQlsqnlevAAATQVAVSVPKRRRKikelnKKEAVGDLLDAFKE-VDPAVPEVENQ 544
Cdd:PHA03247 2848 PSLPLGGSVAPGG---DVRRRPPSRS-------PAAKPAAPARPPVRRL-----ARPAVSRSTESFALpPDQPERPPQPQ 2912
|
410 420
....*....|....*....|....
gi 751130496 545 PPTGSNPSPESEGSMVPTQPEETE 568
Cdd:PHA03247 2913 APPPPQPQPQPPPPPQPQPPPPPP 2936
|
|
| rad2 |
TIGR00600 |
DNA excision repair protein (rad2); All proteins in this family for which functions are known ... |
366-605 |
1.44e-03 |
|
DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273166 [Multi-domain] Cd Length: 1034 Bit Score: 43.35 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600 520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600 589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600 654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
|
250
....*....|
gi 751130496 596 DQWKPLNLEE 605
Cdd:TIGR00600 731 NEWQDISLEE 740
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
320-480 |
4.03e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 42.00 E-value: 4.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 320 PTLAEPILEVEVTLSKPIPESEFSSSPLQV-STALVPHKVETHEPNGVIPSEDLEP---EVESSTEPAPPPLSPCASESL 395
Cdd:PRK08691 380 PSAQTAEKETAAKKPQPRPEAETAQTPVQTaSAAAMPSEGKTAGPVSNQENNDVPPwedAPDEAQTAAGTAQTSAKSIQT 459
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 396 VPIAPTAQPEEL-------------LNGAPSPPAVDLSPVSEPEEQAKKVSSAalasilsPAPPVA----PSDTSPAQEE 458
Cdd:PRK08691 460 ASEAETPPENQVsknkaadnetdapLSEVPSENPIQATPNDEAVETETFAHEA-------PAEPFYgygfPDNDCPPEDG 532
|
170 180
....*....|....*....|..
gi 751130496 459 EMEEDDDDEEGGEAESEKGGED 480
Cdd:PRK08691 533 AEIPPPDWEHAAPADTAGGGAD 554
|
|
| Rib_recp_KP_reg |
pfam05104 |
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ... |
361-455 |
6.04e-03 |
|
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.
Pssm-ID: 461548 [Multi-domain] Cd Length: 140 Bit Score: 38.95 E-value: 6.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 361 HEPNGVIPseDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKKVSSAALAsi 440
Cdd:pfam05104 44 EKPNGKLP--ESEQADESEEEPREFKTPDEAPSAALEPEPVPTPVPAPVEPEPAPPSE-SPAPSPKEKKKKEKKSAKV-- 118
|
90
....*....|....*
gi 751130496 441 lSPAPPVAPSDTSPA 455
Cdd:pfam05104 119 -EPAETPEAVQPKPA 132
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
339-455 |
8.76e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 39.60 E-value: 8.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 339 ESEFSSSPLqvstalVPHKVETHEPNGV---------IPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEElln 409
Cdd:PRK11633 35 QDEFAAIPL------VPKPGDRDEPDMMpaatqalptQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVE--- 105
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 751130496 410 gAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAP-PVAPSDTSPA 455
Cdd:PRK11633 106 -PPKPK-----PVEKPKPKPKPQQKVEAPPAPKPEPkPVVEEKAAPT 146
|
|
|