NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|751130496|ref|NP_001291361|]
View 

eukaryotic translation initiation factor 4 gamma 1 isoform c [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
758-986 2.56e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.56e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496   918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1556 6.07e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 187.49  E-value: 6.07e-55
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559    84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1228-1340 4.99e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.99e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 751130496  1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
4-456 7.38e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 7.38e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943

                  .
gi 751130496  456 Q 456
Cdd:PHA03247 2944 A 2944
rad2 super family cl36701
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
366-605 1.44e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


The actual alignment was detected with superfamily member TIGR00600:

Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 43.35  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600  520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600  589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600  654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
                          250
                   ....*....|
gi 751130496   596 DQWKPLNLEE 605
Cdd:TIGR00600  731 NEWQDISLEE 740
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
758-986 2.56e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.56e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496   918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1556 6.07e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 187.49  E-value: 6.07e-55
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559    84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
759-986 7.53e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.53e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    759 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 838
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    839 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 918
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 751130496    919 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 986
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1228-1340 4.99e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.99e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 751130496  1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1228-1340 2.63e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.63e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 751130496   1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1496-1578 2.05e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.99  E-value: 2.05e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1575
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 751130496   1576 REA 1578
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1507-1583 4.01e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.52  E-value: 4.01e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 751130496  1507 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1583
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-456 7.38e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 7.38e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943

                  .
gi 751130496  456 Q 456
Cdd:PHA03247 2944 A 2944
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
196-455 1.77e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   196 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 275
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   276 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 348
Cdd:pfam05109  477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   349 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 425
Cdd:pfam05109  554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
                          250       260       270
                   ....*....|....*....|....*....|
gi 751130496   426 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 455
Cdd:pfam05109  630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
366-605 1.44e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 43.35  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600  520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600  589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600  654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
                          250
                   ....*....|
gi 751130496   596 DQWKPLNLEE 605
Cdd:TIGR00600  731 NEWQDISLEE 740
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
758-986 2.56e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.56e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   758 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 837
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   838 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 917
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496   918 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 986
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1556 6.07e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 187.49  E-value: 6.07e-55
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYLC-DEQ 1505
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLDdDEQ 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 751130496 1506 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1556
Cdd:cd11559    84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
759-986 7.53e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.53e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    759 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 838
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    839 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 918
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 751130496    919 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 986
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1228-1340 4.99e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.70  E-value: 4.99e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 751130496  1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1228-1340 2.63e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.63e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   1228 VEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1307
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 751130496   1308 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1340
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1496-1578 2.05e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.99  E-value: 2.05e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1575
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 751130496   1576 REA 1578
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1507-1583 4.01e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.52  E-value: 4.01e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 751130496  1507 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1583
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1427-1550 2.40e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.61  E-value: 2.40e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1427 EELRRQLEKLLK-DGGSNQRVFDWIDANLNEQQIASNTLVRALMTTVCYSAIIFE----TPLRVDVQVLKVRARLLQKYL 1501
Cdd:cd11473     4 KKLRDSLLKELEeDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADsislTQKEQLVLVLKKYGPVLRELL 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 751130496 1502 CD-EQKELQALYALQALVVT--LEQPANLLRMFFDALYDEDVVKEDAFYSWE 1550
Cdd:cd11473    84 KLiKKDQLYLLLKIEKLCLQlkLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1496-1583 1.16e-13

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 70.37  E-value: 1.16e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1496 LLQKYLCDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQGKGVALKSVTAFFNWL 1575
Cdd:cd11558    82 LLENYVKSQDDQVELLLALEEFCLESEEGGPLFAKLLHALYDLDILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWL 161

                  ....*...
gi 751130496 1576 REAEDEES 1583
Cdd:cd11558   162 EEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-456 7.38e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 7.38e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS----QPRQHFYPSR------------------AQPPSS 61
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdEPVGEPVHPRmltwirgleelasddagdPPPPLP 2556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   62 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 140
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  141 YAGAYYPAQGVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 220
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  221 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSV 300
Cdd:PHA03247 2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  301 EESTPISCETGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNG--VIPSEDLE---PE 375
Cdd:PHA03247 2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVRrrpPS 2868
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  376 VESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAPPVAPSDTSPA 455
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP-----PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943

                  .
gi 751130496  456 Q 456
Cdd:PHA03247 2944 A 2944
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-563 1.02e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 1.02e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    9 GPPPARSPGLPQPA----FPPGQTAPVVfSTPQAT----------QMNTPSQPRQHFYPSRAQPPSSAASrvqsaaparp 74
Cdd:PHA03247 2550 DPPPPLPPAAPPAApdrsVPPPRPAPRP-SEPAVTsrarrpdappQSARPRAPVDDRGDPRGPAPPSPLP---------- 2618
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   75 gpaphvyPAGSQVMMIPSQISYSASQgayyiPGQGRSTYVVPTQQYPVQPGAPGFYPGASPTEFGTYAGAYYPAQGVQQ- 153
Cdd:PHA03247 2619 -------PDTHAPDPPPPSPSPAANE-----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRr 2686
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  154 -FPASVAPAPVLMNQPPQIAPKRERktirirdPNQGGKDITEEIMSGARTASTPTPPQTGGslePQPNGESPQVAVIIRP 232
Cdd:PHA03247 2687 aARPTVGSLTSLADPPPPPPTPEPA-------PHALVSATPLPPGPAAARQASPALPAAPA---PPAVPAGPATPGGPAR 2756
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  233 DDRSQGAAIGGRPGLP-GPEHSP--GTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPISCE 309
Cdd:PHA03247 2757 PARPPTTAGPPAPAPPaAPAAGPprRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  310 TgepyclsPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSP 389
Cdd:PHA03247 2837 T-------APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKP-AAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  390 CASESLVPIAPTAQPEELLNGAPSP-----------PAVDLSPVSEPEEQAKKVSSAALASILSPA-----PPVAPSDTS 453
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPpppprpqpplaPTTDPAGAGEPSGAVPQPWLGALVPGRVAVprfrvPQPAPSREA 2988
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  454 PAQEEEMEEDDDDEEGGEAESEKGgedVPLDSTPVPAQLSQNLEVAAATQVAvSVPKRRRKIKELNKKEAVgDLLDAFKE 533
Cdd:PHA03247 2989 PASSTPPLTGHSLSRVSSWASSLA---LHEETDPPPVSLKQTLWPPDDTEDS-DADSLFDSDSERSDLEAL-DPLPPEPH 3063
                         570       580       590
                  ....*....|....*....|....*....|
gi 751130496  534 VDPAVPEVENQPPTGSNPSPESEGSMVPTQ 563
Cdd:PHA03247 3064 DPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-353 5.41e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 5.41e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    5 PQPTGPPPARSPGLPQP--------AFPPGQTAPVVFSTPQATQmnTPSQPRQHFYP-SRAQPPSSAASRVQSAAPARPG 75
Cdd:PHA03247 2706 PTPEPAPHALVSATPLPpgpaaarqASPALPAAPAPPAVPAGPA--TPGGPARPARPpTTAGPPAPAPPAAPAAGPPRRL 2783
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   76 PAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYP----VQPGAPGFYPGASPTEFgTYAGAYYPAQGV 151
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsAQPTAPPPPPGPPPPSL-PLGGSVAPGGDV 2862
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  152 QQFPASVAPAPVL----------MNQP-------PQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGGS 214
Cdd:PHA03247 2863 RRRPPSRSPAAKPaaparppvrrLARPavsrsteSFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  215 LEPQPNGESPQVAVIIRPDDRSqGAAIGGR------------PGLPGPEHSPGTESQPSSPSPTPSPPPIL-----EPGS 277
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQPWL-GALVPGRvavprfrvpqpaPSREAPASSTPPLTGHSLSRVSSWASSLAlheetDPPP 3021
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 751130496  278 ESNLGVLSIPGDtmTTGMIPMSVEESTPISCETGEPYCLSPEPTLA---EPILEVEVTLSKPIPESEFSSSPLQVSTAL 353
Cdd:PHA03247 3022 VSLKQTLWPPDD--TEDSDADSLFDSDSERSDLEALDPLPPEPHDPfahEPDPATPEAGARESPSSQFGPPPLSANAAL 3098
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1425-1583 3.27e-07

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 51.46  E-value: 3.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1425 AFEELRRQLEKLLKDGGSNQrvfdwIDANLNEQQIASNTLVRALmttvcysAIIFETPLRVD-VQVLKVRARLLQKYLCD 1503
Cdd:cd11561     7 RVDELGEFLKKNKDESGLSE-----LKEILKEAERLDVVKDKAV-------LVLAEVLFDENiVKEIKKRKALLLKLVTD 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1504 EQKELQALYALQALVVtlEQPANLLRMF---FDALYDEDVVKEDAFYSW---ESSKDPAEQQGKGVaLKSVTAFFNWLRE 1577
Cdd:cd11561    75 EKAQKALLGGIERFCG--KHSPELLKKVpliLKALYDNDILEEEVILKWyekVSKKYVSKEKSKKV-RKAAEPFVEWLEE 151

                  ....*.
gi 751130496 1578 AEDEES 1583
Cdd:cd11561   152 AEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1421-1581 2.17e-06

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 49.90  E-value: 2.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1421 QRTLAFEELRRQLEKLLKDGGSNQRVFDWIDANLNEQQIASN--------TLVRALMTTVCYSA---IIFETPLRVdvqv 1489
Cdd:cd11560    29 YRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKSSlpehevvgLLWTALMDAVEWSKkedQIAEQALRH---- 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496 1490 LKVRARLLQKYLCDEQKELQALYALQalVVTLEQpANLLRMFFD---ALYDEDVVKEDAFYSWesSKDPAEQQGKGVALK 1566
Cdd:cd11560   105 LKKYAPLLAAFCTTARAELALLNKIQ--EYCYEN-MKFMKVFQKivkLLYKADVLSEDAILKW--YKKGHSPKGKQVFLK 179
                         170
                  ....*....|....*
gi 751130496 1567 SVTAFFNWLREAEDE 1581
Cdd:cd11560   180 QMEPFVEWLQEAEEE 194
PHA03378 PHA03378
EBNA-3B; Provisional
1-214 1.71e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 1.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    1 MNKAPQPTGP----PPARSPGLPQPafPPgqTAPVVFSTPQATQmnTPSQPRQHfYPSRAQPPSSAASRVQSaaparpgp 76
Cdd:PHA03378  672 IPYQPSPTGAntmlPIQWAPGTMQP--PP--RAPTPMRPPAAPP--GRAQRPAA-ATGRARPPAAAPGRARP-------- 736
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   77 aphvyPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPgapgfyPGASPTEFGTYAGAYYPAQGVQQFPA 156
Cdd:PHA03378  737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP------PQAPPAPQQRPRGAPTPQPPPQAGPT 805
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  157 SVAPAPvlMNQPPQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTAST-PTP-PQTGGS 214
Cdd:PHA03378  806 SMQLMP--RAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAgPTPsPGSGTS 863
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
196-455 1.77e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   196 IMSGARTASTPTPPQTGGSLEPQPNGESPQVaviirpdDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEP 275
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTL-------NTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSP 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   276 ---GSESNLGVLS---IPGDTMTTGMIPMSVEESTPISCETgePYCLSPEPTLAEPILE-VEVTLSKPIPESEFsSSPLQ 348
Cdd:pfam05109  477 tpaGTTSGASPVTpspSPRDNGTESKAPDMTSPTSAVTTPT--PNATSPTPAVTTPTPNaTSPTLGKTSPTSAV-TTPTP 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   349 VSTALVPhKVETHEPNGVIPSEDLEPEVESSTEPAPPPLSPCASESlVPIAPTAQpeELLNGAPSPPAVDLSP---VSEP 425
Cdd:pfam05109  554 NATSPTP-AVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-SPQANTTN--HTLGGTSSTPVVTSPPknaTSAV 629
                          250       260       270
                   ....*....|....*....|....*....|
gi 751130496   426 EEQAKKVSSAALASiLSPAPPVAPSDTSPA 455
Cdd:pfam05109  630 TTGQHNITSSSTSS-MSLRPSSISETLSPS 658
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
3-172 2.45e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.80  E-value: 2.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496     3 KAPQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPSQPRQHFYPSRAQPPSSAASRVQSAAparpgpaphvyp 82
Cdd:pfam09770  206 QAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQ------------ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496    83 agsqvmMIPSQIsySASQGAYYIPGQGRSTYVVPTQ--QYPVQPGAPGF-YPGAsptefGTYAGAYYPAQGVQQFPASVA 159
Cdd:pfam09770  274 ------PDPAQP--SIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVgYPQN-----PQPGVQPAPAHQAHRQQGSFG 340
                          170
                   ....*....|...
gi 751130496   160 PAPVLMNQPPQIA 172
Cdd:pfam09770  341 RQAPIITHPQQLA 353
PHA03247 PHA03247
large tegument protein UL36; Provisional
155-568 7.23e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 7.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  155 PASVAPAPVLMNQPPQIAPK--------RERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGgSLEPQPNGESPqv 226
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRpsepavtsRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-APDPPPPSPSP-- 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  227 aviirpddRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPMSVEESTPi 306
Cdd:PHA03247 2634 --------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  307 scetgEPyclSPEPTlaePILEVEVTLSKPIPESEFSSSPLQVSTALVPHKVETHEPNGVIPSEDLEPEVESSTEPAPPP 386
Cdd:PHA03247 2705 -----PP---TPEPA---PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  387 LSPCASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEeqakkvSSAALASILSPAPPVAPSDTS-PAQEEEMEEDDD 465
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA------PAAALPPAASPAGPLPPPTSAqPTAPPPPPGPPP 2847
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  466 DEEGGEAESEKGGedvPLDSTPVPAQlsqnlevAAATQVAVSVPKRRRKikelnKKEAVGDLLDAFKE-VDPAVPEVENQ 544
Cdd:PHA03247 2848 PSLPLGGSVAPGG---DVRRRPPSRS-------PAAKPAAPARPPVRRL-----ARPAVSRSTESFALpPDQPERPPQPQ 2912
                         410       420
                  ....*....|....*....|....
gi 751130496  545 PPTGSNPSPESEGSMVPTQPEETE 568
Cdd:PHA03247 2913 APPPPQPQPQPPPPPQPQPPPPPP 2936
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
366-605 1.44e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 43.35  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   366 VIPSEDlEPEVESSTEPAPPPLSpCASESLVPIAPTAQPEELLNG-APSPPAVDLSPVSepeeqakkvSSAALASILSPA 444
Cdd:TIGR00600  520 VKPVSS-EFGLPSQREDKLAIPT-EGTQNLQGISDHPEQFEFQNElSPLETKNNESNLS---------SDAETEGSPNPE 588
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   445 PPVAPSDTSPAQEEEMEEDDDDEeggeaesekGGEDV--PLDSTPVpaqlSQNLEVAAAtQVAVSVPKRRRKIkELNKKE 522
Cdd:TIGR00600  589 MPSWSSVTVPSEALDNYETTNPS---------NAKEVrnFAETGIQ----TTNVGESAD-LLLISNPMEVEPM-ESEKEE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   523 AVGDllDAFKEVDPAVPEVENQPPTGSNPSPESE-------GSMVPTQPEETEEtWDSKEDKIHNAENIQPGEQKYEYKS 595
Cdd:TIGR00600  654 SESD--GSFIEVDSVSSTLELQVPSKSQPTDESEenaenkvASIEGEHRKEIED-LLFDESEEDNIVGMIEEEKDADDFK 730
                          250
                   ....*....|
gi 751130496   596 DQWKPLNLEE 605
Cdd:TIGR00600  731 NEWQDISLEE 740
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
320-480 4.03e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.00  E-value: 4.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  320 PTLAEPILEVEVTLSKPIPESEFSSSPLQV-STALVPHKVETHEPNGVIPSEDLEP---EVESSTEPAPPPLSPCASESL 395
Cdd:PRK08691  380 PSAQTAEKETAAKKPQPRPEAETAQTPVQTaSAAAMPSEGKTAGPVSNQENNDVPPwedAPDEAQTAAGTAQTSAKSIQT 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  396 VPIAPTAQPEEL-------------LNGAPSPPAVDLSPVSEPEEQAKKVSSAalasilsPAPPVA----PSDTSPAQEE 458
Cdd:PRK08691  460 ASEAETPPENQVsknkaadnetdapLSEVPSENPIQATPNDEAVETETFAHEA-------PAEPFYgygfPDNDCPPEDG 532
                         170       180
                  ....*....|....*....|..
gi 751130496  459 EMEEDDDDEEGGEAESEKGGED 480
Cdd:PRK08691  533 AEIPPPDWEHAAPADTAGGGAD 554
Rib_recp_KP_reg pfam05104
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ...
361-455 6.04e-03

Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.


Pssm-ID: 461548 [Multi-domain]  Cd Length: 140  Bit Score: 38.95  E-value: 6.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496   361 HEPNGVIPseDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKKVSSAALAsi 440
Cdd:pfam05104   44 EKPNGKLP--ESEQADESEEEPREFKTPDEAPSAALEPEPVPTPVPAPVEPEPAPPSE-SPAPSPKEKKKKEKKSAKV-- 118
                           90
                   ....*....|....*
gi 751130496   441 lSPAPPVAPSDTSPA 455
Cdd:pfam05104  119 -EPAETPEAVQPKPA 132
PRK11633 PRK11633
cell division protein DedD; Provisional
339-455 8.76e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 39.60  E-value: 8.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 751130496  339 ESEFSSSPLqvstalVPHKVETHEPNGV---------IPSEDLEPEVESSTEPAPPPLSPCASESLVPIAPTAQPEElln 409
Cdd:PRK11633   35 QDEFAAIPL------VPKPGDRDEPDMMpaatqalptQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVE--- 105
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 751130496  410 gAPSPPavdlsPVSEPEEQAKKVSSAALASILSPAP-PVAPSDTSPA 455
Cdd:PRK11633  106 -PPKPK-----PVEKPKPKPKPQQKVEAPPAPKPEPkPVVEEKAAPT 146
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH