NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907154453|ref|XP_036019642|]
View 

transcription factor HIVEP3 isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1845-2315 3.62e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 3.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1845 PCSEAAPPCLPPTLQENSSPVEGPQAPDSTSDEVPQGSSISEATHLTASSCSTPSRGTqglprLGLAPLEKDmSSAPSPK 1924
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP-----APPSPLPPD-THAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1925 ATSPRRpwSPSKEAGSRPSLTRKHSLTKNDSSPQQCSPAREA--QASVTSTPGPQMGPGRdlgphlcgsprlelscltpy 2002
Cdd:PHA03247  2628 PPSPSP--AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRArrLGRAAQASSPPQRPRR-------------------- 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2003 PIGREAPAGLERATDTGTPRYSPtrrwslgqaESPPQTVLPGKWALAGPCSPSADKSGLGLGPVPRALLQPVPLPHTLLS 2082
Cdd:PHA03247  2686 RAARPTVGSLTSLADPPPPPPTP---------EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR 2756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2083 R-SPETCTSAWRKTESRSPSAGPAPLFPRPFSAP-HDFHGHLPSRSEENLFSHL---PLHSQLLSRAPCPLIPiggiqmv 2157
Cdd:PHA03247  2757 PaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlSESRESLPSPWDPADPPAAvlaPAAALPPAASPAGPLP------- 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2158 qARPGAQPTVLPGPCAAWVSGFSGGGSDLTGAREAQERSRWSPTESPSASVSPVAK------VSKFTLSSELEEERTGRG 2231
Cdd:PHA03247  2830 -PPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpaVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2232 PGRPPDWEPH-RAEAPPGPMGTHSPCSPQLPQ-------GHQVAPSWRGLLGSPHTLANLKASSFPPLDRSSSMDCLAET 2303
Cdd:PHA03247  2909 PQPQAPPPPQpQPQPPPPPQPQPPPPPPPRPQpplapttDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988
                          490
                   ....*....|..
gi 1907154453 2304 STYSPPRSRNLS 2315
Cdd:PHA03247  2989 PASSTPPLTGHS 3000
PHA03247 super family cl33720
large tegument protein UL36; Provisional
733-1278 1.10e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  733 PLGSTKSPAEASKSAPslegPTSFQPRTPKPGAGSepgKERRTMSKEISVIQHTSSFEKSDPPEQPSglEEDKPPAQFSS 812
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVP----PPRPAPRPSEPAVTS---RARRPDAPPQSARPRAPVDDRGDPRGPAP--PSPLPPDTHAP 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  813 PPPAPHGRSAHSLQPRlvrqpniqvPEILVTEEPDRPDTEPEPPPKEPekteefqwPQRSQTLAQLPAEKLPPKKKRLR- 891
Cdd:PHA03247  2625 DPPPPSPSPAANEPDP---------HPPPTVPPPERPRDDPAPGRVSR--------PRRARRLGRAAQASSPPQRPRRRa 2687
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  892 -------LAEMAQSSGESSFESSVPLSRSPSQESSISLSGSSRSASFDREDHGKAEAP-GPFSDTRSKTLGSHMLTVPSH 963
Cdd:PHA03247  2688 arptvgsLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPaGPATPGGPARPARPPTTAGPP 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  964 HPHAREMRRSASEQSPNVPHSSHMTETRsksfdygslsPTGPSLAVPAAPPPPAAPPERRKCFLVRQASLNRPPEAELEA 1043
Cdd:PHA03247  2768 APAPPAAPAAGPPRRLTRPAVASLSESR----------ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1044 VPkgkqesseePAASKPSTKSSVPQISVgtTQGGPSGGKSQMQDRPPLGSSPPYTEALQVFQPlgtQLPPPaslfslqql 1123
Cdd:PHA03247  2838 AP---------PPPPGPPPPSLPLGGSV--APGGDVRRRPPSRSPAAKPAAPARPPVRRLARP---AVSRS--------- 2894
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1124 lpqeqeqsseffptqamagllSSPYSMPPLPPSLFQAPPLPLQPTvlHPSQLHLPQLLPHAADIPFQQPPSFLPMPCPAP 1203
Cdd:PHA03247  2895 ---------------------TESFALPPDQPERPPQPQAPPPPQ--PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907154453 1204 STLSGYFLPLQSQFALqLPGEIeshlPPVKTSLPPLATGPPGPSSSTEYSSDIQLPPVTPQATSPA---PTSAPPLAL 1278
Cdd:PHA03247  2952 AGEPSGAVPQPWLGAL-VPGRV----AVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLAlheETDPPPVSL 3024
zf-H2C2_2 pfam13465
Zinc-finger double domain;
200-224 3.30e-06

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 3.30e-06
                           10        20
                   ....*....|....*....|....*
gi 1907154453  200 LQKHIRSHTGERPYPCGPCGFSFKT 224
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1734-1759 1.10e-05

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.90  E-value: 1.10e-05
                           10        20
                   ....*....|....*....|....*.
gi 1907154453 1734 MLKKHIRTHTDVRPYVCKHCHFAFKT 1759
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
1720-1742 1.20e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.75  E-value: 1.20e-04
                           10        20
                   ....*....|....*....|...
gi 1907154453 1720 YVCEECGIRCKKPSMLKKHIRTH 1742
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
1745-1778 8.97e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


:

Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 35.69  E-value: 8.97e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1907154453  1745 VRPYVCKHCHFAFKTKGNLTKHMKSKAHSKKCQE 1778
Cdd:smart00451    1 TGGFYCKLCNVTFTDEISVEAHLKGKKHKKNVKK 34
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1845-2315 3.62e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 3.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1845 PCSEAAPPCLPPTLQENSSPVEGPQAPDSTSDEVPQGSSISEATHLTASSCSTPSRGTqglprLGLAPLEKDmSSAPSPK 1924
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP-----APPSPLPPD-THAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1925 ATSPRRpwSPSKEAGSRPSLTRKHSLTKNDSSPQQCSPAREA--QASVTSTPGPQMGPGRdlgphlcgsprlelscltpy 2002
Cdd:PHA03247  2628 PPSPSP--AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRArrLGRAAQASSPPQRPRR-------------------- 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2003 PIGREAPAGLERATDTGTPRYSPtrrwslgqaESPPQTVLPGKWALAGPCSPSADKSGLGLGPVPRALLQPVPLPHTLLS 2082
Cdd:PHA03247  2686 RAARPTVGSLTSLADPPPPPPTP---------EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR 2756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2083 R-SPETCTSAWRKTESRSPSAGPAPLFPRPFSAP-HDFHGHLPSRSEENLFSHL---PLHSQLLSRAPCPLIPiggiqmv 2157
Cdd:PHA03247  2757 PaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlSESRESLPSPWDPADPPAAvlaPAAALPPAASPAGPLP------- 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2158 qARPGAQPTVLPGPCAAWVSGFSGGGSDLTGAREAQERSRWSPTESPSASVSPVAK------VSKFTLSSELEEERTGRG 2231
Cdd:PHA03247  2830 -PPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpaVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2232 PGRPPDWEPH-RAEAPPGPMGTHSPCSPQLPQ-------GHQVAPSWRGLLGSPHTLANLKASSFPPLDRSSSMDCLAET 2303
Cdd:PHA03247  2909 PQPQAPPPPQpQPQPPPPPQPQPPPPPPPRPQpplapttDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988
                          490
                   ....*....|..
gi 1907154453 2304 STYSPPRSRNLS 2315
Cdd:PHA03247  2989 PASSTPPLTGHS 3000
PHA03247 PHA03247
large tegument protein UL36; Provisional
733-1278 1.10e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  733 PLGSTKSPAEASKSAPslegPTSFQPRTPKPGAGSepgKERRTMSKEISVIQHTSSFEKSDPPEQPSglEEDKPPAQFSS 812
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVP----PPRPAPRPSEPAVTS---RARRPDAPPQSARPRAPVDDRGDPRGPAP--PSPLPPDTHAP 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  813 PPPAPHGRSAHSLQPRlvrqpniqvPEILVTEEPDRPDTEPEPPPKEPekteefqwPQRSQTLAQLPAEKLPPKKKRLR- 891
Cdd:PHA03247  2625 DPPPPSPSPAANEPDP---------HPPPTVPPPERPRDDPAPGRVSR--------PRRARRLGRAAQASSPPQRPRRRa 2687
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  892 -------LAEMAQSSGESSFESSVPLSRSPSQESSISLSGSSRSASFDREDHGKAEAP-GPFSDTRSKTLGSHMLTVPSH 963
Cdd:PHA03247  2688 arptvgsLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPaGPATPGGPARPARPPTTAGPP 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  964 HPHAREMRRSASEQSPNVPHSSHMTETRsksfdygslsPTGPSLAVPAAPPPPAAPPERRKCFLVRQASLNRPPEAELEA 1043
Cdd:PHA03247  2768 APAPPAAPAAGPPRRLTRPAVASLSESR----------ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1044 VPkgkqesseePAASKPSTKSSVPQISVgtTQGGPSGGKSQMQDRPPLGSSPPYTEALQVFQPlgtQLPPPaslfslqql 1123
Cdd:PHA03247  2838 AP---------PPPPGPPPPSLPLGGSV--APGGDVRRRPPSRSPAAKPAAPARPPVRRLARP---AVSRS--------- 2894
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1124 lpqeqeqsseffptqamagllSSPYSMPPLPPSLFQAPPLPLQPTvlHPSQLHLPQLLPHAADIPFQQPPSFLPMPCPAP 1203
Cdd:PHA03247  2895 ---------------------TESFALPPDQPERPPQPQAPPPPQ--PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907154453 1204 STLSGYFLPLQSQFALqLPGEIeshlPPVKTSLPPLATGPPGPSSSTEYSSDIQLPPVTPQATSPA---PTSAPPLAL 1278
Cdd:PHA03247  2952 AGEPSGAVPQPWLGAL-VPGRV----AVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLAlheETDPPPVSL 3024
zf-H2C2_2 pfam13465
Zinc-finger double domain;
200-224 3.30e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 3.30e-06
                           10        20
                   ....*....|....*....|....*
gi 1907154453  200 LQKHIRSHTGERPYPCGPCGFSFKT 224
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1734-1759 1.10e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.90  E-value: 1.10e-05
                           10        20
                   ....*....|....*....|....*.
gi 1907154453 1734 MLKKHIRTHTDVRPYVCKHCHFAFKT 1759
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
1720-1742 1.20e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.75  E-value: 1.20e-04
                           10        20
                   ....*....|....*....|...
gi 1907154453 1720 YVCEECGIRCKKPSMLKKHIRTH 1742
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
ZnF_C2H2 smart00355
zinc finger;
1720-1742 5.36e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 36.29  E-value: 5.36e-03
                            10        20
                    ....*....|....*....|...
gi 1907154453  1720 YVCEECGIRCKKPSMLKKHIRTH 1742
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRTH 23
ZnF_C2H2 smart00355
zinc finger;
1748-1769 5.85e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 36.29  E-value: 5.85e-03
                            10        20
                    ....*....|....*....|..
gi 1907154453  1748 YVCKHCHFAFKTKGNLTKHMKS 1769
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRT 22
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
1745-1778 8.97e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 35.69  E-value: 8.97e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1907154453  1745 VRPYVCKHCHFAFKTKGNLTKHMKSKAHSKKCQE 1778
Cdd:smart00451    1 TGGFYCKLCNVTFTDEISVEAHLKGKKHKKNVKK 34
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1845-2315 3.62e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 3.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1845 PCSEAAPPCLPPTLQENSSPVEGPQAPDSTSDEVPQGSSISEATHLTASSCSTPSRGTqglprLGLAPLEKDmSSAPSPK 1924
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP-----APPSPLPPD-THAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1925 ATSPRRpwSPSKEAGSRPSLTRKHSLTKNDSSPQQCSPAREA--QASVTSTPGPQMGPGRdlgphlcgsprlelscltpy 2002
Cdd:PHA03247  2628 PPSPSP--AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRArrLGRAAQASSPPQRPRR-------------------- 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2003 PIGREAPAGLERATDTGTPRYSPtrrwslgqaESPPQTVLPGKWALAGPCSPSADKSGLGLGPVPRALLQPVPLPHTLLS 2082
Cdd:PHA03247  2686 RAARPTVGSLTSLADPPPPPPTP---------EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR 2756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2083 R-SPETCTSAWRKTESRSPSAGPAPLFPRPFSAP-HDFHGHLPSRSEENLFSHL---PLHSQLLSRAPCPLIPiggiqmv 2157
Cdd:PHA03247  2757 PaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlSESRESLPSPWDPADPPAAvlaPAAALPPAASPAGPLP------- 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2158 qARPGAQPTVLPGPCAAWVSGFSGGGSDLTGAREAQERSRWSPTESPSASVSPVAK------VSKFTLSSELEEERTGRG 2231
Cdd:PHA03247  2830 -PPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarpaVSRSTESFALPPDQPERP 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2232 PGRPPDWEPH-RAEAPPGPMGTHSPCSPQLPQ-------GHQVAPSWRGLLGSPHTLANLKASSFPPLDRSSSMDCLAET 2303
Cdd:PHA03247  2909 PQPQAPPPPQpQPQPPPPPQPQPPPPPPPRPQpplapttDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988
                          490
                   ....*....|..
gi 1907154453 2304 STYSPPRSRNLS 2315
Cdd:PHA03247  2989 PASSTPPLTGHS 3000
PHA03247 PHA03247
large tegument protein UL36; Provisional
733-1278 1.10e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  733 PLGSTKSPAEASKSAPslegPTSFQPRTPKPGAGSepgKERRTMSKEISVIQHTSSFEKSDPPEQPSglEEDKPPAQFSS 812
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVP----PPRPAPRPSEPAVTS---RARRPDAPPQSARPRAPVDDRGDPRGPAP--PSPLPPDTHAP 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  813 PPPAPHGRSAHSLQPRlvrqpniqvPEILVTEEPDRPDTEPEPPPKEPekteefqwPQRSQTLAQLPAEKLPPKKKRLR- 891
Cdd:PHA03247  2625 DPPPPSPSPAANEPDP---------HPPPTVPPPERPRDDPAPGRVSR--------PRRARRLGRAAQASSPPQRPRRRa 2687
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  892 -------LAEMAQSSGESSFESSVPLSRSPSQESSISLSGSSRSASFDREDHGKAEAP-GPFSDTRSKTLGSHMLTVPSH 963
Cdd:PHA03247  2688 arptvgsLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPaGPATPGGPARPARPPTTAGPP 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  964 HPHAREMRRSASEQSPNVPHSSHMTETRsksfdygslsPTGPSLAVPAAPPPPAAPPERRKCFLVRQASLNRPPEAELEA 1043
Cdd:PHA03247  2768 APAPPAAPAAGPPRRLTRPAVASLSESR----------ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1044 VPkgkqesseePAASKPSTKSSVPQISVgtTQGGPSGGKSQMQDRPPLGSSPPYTEALQVFQPlgtQLPPPaslfslqql 1123
Cdd:PHA03247  2838 AP---------PPPPGPPPPSLPLGGSV--APGGDVRRRPPSRSPAAKPAAPARPPVRRLARP---AVSRS--------- 2894
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1124 lpqeqeqsseffptqamagllSSPYSMPPLPPSLFQAPPLPLQPTvlHPSQLHLPQLLPHAADIPFQQPPSFLPMPCPAP 1203
Cdd:PHA03247  2895 ---------------------TESFALPPDQPERPPQPQAPPPPQ--PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907154453 1204 STLSGYFLPLQSQFALqLPGEIeshlPPVKTSLPPLATGPPGPSSSTEYSSDIQLPPVTPQATSPA---PTSAPPLAL 1278
Cdd:PHA03247  2952 AGEPSGAVPQPWLGAL-VPGRV----AVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLAlheETDPPPVSL 3024
zf-H2C2_2 pfam13465
Zinc-finger double domain;
200-224 3.30e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.44  E-value: 3.30e-06
                           10        20
                   ....*....|....*....|....*
gi 1907154453  200 LQKHIRSHTGERPYPCGPCGFSFKT 224
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1734-1759 1.10e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.90  E-value: 1.10e-05
                           10        20
                   ....*....|....*....|....*.
gi 1907154453 1734 MLKKHIRTHTDVRPYVCKHCHFAFKT 1759
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
1720-1742 1.20e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 40.75  E-value: 1.20e-04
                           10        20
                   ....*....|....*....|...
gi 1907154453 1720 YVCEECGIRCKKPSMLKKHIRTH 1742
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
1748-1769 3.60e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.59  E-value: 3.60e-04
                           10        20
                   ....*....|....*....|..
gi 1907154453 1748 YVCKHCHFAFKTKGNLTKHMKS 1769
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRT 22
PHA03247 PHA03247
large tegument protein UL36; Provisional
1036-1293 2.05e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1036 PPEAELEAVPKGKQESSEEPAASKPSTKSSVPQISV---GTTQGGPSGGKSQMQdRPPLGSSPPYTEALQVFQPLGTQLP 1112
Cdd:PHA03247  2628 PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRprrARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPP 2706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1113 PPASlfslqqlLPQEQEQSSEFFPTQAMAGLLSSPYSMPPLPPSLFQAPPLPLQPTVLHPSQLHLPQLLPHAADIPFQQP 1192
Cdd:PHA03247  2707 TPEP-------APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1193 PSFLPMPCPAPSTLSGYFLPLqsqfalqlpgeieshlpPVKTSLPPLATGPPGPSSSTEYSSDIQLPPVTPQATSPAPTS 1272
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPS-----------------PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                          250       260
                   ....*....|....*....|.
gi 1907154453 1273 APPLALPACPDAMVSLVVPVR 1293
Cdd:PHA03247  2843 PGPPPPSLPLGGSVAPGGDVR 2863
PHA03247 PHA03247
large tegument protein UL36; Provisional
939-1354 2.37e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453  939 AEAPGPFSDTRSKTLGSHMLTVPSHHPHAREMRRSASEQSPNVPHSSHMTETRSKSfdyGSLSPTGPSLAVPAAPPPPAA 1018
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR---RAARPTVGSLTSLADPPPPPP 2706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1019 PPERRKCFLVRQASLNRPPEAELEAVPKGKQESSEEPAASKPSTKSSVPQISVGTTqggPSGGKSQMQDRPPLGSSPPYT 1098
Cdd:PHA03247  2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT---TAGPPAPAPPAAPAAGPPRRL 2783
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1099 EALQVfQPLGTQLPPPASLFSLQQLLPQEQEQSSEFFPTQAMAGLLSSPYSMPPLPPSLfqaPPLPLQPTvlhpsqlhLP 1178
Cdd:PHA03247  2784 TRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP---PPGPPPPS--------LP 2851
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1179 QLLPHAADIPF-QQPPSFLPMPCPAPSTlsgyFLPLQSQFALQLPGEIESHlppvktSLPPLATGPPGPSSSTEYSSDIQ 1257
Cdd:PHA03247  2852 LGGSVAPGGDVrRRPPSRSPAAKPAAPA----RPPVRRLARPAVSRSTESF------ALPPDQPERPPQPQAPPPPQPQP 2921
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1258 LPPVTPQATSPAPTSA------PPLALPACPDAMVSLVVPVRIQTHMPSYGSAMYTTLSQILVTQSPGSPASTALTKYEE 1331
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPrpqpplAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSL 3001
                          410       420
                   ....*....|....*....|...
gi 1907154453 1332 PSSKSMTVCEADVYEAEPGPSSI 1354
Cdd:PHA03247  3002 SRVSSWASSLALHEETDPPPVSL 3024
ZnF_C2H2 smart00355
zinc finger;
1720-1742 5.36e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 36.29  E-value: 5.36e-03
                            10        20
                    ....*....|....*....|...
gi 1907154453  1720 YVCEECGIRCKKPSMLKKHIRTH 1742
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRTH 23
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1899-2136 5.55e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 5.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1899 SRGTQGLPRLGLAPLEKDMSSAPSPKATSPRRPWSPSKEAGSRPSLTRKHSLTkndSSPQQCSPAREAQASVTSTPGpqM 1978
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVA---AAPARRSPAPEALAAARQASA--R 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 1979 GPGRDLGPHLCGSPrlelsclTPYPIGREAPAGLERATDTGT---PRYSPTRRWSLGQAESPPQTVLPGKWALAGPCSPS 2055
Cdd:PRK12323   443 GPGGAPAPAPAPAA-------APAAAARPAAAGPRPVAAAAAaapARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPD 515
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907154453 2056 ADKSGLGLGPVPRALLQPVPLPHTLLSRSPeTCTSAWRKTESRSPSAGPAPLFPRPFSAPHDFHGHLPSrseenLFSHLP 2135
Cdd:PRK12323   516 AAPAGWVAESIPDPATADPDDAFETLAPAP-AAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA-----LAARLP 589

                   .
gi 1907154453 2136 L 2136
Cdd:PRK12323   590 V 590
ZnF_C2H2 smart00355
zinc finger;
1748-1769 5.85e-03

zinc finger;


Pssm-ID: 197676  Cd Length: 23  Bit Score: 36.29  E-value: 5.85e-03
                            10        20
                    ....*....|....*....|..
gi 1907154453  1748 YVCKHCHFAFKTKGNLTKHMKS 1769
Cdd:smart00355    1 YRCPECGKVFKSKSALREHMRT 22
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
1745-1778 8.97e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 35.69  E-value: 8.97e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1907154453  1745 VRPYVCKHCHFAFKTKGNLTKHMKSKAHSKKCQE 1778
Cdd:smart00451    1 TGGFYCKLCNVTFTDEISVEAHLKGKKHKKNVKK 34
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH