NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907138010|ref|XP_030105628|]
View 

protein transport protein Sec16A isoform X1 [Mus musculus]

Protein Classification

ACE1-Sec16-like domain-containing protein( domain architecture ID 10173993)

ACE1-Sec16-like domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
1512-1874 7.38e-127

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


:

Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 402.02  E-value: 7.38e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1512 FPGPLGKDDTHKVDVINFAQNKATKCLQNESLIDKESASLLWKFIILLCRQNGTVVGTDIAElllrdhrtvwlpgkspne 1591
Cdd:cd09233      1 FPGPLIKGKTKKKDVLKWLEEKIAELEENEGYLDLEDKLLLWKLLKLLVRQNGKLVGTDIAE------------------ 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1592 anlidftneaveqveeeesgeaqlsfltdsqtvttsvlEKETERFRELLLYGRKKDALESAMKNGLWGHALLLASKMDSR 1671
Cdd:cd09233     63 --------------------------------------QKALNRFRNLLLTGNRKEALELALDNGLWAHALLLASSLGKE 104
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1672 THARVMTRFANSL-PINDPLQTVYQLMSGRMPAASTCCGDE------KWGDWRPHLAMVLSNLNNNMDVEsrTMATMGDT 1744
Cdd:cd09233    105 TWAEVVSRFARSEsKLNDPLQTLYQLFSGNSPEAITELADNpaeaewALGNWREHLAIILSNRTSNLDLE--ALVELGDL 182
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1745 LASKGLLDAAHFCYLMAQVGFGVYTKKTTKLVLIGSNHSLPFLKFATNEAIQRTEAYEYAQSLGAHTCSLPNFQVFKFIY 1824
Cdd:cd09233    183 LAQRGLVEAAHICYLLAGVPLGPYPSSPSSCLLGGAVHNKSPRTFATPEAIQLTEIYEYALSLGNPQFGLPHLQPYKLIH 262
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907138010 1825 LCRLAEMGLATQAFHYCEVIAKSV--LTQPGAYSPVLISQLTQMASQLRLFD 1874
Cdd:cd09233    263 AARLAELGLVSEALKYCEAIASSLksLTKSPYYDPNLLAQLQDLSERLSGTS 314
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2-291 5.90e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 5.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    2 QPPPQAVPSGVAGPPPAGNPRSMFWAN--SPYRKPANNAPVAPI--TRPLQPVT-----------DPFAFNRQTLQNTPV 66
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALpaAPAPPAVPAGPATPGgpARPARPPTtagppapappaAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   67 GSSSKS--SLPNLPGPALSVFSQWPGLPVTPTNAGdSSTGLHEPLSGTLSQPRADASLFPPASTP--SSLPGLEVSRNAE 142
Cdd:PHA03247  2789 ASLSESreSLPSPWDPADPPAAVLAPAAALPPAAS-PAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPP 2867
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  143 ADPSSGhevqmLPHSAHYIPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHDGAVTHAASPFLPQPQMPGQWGPAQGGPQPS 222
Cdd:PHA03247  2868 SRSPAA-----KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907138010  223 YQhhspylegPVQNMGLQAASLPHFPPPSSLHQGPGhESHAPQTFTPASLASGEGNEIVHQQSKNHPLS 291
Cdd:PHA03247  2943 LA--------PTTDPAGAGEPSGAVPQPWLGALVPG-RVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03247 super family cl33720
large tegument protein UL36; Provisional
669-1241 7.15e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  669 AAVAPPDATSGNLEQPpDNMETPCAPQACPLPLSTTGEA------------GQLVSNTAGTPLDTVRP----------VP 726
Cdd:PHA03247  2491 AAGAAPDPGGGGPPDP-DAPPAPSRLAPAILPDEPVGEPvhprmltwirglEELASDDAGDPPPPLPPaappaapdrsVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  727 DKRPSARAQGP-VKCESPATTLWAQNELP--------DFGGNVLLAPAAPALYVPVKPKPSEVVHHPEKGMSGQKAWKQ- 796
Cdd:PHA03247  2570 PPRPAPRPSEPaVTSRARRPDAPPQSARPrapvddrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPp 2649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  797 ---------GSVPPLQNQDPPGASENLENPPKVGEEEALP--VQASSGYASLLSSPPTESLHNQPVLIAQPDQSYNLAQP 865
Cdd:PHA03247  2650 erprddpapGRVSRPRRARRLGRAAQASSPPQRPRRRAARptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  866 INFSVSLLNPNEKNQSWGDAV-VGERSivsnnwalggdPEERAALSGVPASAVTGAslPSSIPQNCAPQGSGSSEMIASQ 944
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPATpGGPAR-----------PARPPTTAGPPAPAPPAA--PAAGPPRRLTRPAVASLSESRE 2796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  945 SASwlvqqlSPQTPqSPHPNAEKGPSEFVSsPAGNTSVMLVPPASSTLVPNSNKAKHSSNQEEAVGALdftlnrTLENPV 1024
Cdd:PHA03247  2797 SLP------SPWDP-ADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV------APGGDV 2862
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1025 RMYSPSPSdgPASQQPLPNHPRQSgpglhnqdhfyqQVTKDAQDQHRLERAQPELVPPRPQnSPQVPQascpepsnpesp 1104
Cdd:PHA03247  2863 RRRPPSRS--PAAKPAAPARPPVR------------RLARPAVSRSTESFALPPDQPERPP-QPQAPP------------ 2915
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1105 ptQGQSESLAQPPASPASVNTGQLLPQPPQASSASVTSTNSSQAAVRSEQLW-------------LHPPPPNTFGPAPQD 1171
Cdd:PHA03247  2916 --PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGalvpgrvavprfrVPQPAPSREAPASST 2993
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1172 LASYYYYRPLYDAYQSQYPSPYPSDPGTASLyYQDMYglyepryrPYDSSASAYAENHRYSEPERPSSRA 1241
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSLALHEETDPPPVSL-KQTLW--------PPDDTEDSDADSLFDSDSERSDLEA 3054
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1934-2308 1.37e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1934 GLNQQAGPQADNPllmpstEPLMHGVQLLPTAPQTLPDGQPAHLsrvPMFPVPMSRgplELSPAYGPPGSALGFPESSRS 2013
Cdd:PHA03247  2539 GLEELASDDAGDP------PPPLPPAAPPAAPDRSVPPPRPAPR---PSEPAVTSR---ARRPDAPPQSARPRAPVDDRG 2606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2014 DPAVLHPGQALPPTTLSLQesglPPQEAKSPDPEMVPRGSPVRHSPPELSQEefgesfaDPGSSRTAQDLETSpvwdlgs 2093
Cdd:PHA03247  2607 DPRGPAPPSPLPPDTHAPD----PPPPSPSPAANEPDPHPPPTVPPPERPRD-------DPAPGRVSRPRRAR------- 2668
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2094 sSLTRAPSLTSDSEGKKPaqavkkepkepkkteswfsRWLPGKKRTEAYLPDDKNKSIVWDEKKNQWVNLNEPEEEKKAP 2173
Cdd:PHA03247  2669 -RLGRAAQASSPPQRPRR-------------------RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2174 PPPPTSFPRVPqVAPTGPAGPPTAsvnvfsrkagGSRARYVDVLNPSGTQRSEPALAPADFFAPLAPLPIPSNLFVPNPD 2253
Cdd:PHA03247  2729 RQASPALPAAP-APPAVPAGPATP----------GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907138010 2254 AEEPQ-PADGTG-CRGQAPAGTQSKAESTLEPKVGSSTVSAPGPELLPSKPDGSQGG 2308
Cdd:PHA03247  2798 LPSPWdPADPPAaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
 
Name Accession Description Interval E-value
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
1512-1874 7.38e-127

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 402.02  E-value: 7.38e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1512 FPGPLGKDDTHKVDVINFAQNKATKCLQNESLIDKESASLLWKFIILLCRQNGTVVGTDIAElllrdhrtvwlpgkspne 1591
Cdd:cd09233      1 FPGPLIKGKTKKKDVLKWLEEKIAELEENEGYLDLEDKLLLWKLLKLLVRQNGKLVGTDIAE------------------ 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1592 anlidftneaveqveeeesgeaqlsfltdsqtvttsvlEKETERFRELLLYGRKKDALESAMKNGLWGHALLLASKMDSR 1671
Cdd:cd09233     63 --------------------------------------QKALNRFRNLLLTGNRKEALELALDNGLWAHALLLASSLGKE 104
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1672 THARVMTRFANSL-PINDPLQTVYQLMSGRMPAASTCCGDE------KWGDWRPHLAMVLSNLNNNMDVEsrTMATMGDT 1744
Cdd:cd09233    105 TWAEVVSRFARSEsKLNDPLQTLYQLFSGNSPEAITELADNpaeaewALGNWREHLAIILSNRTSNLDLE--ALVELGDL 182
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1745 LASKGLLDAAHFCYLMAQVGFGVYTKKTTKLVLIGSNHSLPFLKFATNEAIQRTEAYEYAQSLGAHTCSLPNFQVFKFIY 1824
Cdd:cd09233    183 LAQRGLVEAAHICYLLAGVPLGPYPSSPSSCLLGGAVHNKSPRTFATPEAIQLTEIYEYALSLGNPQFGLPHLQPYKLIH 262
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907138010 1825 LCRLAEMGLATQAFHYCEVIAKSV--LTQPGAYSPVLISQLTQMASQLRLFD 1874
Cdd:cd09233    263 AARLAELGLVSEALKYCEAIASSLksLTKSPYYDPNLLAQLQDLSERLSGTS 314
Sec16_C pfam12931
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ...
1637-1871 6.51e-45

Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.


Pssm-ID: 432884  Cd Length: 279  Bit Score: 165.04  E-value: 6.51e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1637 RELLLYGRKKDALESAMKNGLWGHALLLASKMDSRTHARVMTRFA------NSLPINDPLQTVYQLMSGRMPAA----ST 1706
Cdd:pfam12931    2 RALLLTGDREKALWLALDKKLWAHALLIASTLGKEKWKEVVQEFVrsefkgSNNKSGESLAALYQVFAGNSEEAvdelVP 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1707 CCGDEKWG--DWRPHLAMVLSNLNNNmDVESRTmaTMGDTLASKGLLDAAHFCYLMAQVGFGVytkkttkLVLIGSNHSL 1784
Cdd:pfam12931   82 PSKNALWAldNWRETLALVLSNRSPG-DVEALL--ALGDLLAQYGRTEAAHICFLLAGLPLSQ-------TVLLGADHVR 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1785 PFLKFATN-EAIQRTEAYEYAQSLGAH---TCSLPNFQVFKFIYLCRLAEMGLATQAFHYCEVIAKSV--LTQPGAY-SP 1857
Cdd:pfam12931  152 FPSTFGNDlESILLTEIYEYALSLSPPqppFVGLPHLLPYKLQHAAVLAEYGLVSEAQKYCDAITASLksLTKKSPYyHP 231
                          250
                   ....*....|....
gi 1907138010 1858 VLISQLTQMASQLR 1871
Cdd:pfam12931  232 TLLAQLEDLSNRLS 245
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-291 5.90e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 5.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    2 QPPPQAVPSGVAGPPPAGNPRSMFWAN--SPYRKPANNAPVAPI--TRPLQPVT-----------DPFAFNRQTLQNTPV 66
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALpaAPAPPAVPAGPATPGgpARPARPPTtagppapappaAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   67 GSSSKS--SLPNLPGPALSVFSQWPGLPVTPTNAGdSSTGLHEPLSGTLSQPRADASLFPPASTP--SSLPGLEVSRNAE 142
Cdd:PHA03247  2789 ASLSESreSLPSPWDPADPPAAVLAPAAALPPAAS-PAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPP 2867
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  143 ADPSSGhevqmLPHSAHYIPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHDGAVTHAASPFLPQPQMPGQWGPAQGGPQPS 222
Cdd:PHA03247  2868 SRSPAA-----KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907138010  223 YQhhspylegPVQNMGLQAASLPHFPPPSSLHQGPGhESHAPQTFTPASLASGEGNEIVHQQSKNHPLS 291
Cdd:PHA03247  2943 LA--------PTTDPAGAGEPSGAVPQPWLGALVPG-RVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
669-1241 7.15e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  669 AAVAPPDATSGNLEQPpDNMETPCAPQACPLPLSTTGEA------------GQLVSNTAGTPLDTVRP----------VP 726
Cdd:PHA03247  2491 AAGAAPDPGGGGPPDP-DAPPAPSRLAPAILPDEPVGEPvhprmltwirglEELASDDAGDPPPPLPPaappaapdrsVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  727 DKRPSARAQGP-VKCESPATTLWAQNELP--------DFGGNVLLAPAAPALYVPVKPKPSEVVHHPEKGMSGQKAWKQ- 796
Cdd:PHA03247  2570 PPRPAPRPSEPaVTSRARRPDAPPQSARPrapvddrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPp 2649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  797 ---------GSVPPLQNQDPPGASENLENPPKVGEEEALP--VQASSGYASLLSSPPTESLHNQPVLIAQPDQSYNLAQP 865
Cdd:PHA03247  2650 erprddpapGRVSRPRRARRLGRAAQASSPPQRPRRRAARptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  866 INFSVSLLNPNEKNQSWGDAV-VGERSivsnnwalggdPEERAALSGVPASAVTGAslPSSIPQNCAPQGSGSSEMIASQ 944
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPATpGGPAR-----------PARPPTTAGPPAPAPPAA--PAAGPPRRLTRPAVASLSESRE 2796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  945 SASwlvqqlSPQTPqSPHPNAEKGPSEFVSsPAGNTSVMLVPPASSTLVPNSNKAKHSSNQEEAVGALdftlnrTLENPV 1024
Cdd:PHA03247  2797 SLP------SPWDP-ADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV------APGGDV 2862
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1025 RMYSPSPSdgPASQQPLPNHPRQSgpglhnqdhfyqQVTKDAQDQHRLERAQPELVPPRPQnSPQVPQascpepsnpesp 1104
Cdd:PHA03247  2863 RRRPPSRS--PAAKPAAPARPPVR------------RLARPAVSRSTESFALPPDQPERPP-QPQAPP------------ 2915
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1105 ptQGQSESLAQPPASPASVNTGQLLPQPPQASSASVTSTNSSQAAVRSEQLW-------------LHPPPPNTFGPAPQD 1171
Cdd:PHA03247  2916 --PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGalvpgrvavprfrVPQPAPSREAPASST 2993
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1172 LASYYYYRPLYDAYQSQYPSPYPSDPGTASLyYQDMYglyepryrPYDSSASAYAENHRYSEPERPSSRA 1241
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSLALHEETDPPPVSL-KQTLW--------PPDDTEDSDADSLFDSDSERSDLEA 3054
PHA03247 PHA03247
large tegument protein UL36; Provisional
1934-2308 1.37e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1934 GLNQQAGPQADNPllmpstEPLMHGVQLLPTAPQTLPDGQPAHLsrvPMFPVPMSRgplELSPAYGPPGSALGFPESSRS 2013
Cdd:PHA03247  2539 GLEELASDDAGDP------PPPLPPAAPPAAPDRSVPPPRPAPR---PSEPAVTSR---ARRPDAPPQSARPRAPVDDRG 2606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2014 DPAVLHPGQALPPTTLSLQesglPPQEAKSPDPEMVPRGSPVRHSPPELSQEefgesfaDPGSSRTAQDLETSpvwdlgs 2093
Cdd:PHA03247  2607 DPRGPAPPSPLPPDTHAPD----PPPPSPSPAANEPDPHPPPTVPPPERPRD-------DPAPGRVSRPRRAR------- 2668
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2094 sSLTRAPSLTSDSEGKKPaqavkkepkepkkteswfsRWLPGKKRTEAYLPDDKNKSIVWDEKKNQWVNLNEPEEEKKAP 2173
Cdd:PHA03247  2669 -RLGRAAQASSPPQRPRR-------------------RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2174 PPPPTSFPRVPqVAPTGPAGPPTAsvnvfsrkagGSRARYVDVLNPSGTQRSEPALAPADFFAPLAPLPIPSNLFVPNPD 2253
Cdd:PHA03247  2729 RQASPALPAAP-APPAVPAGPATP----------GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907138010 2254 AEEPQ-PADGTG-CRGQAPAGTQSKAESTLEPKVGSSTVSAPGPELLPSKPDGSQGG 2308
Cdd:PHA03247  2798 LPSPWdPADPPAaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-295 2.90e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 2.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    2 QPPPQAVPSGVAGPPPAGNPRSmfwanspyrkpannapvapitrplqpvtdpfafnrqTLQNTPVGSSSKSSLPNLPGPA 81
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGT------------------------------------TQAATAGPTPSAPSVPPQGSPA 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   82 LSVFSQWPGLPVTPTNAGDSSTGLHEPlsgTLSQPRADASLFPPASTPSSLPglevsrnAEADPSSGHEVQM--LPHSAH 159
Cdd:pfam03154  214 TSQPPNQTQSTAAPHTLIQQTPTLHPQ---RLPSPHPPLQPMTQPPPPSQVS-------PQPLPQPSLHGQMppMPHSLQ 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  160 ----YIPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHDGAVTHAASPFLPQPQMPgqwGPAQGGPQPSYQHHSPYLEGPvq 235
Cdd:pfam03154  284 tgpsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ---QPPREQPLPPAPLSMPHIKPP-- 358
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  236 nmglQAASLPHFPPPSSlHQGPGHESHAPQTFTPASLASGEGNEIVHQQSKNHPLSSFPP 295
Cdd:pfam03154  359 ----PTTPIPQLPNPQS-HKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPP 413
 
Name Accession Description Interval E-value
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
1512-1874 7.38e-127

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 402.02  E-value: 7.38e-127
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1512 FPGPLGKDDTHKVDVINFAQNKATKCLQNESLIDKESASLLWKFIILLCRQNGTVVGTDIAElllrdhrtvwlpgkspne 1591
Cdd:cd09233      1 FPGPLIKGKTKKKDVLKWLEEKIAELEENEGYLDLEDKLLLWKLLKLLVRQNGKLVGTDIAE------------------ 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1592 anlidftneaveqveeeesgeaqlsfltdsqtvttsvlEKETERFRELLLYGRKKDALESAMKNGLWGHALLLASKMDSR 1671
Cdd:cd09233     63 --------------------------------------QKALNRFRNLLLTGNRKEALELALDNGLWAHALLLASSLGKE 104
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1672 THARVMTRFANSL-PINDPLQTVYQLMSGRMPAASTCCGDE------KWGDWRPHLAMVLSNLNNNMDVEsrTMATMGDT 1744
Cdd:cd09233    105 TWAEVVSRFARSEsKLNDPLQTLYQLFSGNSPEAITELADNpaeaewALGNWREHLAIILSNRTSNLDLE--ALVELGDL 182
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1745 LASKGLLDAAHFCYLMAQVGFGVYTKKTTKLVLIGSNHSLPFLKFATNEAIQRTEAYEYAQSLGAHTCSLPNFQVFKFIY 1824
Cdd:cd09233    183 LAQRGLVEAAHICYLLAGVPLGPYPSSPSSCLLGGAVHNKSPRTFATPEAIQLTEIYEYALSLGNPQFGLPHLQPYKLIH 262
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907138010 1825 LCRLAEMGLATQAFHYCEVIAKSV--LTQPGAYSPVLISQLTQMASQLRLFD 1874
Cdd:cd09233    263 AARLAELGLVSEALKYCEAIASSLksLTKSPYYDPNLLAQLQDLSERLSGTS 314
Sec16_C pfam12931
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ...
1637-1871 6.51e-45

Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.


Pssm-ID: 432884  Cd Length: 279  Bit Score: 165.04  E-value: 6.51e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1637 RELLLYGRKKDALESAMKNGLWGHALLLASKMDSRTHARVMTRFA------NSLPINDPLQTVYQLMSGRMPAA----ST 1706
Cdd:pfam12931    2 RALLLTGDREKALWLALDKKLWAHALLIASTLGKEKWKEVVQEFVrsefkgSNNKSGESLAALYQVFAGNSEEAvdelVP 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1707 CCGDEKWG--DWRPHLAMVLSNLNNNmDVESRTmaTMGDTLASKGLLDAAHFCYLMAQVGFGVytkkttkLVLIGSNHSL 1784
Cdd:pfam12931   82 PSKNALWAldNWRETLALVLSNRSPG-DVEALL--ALGDLLAQYGRTEAAHICFLLAGLPLSQ-------TVLLGADHVR 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1785 PFLKFATN-EAIQRTEAYEYAQSLGAH---TCSLPNFQVFKFIYLCRLAEMGLATQAFHYCEVIAKSV--LTQPGAY-SP 1857
Cdd:pfam12931  152 FPSTFGNDlESILLTEIYEYALSLSPPqppFVGLPHLLPYKLQHAAVLAEYGLVSEAQKYCDAITASLksLTKKSPYyHP 231
                          250
                   ....*....|....
gi 1907138010 1858 VLISQLTQMASQLR 1871
Cdd:pfam12931  232 TLLAQLEDLSNRLS 245
Sec16 pfam12932
Vesicle coat trafficking protein Sec16 mid-region; Sec16 is a multi-domain vesicle coat ...
1464-1564 1.22e-05

Vesicle coat trafficking protein Sec16 mid-region; Sec16 is a multi-domain vesicle coat protein. This central region is the functional part of the molecules and thus is vital for the family's role in mediating the movement of protein-cargo between the organelles of the secretory pathway.


Pssm-ID: 432885  Cd Length: 119  Bit Score: 46.44  E-value: 1.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1464 HVCARFGPGGQLL----KVIPNLPS-------EGQPALVEIHSLETLLqhtPEQEEMRSFPGPLGKDDTHKVDVI----- 1527
Cdd:pfam12932    1 HPIFSFGFGGKLVtmfpKRVPRYSTgqdvpmiKRSPGEVKIRNLKDVV---PLSEDLAKFPGPLVKGKSKKKEVLkwlse 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1907138010 1528 ---NFAQNKATKCLQNESLIDKESAS--LLWKFIILLCRQNG 1564
Cdd:pfam12932   78 rieELEQSLPYSDGSLESDEKKRAEEklLLWKLLKILVEHDG 119
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-291 5.90e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 5.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    2 QPPPQAVPSGVAGPPPAGNPRSMFWAN--SPYRKPANNAPVAPI--TRPLQPVT-----------DPFAFNRQTLQNTPV 66
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALpaAPAPPAVPAGPATPGgpARPARPPTtagppapappaAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   67 GSSSKS--SLPNLPGPALSVFSQWPGLPVTPTNAGdSSTGLHEPLSGTLSQPRADASLFPPASTP--SSLPGLEVSRNAE 142
Cdd:PHA03247  2789 ASLSESreSLPSPWDPADPPAAVLAPAAALPPAAS-PAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPP 2867
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  143 ADPSSGhevqmLPHSAHYIPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHDGAVTHAASPFLPQPQMPGQWGPAQGGPQPS 222
Cdd:PHA03247  2868 SRSPAA-----KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907138010  223 YQhhspylegPVQNMGLQAASLPHFPPPSSLHQGPGhESHAPQTFTPASLASGEGNEIVHQQSKNHPLS 291
Cdd:PHA03247  2943 LA--------PTTDPAGAGEPSGAVPQPWLGALVPG-RVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
669-1241 7.15e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 7.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  669 AAVAPPDATSGNLEQPpDNMETPCAPQACPLPLSTTGEA------------GQLVSNTAGTPLDTVRP----------VP 726
Cdd:PHA03247  2491 AAGAAPDPGGGGPPDP-DAPPAPSRLAPAILPDEPVGEPvhprmltwirglEELASDDAGDPPPPLPPaappaapdrsVP 2569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  727 DKRPSARAQGP-VKCESPATTLWAQNELP--------DFGGNVLLAPAAPALYVPVKPKPSEVVHHPEKGMSGQKAWKQ- 796
Cdd:PHA03247  2570 PPRPAPRPSEPaVTSRARRPDAPPQSARPrapvddrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPp 2649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  797 ---------GSVPPLQNQDPPGASENLENPPKVGEEEALP--VQASSGYASLLSSPPTESLHNQPVLIAQPDQSYNLAQP 865
Cdd:PHA03247  2650 erprddpapGRVSRPRRARRLGRAAQASSPPQRPRRRAARptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  866 INFSVSLLNPNEKNQSWGDAV-VGERSivsnnwalggdPEERAALSGVPASAVTGAslPSSIPQNCAPQGSGSSEMIASQ 944
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPATpGGPAR-----------PARPPTTAGPPAPAPPAA--PAAGPPRRLTRPAVASLSESRE 2796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  945 SASwlvqqlSPQTPqSPHPNAEKGPSEFVSsPAGNTSVMLVPPASSTLVPNSNKAKHSSNQEEAVGALdftlnrTLENPV 1024
Cdd:PHA03247  2797 SLP------SPWDP-ADPPAAVLAPAAALP-PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV------APGGDV 2862
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1025 RMYSPSPSdgPASQQPLPNHPRQSgpglhnqdhfyqQVTKDAQDQHRLERAQPELVPPRPQnSPQVPQascpepsnpesp 1104
Cdd:PHA03247  2863 RRRPPSRS--PAAKPAAPARPPVR------------RLARPAVSRSTESFALPPDQPERPP-QPQAPP------------ 2915
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1105 ptQGQSESLAQPPASPASVNTGQLLPQPPQASSASVTSTNSSQAAVRSEQLW-------------LHPPPPNTFGPAPQD 1171
Cdd:PHA03247  2916 --PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGalvpgrvavprfrVPQPAPSREAPASST 2993
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1172 LASYYYYRPLYDAYQSQYPSPYPSDPGTASLyYQDMYglyepryrPYDSSASAYAENHRYSEPERPSSRA 1241
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSLALHEETDPPPVSL-KQTLW--------PPDDTEDSDADSLFDSDSERSDLEA 3054
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-354 8.97e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 8.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    3 PPPQAVPSGVAGPPPAGNPRSMFWANSPYRKPANNAPVAPITRPLQPVTDPFAFNRQTLQNTPVGSSSKSSLPNLPGPAL 82
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   83 SVfsqwPGLPVTPTNAGDSStglhEPLSGTLSqPRADASLFPP----ASTPSSLPGLEVSRNAEADPSSGHEVQMLPHSA 158
Cdd:PHA03247  2834 AQ----PTAPPPPPGPPPPS----LPLGGSVA-PGGDVRRRPPsrspAAKPAAPARPPVRRLARPAVSRSTESFALPPDQ 2904
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  159 hyiPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHdgavthaaSPFLPQPQMPGQWGPAqGGPQPSYQHHSPYLEGPVQnmG 238
Cdd:PHA03247  2905 ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPP--------PPPRPQPPLAPTTDPA-GAGEPSGAVPQPWLGALVP--G 2970
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  239 LQAASLPHFPPPSSLHQGPGHESHAPQTFTPASLASGEGNEIVHQQSKNHPLSSfppKHTFEQNSRIGNMWASPELKQNP 318
Cdd:PHA03247  2971 RVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSL---KQTLWPPDDTEDSDADSLFDSDS 3047
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907138010  319 GVNKEHLLDPAHVNPftqgNSPENQAHHPPVAATNH 354
Cdd:PHA03247  3048 ERSDLEALDPLPPEP----HDPFAHEPDPATPEAGA 3079
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-275 4.80e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 4.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    4 PPQAvPSGVAGPPPAGNPRSMFWANSPYRKPANNAPVAPITRPLQPVTDPFAFNRQTLQNTPVGSSSKSSLPNLPGPALS 83
Cdd:PHA03247  2679 PPQR-PRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   84 VFSQWPGLPVTPTNAGDSSTG----LHEPLSGTLSQPRADASLFP-PASTPSSLPGLEVSRNAEADPSSGHEVQMLPHSA 158
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  159 HYIPGVGPEQP---LGGqmndsGSGPDQPMNRHAPHDGAVTHAASPF------LPQPQMPGQWGP-AQGGPQPSYQHHSP 228
Cdd:PHA03247  2838 APPPPPGPPPPslpLGG-----SVAPGGDVRRRPPSRSPAAKPAAPArppvrrLARPAVSRSTESfALPPDQPERPPQPQ 2912
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1907138010  229 YLEGPVQNMGLQAASLPHFPPPSslhQGPGHESHAPQTFT-PASLASG 275
Cdd:PHA03247  2913 APPPPQPQPQPPPPPQPQPPPPP---PPRPQPPLAPTTDPaGAGEPSG 2957
PHA03379 PHA03379
EBNA-3A; Provisional
3-216 8.54e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 44.66  E-value: 8.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    3 PPPQAVPSGVAGPPPAGNPRSMFWANSPYRKPannaPVAPITRPLQPVTDPFAFNRQTLQNTPVGSSSK----SSLPNLP 78
Cdd:PHA03379   579 PPRSPSQMSVRDRLARLRAEAQPYQASVEVQP----PQLTQVSPQQPMEYPLEPEQQMFPGSPFSQVADvmraGGVPAMQ 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   79 GPALSVFSQWP---GLPVTPTNAG---------DSSTGLHEPLSGTLSQPRADASLFPPA-STPSSLPGLEVSRNAEADP 145
Cdd:PHA03379   655 PQYFDLPLQQPisqGAPLAPLRASmgpvppvpaTQPQYFDIPLTEPINQGASAAHFLPQQpMEGPLVPERWMFQGATLSQ 734
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907138010  146 SSGhevqmlphsahyiPGVGPEQPLGGQMNdsgsgpdQPMNRHAPhdgavthaASPFLPQPQMPGQWGPAQ 216
Cdd:PHA03379   735 SVR-------------PGVAQSQYFDLPLT-------QPINHGAP--------AAHFLHQPPMEGPWVPEQ 777
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1015-1246 1.11e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.05  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1015 TLNRTLENPVRMYSP-SPSDGPASQQPlPNHPRQSGPGLHNQDhfyqqvtKDAQDQHRLERAQPelVPPRPQNSPQVPQA 1093
Cdd:PRK14086    73 TLSRELGRPIRIAITvDPSAGEPAPPP-PHARRTSEPELPRPG-------RRPYEGYGGPRADD--RPPGLPRQDQLPTA 142
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1094 scpepsnpespptqgqseslaqPPASPASVNTGQLLPQPPQASSASvtstnssqaavrSEQLWLHPPPPNTFGPAPQDLA 1173
Cdd:PRK14086   143 ----------------------RPAYPAYQQRPEPGAWPRAADDYG------------WQQQRLGFPPRAPYASPASYAP 188
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907138010 1174 SYYYYRPLYDAYQSQYPSPYPSDPGTASLYYQDMyglYEPRYRPYDSSASAYAENHRYSEPERPSSRASHYSD 1246
Cdd:PRK14086   189 EQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPR---RDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRP 258
PHA03247 PHA03247
large tegument protein UL36; Provisional
1934-2308 1.37e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1934 GLNQQAGPQADNPllmpstEPLMHGVQLLPTAPQTLPDGQPAHLsrvPMFPVPMSRgplELSPAYGPPGSALGFPESSRS 2013
Cdd:PHA03247  2539 GLEELASDDAGDP------PPPLPPAAPPAAPDRSVPPPRPAPR---PSEPAVTSR---ARRPDAPPQSARPRAPVDDRG 2606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2014 DPAVLHPGQALPPTTLSLQesglPPQEAKSPDPEMVPRGSPVRHSPPELSQEefgesfaDPGSSRTAQDLETSpvwdlgs 2093
Cdd:PHA03247  2607 DPRGPAPPSPLPPDTHAPD----PPPPSPSPAANEPDPHPPPTVPPPERPRD-------DPAPGRVSRPRRAR------- 2668
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2094 sSLTRAPSLTSDSEGKKPaqavkkepkepkkteswfsRWLPGKKRTEAYLPDDKNKSIVWDEKKNQWVNLNEPEEEKKAP 2173
Cdd:PHA03247  2669 -RLGRAAQASSPPQRPRR-------------------RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 2174 PPPPTSFPRVPqVAPTGPAGPPTAsvnvfsrkagGSRARYVDVLNPSGTQRSEPALAPADFFAPLAPLPIPSNLFVPNPD 2253
Cdd:PHA03247  2729 RQASPALPAAP-APPAVPAGPATP----------GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907138010 2254 AEEPQ-PADGTG-CRGQAPAGTQSKAESTLEPKVGSSTVSAPGPELLPSKPDGSQGG 2308
Cdd:PHA03247  2798 LPSPWdPADPPAaVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1001-1199 2.40e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 2.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1001 HSSNQEEAVGALDFTLNRTLENPVRMYSPSPS---DGPASQQPLPNHprQSGPGLHNQDhfyqqvtkdaqdqhrleraqp 1077
Cdd:PRK10263   314 APITEPVAVAAAATTATQSWAAPVEPVTQTPPvasVDVPPAQPTVAW--QPVPGPQTGE--------------------- 370
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010 1078 ELVPPRPQNSPQVPQAScpepsnpesPPTQGQSESLAQP--PASPASVNTGQLLPQPPQASSASVTSTNSSQAAVRSEQ- 1154
Cdd:PRK10263   371 PVIAPAPEGYPQQSQYA---------QPAVQYNEPLQQPvqPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQp 441
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1907138010 1155 ----LWLHPPPPNTFGPAPQDLASYYYYRPLYDAYQSQYPSPYPSDPGT 1199
Cdd:PRK10263   442 vagnAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-295 2.90e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 2.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010    2 QPPPQAVPSGVAGPPPAGNPRSmfwanspyrkpannapvapitrplqpvtdpfafnrqTLQNTPVGSSSKSSLPNLPGPA 81
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGT------------------------------------TQAATAGPTPSAPSVPPQGSPA 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010   82 LSVFSQWPGLPVTPTNAGDSSTGLHEPlsgTLSQPRADASLFPPASTPSSLPglevsrnAEADPSSGHEVQM--LPHSAH 159
Cdd:pfam03154  214 TSQPPNQTQSTAAPHTLIQQTPTLHPQ---RLPSPHPPLQPMTQPPPPSQVS-------PQPLPQPSLHGQMppMPHSLQ 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  160 ----YIPGVGPEQPLGGQMNDSGSGPDQPMNRHAPHDGAVTHAASPFLPQPQMPgqwGPAQGGPQPSYQHHSPYLEGPvq 235
Cdd:pfam03154  284 tgpsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ---QPPREQPLPPAPLSMPHIKPP-- 358
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907138010  236 nmglQAASLPHFPPPSSlHQGPGHESHAPQTFTPASLASGEGNEIVHQQSKNHPLSSFPP 295
Cdd:pfam03154  359 ----PTTPIPQLPNPQS-HKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPP 413
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH