NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907195911|ref|XP_036010639|]
View 

paired amphipathic helix protein Sin3a isoform X2 [Mus musculus]

Protein Classification

Sin3 family protein( domain architecture ID 1001704)

Sin3 family protein similar to Homo sapiens paired amphipathic helix proteins Sin3a and Sin3b, which act as transcriptional repressors

CATH:  1.20.1160.11
Gene Ontology:  GO:0003714

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Sin3 super family cl35027
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


The actual alignment was detected with superfamily member COG5602:

Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602     37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602    111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602    173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602    222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602    289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602    355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602    430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602    501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602    581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602    661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602    737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602    806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602    883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602    932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602   1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                         1210      1220      1230      1240      1250      1260
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907195911 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602   1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602     37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602    111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602    173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602    222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602    289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602    355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602    430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602    501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602    581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602    661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602    737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602    806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602    883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602    932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602   1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                         1210      1220      1230      1240      1250      1260
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907195911 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602   1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
887-1187 1.70e-103

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 329.18  E-value: 1.70e-103
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  887 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 966
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  967 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlNSQTsrsl 1046
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911 1047 lESAYQRKAEQLM-SDENCFKLMFIQSqgQVQLTVELLDTEEENSDDPV--EAERWSDYVERYMSSDTTspELREHLAQK 1123
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907195911 1124 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDYM 1187
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
551-651 2.80e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.80e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911   551 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 629
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 1907195911   630 TIRVLEAIQKKLSRLSAEEQAK 651
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.24e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.72  E-value: 1.24e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907195911   48 MQSATGIQYSVAPNYQVSAVPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056    213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602     37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602    111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602    173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602    222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602    289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602    355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602    430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602    501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602    581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602    661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602    737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602    806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602    883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602    932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602   1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                         1210      1220      1230      1240      1250      1260
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907195911 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602   1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
887-1187 1.70e-103

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 329.18  E-value: 1.70e-103
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  887 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 966
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  967 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlNSQTsrsl 1046
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911 1047 lESAYQRKAEQLM-SDENCFKLMFIQSqgQVQLTVELLDTEEENSDDPV--EAERWSDYVERYMSSDTTspELREHLAQK 1123
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907195911 1124 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDYM 1187
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
551-651 2.80e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.80e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911   551 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 629
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 1907195911   630 TIRVLEAIQKKLSRLSAEEQAK 651
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
Sin3_corepress pfam08295
Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms ...
553-647 5.87e-51

Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 462418  Cd Length: 97  Bit Score: 174.34  E-value: 5.87e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907195911  553 KRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDS-TFVSSKKTQYEEHIYRCEDERFELDVVLETNLAT 630
Cdd:pfam08295    1 KRCGPSYRLLPKSEPQLPCSGRDELCWEVLNDEWVSHPTWaSEDSgSFVAHRKNQYEEALFRCEDERYEYDMHIESNLRT 80
                           90
                   ....*....|....*..
gi 1907195911  631 IRVLEAIQKKLSRLSAE 647
Cdd:pfam08295   81 IKLLEPIAEKINNMSPE 97
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
142-186 9.33e-20

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 83.63  E-value: 9.33e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1907195911  142 QVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIMGFNTFL 186
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
479-523 2.04e-13

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 65.52  E-value: 2.04e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1907195911  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFL 523
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
323-380 3.78e-12

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 62.05  E-value: 3.78e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907195911  323 DIYKAFLEILHTYQKEQRNAKEaggnytpalteqeVYAQVARLFKNQEDLLSEFGQFL 380
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAE-------------VIARVSELLKGHPDLLEGFNTFL 45
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.24e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.72  E-value: 1.24e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907195911   48 MQSATGIQYSVAPNYQVSAVPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056    213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH