U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Cenpa centromere protein A [ Mus musculus (house mouse) ]

    Gene ID: 12615, updated on 11-Apr-2024

    Summary

    Official Symbol
    Cenpaprovided by MGI
    Official Full Name
    centromere protein Aprovided by MGI
    Primary source
    MGI:MGI:88375
    See related
    Ensembl:ENSMUSG00000029177 AllianceGenome:MGI:88375
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    Cenp-A
    Summary
    Centromeres are the differentiated chromosomal domains that specify the mitotic behavior of chromosomes. This gene encodes a centromere protein which contains a histone H3 related histone fold domain that is required for targeting to the centromere. Centromere protein A is proposed to be a component of a modified nucleosome or nucleosome-like structure in which it replaces 1 or both copies of conventional histone H3 in the (H3-H4)2 tetrameric core of the nucleosome particle. The protein is a replication-independent histone that is a member of the histone H3 family. Alternative splicing results in multiple transcript variants encoding distinct isoforms. [provided by RefSeq, Nov 2015]
    Expression
    Broad expression in CNS E11.5 (RPKM 43.5), liver E14.5 (RPKM 41.9) and 20 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    Location:
    5 B1; 5 16.76 cM
    Exon count:
    9
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 5 NC_000071.7 (30824214..30832181)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 5 NC_000071.6 (30666877..30674837)

    Chromosome 5 - NC_000071.7Genomic Context describing neighboring genes Neighboring gene predicted gene 9899 Neighboring gene potassium channel, subfamily K, member 3 Neighboring gene STARR-seq mESC enhancer starr_12757 Neighboring gene solute carrier family 35, member F6 Neighboring gene microRNA 5625 Neighboring gene STARR-seq mESC enhancer starr_12759 Neighboring gene STARR-positive B cell enhancer ABC_E4748 Neighboring gene predicted gene, 57741 Neighboring gene autophagy-related 3 pseudogene

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables DNA binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables protein heterodimerization activity IEA
    Inferred from Electronic Annotation
    more info
     
    enables structural constituent of chromatin IEA
    Inferred from Electronic Annotation
    more info
     
    Component Evidence Code Pubs
    part_of CENP-A containing chromatin IDA
    Inferred from Direct Assay
    more info
    PubMed 
    part_of CENP-A containing nucleosome ISO
    Inferred from Sequence Orthology
    more info
     
    located_in chromosome IEA
    Inferred from Electronic Annotation
    more info
     
    located_in chromosome, centromeric region IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in chromosome, centromeric region ISO
    Inferred from Sequence Orthology
    more info
    PubMed 
    located_in condensed chromosome, centromeric region IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in condensed chromosome, centromeric region ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    part_of nucleosome ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in nucleus IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleus ISO
    Inferred from Sequence Orthology
    more info
     
    part_of pericentric heterochromatin IDA
    Inferred from Direct Assay
    more info
    PubMed 

    General protein information

    Preferred Names
    histone H3-like centromeric protein A
    Names
    centromere autoantigen A
    centrosomin A

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001302129.1NP_001289058.1  histone H3-like centromeric protein A isoform 2

      See identical proteins and their annotated locations for NP_001289058.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) contains an alternate exon in the 5' coding region and uses a downstream start codon compared to variant 1. The resulting isoform (2) has a distinct shorter N-terminus, compared to isoform 1. Variants 2, 3 and 4 encode the same isoform (2).
      Source sequence(s)
      AC105298, AF012709, AK011399, AK041138, BQ748418, BY136144
      UniProtKB/Swiss-Prot
      O35216
      Conserved Domains (1) summary
      smart00428
      Location:3105
      H3; Histone H3
    2. NM_001302130.1NP_001289059.1  histone H3-like centromeric protein A isoform 2

      See identical proteins and their annotated locations for NP_001289059.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (3) contains two alternate exons in the 5' coding region and uses a downstream start codon compared to variant 1. The resulting isoform (2) has a distinct shorter N-terminus, compared to isoform 1. Variants 2, 3 and 4 encode the same isoform (2).
      Source sequence(s)
      AC105298, AF012709, AK011399, AK041138, BQ748418, BY136144
      UniProtKB/Swiss-Prot
      O35216
      Conserved Domains (1) summary
      smart00428
      Location:3105
      H3; Histone H3
    3. NM_001302131.1NP_001289060.1  histone H3-like centromeric protein A isoform 2

      See identical proteins and their annotated locations for NP_001289060.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (4) contains two alternate exons in the 5' coding region and uses a downstream start codon compared to variant 1. The resulting isoform (2) has a distinct shorter N-terminus, compared to isoform 1. Variants 2, 3 and 4 encode the same isoform (2).
      Source sequence(s)
      AC105298, AF012709, AK011399, AK041138, BQ748418, BY136144
      UniProtKB/Swiss-Prot
      O35216
      Conserved Domains (1) summary
      smart00428
      Location:3105
      H3; Histone H3
    4. NM_001302132.1NP_001289061.1  histone H3-like centromeric protein A isoform 3

      Status: REVIEWED

      Description
      Transcript Variant: This variant (5) lacks a 3' exon, which results in a frameshift, compared to variant 1. The resulting isoform (3) has a shorter and distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AA016357, AC105298, AF012709, AK011399, BQ748418, BY136144
      UniProtKB/TrEMBL
      A0A0G2JGI2
      Related
      ENSMUSP00000143575.2, ENSMUST00000199320.5
      Conserved Domains (1) summary
      cl23735
      Location:2890
      H4; Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in ...
    5. NM_001421447.1NP_001408376.1  histone H3-like centromeric protein A isoform 2

      Status: REVIEWED

      Source sequence(s)
      AC105298
    6. NM_001421448.1NP_001408377.1  histone H3-like centromeric protein A isoform 4

      Status: REVIEWED

      Source sequence(s)
      AC105298
    7. NM_001421449.1NP_001408378.1  histone H3-like centromeric protein A isoform 5

      Status: REVIEWED

      Source sequence(s)
      AC105298
    8. NM_007681.3NP_031707.1  histone H3-like centromeric protein A isoform 1

      See identical proteins and their annotated locations for NP_031707.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) encodes the longest isoform (1).
      Source sequence(s)
      AC105298, AF012709, AK011399, BQ748418, BY136144
      Consensus CDS
      CCDS19162.1
      UniProtKB/Swiss-Prot
      O35216, Q545C9
      Related
      ENSMUSP00000122831.2, ENSMUST00000144742.6
      Conserved Domains (2) summary
      smart00428
      Location:28131
      H3; Histone H3
      pfam00125
      Location:1127
      Histone; Core histone H2A/H2B/H3/H4

    RNA

    1. NR_126074.1 RNA Sequence

      Status: REVIEWED

      Description
      Transcript Variant: This variant (6) uses an alternate splice site in the 3' region compared to variant 1. This variant is represented as non-coding because the use of the 5'-most expected translational start codon renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
      Source sequence(s)
      AC105298, AF012709, AK011399, BQ748418, BY136144
    2. NR_185302.1 RNA Sequence

      Status: REVIEWED

      Source sequence(s)
      AC105298
    3. NR_185303.1 RNA Sequence

      Status: REVIEWED

      Source sequence(s)
      AC105298
    4. NR_185304.1 RNA Sequence

      Status: REVIEWED

      Source sequence(s)
      AC105298

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000071.7 Reference GRCm39 C57BL/6J

      Range
      30824214..30832181
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_036164762.1XP_036020655.1  histone H3-like centromeric protein A isoform X2

      Conserved Domains (1) summary
      smart00428
      Location:3105
      H3; Histone H3

    RNA

    1. XR_004942431.1 RNA Sequence