In silico mining of EST databases for novel pre-implantation embryo-specific zinc finger protein genes

Mol Reprod Dev. 2001 Jul;59(3):249-55. doi: 10.1002/mrd.1029.

Abstract

Progress in the understanding of early mammalian embryo development has been severely hampered by scarcity of study materials. To circumvent such a constraint, we have developed a strategy that involves a combination of in silico mining of new genes from expressed sequence tags (EST) databases and rapid determination of expression profiles of the dbEST-derived genes using a PCR-based assay and a panel of cDNA libraries derived from different developmental stages and somatic tissues. We demonstrate that in a random sample of 49 independent dbEST-derived zinc finger protein genes mined from a mouse embryonic 2-cell cDNA library, more than three-quarters of these genes are novel. Examination of characteristics of the human orthologues derived from these mouse genes reveals that many of them are associated with human malignancies. Expression studies have further led to the identification of three novel genes that are exclusively expressed in mouse embryos before or up to the 8-cell stage. Two of the genes, designated 2czf45 and 2czf48 (2czf for 2-cell zinc finger), are zinc finger protein genes coding for a RBCC protein with a RFP domain and a protein with three C2H2 fingers, respectively. The third gene, designated 2cpoz56, codes for a protein with a POZ domain that is often associated with zinc finger proteins. These three genes are candidate genes for regulatory or other functions in early embryogenesis. The strategy described in this report should generally be applicable to rapid and large-scale mining of other classes of rare genes involved in other biological and pathological processes. Mol. Reprod. Dev. 59:249-255, 2001.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Factual*
  • Embryo, Mammalian / physiology*
  • Expressed Sequence Tags*
  • Gene Expression Profiling*
  • Gene Library
  • Humans
  • Mice
  • Molecular Sequence Data
  • Polymerase Chain Reaction / methods
  • Zinc Fingers / genetics*

Associated data

  • GENBANK/AA422810
  • GENBANK/AA549412
  • GENBANK/AA666887