ElDorado
Introduction
ElDorado is the Genomatix genome annotation. It is based on the publicly available reference genome assemblies of 18 different organisms. Information from a variety of different sources together with data generated by Genomatix' proprietary algorithms is used to set up a database of more than 500 gigabytes of quality checked data. The individual elements are highly linked to external data sources (e.g. NCBI, Ensembl, TIGR, ...).
Representing a common sequence basis ElDorado allows us to integrate literature based information about gene-gene interactions and pathway annotation from BiblioSphere PathwayEdition together with experimental gene expression data (e.g. from ChipInspector).
Features
- The gene name query of ElDorado supports extended lists of gene name synonyms and variants
- Very fast algorithm for exon mapping evaluating the cDNA for canonical splice sites
- Detailed gene information, integrating information from different public sources (e.g. EntrezGene, GeneOntology) with manually curated gene-gene interactions by Genomatix.
- Comparative Genomics (patent pending) allows you to compare the annotation available for a group of orthologous genes and to identify corresponding promoter regions for the analysis of phylogenetically conserved promoter elements
- Literature analysis gives you access to the genes cocited in the scientific literature providing direct links to BiblioSphere PathwayEdition for further in-depth analysis
Contents
- Transcripts with exon/intron structure, UTRs, CDS, and protein sequence
- Promoters for each transcript analysed for transcription factor binding sites and regulatory modules
- CAGE tags, providing additional experimental support for full length transcripts
- PromoterInspector predictions
- microRNAs
- SNPs and their effects on transcription factor binding sites and protein sequences
- genomic repeats
- SMARs
- genomic location for individual probes from Affymetrix chips

Example Output
For each gene of interest, ElDorado lists information based on the gene sequence and information based on literature analysis.
