Detailed Transcription Information for Regions

After you get transcription details for a region, you can:

The first image is generated through the UCSC Genome Browser. The requested region is shown with behavioral and physiological QTL from the Rat Genome Database or the Mouse Genome Database, annotated RefSeq genes/transcripts, and expression data on normal tissue from the UCSC Genome Browser database. The image can be altered using the Image control options above the image to show or hide transcripts and to change the track options.

Take a look

Genes Physically Located in a Region

All Ensembl genes (including protein-coding and non-coding) are listed in the table. For regions in the rat genome expressed in brain, derived from genome-guided transcriptome reconstruction using brain polyA+ selected RNA from the BN-Lx/CubPrin and SHR/OlaPrin, strains that do not overlap an Ensembl gene are also listed. See "Transcriptome Reconstruction (Rat Only)".

The table has a row for each Ensembl gene, and each Ensembl gene may have multiple transcript (i.e., splice variants).

Take a look

# Ensembl Transcripts

The # Ensembl Transcripts column lists the number of transcripts annotated by Ensembl that are associated with a particular gene.

Transcripts (RNA-Seq)

The Transcripts RNA-Seq column reports the number of transcripts from the transcriptome reconstruction in rat brain (i.e., the transcript must be expressed in brain to be identified in the reconstruction) that overlap the annotated gene. The RNA-Seq transcripts identified as being associated with a given gene are only required to be on the same strand and have some overlap in either exonic or intronic regions. There is no requirement that exonic regions of either overlap, so it is possible a transcript in this count might be located in an intronic region of the Ensembl transcript.

Total Probesets

The Total Probesets column reports the number of probe sets from the Affymetrix Exon 1.0 ST array that overlap the gene. The count includes any probe sets that fall within the start base pair and stop base pair, such as any associated RNA-Seq transcript start and stops that may extend past the start or stop of the annotated gene, and are on the same strand as the annotated gene. This count includes probe sets that fall in an annotated intron as long as the probe set is on the same strand as the gene. The count includes probe sets that have not been masked due to unreliable data (i.e., SNPs or non-uniquely aligning probe sets).

Probesets > 0.33 Heritability

For each probe set on the Affymetrix Exon 1.0 ST Array (mouse or rat), we calculated a broad-sense heritability using an ANOVA model and expression data from the ILSXISS panel (mouse) or the HXB/BXH panel (rat). See "Public Datasets" for details. The heritability threshold of 0.33 was chosen arbitrarily to represent an expression estimate with at least modest heritability. In the rat, we include the number of probe sets at least modestly heritable in the four available tissues: brain, heart, liver, and brown adipose.

Probesets > 1% DABG

For each probe set on the Affymetrix Exon 1.0 ST Array (mouse or rat) and each sample, we calculated a p-value associated with the expression of the probe set above background (DABG – detection above background). Using a p-value threshold of 0.0001, we calculated the proportion of samples from the ILSXISS panel (mouse) or HXB/BXH panel (rat) that had expression values significantly different from background for a given probe set. In the table, we report the number of probe sets whose expression values were detected above background in more than 1% of samples.

eQTLs (Gene/Transcript Cluster ID)

eQTL were calculated for each transcript cluster (approximately gene) on the Affymetrix Exon 1.0 ST Array using the ILSXISS panel (mouse) and the HXB/BXH panel (rat). See "Public Datasets" for details. Locus-specific p-values were calculated using permutation methods for each transcript cluster/SNP pair.

Transcript Cluster – The numeric ID that is used by Affymetrix to specify a group of probe sets that have been combined to estimate expression at the gene level rather than at the individual exon level.

Annotation Level – Affymetrix indicates its confidence in annotation of a particular transcript cluster by classifying it as core (most confidence), extended, or full (least confidence). This designation is based on the source of annotation for a gene. For more details, see the Affymetrix website at: http://www.affymetrix.com.

View Genome-Wide Associations – A Circos graphic displays for this particular gene/transcript cluster when you click the link in this column. The graphic displays the locus-specific (location-specific), p-values across the genome for the gene. When examining data from rat, the locus-specific eQTL displays for each of the four tissues available.

Total # of Locations P-value < 0.001 – In this column, we calculated the number of locations (loci) within the genome that the transcript cluster was significantly associated with (locus-specific p<0.001). For rat, this is calculated for each of the four tissues separately.

Minimum P-Value Location – This column gives the location of the locus with the smallest p-value for this transcript. This is the strongest eQTL for that gene. If no loci have a p-value less than 0.001, no results are displayed in this column. For rat, this is calculated for each of the four tissues separately. The p-value cutoff may be adjusted in the filtering section.

bQTLs Overlapping Region

The bQTL Overlapping Region table lists all the behavioral and physiological QTL reported in the Rat Genome Database (RGD) or the Mouse Genome Database (MGD) that overlap the genomic region selected. The column identified as References includes two links. The top link takes you to the related entry at the RGD or MGD and the bottom link takes you to the related journal article through PubMed. The second-to-last column is the whole region reported for the bQTL. When you click this link, a new window with the detailed transcriptome information for the region of that particular bQTL.

Take a look

Transcripts Controlled From Region (eQTL)

The Transcripts Controlled from Region tab displays a Circos Plot that shows the physical location of genes controlled from the selected region. P-values less than 0.001 are highlighted in yellow, and links are drawn to connect the physical location of the transcript with the associated loci within the specified region. For rat, information on the four tissues available displays in different colors.

  1. Click Filter List and Circos Plot to select from a subset of chromosomes and tissues (for rat), and generate the plot again.
  2. Click to download the plot as a PDF.
Take a look

The table below the Circos Plot lists the genes whose expression is controlled from the chosen region. The first several columns are identical to the table in the first tab. See "Genes Physically Located in a Region".

The columns that differ are:

P-Value from Region

The minimum p-value in the chosen region is reported for each transcript cluster. For the rat, separate columns are reported for each of the four tissues. If a gene has a significant eQTL in this region in any tissue, minimum p-values are reported for all four tissues. eQTL that reach the p-value threshold are highlighted in blue. You can filter on a number of parameters to obtain a more reasonably sized list. You can decreases the minimum p-value to look for more significant eQTLs, or you can limit your search to particular chromosomes to view only genes located on the selected chromosomes.

The exact region may be larger or smaller than the region you entered. A list of the exact eQTL regions displays above the Circos Plot.

# other locations p-value<0.001

This column counts the number of loci (locations) in the genome outside the chosen region that have a significant association with expression of this gene.