Exploring the Arabidopsis thaliana CCD7 gene

(a) Find the Arabidopsis thaliana CCD7 gene on Ensembl Plants. On which chromosome and which strand of the genome is this gene located?

(b) Where in the cell is the CCD7 protein located?

(c) What is the source of the assigned gene name?

(d) How many transcripts does it have? How long is its longest transcript? How long is the protein it encodes? How many exons does it have? Are any of the exons completely or partially untranslated?

(a) Go to the Ensembl Plants homepage (http://plants.ensembl.org/). Select Arabidopsis thaliana from the species list and type CCD7 in the search box. Click Go. Click on CCD7.

The Arabidopsis CCD7 gene is located on chromosome 2 on the forward strand.

(b) Click on GO: cellular component in the side menu.

The protein is located in the chloroplast.

(c) Click on Summary in the side menu.

The gene name is assigned and imported from TAIR (The Arabidopsis Information Resource).

(d) Click on Show transcript table.

There are three transcripts. The longest one is 2005 base pairs and the length of the encoded protein is 622 amino acids.

Click on the Ensembl Transcript ID AT2G44990 in the transcript table.

It has six exons.

Click on Sequence - Exons in the side menu.

The first and last exons are partially untranslated (sequence shown in orange). This can also been seen from the fact that in the transcript diagrams on the Gene summary and Transcript summary pages the boxes representing the first and last exon are partially unfilled.