Ensembl TrainingEnsembl Home
Finding genes by protein domain

<- Back to exercise page

Finding genes by protein domain

Find zebrafish proteins with Transmembrane helices located on chromosome 9.

As with all BioMart queries you must select the dataset, set your filters (input) and define your attributes (desired output). For this exercise: Dataset: Ensembl genes in zebrafish Filters: Transmembrane helices on chromosome 9 Attributes: Ensembl gene and transcript IDs and gene name

Go to the Ensembl homepage (http://www.ensembl.org) and click on BioMart at the top of the page. Select Ensembl Genes as your database and Zebrafish genes as the dataset. Click on Filters on the left of the screen and expand REGION. Change the Chromosome to 9.

Now expand PROTEIN DOMAINS AND FAMILIES, also under filters, and select Limit to genes, choosing With Transmembrane helices ID(s) from the drop-down and then Only. Clicking on Count should reveal that you have filtered the dataset down to 238 genes.

Click on Attributes and expand GENE. Select Gene name. Now click on Results. The first 10 results are displayed by default; display all results by selecting ALL from the drop down menu.

The output will display the Ensembl gene ID, Ensembl Transcript ID and gene names of all proteins with Transmembrane helices on zebrafish chromosome 9. If you prefer, you can also export as an Excel sheet by using the Export all results to XLS option.