Ensembl TrainingEnsembl Home

<- Back to exercise page

BioMart Convert IDs

BioMart is a very handy tool when you want to convert IDs from different databases. The following is a list of 27 IDs of pig (Sus scrofa) proteins from the NCBI RefSeq database:

NP_001116455, NP_001191704, NP_001231885 NP_001191292, NP_001230616,NP_001231413, NP_001231746, NP_999129, NP_001231602, NP_001231584, NP_001177096, NP_001231419, NP_001230512, NP_001231165, NP_001167636, NP_001136139, NP_001172069, NP_001011509, NP_999191, NP_001231201, NP_001231786, NP_001231468, NP_001121951, NP_001230557, NP_001177223, NP_999413, NP_999251

Generate a list that shows to which Ensembl Gene IDs and to which gene names these RefSeq IDs correspond. Do these 27 proteins correspond to 27 genes?

Click New. Choose the ENSEMBL Genes database. Choose the Sscrofa 11.1 genes dataset.

Click on Filters in the left panel. Expand the GENE section by clicking on the + box. Select Input external references ID list - RefSeq peptide ID(s) and enter the list of IDs in the text box (either comma separated or as a list). HINT: You may have to scroll down the menu to see these. Count shows 26 genes (remember one gene may have multiple splice variants coding for different proteins, that is the reason why these 27 proteins do not correspond to 27 genes).

Click on Attributes in the left panel. Select the Features attributes page. Expand the External section by clicking on the + box. Select Gene name and RefSeq Peptide ID from the External References section.

Click the Results button on the toolbar. Select View All rows as HTML or export all results to a file.