New FlyMine Tutorials

The FlyMine tutorials have now been updated. Tutorials on using the QueryBuilder, working with bags and building your own templates have been added.


Release 5.0

FlyMine 5.0 adds genome annotation for D. pseudoobscura (release 2.0 from FlyBase) and A. mellifera (release 2.0 from Ensembl) with InParanoid orthologues to the other genomes in FlyMine. Also added are InParanoid orthologues to many other species and GO annotation for some of them. Interface improvements include the ability for users to tag favourite templates and to change their password.

  • Data
    • NEW: Genome annotation for D. pseudoobscura version 2.0 from FlyBase.
    • NEW: Genome annotation for A. Mellifera release 2.0 from Ensembl version 37.
    • NEW: InParanoid orthologues from D. melanogaster, D. pseudoobscura, A. gambiae, C. elegans and A. mellifera to many other organisms.
    • NEW: GO annotation (accessible via orthologues) for M. musculus, R. ratticus, S. cerevisiae and A. mellifera.
    • NEW: Protein interaction and UniProt data for S. cerevisiae.
    • NEW: IntergenicRegion class – objects to describe intergenic regions on chromosomes, sequence and overlapping features available. Genes now have references to upstream and downstream intergenic regions.
    • UPDATE: Genome annotation for A. gambiae updated to build AgamP3 from Ensembl version 37.
    • FIX: INDAC microarray oligos now have sequences.
    • FIX: OMIM disease names are no longer duplicated, new templates added to the Disease aspect.
    • FIX: mRNA sequences no longer include intron sequences.
    • FIX: Protein Interactions – some interactions referenced isoform ids so data didn’t merge correctly with other protein information. Now have correct UniProt accession.
    • FIX: Protein interactions are now correctly assigned to Formstecher et al, 2005 – was fixed in IntAct.
  • User Interface
    • NEW: Favourite templates – logged in users can see a * beside template query names to select then as favourites. They are listed in a new ‘Favourites’ tab on the History page.
    • NEW: Change password – users can now choose there own password when creating an account. Passwords can be changed when logged in by clicking on the username in the top menu bar.
    • FIX: Missing class descriptions completed.
    • NEW: Boolean fields are now constrained by a true/false dropdown.
  • Tools
    • FIX: GBrowse images on object details pages are now oriented correctly for features on reverse strand. Also now show absolute coordinates.
  • Known Issues
    • Due to a change in format of XML from IntAct there are no confidence scores for protein interaction experiments, we will release a fixed version as soon as possible.
    • Gene.goAnnotation collection not filled in for A. Gambiae genes, need to use Gene.annotations instead.

Release 4.0

Major changes to layout and content of summary pages. These pages can now display the results of template queries inline to give immediate access to data. The home page now shows aspect icons and query options. Many other interface improvements. Addition of Arbeitman et al, 2002 Drosophila time course microarray data and human disease gene matches from Homophila.

  • Data
    • NEW: microarray data from “Gene Expression During the Life Cycle of Drosophila melanogaster“, Arbeitman et al 2002.
    • NEW: added data human disease gene matches from Homophila (
    • NEW: more protein interaction experiments from IntAct – see Protein Interaction aspect.
    • NEW: Gene now has an “allGoAnnotation” collection that contains the term assigned to a gene plus all the parents of the term. Using this collection to find genes with a particular term will find all genes with that term *and* any with a more specific term.
    • ENHANCEMENT: MRNA now references threePrimeUTR and fivePrimeUTR.
    • ENHANCEMENT: D. melanogaster translation symbols now set from FlyBase.
    • ENHANCEMENT: D. melanogaster chromosome bands now loaded from FlyBase.
    • UPDATE: D. melanogaster genome annotation updated from 4.2 to 4.2.1.
    • UPDATE: GO annotation and UniProt proteins updated.
    • FIX: D. melanogaster regulatory regions from FlyBase now have chromosome location.
    • FIX: D. melanogaster insertion sites and point mutations from FlyBase reference genes correctly.
    • FIX: A. gambiae genes from Ensembl and InParanoid orthologues now merged correctly.
  • Tools
    • NEW: a graph of interacting proteins on protein summary pages.
    • NEW: graphs of D. melanogaster time course expression on gene details pages (Arbeitman et al, 2002).
    • NEW: FASTA retrieval icon to access sequence of all features with sequence.
    • FIX: insertion sites now displayed correctly on GBrowse.
  • User Interface
    • NEW: Template queries are now viewable inline on object summary pages.
    • NEW: bags saved in history can now be subtracted from one another.
    • NEW: Query constraint logic can now be manipulated within the query builder – can constrain A AND B/A OR B.
    • NEW: Added Run query link to run a query directly from the History page.
    • NEW: Template queries can now include a choice of several different output formats.
    • ENHANCEMENT: Template search terms are highlighted in results lists.
    • ENHANCEMENT: Template building integrated into query builder interface.
  • Known issues
    • DATA: Protein interactions from Formstecher et al, 2005 (PubMed: 15710747) are incorrectly assigned to Rain et al, 2001 (PubMed: 11196647) which in IntAct is a secondary reference.
    • DATA: OMIM disease names contain the same text repeated multiple times.

Release 3.0

  • Data – new and updated
    • UPDATE: FlyBase D. melanogaster genome annotation updated to release 4.2.
    • NEW: Microarray gene expression data – currently five experiments run at the FlyChip facility are available.
    • NEW: Protein domain and family information from InterPro.
    • NEW: DrosDel deletion data added.
    • NEW: D. melanogaster genes now have full name and wild type function set.
    • UPDATE: GO annotation changes:
      • D. melanogaster and C. elegans gene associations are now taken from FlyBase and WormBase respectively.
      • Gene now has a collection of GOAnnotations, an object which contains the GO term id, name, evidence code and text from ‘with’ reference if present. If ‘with’ is a Gene or Protein a reference to the object is added.
      • GOTerm entries now have a description, namespace (‘biological_process’, ‘cellular_component’ or ‘molecular_function’) and a collection of synonyms.

    NOTE – this list does not include all updates to the versions of existing data sources – for a full list of versions see the Datasources section on the FlyMine home page.

  • Data – fixes
    • FIX: is now Gene.symbol, added which has a longer text name.
    • FIX: Transcript.exonCount is now set for all Transcripts.
    • FIX: Only RepeatRegions predicted by TRF are retrieved for A. gambiae from Ensembl.
    • FIX: There are no longer links to UniProt with BDGP identifiers.
    • FIX: Incorrect start and end fields removed from RepeatRegion.
    • FIX: Removed duplicate synonyms for some Proteins.
  • Data – known issues
    • Five genes are annotated incorrectly in the FlyBase release 4.2 annotation. These are eys, Dimp, cp309, Nev3 and kay. It is a known problem with FlyBase and a fix is progress. In FlyMine the symbols(names) are modified so that they are unique and indicate that there is a problem. Examples are: Dimp-duplicate-symbol-1, Dimp-duplicate-symbol-2 instead of Dimp.
    • The latest version of InterPro (11.0) is not yet available as a mysql dump so we still have version 10.0.
    • INDAC microarray oligos were designed to FlyBase version 4.0 transcripts. This means a number now map to old transcripts. These oligos can be queried in FlyMine, however the transcript they are designed to will not have merged with a Gene.
    • A number of proteins from Uniprot have not merged with FlyBase genes. This means that for certain genes there are two objects – one from Uniprot with no other information.
    • Orthologues for A. gambiae did not load.
  • Interface – new features
    • NEW: The start page of FlyMine now has links to a series of ‘aspects’ which are convenient starting points for the different types of data in FlyMine – such as genomics, protein interactions and gene expression. Each aspect lists relevant data sets in FlyMine, template queries and suggested starting points for queries.
    • NEW: Template search box added to the front page.
    • NEW: User templates are now listed on the user’s History page.
    • NEW: Saved bags and saved queries can be renamed from the History page.
    • NEW: Executed queries are now automatically saved to a ‘session history’. The user can permanently save a query from the session history from the History page.
    • NEW: Saved queries and queries in the session history are now displayed with the date and time at which they were created.
    • NEW: Saved queries and queries in the session history are now displayed with their starting point and a summary of the fields selected for output.
    • NEW: You can now edit an existing constraint in the query builder by clicking the blue edit icon next to the constraint.
  • Interface – fixes
    • FIX: History page is now viewable to anonymous users.
    • FIX: Characters from the Greek alphabet within identifier names should now display correctly.
    • FIX: Users can now run separate queries in separate browser windows and view results simultaneously.
    • FIX: Exported results on Windows Internet Explorer now download with the correct file name.

Release 3.1

Data – fixes

  • FIX: D. melanogaster genes from FlyBase and UniProt are now merged correctly.
  • FIX: Missing orthologues from D. melanogaster to A. gambiae are now loaded.