Reference Data

Many Galaxy tools rely on the presence of reference data, such as alignment indexes or reference genome sequences, to efficiently work. A complete set of Reference Data, able to work with most common tools for NGS analysis is available for each Galaxy instance deployed.

The reference data are available for many species and shared among all the instances, avoiding unnecessary and costly data duplication, exploiting a CernVM-FS (CVMFS) repository.

Laniakea automatically configures Galaxy to properly use them.

By default Laniakea exploits the usegalaxy.org reference data, but for specific needs, e.g. new tools, it is possible to enable, using the Laniakea Dahsobard, different repositories:

Reference data indexes

Reference data indexes available for bowite

data.galaxyproject.org

Description:

The usegalaxy.org CVMFS repository hosts more than 4 TB of reference data. There are two primary directories in the reference data repository:

  • /managed: Data generated with Galaxy Data Managers, organized by data table (index format), then by genome build.

  • /byhand: Data generated prior to the existence/use of Data Managers, manually curated.

Currently, the Laniakea instances are preconfigured to mount /byhand data. More information can be found here.

For GDC Somatic Variant flavour (GDC Somatic Variant on Galaxy) Galaxy is configured to use also an additional gdc_tool_data_table_conf.xml, which can be found here.

elixir-italy.covacs.refdata

Description:

This repository hosts specific reference data for CoVaCS pipeline, Laniakea configure the CoVaCS flavours to consume these data.

Reference data cvmfs

Details

cvmfs repository name

elixir-italy.covacs.refdata

cvmfs server url

90.147.75.251

cvmfs config file

elixir-italy.covacs.refdata.conf

cvmfs key file

elixir-italy.covacs.refdata.pub

cvmfs proxy url

DIRECT

galaxy tool data table

tool-data-table.xml

elixir-italy.galaxy.refdata

Description:

This repository is recommended only for testing tools and is currently not available on the Laniakea Dashboard. It is used for those tools need to ship reference data still not in the Galaxy Official CVMFS.

Reference data cvmfs

Details

cvmfs repository name

elixir-italy.galaxy.refdata

cvmfs server url

90.147.102.186

cvmfs config file

elixir-italy.galaxy.refdata.conf

cvmfs key file

elixir-italy.galaxy.refdata.pub

cvmfs proxy url

DIRECT

galaxy tool data table

tool-data-table.xml

Supplementary information

References

Galaxyproject CVMFS

CernVM-FS

CVMFS documentation

Debugging CVMFS