Resources
- FAQ
- Biology-centric
- I want to learn how to do bioinformatics from the command line - how do I get started?
- What is the link to our local instance of Galaxy?
- I can’t find any help pages or tutorials…how can I get started?
- I don’t understand whether my data is of high quality or not…how can I check?
- How can I check genome wide coverage distribution?
- Data-centric
- Biology-centric
- Links
FAQ
Biology-centric
I want to learn how to do bioinformatics from the command line - how do I get started?
There web is a great resource to learn any and all things computational. Including some of the links found at the bottom of this page, a good place to start in the Biostars Handbook, authored by Istvan Albert of Penn State University. It contains practical tips for bioinformatics as well as analyzing biological data.
What is the link to our local instance of Galaxy?
If you have an account, you can access the local instance of Galaxy at tadpole.bx.psu.edu. Otherwise, many of the same tools are also available on the public instance at usegalaxy.org.
I can’t find any help pages or tutorials…how can I get started?
You can learn how to get started using the following links:
I don’t understand whether my data is of high quality or not…how can I check?
See Understanding FastQC Reports (also in the links below). Created by the developers of FastQC, they go on to describe common things that we can see went wrong in the initial QC reports generated using their very versatile tool.
How can I check genome wide coverage distribution?
There are several tools that allow you do qualitatively and quantitatively check this. For viewing purposes, you can view your alignment in IGV. But to quantify this you will want to either:
- Use a tool that checks various statistics for you
- Create a custom pipeline to check for you
Tools that are available to do this for you include deepTools and Picard Tools for whole genome and ChIP-seq data sets or RNA-SeQC, RSeQC, or QoRTs for RNA-seq.
Alternatively, if you know which regions of the genome you are interested in, you could also create a custome pipeline with BEDtools and calculate the statistics you care about.
Data-centric
How can I share my data with others?
Links
Essentials
- GeneDB: http://www.genedb.org
- PlasmoDB: http://plasmodb.org
- Biostars: https://www.biostars.org/
- SEQanswers: http://seqanswers.com
- OmicTools: http://omictools.com/
- SAM/BAM/CRAM/VCF Format Specs: https://samtools.github.io/hts-specs/
- SAMtools Flags Explained: https://broadinstitute.github.io/picard/explain-flags.html
Galaxy Resources
- Twitter: @galaxyproject
- Tutorials: http://galaxyproject.org/wiki/Learn
- Mailing List: http://user.list.galaxyproject/org
- Biostars - Galaxy Explained: https://biostar.usegalaxy.org/
- Galaxy NGS 101: https://wiki.galaxyproject.org/Learn/GalaxyNGS101
- Main Toolshed: https://toolshed.g2.bx.psu.edu/
- Test Toolshed: https://testtoolshed.g2.bx.psu.edu/
Public Galaxy Servers
- deepTools: http://deeptools.ie-freiburg.mpg.de/
Genome Browsers
- Integrative Genomics Viewer (IGV): https://www.broadinstitute.org/igv/
- Integrated Genome Browser (IGB): http://bioviz.org/igb/index.html
- UCSC Genome Browser: https://genome.ucsc.edu/
Useful Blogs
- Getting Genetics Done: http://gettinggeneticsdone.com
Miscellaneous
- Sequencing QC Fail: https://sequencing.qcfail.com/
Updated - 2016-04-08