Search our client documentation below, optionally filtered by one or more systems.
The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC.
The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC.
Search our client documentation below, optionally filtered by one or more systems.
STAR-Fusion is a component of the Trinity Cancer Transcriptome Analysis Toolkit (CTAT). STAR-Fusion uses the STAR aligner to identify candidate fusion transcripts supported by Illumina reads. STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set.
The following versions of STAR-Fusion are available on OSC clusters:
HOMER (Hypergeometric Optimization of Motif EnRichment) is a suite of tools for Motif Discovery and ChIP-Seq analysis. It is a collection of command line programs for unix-style operating systems written in mostly perl and c++. Homer was primarily written as a de novo motif discovery algorithm that is well suited for finding 8-12 bp motifs in large scale genomics data.
MIRA - Sequence assembler and sequence mapping for whole genome shotgun and EST / RNASeq sequencing data. Can use Sanger, 454, Illumina and IonTorrent data. PacBio: CCS and error corrected data usable, uncorrected not yet.
The following versions of MIRA are available on OSC clusters:
Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data.The selection of trimming steps and their associated parameters are supplied on the command line.
The following versions of Trimmomatic are available on OSC clusters:
bam2fastq is used to extract raw sequences (with qualities) from programs like SAMtools, Picard, and Bamtools.
The following versions of bam2fastq are available on OSC clusters:
GMAP is a genomic mapping and alignment program for mRNA and EST sequences.
The following versions of GMAP are available on OSC clusters:
The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing.
The following versions of FASTX-Toolkit are available on OSC clusters:
SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of variants on genes (such as amino acid changes).
The following versions of SnpEff are available on OSC clusters:
The Sequence Read Archive (SRA Toolkit) stores raw sequence data from "next-generation" sequencing technologies including 454, IonTorrent, Illumina, SOLiD, Helicos and Complete Genomics. In addition to raw sequence data, SRA now stores alignment information in the form of read placements on a reference sequence. Use SRA Toolkit tools to directly operate on SRA runs.
RNA-SeQC is a java program which computes a series of quality control metrics for RNA-seq data. The input can be one or more BAM files. The output consists of HTML reports and tab delimited files of metrics data. This program can be valuable for comparing sequencing quality across different samples or experiments to evaluate different experimental parameters. It can also be run on individual samples as a means of quality control before continuing with downstream analysis.