We’ve included in the bin directory of the Seal distribution some of the utilities and scripts we use internally. These may not be all maintained (it depends on whether we’re using them at the moment). Use at your own risk!!
Note that some of these utilities don’t follow the Seal usage convention.
A utility useful for uploading files to HDFS.
Under the hood, it uses hadoop distcp, but it avoids you having to prepend file:// to local files.
bin/distcp_files filenames* hdfs_dest_directory
A script to perform read alignment using libbwa (the same one used by Seqal).
bin/align_script [OPTIONS] --reference=REFERENCE PRQFILE PRQFILE
You can also pipe it input in prq format via stdin.
Reads a fastq file and tries to determine whether the base qualities are encoded in Sanger or Illumina format.
Reformat a pair of qseq files (one for reads 1, one for reads 2) into a prq file. Read mates have to appear on the same line of their respective files.
bin/prq_local reads1.qseq reads2.qseq output.prq
Split a prq file into two fastq files.
bin/prq_to_fastq input.prq reads1.fastq reads2.fastq
Realign SNPs. See the documentation in the script file for details.