30 Run bactmap
Learning Objectives
- Generate consensus genomes for pneumococcus using reference-based aligment.
Remember to QC your sequencing reads
Remember, the first step of any analysis of a new sequence dataset is to perform Quality Control. For the purposes of time, we’ve run bacQC
for you and the results are in preprocessed/bacqc
. Before you run bactmap
, have a look at the read stats and species composition TSV files and make sure that the data looks good before we go ahead and map it.
30.1 Summary
Key Points
- Obtaining genomes for a species such as pneumococcus can be done using reference-based alignment.
- We can use the same workflows and scripts covered so far:
avantonder/bacQC
to perform sequence quality control on the raw sequencing reads.nf-core/bactmap
for generating consensus genome sequences based on mapping reads to a reference genome.- Use a custom script to loop through our samples and run
seqtk comp
, which estimates the fraction of missing data in our genomes.