This tutorial teaches how to mask low complexity regions in a FASTA file using BBMap.
1. How to Mask Low Complexity Regions in a FASTA File
First, we need to install BBMap, which comes with bbmask.sh to mask low complexity regions.
I would recommend using Bioconda to install it using the command below:
$ conda install -c bioconda bbmap
Now that we have bbmask, we can use it to mask a FASTA file using an entropy of 0.7
$ bbmask.sh in=INPUT out=MASKED_OUTPUT entropy=0.7
2. More Resources
- Fast Conversion of Lowercase Sequences to Uppercase in FASTA Format
- Easy NCBI Genome Download
- The Fastest Way to Read a FASTA in Python