SVDSS, Structural Variant Discovery from Sample-specific Strings.
Usage: * Index reference/sample: SVDSS index --fastq/--fasta /path/to/genome/file --index /path/to/output/index/file
Optional arguments: -b, --binary output index in binary format. Allows for another index to be appended to this index later. -a, --append /path/to/binary/index append to existing binary index.
Optional arguments: --clipped calls SVs from clipped SFS. --min-cluster-weight minimum number of supporting superstrings for a call to be reported. --min-sv-length minimum length of reported SVs. Default is 25. Values < 25 are ignored.
General options: --threads sets number of threads, default 4. --version print version information. --helpprint this help message.
先跟着GitHub上的步骤操作一波试试。
All of SVDSS steps must be run in the same directory so we always pass --workdir $PWD for every command.
构建参考基因组索引
1 2
mkdir 1.refer.index SVDSS index --fasta 00.data/genome/nip.t2t.fa --index 01.refer.index/refer.genome.fmds
373M的基因组运行结果如下:
1 2 3 4 5
SVDSS, Structural Variant Discovery from Sample-specific Strings. Mode: index [I] . [M::mr_insert_multi] Turn off parallelization for this batch as too few strings are left. [I] Complete. Runtime: 876 seconds.
Mapping
使用Minimap2将序列比对到参考基因组上获得BAM文件。
1 2 3
minimap2 -ax map-pb ref.fa pacbio-reads.fq > aln.sam # for PacBio CLR reads minimap2 -ax map-ont ref.fa ont-reads.fq > aln.sam # for Oxford Nanopore reads minimap2 -ax map-iclr ref.fa iclr-reads.fq > aln.sam # for Illumina Complete Long Reads