MinCEDは、完全なゲノムまたはメタゲノムからアセンブルされたコンティグなどの環境データセットで、Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) を見つけるプログラムである。
インストール
依存
#bioconda (link)
conda install -c bioconda minced
> minced
$ minced
MinCED, a program to find CRISPRs in shotgun DNA sequences or full genomes
Usage: minced [options] file.fa [outputFile.txt] [outputFile.gff]
Options: -searchWL Length of search window used to discover CRISPRs (range: 6-9). Default: 8
-minNR Minimum number of repeats a CRISPR must contain. Default: 3
-minRL Minimum length of the CRISPR repeats. Default: 23
-maxRL Maximum length of the CRISPR repeats. Default: 47
-minSL Minimum length of the CRISPR spacers. Default: 26
-maxSL Maximum length of the CRISPR spacers. Default: 50
-gff Output summary results in gff format containing
only the positions of the CRISPR arrays. Default: false
-gffFull Output detailed results in gff format containing
positions of CRISPR arrays and all repeat units. Default: false
-spacers Output a fasta formatted file containing the spacers. Default: false
-h --help Output this handy help message
--version Output version information
Examples: minced ecoli.fna
minced metagenome.fna
minced metagenome.fna metagenome.crisprs
minced metagenome.fna metagenome.crisprs metagenome.gff
実行方法
genomeのFASTAファイルを指定する。メタゲノム由来contigも使用できる。出力のgtfはなくても良い。
minced genome.fna out.txt out.gff
出力
引用
https://github.com/ctSkennerton/minced