hisat-3n/evaluation/tests/CODIS
2025-01-18 21:09:52 +08:00
..
genome.fa initial commit 2025-01-18 21:09:52 +08:00
genome.fa.fai initial commit 2025-01-18 21:09:52 +08:00
grch38 initial commit 2025-01-18 21:09:52 +08:00
hisatgenotype_convert_codis.py initial commit 2025-01-18 21:09:52 +08:00
hisatgenotype_extract_codis_data.py initial commit 2025-01-18 21:09:52 +08:00
hisatgenotype_extract_vars.py initial commit 2025-01-18 21:09:52 +08:00
hisatgenotype_locus.py initial commit 2025-01-18 21:09:52 +08:00
hisatgenotype_modules initial commit 2025-01-18 21:09:52 +08:00
README initial commit 2025-01-18 21:09:52 +08:00

To extract HLA alleles and variants

   ./hisatgenotype_extract_vars.py --base codis --inter-gap 30 --intra-gap 50 --min-var-freq 0.1
   ./hisatgenotype_extract_vars.py --base hla --locus-list A --inter-gap 30 --intra-gap 50 --min-var-freq 0.1


To build genotype genome

   ./hisatgenotype_build_genome.py genome.fa genotype_genome -p 3 --verbose


For DNA fingerprinting
   ./hisatgenotype_locus.py --verbose --base codis --locus-list D13S317 --fragment-len 200 --no-assembly --debug paired



To run HISAT-genotype
   ./hisatgenotype_locus.py --base hla --locus-list A --simulate-interval 10 --perbase-errorrate 0.3 --perbase-snprate 0.1 --num-editdist 2 --debug paired

   for i in `ls -l ILMN/NA128*1.fq.gz | awk '{print $NF}' | cut -d'/' -f2 | cut -d'.' -f1`; do echo $i; ./hisatgenotype_locus.py --locus-list A --aligner-list hisat2.graph --num-editdist 2 --reads ILMN/$i.extracted.1.fq.gz,ILMN/$i.extracted.2.fq.gz --assembly-base assembly_graph_$i 2> $i.tmp; done