3 File Formats
Here we have described in detail the acquisition method and formats of the input files and the output files.
Input files
File1: k-mers file (.Fasta)
example:
>1
ATCCGCAGCAGCCATATCCACAACAACCA
>1
AACAACAAGAACAACTCCCACAACAACTC
>1
ACATCATTGGCAAGGTCCGGAGTGCAATG
>1
GGCAATCGCTTTGTGGCCAGATTATCGAC
>1
CCAACTCCGGCGCCAACGCCGAATGGCAC
File2: sequencing data (.Fastq or .Fastq.gz)
-
example:
@A00164:490:HWTCWDSXX:2:1101:14877:1235 1:N:0:GGCTATAG+GTCCATCA
TAGCAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAGCAGCAG
+
FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:F:FFFFFFFFFFFFFFFFFF:FF:FFFFFFFFFFFFFFFF:FFFFFF:FFFFFFF:FFFFF,FFFFF,FFFFF,FF,FF,
@A00164:490:HWTCWDSXX:2:1101:9263:1251 1:N:0:GGCTATAG+GTCCATCA
TGGTGGTGGTGGTGGTGGTGGTGGTGGTGTTGGTGTTGCTGTTGCTGTAGTTGCTGCTGTTGTTGTTATTGTTGTTGTTGTTATTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGT
+
FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:F:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF,FFFFFFFFFFFFFF:FFF:FFF,FFF,
@A00164:490:HWTCWDSXX:2:1101:9254:1266 1:N:0:GGCTATAG+GTCCATCA
TGGTGGTGGTGGTGGTGGTGGTGGTGGTGTTGGTGTTGCTGTTGCTGTAGTTGCTGCTGTTGTTGTTATTGTTGTTGTTGTTATTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGGTGGTGG
Output Files
File1: All reads containing the input k-mer (.Fastq)
-
example:
@A00164:490:HWTCWDSXX:2:1102:24406:6997 1:N:0:GGCTATAG+GTCCCTCA
TCCCGATGCCCGGCCATCCACAACATTGTGCACGCCATCGTCGTTCAACAACAACATGTGGATAGAGGTTTCGGCCAGCCTCAACCACAACAGTTGGGCCAGGGAATGCCCATGCAGCCTCAATATCAATTGGGCCAGGGCTTTATCCTA
+
FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF,FFF:FFF
@A00164:490:HWTCWDSXX:2:1102:24406:6997 1:N:0:GGCTATAG+GTCCCTCA
TCCCGATGCCCGGCCATCCACAACATTGTGCACGCCATCGTCGTTCAACAACAACATGTGGATAGAGGTTTCGGCCAGCCTCAACCACAACAGTTGGGCCAGGGAATGCCCATGCAGCCTCAATATCAATTGGGCCAGGGCTTTATCCTA
+
FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF,FFF:FFF
File2: The counting of the occurrence frequency of k-mer (.txt)
-
example:
AAAAAAAACAACAACAACAACTATCGAGC 0
AAAAAAACAACAACAACAACTATCGAGCC 1
AAAAAAACACCCAACAACCACAACAAATC 0
AAAAAAATTCATTTCAGATGCAGCCAAAC 2
AAAAAACAACAACAACAACTATCGAGCCA 0
AAAAAACACCCAACAACCACAACAAATCC 10
AAAAAATAGGACAAGGGCAACAACCAGAA 0
AAAAAATAGGACAAGGGCAACAACCAGGA 0