Skip to content

Commit e7f46cb

Browse files
committedFeb 12, 2023
benchmarks updated
1 parent 0818558 commit e7f46cb

File tree

4 files changed

+54
-49
lines changed

4 files changed

+54
-49
lines changed
 

‎benchmarks/README.md

+6-1
Original file line numberDiff line numberDiff line change
@@ -15,8 +15,13 @@ The datasets can be downloaded from [zenodo](https://zenodo.org/record/7239205#.
1515

1616
For streaming queries we used FASTQ files downloaded from [ENA](https://www.ebi.ac.uk/ena/browser/home), using accession numbers:
1717

18-
- elegans: SRR16288382
18+
- elegans: SRR16288382
1919
- cod: SRR12858649
2020
- kestrel: SRR11449743
2121
- human: SRR5833294
2222

23+
The query times are relative to the following configuration:
24+
25+
- Processor: Intel i9-9900K @ 3.60 GHz;
26+
- Compiler: gcc 11.2.0;
27+
- OS: GNU/Linux 5.13.0-52-generic x86_64.

‎benchmarks/sshash.canon.streaming_query_log

+21-21
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,27 @@
1-
2023-02-07 12:43:51: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
2-
2023-02-07 12:45:27: DONE
1+
2023-02-07 21:35:42: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
2+
2023-02-07 21:42:26: DONE
33
==== query report:
4-
num_kmers = 792953816
5-
num_positive_kmers = 11201 (0.00141257%)
6-
num_searches = 10923/11201 (97.5181%)
7-
num_extensions = 278/11201 (2.48192%)
8-
elapsed = 96434.8 millisec / 96.4348 sec / 1.60725 min / 121.615 ns/kmer
9-
2023-02-07 12:45:27: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
10-
2023-02-07 12:46:29: DONE
4+
num_kmers = 2849620920
5+
num_positive_kmers = 2475259037 (86.8627%)
6+
num_searches = 1340695937/2475259037 (54.1639%)
7+
num_extensions = 1134563100/2475259037 (45.8361%)
8+
elapsed = 404175 millisec / 404.175 sec / 6.73625 min / 141.835 ns/kmer
9+
2023-02-07 21:42:26: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
10+
2023-02-07 21:47:40: DONE
1111
==== query report:
12-
num_kmers = 719118597
13-
num_positive_kmers = 0 (0%)
14-
num_searches = 0/0 (-nan%)
15-
num_extensions = 0/0 (-nan%)
16-
elapsed = 61845.9 millisec / 61.8459 sec / 1.03077 min / 86.0024 ns/kmer
17-
2023-02-07 12:46:29: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
18-
2023-02-07 12:47:21: DONE
12+
num_kmers = 2469671464
13+
num_positive_kmers = 1995797497 (80.8123%)
14+
num_searches = 1050159375/1995797497 (52.6185%)
15+
num_extensions = 945638122/1995797497 (47.3815%)
16+
elapsed = 314614 millisec / 314.614 sec / 5.24357 min / 127.391 ns/kmer
17+
2023-02-07 21:47:40: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
18+
2023-02-07 21:53:39: DONE
1919
==== query report:
20-
num_kmers = 646255023
21-
num_positive_kmers = 0 (0%)
22-
num_searches = 0/0 (-nan%)
23-
num_extensions = 0/0 (-nan%)
24-
elapsed = 52134 millisec / 52.134 sec / 0.8689 min / 80.6709 ns/kmer
20+
num_kmers = 2089722008
21+
num_positive_kmers = 1567619555 (75.0157%)
22+
num_searches = 814271235/1567619555 (51.9432%)
23+
num_extensions = 753348320/1567619555 (48.0568%)
24+
elapsed = 359057 millisec / 359.057 sec / 5.98428 min / 171.82 ns/kmer
2525

2626
2023-02-07 12:47:21: performing queries from file '/data2/DNA/queries/SRR12858649.fastq.gz'...
2727
2023-02-07 12:47:48: DONE

‎benchmarks/sshash.regular.streaming_query_log

+21-21
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,27 @@
1-
2023-02-06 19:49:08: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
2-
2023-02-06 19:54:04: DONE
1+
2023-02-07 21:12:01: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
2+
2023-02-07 21:19:13: DONE
33
==== query report:
4-
num_kmers = 792953816
5-
num_positive_kmers = 11201 (0.00141257%)
6-
num_searches = 10955/11201 (97.8038%)
7-
num_extensions = 246/11201 (2.19623%)
8-
elapsed = 295397 millisec / 295.397 sec / 4.92328 min / 372.527 ns/kmer
9-
2023-02-06 19:54:04: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
10-
2023-02-06 19:59:02: DONE
4+
num_kmers = 2849620920
5+
num_positive_kmers = 2475259037 (86.8627%)
6+
num_searches = 1399102570/2475259037 (56.5235%)
7+
num_extensions = 1076156467/2475259037 (43.4765%)
8+
elapsed = 432031 millisec / 432.031 sec / 7.20051 min / 151.61 ns/kmer
9+
2023-02-07 21:19:13: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
10+
2023-02-07 21:26:22: DONE
1111
==== query report:
12-
num_kmers = 719118597
13-
num_positive_kmers = 0 (0%)
14-
num_searches = 0/0 (-nan%)
15-
num_extensions = 0/0 (-nan%)
16-
elapsed = 298789 millisec / 298.789 sec / 4.97982 min / 415.493 ns/kmer
17-
2023-02-06 19:59:02: performing queries from file '/data2/DNA/queries/SRR5901135.fastq.gz'...
18-
2023-02-06 20:05:39: DONE
12+
num_kmers = 2469671464
13+
num_positive_kmers = 1995797497 (80.8123%)
14+
num_searches = 1079727026/1995797497 (54.1%)
15+
num_extensions = 916070471/1995797497 (45.9%)
16+
elapsed = 428959 millisec / 428.959 sec / 7.14932 min / 173.691 ns/kmer
17+
2023-02-07 21:26:22: performing queries from file '/data2/DNA/queries/SRR16288382_1.fastq.gz'...
18+
2023-02-07 21:35:42: DONE
1919
==== query report:
20-
num_kmers = 646255023
21-
num_positive_kmers = 0 (0%)
22-
num_searches = 0/0 (-nan%)
23-
num_extensions = 0/0 (-nan%)
24-
elapsed = 396728 millisec / 396.728 sec / 6.61214 min / 613.888 ns/kmer
20+
num_kmers = 2089722008
21+
num_positive_kmers = 1567619555 (75.0157%)
22+
num_searches = 830693880/1567619555 (52.9908%)
23+
num_extensions = 736925675/1567619555 (47.0092%)
24+
elapsed = 559344 millisec / 559.344 sec / 9.3224 min / 267.664 ns/kmer
2525

2626
2023-02-06 20:05:39: performing queries from file '/data2/DNA/queries/SRR12858649.fastq.gz'...
2727
2023-02-06 20:06:07: DONE

‎script/streaming_query.sh

+6-6
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@
22

33
### regular indexes
44

5-
./sshash query -i celegans.k31.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.regular.streaming_query_log
6-
./sshash query -i celegans.k47.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.regular.streaming_query_log
7-
./sshash query -i celegans.k63.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.regular.streaming_query_log
5+
./sshash query -i celegans.k31.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.regular.streaming_query_log
6+
./sshash query -i celegans.k47.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.regular.streaming_query_log
7+
./sshash query -i celegans.k63.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.regular.streaming_query_log
88

99
./sshash query -i cod.k31.sshash -q /data2/DNA/queries/SRR12858649.fastq.gz >> sshash.regular.streaming_query_log
1010
./sshash query -i cod.k47.sshash -q /data2/DNA/queries/SRR12858649.fastq.gz >> sshash.regular.streaming_query_log
@@ -20,9 +20,9 @@
2020

2121
### canonical indexes
2222

23-
./sshash query -i celegans.k31.canon.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.canon.streaming_query_log
24-
./sshash query -i celegans.k47.canon.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.canon.streaming_query_log
25-
./sshash query -i celegans.k63.canon.sshash -q /data2/DNA/queries/SRR5901135.fastq.gz >> sshash.canon.streaming_query_log
23+
./sshash query -i celegans.k31.canon.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.canon.streaming_query_log
24+
./sshash query -i celegans.k47.canon.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.canon.streaming_query_log
25+
./sshash query -i celegans.k63.canon.sshash -q /data2/DNA/queries/SRR16288382_1.fastq.gz >> sshash.canon.streaming_query_log
2626

2727
./sshash query -i cod.k31.canon.sshash -q /data2/DNA/queries/SRR12858649.fastq.gz >> sshash.canon.streaming_query_log
2828
./sshash query -i cod.k47.canon.sshash -q /data2/DNA/queries/SRR12858649.fastq.gz >> sshash.canon.streaming_query_log

0 commit comments

Comments
 (0)
Please sign in to comment.