Skip to content

Commit df37715

Browse files
committed
v2.5.0
1 parent bec639c commit df37715

File tree

3 files changed

+65
-40
lines changed

3 files changed

+65
-40
lines changed

CHANGELOG.md

+5-5
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,19 @@
1-
- [SeqKit v2.5.0](https://github.com/shenwei356/seqkit/releases/tag/v2.5.0) - 2023-03-17
1+
- [SeqKit v2.5.0](https://github.com/shenwei356/seqkit/releases/tag/v2.5.0) - 2023-07-16
22
[![Github Releases (by Release)](https://img.shields.io/github/downloads/shenwei356/seqkit/v2.5.0/total.svg)](https://github.com/shenwei356/seqkit/releases/tag/v2.5.0)
33
- new command `seqkit merge-slides`: merge sliding windows generated from seqkit sliding. [#390](https://github.com/shenwei356/seqkit/issues/390)
44
- `seqkit stats`:
5-
- a new flag `-N/--N` for appending other N50-like stats as new columns. [#393](https://github.com/shenwei356/seqkit/issues/393)
5+
- added a new flag `-N/--N` for appending other N50-like stats as new columns. [#393](https://github.com/shenwei356/seqkit/issues/393)
66
- added a progress bar for > 1 input files.
7-
- write the result of each file immediately (no output buffer).
7+
- write the result of each file immediately (no output buffer) when using `-T/--tabular`.
88
- `seqkit translate`:
99
- add options `-s/--out-subseqs` and `-m/--min-len` to write ORFs longer than `x` amino acids as individual records. [#389](https://github.com/shenwei356/seqkit/issues/389)
1010
- `seqkit sum`:
11-
- do not remove possible '*' by default. Thanks to @photocyte. [#399](https://github.com/shenwei356/seqkit/issues/399)
11+
- do not remove possible '*' by default and delete confusing warnings. Thanks to @photocyte. [#399](https://github.com/shenwei356/seqkit/issues/399)
1212
- added a progress bar for > 1 input files.
1313
- `seqkit pair`:
1414
- remove the restriction of requiring FASTQ format, i.e., FASTA files are also supported.
1515
- `seqkit seq`:
16-
- update help message. [#387](https://github.com/shenwei356/seqkit/issues/387)
16+
- update help messages. [#387](https://github.com/shenwei356/seqkit/issues/387)
1717
- `seqkit fxtab`:
1818
- faster alphabet computation (`-a/--alphabet`) with a new data structure. Thanks to @elliotwutingfeng [#388](https://github.com/shenwei356/seqkit/pull/388)
1919
- `seqkit subseq`:

doc/docs/download.md

+52-32
Original file line numberDiff line numberDiff line change
@@ -9,39 +9,26 @@ Please cite: **W Shen**, S Le, Y Li\*, F Hu\*. SeqKit: a cross-platform and ultr
99

1010
## Current Version
1111

12-
- [SeqKit v2.4.0](https://github.com/shenwei356/seqkit/releases/tag/v2.4.0) - 2023-03-17
13-
[![Github Releases (by Release)](https://img.shields.io/github/downloads/shenwei356/seqkit/v2.4.0/total.svg)](https://github.com/shenwei356/seqkit/releases/tag/v2.4.0)
14-
- `seqkit`:
15-
- **support `bzip2` format**. [#361](https://github.com/shenwei356/seqkit/issues/361)
16-
- support setting compression level for `gzip`, `zstd`, and `bzip2` format via `--compress-level`. [#320](https://github.com/shenwei356/seqkit/issues/320)
17-
- the global flag `--infile-list` accepts stdin (`-`) now.
18-
- wrap the help message of flags.
19-
- `seqkit locate`:
20-
- **do not remove embeded regions when searching with regular expressions**. [#368](https://github.com/shenwei356/seqkit/issues/368)
21-
- `seqkit amplicon`:
22-
- fix BED coordinates for amplicons found in the minus strand. [#367](https://github.com/shenwei356/seqkit/issues/367)
23-
- `seqkit split`:
24-
- fix forgetting to add extension for `--two-pass`. [#332](https://github.com/shenwei356/seqkit/issues/332)
12+
- [SeqKit v2.5.0](https://github.com/shenwei356/seqkit/releases/tag/v2.5.0) - 2023-07-16
13+
[![Github Releases (by Release)](https://img.shields.io/github/downloads/shenwei356/seqkit/v2.5.0/total.svg)](https://github.com/shenwei356/seqkit/releases/tag/v2.5.0)
14+
- new command `seqkit merge-slides`: merge sliding windows generated from seqkit sliding. [#390](https://github.com/shenwei356/seqkit/issues/390)
2515
- `seqkit stats`:
26-
- fix compute Q1 and Q3 of sequence length for one record. [#353](https://github.com/shenwei356/seqkit/issues/353)
27-
- `seqkit grep`:
28-
- fix count number (`-C`) for matching with mismatch (`-m > 0`). [#370](https://github.com/shenwei356/seqkit/issues/370)
29-
- `seqkit replace`:
30-
- **add some flags to match partly records to edit**; these flags are transplanted from `seqkit grep`. [#348](https://github.com/shenwei356/seqkit/issues/348)
31-
- `seqkit faidx`:
32-
- **allow empty lines at the end of sequences**.
33-
- `seqkit faidx/sort/shuffle/split/subseq`:
34-
- **new flag `-U/--update-faidx`: update the FASTA index file if it exists, to guarantee the index file matches the FASTA files**. [#364](https://github.com/shenwei356/seqkit/issues/364)
35-
- improve log info and update help message. [#365](https://github.com/shenwei356/seqkit/issues/365)
36-
- `seqkit seq`:
37-
- allow filtering sequences of length zero. thanks to @penglbio.
38-
- `seqkit rename`:
39-
- new flag `-s/--separator` for setting separator between original ID/name and the counter (default "_"). [#360](https://github.com/shenwei356/seqkit/issues/360)
40-
- new flag `-N/--start-num` for setting starting count number for duplicated IDs/names (default 2). [#360](https://github.com/shenwei356/seqkit/issues/360)
41-
- new flag `-1/--rename-1st-rec` for renaming the first record as well. [#360](https://github.com/shenwei356/seqkit/issues/360)
42-
- do not append space if there's no description after the sequene ID.
43-
- `seqkit sliding`:
44-
- new flag `-S/--suffix` for change the suffix added to the sequence ID (default: "_sliding").
16+
- added a new flag `-N/--N` for appending other N50-like stats as new columns. [#393](https://github.com/shenwei356/seqkit/issues/393)
17+
- added a progress bar for > 1 input files.
18+
- write the result of each file immediately (no output buffer) when using `-T/--tabular`.
19+
- `seqkit translate`:
20+
- add options `-s/--out-subseqs` and `-m/--min-len` to write ORFs longer than `x` amino acids as individual records. [#389](https://github.com/shenwei356/seqkit/issues/389)
21+
- `seqkit sum`:
22+
- do not remove possible '*' by default and delete confusing warnings. Thanks to @photocyte. [#399](https://github.com/shenwei356/seqkit/issues/399)
23+
- added a progress bar for > 1 input files.
24+
- `seqkit pair`:
25+
- remove the restriction of requiring FASTQ format, i.e., FASTA files are also supported.
26+
- `seqkit seq`:
27+
- update help messages. [#387](https://github.com/shenwei356/seqkit/issues/387)
28+
- `seqkit fxtab`:
29+
- faster alphabet computation (`-a/--alphabet`) with a new data structure. Thanks to @elliotwutingfeng [#388](https://github.com/shenwei356/seqkit/pull/388)
30+
- `seqkit subseq`:
31+
- accept reverse coordinates in BED/GTF. [#392](https://github.com/shenwei356/seqkit/issues/392)
4532

4633
4734
### Links
@@ -176,6 +163,39 @@ fish:
176163

177164
## Release history
178165

166+
- [SeqKit v2.4.0](https://github.com/shenwei356/seqkit/releases/tag/v2.4.0) - 2023-03-17
167+
[![Github Releases (by Release)](https://img.shields.io/github/downloads/shenwei356/seqkit/v2.4.0/total.svg)](https://github.com/shenwei356/seqkit/releases/tag/v2.4.0)
168+
- `seqkit`:
169+
- **support `bzip2` format**. [#361](https://github.com/shenwei356/seqkit/issues/361)
170+
- support setting compression level for `gzip`, `zstd`, and `bzip2` format via `--compress-level`. [#320](https://github.com/shenwei356/seqkit/issues/320)
171+
- the global flag `--infile-list` accepts stdin (`-`) now.
172+
- wrap the help message of flags.
173+
- `seqkit locate`:
174+
- **do not remove embeded regions when searching with regular expressions**. [#368](https://github.com/shenwei356/seqkit/issues/368)
175+
- `seqkit amplicon`:
176+
- fix BED coordinates for amplicons found in the minus strand. [#367](https://github.com/shenwei356/seqkit/issues/367)
177+
- `seqkit split`:
178+
- fix forgetting to add extension for `--two-pass`. [#332](https://github.com/shenwei356/seqkit/issues/332)
179+
- `seqkit stats`:
180+
- fix compute Q1 and Q3 of sequence length for one record. [#353](https://github.com/shenwei356/seqkit/issues/353)
181+
- `seqkit grep`:
182+
- fix count number (`-C`) for matching with mismatch (`-m > 0`). [#370](https://github.com/shenwei356/seqkit/issues/370)
183+
- `seqkit replace`:
184+
- **add some flags to match partly records to edit**; these flags are transplanted from `seqkit grep`. [#348](https://github.com/shenwei356/seqkit/issues/348)
185+
- `seqkit faidx`:
186+
- **allow empty lines at the end of sequences**.
187+
- `seqkit faidx/sort/shuffle/split/subseq`:
188+
- **new flag `-U/--update-faidx`: update the FASTA index file if it exists, to guarantee the index file matches the FASTA files**. [#364](https://github.com/shenwei356/seqkit/issues/364)
189+
- improve log info and update help message. [#365](https://github.com/shenwei356/seqkit/issues/365)
190+
- `seqkit seq`:
191+
- allow filtering sequences of length zero. thanks to @penglbio.
192+
- `seqkit rename`:
193+
- new flag `-s/--separator` for setting separator between original ID/name and the counter (default "_"). [#360](https://github.com/shenwei356/seqkit/issues/360)
194+
- new flag `-N/--start-num` for setting starting count number for duplicated IDs/names (default 2). [#360](https://github.com/shenwei356/seqkit/issues/360)
195+
- new flag `-1/--rename-1st-rec` for renaming the first record as well. [#360](https://github.com/shenwei356/seqkit/issues/360)
196+
- do not append space if there's no description after the sequene ID.
197+
- `seqkit sliding`:
198+
- new flag `-S/--suffix` for change the suffix added to the sequence ID (default: "_sliding").
179199
- [SeqKit v2.3.1](https://github.com/shenwei356/seqkit/releases/tag/v2.3.1) - 2022-09-22
180200
[![Github Releases (by Release)](https://img.shields.io/github/downloads/shenwei356/seqkit/v2.3.1/total.svg)](https://github.com/shenwei356/seqkit/releases/tag/v2.3.1)
181201
- `seqkit grep/locate`: fix bug of FMIndex building for empty sequences. [#321](https://github.com/shenwei356/seqkit/issues/321)

doc/docs/usage.md

+8-3
Original file line numberDiff line numberDiff line change
@@ -158,7 +158,7 @@ reproduced in different environments with same random seed.
158158
``` text
159159
SeqKit -- a cross-platform and ultrafast toolkit for FASTA/Q file manipulation
160160
161-
Version: 2.4.0
161+
Version: 2.5.0
162162
163163
Author: Wei Shen <[email protected]>
164164
@@ -185,7 +185,7 @@ Compression level:
185185
bzip 1-9 6 https://github.com/dsnet/compress
186186
187187
Usage:
188-
seqkit [command]
188+
seqkit [command]
189189
190190
Available Commands:
191191
amplicon extract amplicon (or specific region around it) via primer(s)
@@ -204,6 +204,7 @@ Available Commands:
204204
head print first N FASTA/Q records
205205
head-genome print sequences of the first genome with common prefixes in name
206206
locate locate subsequences/motifs, mismatch allowed
207+
merge-slides merge sliding windows generated from seqkit sliding
207208
mutate edit sequence (point mutation, insertion, deletion)
208209
pair match up paired-end reads from two fastq files
209210
range print FASTA/Q records in a range (start:end)
@@ -266,7 +267,7 @@ Human genome from [ensembl](http://uswest.ensembl.org/info/data/ftp/index.html)
266267
- [`Homo_sapiens.GRCh38.84.gtf.gz`](ftp://ftp.ensembl.org/pub/release-84/gtf/homo_sapiens/Homo_sapiens.GRCh38.84.gtf.gz)
267268
- `Homo_sapiens.GRCh38.84.bed.gz` is converted from `Homo_sapiens.GRCh38.84.gtf.gz`
268269
by [`gtf2bed`](http://bedops.readthedocs.org/en/latest/content/reference/file-management/conversion/gtf2bed.html?highlight=gtf2bed)
269-
with command
270+
with the command
270271

271272
zcat Homo_sapiens.GRCh38.84.gtf.gz \
272273
| gtf2bed --do-not-sort \
@@ -3899,6 +3900,10 @@ Flags:
38993900
39003901
```
39013902

3903+
Example:
3904+
3905+
seqkit merge-slides sliding_windows.tsv -l 50 -o sliding_windows.merged.tsv
3906+
39023907
## genautocomplete
39033908

39043909
Usage

0 commit comments

Comments
 (0)