Skip to content

Releases: revdotcom/fstalign

Bullseye and Bookworm flavors

07 Jan 22:12

Choose a tag to compare

  • Added support for Bullseye and Bookworm based docker images

New major release : v 2.0 !

01 May 20:03
82dec1d

Choose a tag to compare

Version 2.0 introduces two major changes:

  1. A new method to traverse the composition graph, which dramatically improves the overall speed, especially when the sequences are long contain many errors. We have files that took 25 minutes to align before that can now take about 7 seconds. This is especially noticeable with the adapted composition (the default).
  2. Some smarts were introduced when --use-case and --use-punctuation are enabled. Now, by default, punctuation symbols can only be substituted by other punctuation symbols (or deleted/inserted). Also, words that differ only by the first letter case will be preffered for substitution.

These behavior, as well as the beam size (that has a default value of 50.0) can be controlled with the following new parameters:

  --disable-strict-punctuation: Disable strict punctuation alignment (which prevents punctuation aligning with words).
  --disable-favored-subs        Disable favored substitutions (which makes alignment favor substitutions between words which differ only by case).
  --favored-sub-cost FLOAT      Cost for favored substitutions (e.g., case diff). Default: 0.1

See the README.md for more details.

1.14.0

31 Oct 18:56
3446afd

Choose a tag to compare

What's Changed

  • Allow flexible forwarding of NLP columns to SBS by @nishchalb in #56

Full Changelog: 1.13.0...1.14.0

fstalign-1.13.0

18 Apr 19:30
363deb8

Choose a tag to compare

What's Changed

  • Update license copyright year(s) by @github-actions in #53
  • Bug: WER sidecar info not appearing in SBS by @nishchalb in #55

Full Changelog: 1.12.0...1.13.0

1.12.0

19 Oct 19:23
218bb25

Choose a tag to compare

What's Changed

  • FSTALIGN-37: Add flag in fstalign to allow for case-sensitive testing by @pique0822 in #51
  • FSTALIGN-37: Add flag in fstalign to allow for case-sensitive testing (cont.) by @pique0822 in #52

Full Changelog: 1.11.0...1.12.0

fstalign-1.11.0

11 Sep 23:28
5c5a150

Choose a tag to compare

Release notes - Rev FST Aligner - 1.11.0

Improvement

FSTALIGN-61 Minimize explicit heap allocations (new) and leverage more managed memory sematics

FSTALIGN-62 Migrate to the Debian Buster and newer OpenFST

FSTALIGN-63 Add a way to preserve inserts in NLP output.

fstalign-1.10.0

06 Jun 17:01
9008d82

Choose a tag to compare

What's Changed

  • Updated nlp file that was missing confidence column by @dchen579 in #43
  • Accept references in FST format by @dchen579 in #44
  • FSTALIGN-61: Fewer explicit heap allocations by @dchen579 in #45

New Contributors

Full Changelog: 1.9.0...1.10.0

fstalign-1.9.0

20 Apr 21:19
4c579ad

Choose a tag to compare

What's Changed

  • Update license copyright year(s) by @github-actions in #38
  • Update FST usage to mention composition approach by @nishchalb in #41
  • Pass through confidence to aligned NLP output by @nishchalb in #42
  • Nerd-1422: Add flag for reading punctuation from nlp as separate tokens by @ajhinsvark in #10

New Contributors

  • @github-actions made their first contribution in #38

Full Changelog: 1.8.0...1.9.0

fstalign-1.8.0

22 Sep 18:28
467e2ea

Choose a tag to compare

What's Changed

Full Changelog: 1.7.0...1.8.0

fstalign-1.7.0

27 Jun 15:21
3796629

Choose a tag to compare

What's Changed

Full Changelog: 1.6.1...1.7.0