142 Commits

Author SHA1 Message Date
vadimdddd
b8a88cc30c
Add gradio recognizer (#1114)
Demonstrate speech recognition in gradio
2022-08-19 17:31:47 +03:00
Nickolay V. Shmyrev
be0b117711 Add training colab 2022-08-17 19:11:45 +02:00
Nickolay V. Shmyrev
df0ee24084 Add adaptation colab 2022-08-17 12:38:17 +02:00
Nickolay V. Shmyrev
d3d8f53156 Add Kaldi build colab 2022-08-17 10:02:43 +02:00
vadimdddd
6eee303d7e
Fix decoding files with spaces (#1084)
Fix decoding files with spaces
2022-08-02 20:56:04 +03:00
Nickolay Shmyrev
053d71f5aa Bug fix in notebook 2022-08-01 22:59:38 +02:00
Nickolay Shmyrev
4fbbf5882c Add colab demo 2022-08-01 22:52:57 +02:00
vadimdddd
f63b015284
Add new dependency websockets (#1079)
add websockets to dependencies
2022-07-19 19:58:24 +03:00
Nickolay Shmyrev
0a9672d910 Attempt to build manylinux wheel 2022-07-15 00:40:38 +02:00
Nickolay Shmyrev
4ccccd0cd2 Bump version 2022-07-14 21:38:36 +02:00
Nickolay Shmyrev
1475b0e986 Async ffmpeg for trancriber and code refactoring 2022-07-05 01:13:05 +02:00
vadimdddd
8ceab0b9b1
Add vosk-server to transcriber (#1024)
Allow to use vosk-server for transcription
2022-07-04 13:50:38 +03:00
vadimdddd
983519e629
Add compatibility with python 3.5 issue(#1021) (#1041)
Added format for python3.5
2022-06-29 18:00:40 +03:00
Nickolay Shmyrev
a7bc5a22d4 We support Polish now 2022-06-05 23:03:57 +02:00
vadimdddd
630edeb3d6
fixed error if parent directory doesn't exist #988 (#990)
Create missing parent directory
2022-06-01 17:52:16 +03:00
Nickolay Shmyrev
b1b216d4c8 Bump versions 2022-05-26 15:22:16 +02:00
Nickolay Shmyrev
55dd29b0ff Fix transcriber bugs while processing folder 2022-05-26 14:05:54 +02:00
Nickolay Shmyrev
298c86d0d4 Bump version and add python requirements 2022-05-26 12:46:06 +02:00
Nickolay Shmyrev
859420809b Add extra comments 2022-05-26 11:45:16 +02:00
Nickolay Shmyrev
0fe3a89768 Update to 0.3.40 2022-05-26 11:41:14 +02:00
Nickolay Shmyrev
5b892fbfc5 Architecture should be universal2 2022-05-25 19:52:05 +03:00
Nickolay Shmyrev
fb4ed21a7f Shave transcriber code 2022-05-24 22:53:57 +02:00
vadimdddd
4209f3a9fe
Vosk model loader(#871) (#941)
* methods get_model_by_name, get_model_by_lang, get_model were added into the model class

* importing modules changed to using components; introduced constant MODELS_HOME_DIR; simplified code

* added new model folders into init; changed samples and transcriber bin for new mode loader

* changed back in cli.py lang arg to args.lang

* added 3 directories instead of 1 to check for models

* cli.py: added 3 args instead of 1 for model; __init__.py: changed script get_model_path for run get_model_by_name/lang inside current directory

* deleted default env var

* cli.py: changed arg_name; __init__.py: changed const name, changed model loading only for last directory

* deleted unused method

* changed by_name, by_lang methods, added download_model method

* deleted env variable initialization

* deleted print()

* deteled unused modules

* added progress_bar, added folder AppData/Local/vosk for model search

* changed download_model methond; added my_hook method
2022-05-24 21:06:13 +03:00
Nickolay Shmyrev
def8c93711 Remove semicolon 2022-05-24 15:23:03 +03:00
Nickolay Shmyrev
f73088da58 We create universal binary on OSX 2022-05-24 15:11:05 +03:00
Nickolay Shmyrev
97d737a30a Bump version 2022-05-15 23:36:46 +02:00
Nickolay Shmyrev
b0146782d6 Reorganize transcriber binary 2022-04-29 03:01:09 +02:00
vadimdddd
9d94746479
Add transcriber tool (#851)
Add transcriber tool
2022-04-20 14:48:22 +03:00
Nickolay V. Shmyrev
a87f2e1e07 Czech model 2022-04-13 22:31:45 +02:00
Nickolay Shmyrev
7b7d814484 Introduce incremental decoder with confidences in partial results 2022-04-07 01:07:47 +02:00
Nickolay Shmyrev
22cb90de4a Add Hindi 2022-03-17 20:08:39 +01:00
Nickolay Shmyrev
a57a84f90e Refactor GPU API to hide the ID and keep it closer to CPU recognizer 2022-03-03 21:09:09 +01:00
Nickolay Shmyrev
79b8395be0 Add NLSML output 2022-02-03 23:08:09 +01:00
Nickolay Shmyrev
d2c11a611f Read list of files from arguments 2022-01-30 22:57:36 +01:00
Nickolay Shmyrev
a1eac015dc Add Esperanto 2022-01-07 16:27:57 +01:00
Nickolay Shmyrev
64dfc65d51 Merge branch 'master' of github.com:alphacep/vosk-api 2022-01-05 20:32:25 +01:00
Nickolay Shmyrev
70d5cbd0e0 Update README with Japanese 2022-01-05 20:32:08 +01:00
Nickolay Shmyrev
72bf210164 Put the demo into main folder 2021-12-24 01:07:38 +01:00
Nickolay Shmyrev
cb0f8e6411 Per-stream wait API 2021-12-23 22:34:47 +01:00
Nickolay Shmyrev
848b2dc753 Expose results in Python 2021-12-17 22:57:00 +01:00
Nickolay Shmyrev
60f0396fe0 Reset lattice on endpoint 2021-12-17 01:13:09 +01:00
Nickolay Shmyrev
344e137a61 Decoding works, results are empty yet 2021-12-13 01:21:59 +01:00
Nickolay Shmyrev
6977be7fb7 Batch recognizer draft 2021-12-12 21:37:44 +01:00
Nickolay Shmyrev
5e46825474 Add try/catch wrapper for C++ method to raise native exceptions. Python and Java
are implemented, others on the way
2021-10-12 22:31:36 +02:00
Nickolay Shmyrev
fcab5a9581 Revert CFFI bump 2021-10-10 22:09:55 +02:00
Nickolay Shmyrev
e7f5e0ac23 Bump cffi version 2021-10-10 21:21:32 +02:00
Nickolay Shmyrev
e2af710369 Rework rescoring for faster and more accurate results 2021-08-31 21:56:21 +02:00
Lars Kiesow
2349e66a97
Subtitles require word times (#607)
This is a port of the recent addition of commit 7ccf743, adding
`KaldiRecognizer.SetWords(True)` to the other examples dealing with
subtitles to the WebVTT example.

Without this, the example will not work with the most recent `vosk`
(0.3.30) Python package.
2021-06-27 18:06:11 +03:00
Lars Kiesow
02ef49f67e
Allow Saving WebVTT (#605)
This patch is a small extension to the WebVTT example which allows to
directly save the WebVTT output to a file like this:

    ./test_webvtt.py test.wav out.vtt
2021-06-24 19:35:30 +03:00
Lars Kiesow
7cdf8f1d03
Add Python WebVTT Example (#601)
This patch adds an example for using webvtt-py to generate WebVTT files
from Vosk output. This is similar to the SRT example but still very
useful for generating an example video subtitle usable in web contexts.
2021-06-23 01:16:33 +03:00