162 Commits

Author SHA1 Message Date
Nickolay Shmyrev
d497c3a0a5 Move to 3-value endpointer config 2023-12-13 04:29:21 +01:00
Nickolay Shmyrev
f37ee638a4 Add endpointer delays parameter 2023-11-27 15:02:55 +01:00
Nickolay Shmyrev
e0b29bb17f Implement more endpointer modes 2023-11-10 01:58:31 +01:00
Nickolay Shmyrev
c3e3fc002f Add online endpointer reconfiguration 2023-10-16 13:40:32 +02:00
Nickolay Shmyrev
62fc308bdf Sometimes python interpreter destroys the handle on exit before
the object. See

https://foss.heptapod.net/pypy/cffi/-/issues/340
2023-09-04 00:33:42 +02:00
Anton Romanov
128c216c61
Fix vosk-transcriber cli json format (#1304)
Fix error UnboundLocalError: local variable 'monologue' referenced before assignment in transcriber.py line 102
2023-03-29 03:12:16 +03:00
Nickolay Shmyrev
790c564d14 Ignore non-audio files in folder transcription 2023-02-10 00:57:14 +01:00
Nickolay Shmyrev
800d3d7089 Bump version 2022-12-14 17:38:53 +01:00
Edwin Hendriks
25c59b52e3
add -m (language model) argument to test_microphone.py (#1219)
* added language model as parameter

* fix fore more help explanation
2022-12-07 22:53:21 +03:00
Nickolay Shmyrev
4c72097478 Allow to configure GPU model path. See https://github.com/alphacep/vosk-server/issues/207 2022-11-19 01:13:42 +01:00
Tomas Mehdi
1053cfa0f8
JSON output_type with texts (#1180) 2022-10-26 22:41:04 +03:00
gmat
21a42cb6cd
new output type json for gecko project (#1166)
add a output type `json` to get output file compatible with transcription file format for the gecko project annotation https://github.com/gong-io/gecko
2022-10-06 16:27:29 +03:00
Nickolay Shmyrev
16c4a0d985 Bump version 2022-09-25 21:00:24 +02:00
Nickolay Shmyrev
298253401a Update approach to machine detection on cross-compilation
See for details issue #1155
2022-09-25 21:00:24 +02:00
Nickolay Shmyrev
32aa980069 Allow to change grammar of existing recognizer. Probably can be faster though
if we do not reinit feature pipeline
2022-09-13 23:12:13 +02:00
vadimdddd
7b4d396eb1
fix quotes and import's order (#1143) 2022-09-13 21:51:45 +03:00
vadimdddd
36968fbb30
Fix python style issues (#1140)
* pylint_fix v.2

* init: fix quotes
2022-09-12 23:08:59 +03:00
vadimdddd
7474888801
Fix python style issues (#1131) 2022-09-09 00:13:02 +03:00
vadimdddd
d46b7a43eb
Split results by newlines (#1124) 2022-08-26 17:22:10 +03:00
vadimdddd
2376b32a8a
Add SRTResult to KaldiRecognizer (#1080) 2022-08-24 12:48:18 +03:00
vadimdddd
b8a88cc30c
Add gradio recognizer (#1114)
Demonstrate speech recognition in gradio
2022-08-19 17:31:47 +03:00
Nickolay V. Shmyrev
be0b117711 Add training colab 2022-08-17 19:11:45 +02:00
Nickolay V. Shmyrev
df0ee24084 Add adaptation colab 2022-08-17 12:38:17 +02:00
Nickolay V. Shmyrev
d3d8f53156 Add Kaldi build colab 2022-08-17 10:02:43 +02:00
vadimdddd
6eee303d7e
Fix decoding files with spaces (#1084)
Fix decoding files with spaces
2022-08-02 20:56:04 +03:00
Nickolay Shmyrev
053d71f5aa Bug fix in notebook 2022-08-01 22:59:38 +02:00
Nickolay Shmyrev
4fbbf5882c Add colab demo 2022-08-01 22:52:57 +02:00
vadimdddd
f63b015284
Add new dependency websockets (#1079)
add websockets to dependencies
2022-07-19 19:58:24 +03:00
Nickolay Shmyrev
0a9672d910 Attempt to build manylinux wheel 2022-07-15 00:40:38 +02:00
Nickolay Shmyrev
4ccccd0cd2 Bump version 2022-07-14 21:38:36 +02:00
Nickolay Shmyrev
1475b0e986 Async ffmpeg for trancriber and code refactoring 2022-07-05 01:13:05 +02:00
vadimdddd
8ceab0b9b1
Add vosk-server to transcriber (#1024)
Allow to use vosk-server for transcription
2022-07-04 13:50:38 +03:00
vadimdddd
983519e629
Add compatibility with python 3.5 issue(#1021) (#1041)
Added format for python3.5
2022-06-29 18:00:40 +03:00
Nickolay Shmyrev
a7bc5a22d4 We support Polish now 2022-06-05 23:03:57 +02:00
vadimdddd
630edeb3d6
fixed error if parent directory doesn't exist #988 (#990)
Create missing parent directory
2022-06-01 17:52:16 +03:00
Nickolay Shmyrev
b1b216d4c8 Bump versions 2022-05-26 15:22:16 +02:00
Nickolay Shmyrev
55dd29b0ff Fix transcriber bugs while processing folder 2022-05-26 14:05:54 +02:00
Nickolay Shmyrev
298c86d0d4 Bump version and add python requirements 2022-05-26 12:46:06 +02:00
Nickolay Shmyrev
859420809b Add extra comments 2022-05-26 11:45:16 +02:00
Nickolay Shmyrev
0fe3a89768 Update to 0.3.40 2022-05-26 11:41:14 +02:00
Nickolay Shmyrev
5b892fbfc5 Architecture should be universal2 2022-05-25 19:52:05 +03:00
Nickolay Shmyrev
fb4ed21a7f Shave transcriber code 2022-05-24 22:53:57 +02:00
vadimdddd
4209f3a9fe
Vosk model loader(#871) (#941)
* methods get_model_by_name, get_model_by_lang, get_model were added into the model class

* importing modules changed to using components; introduced constant MODELS_HOME_DIR; simplified code

* added new model folders into init; changed samples and transcriber bin for new mode loader

* changed back in cli.py lang arg to args.lang

* added 3 directories instead of 1 to check for models

* cli.py: added 3 args instead of 1 for model; __init__.py: changed script get_model_path for run get_model_by_name/lang inside current directory

* deleted default env var

* cli.py: changed arg_name; __init__.py: changed const name, changed model loading only for last directory

* deleted unused method

* changed by_name, by_lang methods, added download_model method

* deleted env variable initialization

* deleted print()

* deteled unused modules

* added progress_bar, added folder AppData/Local/vosk for model search

* changed download_model methond; added my_hook method
2022-05-24 21:06:13 +03:00
Nickolay Shmyrev
def8c93711 Remove semicolon 2022-05-24 15:23:03 +03:00
Nickolay Shmyrev
f73088da58 We create universal binary on OSX 2022-05-24 15:11:05 +03:00
Nickolay Shmyrev
97d737a30a Bump version 2022-05-15 23:36:46 +02:00
Nickolay Shmyrev
b0146782d6 Reorganize transcriber binary 2022-04-29 03:01:09 +02:00
vadimdddd
9d94746479
Add transcriber tool (#851)
Add transcriber tool
2022-04-20 14:48:22 +03:00
Nickolay V. Shmyrev
a87f2e1e07 Czech model 2022-04-13 22:31:45 +02:00
Nickolay Shmyrev
7b7d814484 Introduce incremental decoder with confidences in partial results 2022-04-07 01:07:47 +02:00