Nickolay Shmyrev
d497c3a0a5
Move to 3-value endpointer config
2023-12-13 04:29:21 +01:00
Nickolay Shmyrev
f37ee638a4
Add endpointer delays parameter
2023-11-27 15:02:55 +01:00
Nickolay Shmyrev
e0b29bb17f
Implement more endpointer modes
2023-11-10 01:58:31 +01:00
Nickolay Shmyrev
c3e3fc002f
Add online endpointer reconfiguration
2023-10-16 13:40:32 +02:00
Nickolay Shmyrev
62fc308bdf
Sometimes python interpreter destroys the handle on exit before
...
the object. See
https://foss.heptapod.net/pypy/cffi/-/issues/340
2023-09-04 00:33:42 +02:00
Anton Romanov
128c216c61
Fix vosk-transcriber cli json format ( #1304 )
...
Fix error UnboundLocalError: local variable 'monologue' referenced before assignment in transcriber.py line 102
2023-03-29 03:12:16 +03:00
Nickolay Shmyrev
790c564d14
Ignore non-audio files in folder transcription
2023-02-10 00:57:14 +01:00
Nickolay Shmyrev
4c72097478
Allow to configure GPU model path. See https://github.com/alphacep/vosk-server/issues/207
2022-11-19 01:13:42 +01:00
Tomas Mehdi
1053cfa0f8
JSON output_type with texts ( #1180 )
2022-10-26 22:41:04 +03:00
gmat
21a42cb6cd
new output type json for gecko project ( #1166 )
...
add a output type `json` to get output file compatible with transcription file format for the gecko project annotation https://github.com/gong-io/gecko
2022-10-06 16:27:29 +03:00
Nickolay Shmyrev
32aa980069
Allow to change grammar of existing recognizer. Probably can be faster though
...
if we do not reinit feature pipeline
2022-09-13 23:12:13 +02:00
vadimdddd
7b4d396eb1
fix quotes and import's order ( #1143 )
2022-09-13 21:51:45 +03:00
vadimdddd
36968fbb30
Fix python style issues ( #1140 )
...
* pylint_fix v.2
* init: fix quotes
2022-09-12 23:08:59 +03:00
vadimdddd
7474888801
Fix python style issues ( #1131 )
2022-09-09 00:13:02 +03:00
vadimdddd
d46b7a43eb
Split results by newlines ( #1124 )
2022-08-26 17:22:10 +03:00
vadimdddd
2376b32a8a
Add SRTResult to KaldiRecognizer ( #1080 )
2022-08-24 12:48:18 +03:00
vadimdddd
6eee303d7e
Fix decoding files with spaces ( #1084 )
...
Fix decoding files with spaces
2022-08-02 20:56:04 +03:00
Nickolay Shmyrev
1475b0e986
Async ffmpeg for trancriber and code refactoring
2022-07-05 01:13:05 +02:00
vadimdddd
8ceab0b9b1
Add vosk-server to transcriber ( #1024 )
...
Allow to use vosk-server for transcription
2022-07-04 13:50:38 +03:00
vadimdddd
983519e629
Add compatibility with python 3.5 issue( #1021 ) ( #1041 )
...
Added format for python3.5
2022-06-29 18:00:40 +03:00
vadimdddd
630edeb3d6
fixed error if parent directory doesn't exist #988 ( #990 )
...
Create missing parent directory
2022-06-01 17:52:16 +03:00
Nickolay Shmyrev
55dd29b0ff
Fix transcriber bugs while processing folder
2022-05-26 14:05:54 +02:00
Nickolay Shmyrev
fb4ed21a7f
Shave transcriber code
2022-05-24 22:53:57 +02:00
vadimdddd
4209f3a9fe
Vosk model loader( #871 ) ( #941 )
...
* methods get_model_by_name, get_model_by_lang, get_model were added into the model class
* importing modules changed to using components; introduced constant MODELS_HOME_DIR; simplified code
* added new model folders into init; changed samples and transcriber bin for new mode loader
* changed back in cli.py lang arg to args.lang
* added 3 directories instead of 1 to check for models
* cli.py: added 3 args instead of 1 for model; __init__.py: changed script get_model_path for run get_model_by_name/lang inside current directory
* deleted default env var
* cli.py: changed arg_name; __init__.py: changed const name, changed model loading only for last directory
* deleted unused method
* changed by_name, by_lang methods, added download_model method
* deleted env variable initialization
* deleted print()
* deteled unused modules
* added progress_bar, added folder AppData/Local/vosk for model search
* changed download_model methond; added my_hook method
2022-05-24 21:06:13 +03:00
Nickolay Shmyrev
b0146782d6
Reorganize transcriber binary
2022-04-29 03:01:09 +02:00
Nickolay Shmyrev
7b7d814484
Introduce incremental decoder with confidences in partial results
2022-04-07 01:07:47 +02:00
Nickolay Shmyrev
a57a84f90e
Refactor GPU API to hide the ID and keep it closer to CPU recognizer
2022-03-03 21:09:09 +01:00
Nickolay Shmyrev
79b8395be0
Add NLSML output
2022-02-03 23:08:09 +01:00
Nickolay Shmyrev
cb0f8e6411
Per-stream wait API
2021-12-23 22:34:47 +01:00
Nickolay Shmyrev
848b2dc753
Expose results in Python
2021-12-17 22:57:00 +01:00
Nickolay Shmyrev
60f0396fe0
Reset lattice on endpoint
2021-12-17 01:13:09 +01:00
Nickolay Shmyrev
344e137a61
Decoding works, results are empty yet
2021-12-13 01:21:59 +01:00
Nickolay Shmyrev
6977be7fb7
Batch recognizer draft
2021-12-12 21:37:44 +01:00
Nickolay Shmyrev
5e46825474
Add try/catch wrapper for C++ method to raise native exceptions. Python and Java
...
are implemented, others on the way
2021-10-12 22:31:36 +02:00
Nickolay Shmyrev
75bedfe06d
Add a method to show/hide words and their times
2021-06-07 01:04:37 +02:00
Nickolay Shmyrev
499b2f183a
Introduce new API to set speaker model to already initialized recognizer.
...
Introduce a method to reset recognizer results to start from scratch without
computation of the result.
2021-05-26 00:46:32 +02:00
Nickolay Shmyrev
f8189685e5
Add max alternatives output
2021-05-19 18:47:25 +02:00
sskorol
eea7ca571b
- Added an optional HAVE_CUDA flag to build Vosk with GPU support.
...
- Added missing GpuInit/ThreadInit python wrappers.
2021-02-22 12:30:46 +02:00
Nickolay Shmyrev
db3e31d7ce
Automatically put .so file in resources
2021-02-16 17:33:09 +01:00
Nickolay Shmyrev
d917af21ab
Better search for Windows dlls
2021-02-12 22:24:54 +01:00
Nickolay Shmyrev
c3430e448a
Add build for Windows wheel
2021-02-07 18:11:38 +01:00
Nickolay Shmyrev
481881e59d
Fully move python to cffi
2021-02-07 13:23:03 +01:00
Nickolay Shmyrev
80219066e9
Expose verbose level in the API
2020-05-01 19:02:57 +02:00
Nickolay Shmyrev
aa91ccf68b
Added speaker identification
2020-02-16 17:04:37 +01:00
Nickolay Shmyrev
fe35675eaa
Added basic travis
2020-01-10 20:01:48 +01:00
Nickolay Shmyrev
78e66149f8
Imported Python bindings and Node bindings
2020-01-02 20:46:14 +01:00