alphacep_vosk-api

mirror of https://github.com/alphacep/vosk-api.git synced 2026-03-23 00:01:25 +08:00

Author	SHA1	Message	Date
Nickolay Shmyrev	d497c3a0a5	Move to 3-value endpointer config	2023-12-13 04:29:21 +01:00
Nickolay Shmyrev	f37ee638a4	Add endpointer delays parameter	2023-11-27 15:02:55 +01:00
Nickolay Shmyrev	e0b29bb17f	Implement more endpointer modes	2023-11-10 01:58:31 +01:00
Nickolay Shmyrev	c3e3fc002f	Add online endpointer reconfiguration	2023-10-16 13:40:32 +02:00
Nickolay Shmyrev	62fc308bdf	Sometimes python interpreter destroys the handle on exit before the object. See https://foss.heptapod.net/pypy/cffi/-/issues/340	2023-09-04 00:33:42 +02:00
Anton Romanov	128c216c61	Fix vosk-transcriber cli json format (#1304 ) Fix error UnboundLocalError: local variable 'monologue' referenced before assignment in transcriber.py line 102	2023-03-29 03:12:16 +03:00
Nickolay Shmyrev	790c564d14	Ignore non-audio files in folder transcription	2023-02-10 00:57:14 +01:00
Nickolay Shmyrev	4c72097478	Allow to configure GPU model path. See https://github.com/alphacep/vosk-server/issues/207	2022-11-19 01:13:42 +01:00
Tomas Mehdi	1053cfa0f8	JSON output_type with texts (#1180 )	2022-10-26 22:41:04 +03:00
gmat	21a42cb6cd	new output type json for gecko project (#1166 ) add a output type `json` to get output file compatible with transcription file format for the gecko project annotation https://github.com/gong-io/gecko	2022-10-06 16:27:29 +03:00
Nickolay Shmyrev	32aa980069	Allow to change grammar of existing recognizer. Probably can be faster though if we do not reinit feature pipeline	2022-09-13 23:12:13 +02:00
vadimdddd	7b4d396eb1	fix quotes and import's order (#1143 )	2022-09-13 21:51:45 +03:00
vadimdddd	36968fbb30	Fix python style issues (#1140 ) * pylint_fix v.2 * init: fix quotes	2022-09-12 23:08:59 +03:00
vadimdddd	7474888801	Fix python style issues (#1131 )	2022-09-09 00:13:02 +03:00
vadimdddd	d46b7a43eb	Split results by newlines (#1124 )	2022-08-26 17:22:10 +03:00
vadimdddd	2376b32a8a	Add SRTResult to KaldiRecognizer (#1080 )	2022-08-24 12:48:18 +03:00
vadimdddd	6eee303d7e	Fix decoding files with spaces (#1084 ) Fix decoding files with spaces	2022-08-02 20:56:04 +03:00
Nickolay Shmyrev	1475b0e986	Async ffmpeg for trancriber and code refactoring	2022-07-05 01:13:05 +02:00
vadimdddd	8ceab0b9b1	Add vosk-server to transcriber (#1024 ) Allow to use vosk-server for transcription	2022-07-04 13:50:38 +03:00
vadimdddd	983519e629	Add compatibility with python 3.5 issue(#1021 ) (#1041 ) Added format for python3.5	2022-06-29 18:00:40 +03:00
vadimdddd	630edeb3d6	fixed error if parent directory doesn't exist #988 (#990 ) Create missing parent directory	2022-06-01 17:52:16 +03:00
Nickolay Shmyrev	55dd29b0ff	Fix transcriber bugs while processing folder	2022-05-26 14:05:54 +02:00
Nickolay Shmyrev	fb4ed21a7f	Shave transcriber code	2022-05-24 22:53:57 +02:00
vadimdddd	4209f3a9fe	Vosk model loader(#871 ) (#941 ) * methods get_model_by_name, get_model_by_lang, get_model were added into the model class * importing modules changed to using components; introduced constant MODELS_HOME_DIR; simplified code * added new model folders into init; changed samples and transcriber bin for new mode loader * changed back in cli.py lang arg to args.lang * added 3 directories instead of 1 to check for models * cli.py: added 3 args instead of 1 for model; __init__.py: changed script get_model_path for run get_model_by_name/lang inside current directory * deleted default env var * cli.py: changed arg_name; __init__.py: changed const name, changed model loading only for last directory * deleted unused method * changed by_name, by_lang methods, added download_model method * deleted env variable initialization * deleted print() * deteled unused modules * added progress_bar, added folder AppData/Local/vosk for model search * changed download_model methond; added my_hook method	2022-05-24 21:06:13 +03:00
Nickolay Shmyrev	b0146782d6	Reorganize transcriber binary	2022-04-29 03:01:09 +02:00
Nickolay Shmyrev	7b7d814484	Introduce incremental decoder with confidences in partial results	2022-04-07 01:07:47 +02:00
Nickolay Shmyrev	a57a84f90e	Refactor GPU API to hide the ID and keep it closer to CPU recognizer	2022-03-03 21:09:09 +01:00
Nickolay Shmyrev	79b8395be0	Add NLSML output	2022-02-03 23:08:09 +01:00
Nickolay Shmyrev	cb0f8e6411	Per-stream wait API	2021-12-23 22:34:47 +01:00
Nickolay Shmyrev	848b2dc753	Expose results in Python	2021-12-17 22:57:00 +01:00
Nickolay Shmyrev	60f0396fe0	Reset lattice on endpoint	2021-12-17 01:13:09 +01:00
Nickolay Shmyrev	344e137a61	Decoding works, results are empty yet	2021-12-13 01:21:59 +01:00
Nickolay Shmyrev	6977be7fb7	Batch recognizer draft	2021-12-12 21:37:44 +01:00
Nickolay Shmyrev	5e46825474	Add try/catch wrapper for C++ method to raise native exceptions. Python and Java are implemented, others on the way	2021-10-12 22:31:36 +02:00
Nickolay Shmyrev	75bedfe06d	Add a method to show/hide words and their times	2021-06-07 01:04:37 +02:00
Nickolay Shmyrev	499b2f183a	Introduce new API to set speaker model to already initialized recognizer. Introduce a method to reset recognizer results to start from scratch without computation of the result.	2021-05-26 00:46:32 +02:00
Nickolay Shmyrev	f8189685e5	Add max alternatives output	2021-05-19 18:47:25 +02:00
sskorol	eea7ca571b	- Added an optional HAVE_CUDA flag to build Vosk with GPU support. - Added missing GpuInit/ThreadInit python wrappers.	2021-02-22 12:30:46 +02:00
Nickolay Shmyrev	db3e31d7ce	Automatically put .so file in resources	2021-02-16 17:33:09 +01:00
Nickolay Shmyrev	d917af21ab	Better search for Windows dlls	2021-02-12 22:24:54 +01:00
Nickolay Shmyrev	c3430e448a	Add build for Windows wheel	2021-02-07 18:11:38 +01:00
Nickolay Shmyrev	481881e59d	Fully move python to cffi	2021-02-07 13:23:03 +01:00
Nickolay Shmyrev	80219066e9	Expose verbose level in the API	2020-05-01 19:02:57 +02:00
Nickolay Shmyrev	aa91ccf68b	Added speaker identification	2020-02-16 17:04:37 +01:00
Nickolay Shmyrev	fe35675eaa	Added basic travis	2020-01-10 20:01:48 +01:00
Nickolay Shmyrev	78e66149f8	Imported Python bindings and Node bindings	2020-01-02 20:46:14 +01:00

46 Commits