alphacep_vosk-api

mirror of https://github.com/alphacep/vosk-api.git synced 2026-01-15 01:33:31 +08:00

Author	SHA1	Message	Date
Nickolay Shmyrev	7da70c6107	Add test for ITN	2024-03-29 12:57:01 +01:00
Nickolay Shmyrev	aba84973b1	Update to latest gradio	2023-12-15 00:23:37 +01:00
Nickolay Shmyrev	339b1c5d00	Fix t_max endpointer config and introduce t_start_max for silent inputs.	2023-12-13 04:45:30 +01:00
Nickolay Shmyrev	a47fa9147b	Add endpointer delays parameter	2023-11-27 15:29:31 +01:00
Edwin Hendriks	25c59b52e3	add -m (language model) argument to test_microphone.py (#1219 ) * added language model as parameter * fix fore more help explanation	2022-12-07 22:53:21 +03:00
Nickolay Shmyrev	4c72097478	Allow to configure GPU model path. See https://github.com/alphacep/vosk-server/issues/207	2022-11-19 01:13:42 +01:00
Nickolay Shmyrev	16c4a0d985	Bump version	2022-09-25 21:00:24 +02:00
Nickolay Shmyrev	32aa980069	Allow to change grammar of existing recognizer. Probably can be faster though if we do not reinit feature pipeline	2022-09-13 23:12:13 +02:00
vadimdddd	7b4d396eb1	fix quotes and import's order (#1143 )	2022-09-13 21:51:45 +03:00
vadimdddd	36968fbb30	Fix python style issues (#1140 ) * pylint_fix v.2 * init: fix quotes	2022-09-12 23:08:59 +03:00
vadimdddd	7474888801	Fix python style issues (#1131 )	2022-09-09 00:13:02 +03:00
vadimdddd	2376b32a8a	Add SRTResult to KaldiRecognizer (#1080 )	2022-08-24 12:48:18 +03:00
vadimdddd	b8a88cc30c	Add gradio recognizer (#1114 ) Demonstrate speech recognition in gradio	2022-08-19 17:31:47 +03:00
Nickolay V. Shmyrev	be0b117711	Add training colab	2022-08-17 19:11:45 +02:00
Nickolay V. Shmyrev	df0ee24084	Add adaptation colab	2022-08-17 12:38:17 +02:00
Nickolay V. Shmyrev	d3d8f53156	Add Kaldi build colab	2022-08-17 10:02:43 +02:00
Nickolay Shmyrev	053d71f5aa	Bug fix in notebook	2022-08-01 22:59:38 +02:00
Nickolay Shmyrev	4fbbf5882c	Add colab demo	2022-08-01 22:52:57 +02:00
vadimdddd	983519e629	Add compatibility with python 3.5 issue(#1021 ) (#1041 ) Added format for python3.5	2022-06-29 18:00:40 +03:00
Nickolay Shmyrev	859420809b	Add extra comments	2022-05-26 11:45:16 +02:00
vadimdddd	4209f3a9fe	Vosk model loader(#871 ) (#941 ) * methods get_model_by_name, get_model_by_lang, get_model were added into the model class * importing modules changed to using components; introduced constant MODELS_HOME_DIR; simplified code * added new model folders into init; changed samples and transcriber bin for new mode loader * changed back in cli.py lang arg to args.lang * added 3 directories instead of 1 to check for models * cli.py: added 3 args instead of 1 for model; __init__.py: changed script get_model_path for run get_model_by_name/lang inside current directory * deleted default env var * cli.py: changed arg_name; __init__.py: changed const name, changed model loading only for last directory * deleted unused method * changed by_name, by_lang methods, added download_model method * deleted env variable initialization * deleted print() * deteled unused modules * added progress_bar, added folder AppData/Local/vosk for model search * changed download_model methond; added my_hook method	2022-05-24 21:06:13 +03:00
Nickolay Shmyrev	def8c93711	Remove semicolon	2022-05-24 15:23:03 +03:00
Nickolay Shmyrev	7b7d814484	Introduce incremental decoder with confidences in partial results	2022-04-07 01:07:47 +02:00
Nickolay Shmyrev	a57a84f90e	Refactor GPU API to hide the ID and keep it closer to CPU recognizer	2022-03-03 21:09:09 +01:00
Nickolay Shmyrev	79b8395be0	Add NLSML output	2022-02-03 23:08:09 +01:00
Nickolay Shmyrev	d2c11a611f	Read list of files from arguments	2022-01-30 22:57:36 +01:00
Nickolay Shmyrev	72bf210164	Put the demo into main folder	2021-12-24 01:07:38 +01:00
Nickolay Shmyrev	cb0f8e6411	Per-stream wait API	2021-12-23 22:34:47 +01:00
Nickolay Shmyrev	848b2dc753	Expose results in Python	2021-12-17 22:57:00 +01:00
Nickolay Shmyrev	60f0396fe0	Reset lattice on endpoint	2021-12-17 01:13:09 +01:00
Nickolay Shmyrev	344e137a61	Decoding works, results are empty yet	2021-12-13 01:21:59 +01:00
Nickolay Shmyrev	6977be7fb7	Batch recognizer draft	2021-12-12 21:37:44 +01:00
Lars Kiesow	2349e66a97	Subtitles require word times (#607 ) This is a port of the recent addition of commit 7ccf743, adding `KaldiRecognizer.SetWords(True)` to the other examples dealing with subtitles to the WebVTT example. Without this, the example will not work with the most recent `vosk` (0.3.30) Python package.	2021-06-27 18:06:11 +03:00
Lars Kiesow	02ef49f67e	Allow Saving WebVTT (#605 ) This patch is a small extension to the WebVTT example which allows to directly save the WebVTT output to a file like this: ./test_webvtt.py test.wav out.vtt	2021-06-24 19:35:30 +03:00
Lars Kiesow	7cdf8f1d03	Add Python WebVTT Example (#601 ) This patch adds an example for using webvtt-py to generate WebVTT files from Vosk output. This is similar to the SRT example but still very useful for generating an example video subtitle usable in web contexts.	2021-06-23 01:16:33 +03:00
Nickolay Shmyrev	7ccf743bb6	SRT requires word times	2021-06-10 10:37:15 +02:00
Nickolay Shmyrev	75bedfe06d	Add a method to show/hide words and their times	2021-06-07 01:04:37 +02:00
Nickolay Shmyrev	6aa5af7640	Add reset test	2021-05-26 21:22:07 +02:00
Nickolay Shmyrev	499b2f183a	Introduce new API to set speaker model to already initialized recognizer. Introduce a method to reset recognizer results to start from scratch without computation of the result.	2021-05-26 00:46:32 +02:00
Nickolay Shmyrev	f8189685e5	Add max alternatives output	2021-05-19 18:47:25 +02:00
Nickolay Shmyrev	a5a3697b7c	Copy data before queue, original data can be destroyed. Fixes issue #444 Thanks to Alexander Zatvornitsky	2021-03-01 21:25:41 +01:00
Vlad Ki	c6119c4835	test_microphone: AcceptWavform wants bytes	2021-02-08 00:38:28 +02:00
Nickolay Shmyrev	6f2d6d0d69	Proper microphone recognizer with the queue	2021-01-09 23:16:35 +01:00
Nickolay Shmyrev	08c35e84f3	Update demo with spk vector check	2020-12-23 22:21:12 +01:00
Nickolay Shmyrev	dc3d03d742	Make sure we have result field in json	2020-11-29 19:19:50 +01:00
Nickolay Shmyrev	746ff47757	Split long lines in subtitles	2020-11-03 15:13:16 +01:00
Nickolay Shmyrev	7af3e9a334	Add srt example	2020-10-07 13:43:09 +02:00
Nickolay Shmyrev	57cc474c9f	Build bigram language model from grammars	2020-10-04 23:42:50 +02:00
Nickolay Shmyrev	f97383c17f	Update models location	2020-09-22 00:00:24 +02:00
Nickolay Shmyrev	41035485db	Fix x-vectors, now they actually work. Requires new version spk-model-0.4 with whitening transform matrix	2020-09-21 23:31:30 +02:00

1 2

71 Commits