30 Commits

Author SHA1 Message Date
Fangjun Kuang
26aa2fa932
Release v1.12.14 (#2608) 2025-09-18 15:09:10 +08:00
Fangjun Kuang
a45384b874
Provide pre-compiled whls for cuda 12.x on Linux x64 and Windows x64 (#2601) 2025-09-15 17:12:45 +08:00
Fangjun Kuang
32c248b8a0
Release v1.12.13 (#2593) 2025-09-12 16:03:15 +08:00
Fangjun Kuang
04a98ca8bf
Release v1.12.12 (#2586) 2025-09-10 22:55:01 +08:00
Fangjun Kuang
a9187d5c75
Release v1.12.11 (#2547) 2025-09-01 14:09:24 +08:00
Fangjun Kuang
6b1fbdedd2
Release v1.12.10 (#2523) 2025-08-25 11:49:31 +08:00
Fangjun Kuang
e8dd5cd2a0
Split sherpa-onnx Python package (#2521) 2025-08-25 10:16:58 +08:00
Karel Vesely
f3d23aa170
cmake build, configurable from env (#2115)
- make sure the defaults in `cmake/cmake_extension.py` variable
  `extra_cmake_args` can be overriden by `cmake_args` from
  `SHERPA_ONNX_CMAKE_ARGS` env variable
- fix a bug in `sherpa-onnx/csrc/parse-options.cc` which appears
  when using `-DSHERPA_ONNX_ENABLE_CHECK=ON`
- avoid copying binaries when these are disabled
2025-04-16 21:26:54 +08:00
Fangjun Kuang
4198d9a166
Provide pre-built wheels with CUDA support. (#1143) 2024-07-17 22:59:13 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper (#694) 2024-03-24 22:57:00 +08:00
Fangjun Kuang
87a7030c08
Support using alsa to access the microphone with non-streaming ASR models (#517) 2024-02-26 21:17:26 +08:00
Fangjun Kuang
67acd34dcd
Use alsa to read microphone in speaker identification demo. (#605) 2024-02-23 19:27:51 +08:00
Fangjun Kuang
0b18ccfbb2
C++ API demo for speaker identification with portaudio. (#561) 2024-01-30 11:21:43 +08:00
Fangjun Kuang
44efff4e47
Fix CI tests for Python and JNI. (#554) 2024-01-27 13:01:54 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting (#505)
* various fixes to ContextGraph to support open vocabulary keywords decoder

* Add keyword spotter runtime

* Add binary

* First version works

* Minor fixes

* update text2token

* default values

* Add jni for kws

* add kws android project

* Minor fixes

* Remove unused interface

* Minor fixes

* Add workflow

* handle extra info in texts

* Minor fixes

* Add more comments

* Fix ci

* fix cpp style

* Add input box in android demo so that users can specify their keywords

* Fix cpp style

* Fix comments

* Minor fixes

* Minor fixes

* minor fixes

* Minor fixes

* Minor fixes

* Add CI

* Fix code style

* cpplint

* Fix comments

* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
33c03f78b2
Fix CI (#485) 2023-12-15 10:25:03 +08:00
Fangjun Kuang
9ff6185b7c
fix building linux x86 wheels (#484) 2023-12-14 21:37:40 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS (#482) 2023-12-13 22:28:03 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. (#457) 2023-12-02 15:35:11 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs (#453) 2023-11-30 23:57:43 +08:00
Fangjun Kuang
68f0e59688
Add a C++ example to show streaming VAD + non-streaming ASR. (#420) 2023-11-11 22:54:27 +08:00
Fangjun Kuang
0df0a735d7
Fix building wheels for windows (#365) 2023-10-14 23:14:45 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS (#364) 2023-10-14 14:21:53 +08:00
Fangjun Kuang
c471423125
Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file (#296) 2023-09-14 19:33:17 +08:00
Fangjun Kuang
496c5dd7f5
Fix pip install on Windows for Python 3.11 (#271) 2023-08-14 22:22:46 +08:00
Fangjun Kuang
c88e672733
Fix building wheels on Windows (#229) 2023-08-01 13:45:02 +08:00
Wei Kang
8562711252
Implement context biasing with a Aho Corasick automata (#145)
* Implement context graph

* Modify the interface to support context biasing

* Support context biasing in modified beam search; add python wrapper

* Support context biasing in python api example

* Minor fixes

* Fix context graph

* Minor fixes

* Fix tests

* Fix style

* Fix style

* Fix comments

* Minor fixes

* Add missing header

* Replace std::shared_ptr with std::unique_ptr for effciency

* Build graph in constructor

* Fix comments

* Minor fixes

* Fix docs
2023-06-16 14:26:36 +08:00
Fangjun Kuang
726680c5e0
Install binaries via pip install (#112)
When pepole use pip install sherpa-onnx, they also get the following binaries:

(py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh  sherpa-onnx*
-rwxr-xr-x  1 fangjun  staff    36K Apr  4 13:48 sherpa-onnx
-rwxr-xr-x  1 fangjun  staff    52K Apr  4 13:48 sherpa-onnx-microphone
-rwxr-xr-x  1 fangjun  staff    54K Apr  4 13:48 sherpa-onnx-microphone-offline
-rwxr-xr-x  1 fangjun  staff    37K Apr  4 13:48 sherpa-onnx-offline
-rwxr-xr-x  1 fangjun  staff   634K Apr  4 13:48 sherpa-onnx-offline-websocket-server
-rwxr-xr-x  1 fangjun  staff   710K Apr  4 13:48 sherpa-onnx-online-websocket-client
-rwxr-xr-x  1 fangjun  staff   651K Apr  4 13:48 sherpa-onnx-online-websocket-server
(py38) fangjuns-MacBook-Pro:bin fangjun$ pwd
/Users/fangjun/py38/bin
2023-04-04 15:45:59 +08:00
Fangjun Kuang
ea09d5fbc5
Add Python API (#31) 2023-02-19 19:36:03 +08:00