Fangjun Kuang
26aa2fa932
Release v1.12.14 ( #2608 )
2025-09-18 15:09:10 +08:00
Fangjun Kuang
a45384b874
Provide pre-compiled whls for cuda 12.x on Linux x64 and Windows x64 ( #2601 )
2025-09-15 17:12:45 +08:00
Fangjun Kuang
32c248b8a0
Release v1.12.13 ( #2593 )
2025-09-12 16:03:15 +08:00
Fangjun Kuang
04a98ca8bf
Release v1.12.12 ( #2586 )
2025-09-10 22:55:01 +08:00
Fangjun Kuang
a9187d5c75
Release v1.12.11 ( #2547 )
2025-09-01 14:09:24 +08:00
Fangjun Kuang
6b1fbdedd2
Release v1.12.10 ( #2523 )
2025-08-25 11:49:31 +08:00
Fangjun Kuang
e8dd5cd2a0
Split sherpa-onnx Python package ( #2521 )
2025-08-25 10:16:58 +08:00
Karel Vesely
f3d23aa170
cmake build, configurable from env ( #2115 )
...
- make sure the defaults in `cmake/cmake_extension.py` variable
`extra_cmake_args` can be overriden by `cmake_args` from
`SHERPA_ONNX_CMAKE_ARGS` env variable
- fix a bug in `sherpa-onnx/csrc/parse-options.cc` which appears
when using `-DSHERPA_ONNX_ENABLE_CHECK=ON`
- avoid copying binaries when these are disabled
2025-04-16 21:26:54 +08:00
Fangjun Kuang
4198d9a166
Provide pre-built wheels with CUDA support. ( #1143 )
2024-07-17 22:59:13 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper ( #694 )
2024-03-24 22:57:00 +08:00
Fangjun Kuang
87a7030c08
Support using alsa to access the microphone with non-streaming ASR models ( #517 )
2024-02-26 21:17:26 +08:00
Fangjun Kuang
67acd34dcd
Use alsa to read microphone in speaker identification demo. ( #605 )
2024-02-23 19:27:51 +08:00
Fangjun Kuang
0b18ccfbb2
C++ API demo for speaker identification with portaudio. ( #561 )
2024-01-30 11:21:43 +08:00
Fangjun Kuang
44efff4e47
Fix CI tests for Python and JNI. ( #554 )
2024-01-27 13:01:54 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
33c03f78b2
Fix CI ( #485 )
2023-12-15 10:25:03 +08:00
Fangjun Kuang
9ff6185b7c
fix building linux x86 wheels ( #484 )
2023-12-14 21:37:40 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. ( #457 )
2023-12-02 15:35:11 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
68f0e59688
Add a C++ example to show streaming VAD + non-streaming ASR. ( #420 )
2023-11-11 22:54:27 +08:00
Fangjun Kuang
0df0a735d7
Fix building wheels for windows ( #365 )
2023-10-14 23:14:45 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Wei Kang
47184f9db7
Refactor hotwords,support loading hotwords from file ( #296 )
2023-09-14 19:33:17 +08:00
Fangjun Kuang
496c5dd7f5
Fix pip install on Windows for Python 3.11 ( #271 )
2023-08-14 22:22:46 +08:00
Fangjun Kuang
c88e672733
Fix building wheels on Windows ( #229 )
2023-08-01 13:45:02 +08:00
Wei Kang
8562711252
Implement context biasing with a Aho Corasick automata ( #145 )
...
* Implement context graph
* Modify the interface to support context biasing
* Support context biasing in modified beam search; add python wrapper
* Support context biasing in python api example
* Minor fixes
* Fix context graph
* Minor fixes
* Fix tests
* Fix style
* Fix style
* Fix comments
* Minor fixes
* Add missing header
* Replace std::shared_ptr with std::unique_ptr for effciency
* Build graph in constructor
* Fix comments
* Minor fixes
* Fix docs
2023-06-16 14:26:36 +08:00
Fangjun Kuang
726680c5e0
Install binaries via pip install ( #112 )
...
When pepole use pip install sherpa-onnx, they also get the following binaries:
(py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh sherpa-onnx*
-rwxr-xr-x 1 fangjun staff 36K Apr 4 13:48 sherpa-onnx
-rwxr-xr-x 1 fangjun staff 52K Apr 4 13:48 sherpa-onnx-microphone
-rwxr-xr-x 1 fangjun staff 54K Apr 4 13:48 sherpa-onnx-microphone-offline
-rwxr-xr-x 1 fangjun staff 37K Apr 4 13:48 sherpa-onnx-offline
-rwxr-xr-x 1 fangjun staff 634K Apr 4 13:48 sherpa-onnx-offline-websocket-server
-rwxr-xr-x 1 fangjun staff 710K Apr 4 13:48 sherpa-onnx-online-websocket-client
-rwxr-xr-x 1 fangjun staff 651K Apr 4 13:48 sherpa-onnx-online-websocket-server
(py38) fangjuns-MacBook-Pro:bin fangjun$ pwd
/Users/fangjun/py38/bin
2023-04-04 15:45:59 +08:00
Fangjun Kuang
ea09d5fbc5
Add Python API ( #31 )
2023-02-19 19:36:03 +08:00