63 Commits

Author SHA1 Message Date
Fangjun Kuang
7e42ba2c0c
Add various languge bindings for Wenet non-streaming CTC models (#2584)
This PR adds support for Wenet non-streaming CTC models to sherpa-onnx by introducing the SherpaOnnxOfflineWenetCtcModelConfig struct and integrating it across all language bindings and APIs. The implementation follows the same pattern as other CTC model types like Zipformer CTC.

- Introduces SherpaOnnxOfflineWenetCtcModelConfig struct with a single model field for the ONNX model path
- Adds the new config to SherpaOnnxOfflineModelConfig and updates all language bindings (C++, Pascal, Kotlin, Java, Go, C#, Swift, JavaScript, etc.)
- Provides comprehensive examples and tests across all supported platforms and languages
2025-09-10 18:52:18 +08:00
Fangjun Kuang
686b909e2f
Add various language bindings for streaming T-one Russian ASR models (#2576)
This PR adds support for streaming T-one Russian ASR models across various language bindings in the sherpa-onnx library. The changes enable T-one CTC (Connectionist Temporal Classification) model integration by adding new configuration structures and example implementations.

- Introduces OnlineToneCtcModelConfig structures across all language bindings (C, C++, Swift, Java, Kotlin, Go, etc.)
- Adds T-one CTC model support to WASM implementations for both ASR and keyword spotting
- Provides comprehensive example implementations demonstrating T-one model usage in multiple programming languages
2025-09-09 16:51:18 +08:00
Fangjun Kuang
353658eabb
Add C# API for KittenTTS (#2477) 2025-08-08 20:22:05 +08:00
Fangjun Kuang
8ab5cba598
Add APIs for Online NeMo CTC models (#2454) 2025-08-07 09:28:16 +08:00
Fangjun Kuang
71aea2f19c
Add C# API for ten-vad (#2385) 2025-07-12 18:39:18 +08:00
Fangjun Kuang
fd9a687ec2
Add Pascal/Go/C#/Dart API for NeMo Canary ASR models (#2367)
Add support for the new NeMo Canary ASR model across multiple language bindings by introducing a Canary model configuration and setter method on the offline recognizer.

- Define Canary model config in Pascal, Go, C#, Dart and update converter functions
- Add SetConfig API for offline recognizer (Pascal, Go, C#, Dart)
- Extend CI/workflows and example scripts to test non-streaming Canary decoding
2025-07-10 14:53:33 +08:00
Fangjun Kuang
3bf986d08d
Support non-streaming zipformer CTC ASR models (#2340)
This PR adds support for non-streaming Zipformer CTC ASR models across 
multiple language bindings, WebAssembly, examples, and CI workflows.

- Introduces a new OfflineZipformerCtcModelConfig in C/C++, Python, Swift, Java, Kotlin, Go, Dart, Pascal, and C# APIs
- Updates initialization, freeing, and recognition logic to include Zipformer CTC in WASM and Node.js
- Adds example scripts and CI steps for downloading, building, and running Zipformer CTC models

Model doc is available at
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/icefall/zipformer.html
2025-07-04 15:57:07 +08:00
Fangjun Kuang
bda427f4b2
Add API to get version information (#2309) 2025-06-25 00:22:21 +08:00
Fangjun Kuang
2dc0f91904
Add C# API for Dolphin CTC models (#2089) 2025-04-02 23:36:22 +08:00
Fangjun Kuang
0aacf02dd8
Add C++ runtime for vocos (#2014) 2025-03-17 17:05:15 +08:00
Fangjun Kuang
d3e27d5e21
Add C# API for speech enhancement GTCRN models (#1990) 2025-03-11 18:58:17 +08:00
Fangjun Kuang
b5d89d7bcb
Add Dart API for FireRedAsr AED Model (#1877) 2025-02-17 15:17:08 +08:00
Fangjun Kuang
d95d43112d
Add C# API for FireRedAsr Model (#1875) 2025-02-17 13:32:41 +08:00
Fangjun Kuang
ae32dfaa56
Add C# API for Kokoro TTS 1.0 (#1805) 2025-02-07 15:39:03 +08:00
Fangjun Kuang
8b989a851c
Fix keyword spotting. (#1689)
Reset the stream right after detecting a keyword
2025-01-20 16:41:10 +08:00
Fangjun Kuang
cc812e6237
Add C# API for Kokoro TTS models (#1720) 2025-01-16 16:30:10 +08:00
Fangjun Kuang
3eced3e7ee
Add C# and JavaScript (wasm) API for MatchaTTS models (#1682) 2025-01-05 15:08:19 +08:00
Michael Lamothe
8a60985363
Upgraded to .NET 8 and made code style a little more internally consistent. (#1680) 2025-01-04 16:39:06 +08:00
Fangjun Kuang
d9c586ccf2
Removed unused TTS example code in .Net examples (#1492) 2024-10-29 14:59:12 +08:00
Fangjun Kuang
3622104133
Add C# API for Moonshine models. (#1483)
* Also, return timestamps for non-streaming ASR.
2024-10-27 13:14:25 +08:00
Fangjun Kuang
a45e5dba99
C# API for speaker diarization (#1407) 2024-10-10 14:29:05 +08:00
Michael Twohey
b409b0a958
Fixed the C api calls and created the TTS project file (#1324)
Co-authored-by: Michael Twohey <mtwohey@americanambulance.com>
2024-09-07 23:25:02 +08:00
Fangjun Kuang
06fd50f536
Add test about whisper large-v3 for .Net (#1187) 2024-07-29 20:49:38 +08:00
Fangjun Kuang
e472180f2c
Add C# API for SenseVoice models (#1151) 2024-07-20 17:09:23 +08:00
Fangjun Kuang
54e6e962bf
Add microphone example for .Net keyword spotting (#1120) 2024-07-13 14:56:39 +08:00
Fangjun Kuang
08c758520f
Add keyword spotting for C# (#1105) 2024-07-10 21:18:46 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 (#906) 2024-07-10 17:05:26 +08:00
Fangjun Kuang
c2cc9dec58
Add Flush to VAD so that the last segment can be detected. (#1099) 2024-07-09 16:15:56 +08:00
Fangjun Kuang
8c4f576f1b
Support .Net framework 2.0 (#1062) 2024-06-28 11:27:19 +08:00
Fangjun Kuang
5cce159cf3
Fix passing C# string to C++ (#1055) 2024-06-25 10:52:59 +08:00
Fangjun Kuang
e7a45108ac
Remove unused files from .Net examples (#1051) 2024-06-24 10:25:14 +08:00
东风破
00de2bd00b
Refactor .Net example project (#1049)
Co-authored-by: 东风破 <birdfishs@163.com>
2024-06-24 10:10:13 +08:00
Fangjun Kuang
9dd0e03568
Enable to stop TTS generation (#1041) 2024-06-22 18:18:36 +08:00
Fangjun Kuang
36336b31f4
Build Android APK for Thai (#1036) 2024-06-20 18:05:57 +08:00
Fangjun Kuang
6789c909d2
Inverse text normalization API of streaming ASR for various programming languages (#1022) 2024-06-18 13:42:17 +08:00
Fangjun Kuang
6e09933d99
Inverse text normalization API for other programming languages (#1019) 2024-06-17 17:02:39 +08:00
Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
0196f1d546
Wrap VAD APIs to C# (#946) 2024-05-30 18:29:38 +08:00
Fangjun Kuang
a99c7cb35c
Wrap punctuation APIs to C#. (#945) 2024-05-30 16:19:56 +08:00
Fangjun Kuang
b445956675
Fix CI tests. (#898) 2024-05-21 20:37:29 +08:00
Fangjun Kuang
677bc1da3e
Add Speaker ID demo for C# (#862) 2024-05-11 13:27:33 +08:00
Fangjun Kuang
612002da57
Fix C# to support Chinese tts models using jieba (#815) 2024-04-26 11:50:07 +08:00
Fangjun Kuang
f7b3735621
Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
dbff2eaadb
Add C API for streaming HLG decoding (#734) 2024-04-05 10:31:20 +08:00
Fangjun Kuang
305c373107
Add C# API for spoken language identification (#697) 2024-03-25 18:45:09 +08:00
Fangjun Kuang
ab7cff2513
Add C API for spoken language identification. (#695) 2024-03-25 15:16:47 +08:00
Fangjun Kuang
1952772654
Add timestamps and tokens for .Net's online models. (#690) 2024-03-23 18:51:56 +08:00
Fangjun Kuang
2fc1201924
Add hotwords support to .Net (#689) 2024-03-22 21:40:42 +08:00
Fangjun Kuang
24f437a6f1
Refactor github actions tests (#688) 2024-03-22 21:22:42 +08:00