This PR adds support for Wenet non-streaming CTC models to sherpa-onnx by introducing the SherpaOnnxOfflineWenetCtcModelConfig struct and integrating it across all language bindings and APIs. The implementation follows the same pattern as other CTC model types like Zipformer CTC. - Introduces SherpaOnnxOfflineWenetCtcModelConfig struct with a single model field for the ONNX model path - Adds the new config to SherpaOnnxOfflineModelConfig and updates all language bindings (C++, Pascal, Kotlin, Java, Go, C#, Swift, JavaScript, etc.) - Provides comprehensive examples and tests across all supported platforms and languages
Introduction
This folder contains Go API examples for sherpa-onnx.
Please refer to the documentation https://k2-fsa.github.io/sherpa/onnx/go-api/index.html for details.
-
./add-punctuation It shows how to use a punctuation model to add punctuations to text
-
./non-streaming-decode-files It shows how to use a non-streaming ASR model to decode files
-
./non-streaming-speaker-diarization It shows how to use a speaker segmentation model and a speaker embedding model for speaker diarization.
-
./non-streaming-tts It shows how to use a non-streaming TTS model to convert text to speech
-
./real-time-speech-recognition-from-microphone It shows how to use a streaming ASR model to recognize speech from a microphone in real-time
-
./speaker-identification It shows how to use a speaker embedding model for speaker identification.
-
./streaming-decode-files It shows how to use a streaming model for streaming speech recognition
-
./streaming-hlg-decoding It shows how to use a streaming model for streaming speech recognition with HLG decoding
-
./vad It shows how to use silero VAD with Golang.
-
./vad-asr-paraformer It shows how to use silero VAD + Paraformer for speech recognition.
-
./vad-asr-whisper It shows how to use silero VAD + Whisper
-
./vad-speaker-identification It shows how to use Go API for VAD + speaker identification. for speech recognition.
-
./vad-spoken-language-identification It shows how to use silero VAD + Whisper for spoken language identification.