mirror of https://github.com/k2-fsa/sherpa-onnx.git synced 2026-01-09 07:41:06 +08:00

History

Add various languge bindings for Wenet non-streaming CTC models (#2584 )

This PR adds support for Wenet non-streaming CTC models to sherpa-onnx by introducing the SherpaOnnxOfflineWenetCtcModelConfig struct and integrating it across all language bindings and APIs. The implementation follows the same pattern as other CTC model types like Zipformer CTC.

- Introduces SherpaOnnxOfflineWenetCtcModelConfig struct with a single model field for the ONNX model path
- Adds the new config to SherpaOnnxOfflineModelConfig and updates all language bindings (C++, Pascal, Kotlin, Java, Go, C#, Swift, JavaScript, etc.)
- Provides comprehensive examples and tests across all supported platforms and languages

2025-09-10 18:52:18 +08:00

add-punctuation

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

audio-tagging

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

keyword-spotting-from-file

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

non-streaming-canary-decode-files

Add APIs for Online NeMo CTC models (#2454 )

2025-08-07 09:28:16 +08:00

non-streaming-decode-files

Add various languge bindings for Wenet non-streaming CTC models (#2584 )

2025-09-10 18:52:18 +08:00

non-streaming-speaker-diarization

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

non-streaming-tts

Add Go API for KittenTTS (#2478 )

2025-08-08 20:26:15 +08:00

offline-tts-play

Add Go API for KittenTTS (#2478 )

2025-08-08 20:26:15 +08:00

real-time-speech-recognition-from-microphone

Remove portaudio-go in Go API examples. (#2317 )

2025-06-26 11:33:50 +08:00

speaker-identification

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

speech-enhancement-gtcrn

Add Go API for speech enhancement GTCRN models (#1991 )

2025-03-11 19:33:05 +08:00

streaming-decode-files

Add various language bindings for streaming T-one Russian ASR models (#2576 )

2025-09-09 16:51:18 +08:00

streaming-hlg-decoding

go.mod set to use go 1.17, and use unsafe.Slice to optimize the code (#1920 )

2025-02-25 15:31:15 +08:00

vad

Add Go API for ten-vad (#2384 )

2025-07-12 15:45:49 +08:00

vad-asr-paraformer

Add Java/Kotlin API and Android support for ten-vad (#2389 )

2025-07-12 19:55:37 +08:00

vad-asr-whisper

Add Java/Kotlin API and Android support for ten-vad (#2389 )

2025-07-12 19:55:37 +08:00

vad-speaker-identification

Add Java/Kotlin API and Android support for ten-vad (#2389 )

2025-07-12 19:55:37 +08:00

vad-spoken-language-identification

Add Java/Kotlin API and Android support for ten-vad (#2389 )

2025-07-12 19:55:37 +08:00

.gitignore

Support streaming zipformer CTC (#496 )

2023-12-22 13:46:33 +08:00

README.md

Add Go API for Moonshine models (#1479 )

2024-10-27 09:39:09 +08:00

README.md

Introduction

This folder contains Go API examples for sherpa-onnx.

Please refer to the documentation https://k2-fsa.github.io/sherpa/onnx/go-api/index.html for details.

./add-punctuation It shows how to use a punctuation model to add punctuations to text
./non-streaming-decode-files It shows how to use a non-streaming ASR model to decode files
./non-streaming-speaker-diarization It shows how to use a speaker segmentation model and a speaker embedding model for speaker diarization.
./non-streaming-tts It shows how to use a non-streaming TTS model to convert text to speech
./real-time-speech-recognition-from-microphone It shows how to use a streaming ASR model to recognize speech from a microphone in real-time
./speaker-identification It shows how to use a speaker embedding model for speaker identification.
./streaming-decode-files It shows how to use a streaming model for streaming speech recognition
./streaming-hlg-decoding It shows how to use a streaming model for streaming speech recognition with HLG decoding
./vad It shows how to use silero VAD with Golang.
./vad-asr-paraformer It shows how to use silero VAD + Paraformer for speech recognition.
./vad-asr-whisper It shows how to use silero VAD + Whisper
./vad-speaker-identification It shows how to use Go API for VAD + speaker identification. for speech recognition.
./vad-spoken-language-identification It shows how to use silero VAD + Whisper for spoken language identification.