OpenAI Whisper-compatible ASR server using NVIDIA Parakeet TDT 0.6B (ONNX). CPU-only inference
-
Updated
Jan 17, 2026 - Go
OpenAI Whisper-compatible ASR server using NVIDIA Parakeet TDT 0.6B (ONNX). CPU-only inference
This project benchmarks vision (MobileNet, ResNet20) and NLP (DistilBERT) models across server vs. client inference, using backend servers and ONNX-converted local models. It evaluates latency, UX, accuracy, and performance, dedicated to fulfilling the Master’s thesis in AI & ML.
Add a description, image, and links to the onxx topic page so that developers can more easily learn about it.
To associate your repository with the onxx topic, visit your repo's landing page and select "manage topics."