summaryrefslogtreecommitdiffstats
path: root/meta-offline-voice-agent/README
diff options
context:
space:
mode:
authoramanarora_09 <aman.arora9848@gmail.com>2022-07-22 15:58:23 +0200
committerJan-Simon Moeller <jsmoeller@linuxfoundation.org>2022-07-28 13:58:18 +0000
commiteda84d25d819824d641cf89af1e4d61a5e946744 (patch)
tree38f4b7c34a5b91257518bcbcdc6b96d24f0641b2 /meta-offline-voice-agent/README
parentf206eab5753bd9ee2d5b1126c6ec5a18e75cac89 (diff)
meta-offline-voice-agent: Enable layer for offline voice recognition
- This enables offline voice recognition in AGL - Can successfully run vosk API and vosk websocket server Changes: - Create a new layer: meta-offline-voice-recognition - Create recipes for the the Vosk library, API, websocket server and model - Create feature template for the layer (agl-offline-voice-agent) - Add all required packages to IMAGE_INSTALL - ptest enable recipe python3-vosk-api - Few fixes Bug-AGL: SPEC-4497 Change-Id: Icb290ceee22e2c3a51cbd67f7431701795f1201b Signed-off-by: amanarora_09 <aman.arora9848@gmail.com>
Diffstat (limited to 'meta-offline-voice-agent/README')
-rw-r--r--meta-offline-voice-agent/README46
1 files changed, 46 insertions, 0 deletions
diff --git a/meta-offline-voice-agent/README b/meta-offline-voice-agent/README
new file mode 100644
index 00000000..233e0fc9
--- /dev/null
+++ b/meta-offline-voice-agent/README
@@ -0,0 +1,46 @@
+Offline Speech Recognition using Vosk (Kaldi-ASR) : meta-offline-voice-agent
+=========================================================================
+
+meta-offline-voice-agent is the AGL Layer using Vosk API based on Kaldi ASR Toolkit to enable Offline Speech Recognition capabilities for Automotive Grade Linux.
+
+
+WIP
+========
+The layer in its current state contains the Vosk library, and is capable of recognizing speech, as verified from the test scripts contained in https://github.com/alphacep/vosk-api/tree/master/python/example.
+
+### Working features:
+- [vosk-api (python)](https://github.com/alphacep/vosk-api/tree/master/python)
+- [vosk-websocket-server](https://github.com/alphacep/vosk-server/tree/master/websocket)
+
+
+Testing vosk-api on AGL
+======================
+
+### 1. Initializing the build environment:
+
+The `agl-offline-voice-agent` feature needs to be enabled when including aglsetup.sh:
+
+```shell
+$ source meta-agl/scripts/aglsetup.sh -m qemux86-64 -b build-voice-qemux86-64 agl-demo agl-offline-voice-agent ${AGL_META_PYTHON}
+
+$ bitbake agl-demo-platform
+```
+
+### 2. Running the image:
+
+```shell
+$ runqemu tmp/deploy/images/qemux86-64/agl-demo-platform-qemux86-64.qemuboot.conf kvm serialstdio slirp publicvnc audio
+```
+
+### 3. Run the test with ptest-runner:
+
+```shell
+$ ptest-runner python3-vosk-api
+```
+
+### Currently supported targets:
+- QEMU x86-64: Work in progress.
+
+Maintainers:
+- Aman Arora <aman.arora9848@gmail.com>
+