blob: 233e0fc94be3a9468e22b49ab2d78defeea9875c (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
|
Offline Speech Recognition using Vosk (Kaldi-ASR) : meta-offline-voice-agent
=========================================================================
meta-offline-voice-agent is the AGL Layer using Vosk API based on Kaldi ASR Toolkit to enable Offline Speech Recognition capabilities for Automotive Grade Linux.
WIP
========
The layer in its current state contains the Vosk library, and is capable of recognizing speech, as verified from the test scripts contained in https://github.com/alphacep/vosk-api/tree/master/python/example.
### Working features:
- [vosk-api (python)](https://github.com/alphacep/vosk-api/tree/master/python)
- [vosk-websocket-server](https://github.com/alphacep/vosk-server/tree/master/websocket)
Testing vosk-api on AGL
======================
### 1. Initializing the build environment:
The `agl-offline-voice-agent` feature needs to be enabled when including aglsetup.sh:
```shell
$ source meta-agl/scripts/aglsetup.sh -m qemux86-64 -b build-voice-qemux86-64 agl-demo agl-offline-voice-agent ${AGL_META_PYTHON}
$ bitbake agl-demo-platform
```
### 2. Running the image:
```shell
$ runqemu tmp/deploy/images/qemux86-64/agl-demo-platform-qemux86-64.qemuboot.conf kvm serialstdio slirp publicvnc audio
```
### 3. Run the test with ptest-runner:
```shell
$ ptest-runner python3-vosk-api
```
### Currently supported targets:
- QEMU x86-64: Work in progress.
Maintainers:
- Aman Arora <aman.arora9848@gmail.com>
|