pocketsphinx

Pocketsphinx Speech to Text Tutorial in Python

Harman Singh

Nov 3, 2018 — 1 min read

Pocket Sphinx Python

This tutorial will focus on how to use pocketsphinx for speech to text in python. Using CMU Sphinx with python is a non complicated task, when you install all the relevant packages.

What is CMU Sphinx and Pocketsphinx?
CMU Sphinx, called Sphinx in short is a group of speech recognition system developed at Carnegie Mellon University [Wikipedia].
PocketSphinx: A version of Sphinx specialized for embedded systems. This is a most popular version of Sphinx for mobile phone development.

How to Install PocketSphinx?
To install PocketSphinx, you need to install Sphinx base package on your machine first. Goto Sphinx website and download the package as per your operating system.

After installation of Sphinx, from https://github.com/cmusphinx/sphinxbase
Install PocketSphinx, from https://github.com/cmusphinx/pocketsphinx
Use PIP for installing PocketSphinx Library in Python
```
pip install pocketsphinx
```
If above doesn’t work, sometimes you need to upgrade the version of pip, use following commands then:
```
python -m pip install --upgrade pip setuptools wheel
pip install --upgrade pocketsphinx
```
Now start writing code for testing your first program, following is the first test program you can use in Python.

# Code retested by KhalsaLabs
# You can use your own audio file in code
# Raw or wav files would work perfectly
# For mp3 files, you need to modify code (add codex)

from __future__ import print_function
import os
from pocketsphinx import Pocketsphinx, get_model_path, get_data_path

model_path = get_model_path()
data_path = get_data_path()

config = {
'hmm': os.path.join(model_path, 'en-us'),
'lm': os.path.join(model_path, 'en-us.lm.bin'),
'dict': os.path.join(model_path, 'cmudict-en-us.dict')
}

ps = Pocketsphinx(**config)
ps.decode(
audio_file=os.path.join(data_path, 'goforward.raw'), # add your audio file here
buffer_size=2048,
no_search=False,
full_utt=False
)

print(ps.hypothesis())

I will explain the working of code, step by step in another post.

IOT 101 - Minimizing Wifi Battery Consumption in IOT projects

In the early development stages of PetDrifts, we faced significant challenges with our MVP, which included a custom PCB equipped with an ESP32 C3 mini, sensor, and a Li-ion battery. One of the major issues was the rapid draining of the battery, a critical problem in our IoT device design

Writing Kubernetes CronJobs: A Simple Guide

Kubernetes CronJobs are a powerful tool for running scheduled tasks within your Kubernetes cluster. Think of them as cron jobs for containers — they allow you to automate tasks like backups, batch jobs, or routine cleanups. In this guide, we'll walk through how to set up a CronJob in Kubernetes with

Elon Musk’s AI Startup, xAI, Hits $50 Billion Valuation

Elon Musk’s AI startup, xAI, is now valued at $50 billion, according to The Wall Street Journal. The xAI valuation has doubled since spring, showing its rapid growth in the competitive AI industry. xAI Raises $5 Billion in New Funding In its latest funding round, xAI raised $5 billion,

The Essence of Paul Grahm's Essay on Hard Work: How to Hard Work

In the realm of success and achievement, the concept of hard work often gets oversimplified. Paul Graham, a luminary in the tech and startup world, offers an exploration of what it truly means to work hard in his enlightening essay, "How to Work Hard." Through his rich narrative and insightful

This tutorial will focus on how to use pocketsphinx for speech to text in python. Using CMU Sphinx with python is a non complicated task, when you install all the relevant packages.

Read more

IOT 101 - Minimizing Wifi Battery Consumption in IOT projects

Writing Kubernetes CronJobs: A Simple Guide

Elon Musk’s AI Startup, xAI, Hits $50 Billion Valuation

The Essence of Paul Grahm's Essay on Hard Work: How to Hard Work