Speech Recognition Jukebox

For the Final Project in ECE 476: Designing with Microcontrollers, Robbins and Saha developed a Speech Recognition Jukebox, comprised of a speech recognition system that activated a simple music player. The speech recognition system was capable of recognizing four commands and could cycle through a simple play list of three songs. The jukebox could turn itself on, begin play, move between tracks, and stop play all through user voice commands.

In order to implement this design, Robbins and Saha needed to combine several different hardware and software elements. A small microphone was purchased and used to convert the human voice signal into a voltage signal. This alternating voltage signal was amplified by 1,000 times using three LM358 operational amplifiers. Hardware frequency filters were used to limit the frequency input and software frequency filters were used to parse the signal into different frequency regions.

The values of the signal in these different frequency regions helped to determine each individual word’s unique digital ‘fingerprint’. The fingerprints of important words, such as commands for the music-playing element of the design, were stored into the program. Each time a word was spoken, the fingerprint of this sample word was compared to the stored fingerprints to determine which command, if any, was spoken.

Recognized commands for the system are:

“ON”	Turn the music player on, play current song
“END”	Pause the music player
“SOON”	Play the next song
“PREV”	Play the previous song

Table 1: Voice Commands Recognized by the System

Given the correct combination of commands, a simple music tune would be played on the speaker of the television. A more in-depth analysis of the workings of both the software and hardware sections of the design can be found below.

Filter	Frequency Range
Band-Pass Filter #1	150 Hz – 350 Hz
Band-Pass Filter #2	350 Hz – 600 Hz
Band-Pass Filter #3	600 Hz – 850 Hz
Band-Pass Filter #4	850 Hz – 1100 Hz
Band-Pass Filter #5	1100 Hz – 1350 Hz
Band-Pass Filter #6	1350 Hz – 1600 Hz
High-Pass Filter	above 1600 Hz

Note	C	D	E	F	G	A	B	C	D	E	F	G	A	B	C	Rest
Value	239	213	189	179	159	142	126	120	106	94	90	80	71	63	60	0

Item	Unit Cost	# Used	Cost

Atmel Mega32 Microcontroller	$8.00	1	$8.00
White board	$6.00	1	$6.00
STK 500 board	$15.00	1	$15.00
Power Supply	$5.00	1	$5.00
Digi-Key Microphone #423-1027-ND Manufacturer Part #MD9752NSZ-0	$2.36	1	$2.36
Black and White Television	$5.00	1	$5.00
LM358 Operational Amplifier	$0.00	2	$0.00
Resistors
1 kΩ	$0.00	8	$0.00
2 kΩ	$0.00	3	$0.00
10 kΩ	$0.00	4	$0.00
Capacitors
1 μF	$0.00	7	$0.00
.1 μF	$0.00	1	$0.00

Total Project Cost			$41.36

Project Task	Member Responsible

Software	Robbins and Saha
Digital Filter Design	Saha
Control Section	Robbins and Saha
Audio Playback	Robbins and Saha
Debugging	Robbins and Saha
Testing	Robbins and Saha

Hardware	Robbins and Saha
Microphone Connection	Saha
Filter Design	Robbins
Amplifier Design	Robbins
Television Connection	Robbins

Project Research	Robbins and Saha

Lab Report	Robbins and Saha