+ All Categories
Home > Documents > Using a Theremin for Micro-Gesture Recognition in an Automotive Environment · Using a Theremin for...

Using a Theremin for Micro-Gesture Recognition in an Automotive Environment · Using a Theremin for...

Date post: 05-May-2018
Category:
Upload: hoangkiet
View: 216 times
Download: 1 times
Share this document with a friend
1
Using a Theremin for Micro-Gesture Recognition in an Automotive Environment Christoph Endres DFKI Saarbrücken, Germany [email protected] Svilen Dimitrov DFKI Saarbrücken, Germany [email protected] ABSTRACT The world’s first electronic musical instrumented was in- vented in the 1920ies by the Russian Professor L´ eon Theremin. It is based on proximity sensing and can be played touch- free. We use this feature for recognition of micro gestures as a first step for future use in a car environment. 1. INTRODUCTION In increasing awareness for a safe driver interface, sev- eral car manufacturers are carrying out research in gesture recognition [2]. Gestures represent a comfortable addition to existing interaction without the decline in recognition ac- curacy under noisy conditions that speech still suffers from [1]. However, as gesture recognition approaches are mainly camera-based, maintaining accuracy in varying light condi- tions and with dynamic background, as we encounter it in a car environment, are a major issue. We argue that an ap- proach based on proximity-sensing is superior to audio- and video-based approaches. 2. THEREMIN GESTURE RECOGNITION While doing research on proximity sensors for the Russian government, L´ eon Theremin invented an electronic musical instrument in 1920. It was named after him and patented in 1928 [3]. It became popular in the 1950ies trough the work of Robert Moog as well as by being featured in pop- ular movie themes like Bernard Herrmann’s soundtrack for The day the Earth stood still. Since the Theremin is based on proximity sensing, it can be played without actually touching it - a fact that we exploit here. The Theremin is controlled by two proximity-sensing anten- nae, one controlling the pitch of the produced sound and the other one its volume. The instruments pitch corresponds to the shortest distance of the players hand to the pitch antenna. In our experiment, we plug the audio output of a regular Theremin instrument (a Jupiter 4, manufactured by Golem Instruments, Germany) into the computers sound card. Since the relative proximity to the pitch antenna is Copyright held by author(s) AutomotiveUI’10, November 11-12, 2010, Pittsburgh, Pennsylvania Adjunct Proceedings. Figure 1: Gesture recognition via proximity sensing equivalent to the current frequency of the instrument, we can track one motion dimension via the received pitch fre- quency. For training, a predefined set of ten gestures (circle, square and triangle, each both clockwise and counterclockwise, as well as motion right, left, up and down) is performed in close proximity of the pitch antenna. The signal pitch is normal- ized and used as training data for a Dynamic-Time-Warping (DTW) algorithm. Sub sequentially, gestures performed at the Theremins pitch antenna can be recognized by classification based on the training data. The recognized gesture is displayed on the screen (see figure 1, bottom). With the prottoype described here, the current recognition accuracy is about 64% (on a 10% chance level), and mainly limited by the one-dimensional input signal. At the moment, we still use a push-to-gesture button to se- lect the start and end of a gesture, but consider an automatic recognition feasible. We plan to build a similar two-antennae-device, not based on audio generation, and mount it behind the steering wheel of a car. The improved accuracy of two-dimensional recog- nition is believed to be reliable enough to use the device as an additional input-modality. As an obvious advantage, the driver can then interact without taking the hands off the wheel (see figure 1, top). 3. REFERENCES [1] F. Althoff, R. Lindl, and L. Walchshaeusl. Robust multimodal hand- and head gesture recognition for controlling automotive infotainment systems. In VDI-Tagung: Der Fahrer im 21. Jahrhundert, Braunschweig, Germany, November 22-23 2005. [2] C. Pickering. Gesture ‘recognition’ could improve automotive safety. Asian Engineer - Automotive-Design, December 2006. [3] L. S. Theremin. Method of and apparatus for the generation of sounds. United States Patent US1661058, February 28th 1928. Adjunct Proceedings of the Second International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI 2010), November 11-12, 2010, Pittsburgh, Pennsylvania, USA 8
Transcript

Using a Theremin for Micro-Gesture Recognitionin an Automotive Environment

Christoph EndresDFKI Saarbrücken, Germany

[email protected]

Svilen DimitrovDFKI Saarbrücken, [email protected]

ABSTRACTThe world’s first electronic musical instrumented was in-vented in the 1920ies by the Russian Professor Leon Theremin.It is based on proximity sensing and can be played touch-free. We use this feature for recognition of micro gesturesas a first step for future use in a car environment.

1. INTRODUCTIONIn increasing awareness for a safe driver interface, sev-

eral car manufacturers are carrying out research in gesturerecognition [2]. Gestures represent a comfortable additionto existing interaction without the decline in recognition ac-curacy under noisy conditions that speech still suffers from[1]. However, as gesture recognition approaches are mainlycamera-based, maintaining accuracy in varying light condi-tions and with dynamic background, as we encounter it ina car environment, are a major issue. We argue that an ap-proach based on proximity-sensing is superior to audio- andvideo-based approaches.

2. THEREMIN GESTURE RECOGNITIONWhile doing research on proximity sensors for the Russian

government, Leon Theremin invented an electronic musicalinstrument in 1920. It was named after him and patentedin 1928 [3]. It became popular in the 1950ies trough thework of Robert Moog as well as by being featured in pop-ular movie themes like Bernard Herrmann’s soundtrack forThe day the Earth stood still.Since the Theremin is based on proximity sensing, it can beplayed without actually touching it - a fact that we exploithere.The Theremin is controlled by two proximity-sensing anten-nae, one controlling the pitch of the produced sound and theother one its volume. The instruments pitch correspondsto the shortest distance of the players hand to the pitchantenna. In our experiment, we plug the audio output ofa regular Theremin instrument (a Jupiter 4, manufacturedby Golem Instruments, Germany) into the computers soundcard. Since the relative proximity to the pitch antenna is

Copyright held by author(s)AutomotiveUI’10, November 11-12, 2010, Pittsburgh, PennsylvaniaAdjunct Proceedings.

Figure 1: Gesture recognition via proximity sensing

equivalent to the current frequency of the instrument, wecan track one motion dimension via the received pitch fre-quency.For training, a predefined set of ten gestures (circle, squareand triangle, each both clockwise and counterclockwise, aswell as motion right, left, up and down) is performed in closeproximity of the pitch antenna. The signal pitch is normal-ized and used as training data for a Dynamic-Time-Warping(DTW) algorithm.Sub sequentially, gestures performed at the Theremins pitchantenna can be recognized by classification based on thetraining data.The recognized gesture is displayed on the screen (see figure1, bottom). With the prottoype described here, the currentrecognition accuracy is about 64% (on a 10% chance level),and mainly limited by the one-dimensional input signal.At the moment, we still use a push-to-gesture button to se-lect the start and end of a gesture, but consider an automaticrecognition feasible.We plan to build a similar two-antennae-device, not basedon audio generation, and mount it behind the steering wheelof a car. The improved accuracy of two-dimensional recog-nition is believed to be reliable enough to use the device asan additional input-modality. As an obvious advantage, thedriver can then interact without taking the hands off thewheel (see figure 1, top).

3. REFERENCES[1] F. Althoff, R. Lindl, and L. Walchshaeusl. Robust multimodal

hand- and head gesture recognition for controlling automotiveinfotainment systems. In VDI-Tagung: Der Fahrer im 21.Jahrhundert, Braunschweig, Germany, November 22-23 2005.

[2] C. Pickering. Gesture ‘recognition’ could improve automotivesafety. Asian Engineer - Automotive-Design, December 2006.

[3] L. S. Theremin. Method of and apparatus for the generation ofsounds. United States Patent US1661058, February 28th 1928.

Adjunct Proceedings of the Second International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI 2010), November 11-12, 2010, Pittsburgh, Pennsylvania, USA

8

Recommended