Abstract
It has been expected that the computer speaks speech with human emotion for achieving a comfortable speech interaction with computer. In this paper, we try to ascertain the physical features included in our conversations. We employ the eight categories of human emotion and collect the speech signals of each emotion. These speech signals are compared with speech signals that have no emotions by perceptual experiments and analyzing their physical features. According to these results, it is found that the pitch frequency becomes high and the power becomes weak if the speech signals are with "surprise". Speech signals with "joy" have the same pitch characteristic, and opposite characteristic about the pewer. Speech signals with "sad", there are certain oscillation both peak frequency of spectra and pitch.