映像情報メディア学会誌
Online ISSN : 1881-6908
Print ISSN : 1342-6907
ISSN-L : 1342-6907
64 巻, 2 号
選択された号の論文の21件中1~21を表示しています
ふぉーかす
小特集
ディジタルコンテンツ制作の最新動向~ここまで来たディジタルコンテンツ制作~
話  題
講  座
マルチメディア検索の最先端(第2回)
知っておきたいキーワード(第49回)
私の研究開発ツール(第31回)
大学発のベンチャービジネス
100行で書く画像処理最先端(第2回)
ニュース
2009 国際放送機器展見聞記(InterBEE 2009)
論文・研究速報
論文
  • 大久保 隆, 小林 直樹, 大野 光平, 藤井 雅弘, 伊丹 誠
    2010 年64 巻2 号 p. 222-229
    発行日: 2010/02/01
    公開日: 2010/06/01
    ジャーナル フリー
    A scheme for cooperative reception of ISDB-T one-segment service is proposed and its performance is analyzed. It is very difficult to install multiple antennas for diversity reception in small mobile one-segment terminals, and therefore efficient diversity is not always achieved. Cooperative reception between multiple receivers via a dedicated communication network such as Bluetooth is one method of overcoming this problem. Since the communication-link capacity between terminals is limited, a scheme for the efficient exchange of required information such as channel information and sub-carrier data between terminals for diversity must be developed. We discuss possible schemes to reduce the information exchanged between terminals and to achieve maximum diversity performance. Analysis demonstrated the possibility of the proposed scheme for use in a cooperative reception of ISDB-T one-segment service.
  • Zisheng Li, Jun-ichi Imai, Masahide Kaneko
    2010 年64 巻2 号 p. 230-236
    発行日: 2010/02/01
    公開日: 2010/06/01
    ジャーナル フリー
    Facial expression recognition has many potential applications in areas such as human-computer interaction (HCI), emotion analysis, and synthetic face animation. This paper proposes a novel framework of facial appearance and shape information extraction for facial expression recognition. For appearance information extraction, a facial-component-based bag of words method is presented. We segment face images into four component regions: forehead, eye-eyebrow, nose, and mouth. We then partition them into 4 × 4 sub-regions. Dense SIFT (scale-invariant feature transform) features are calculated over the sub-regions and vector quantized into 4 × 4 sets of codeword distributions. For shape information extraction, PHOG (pyramid histogram of orientated gradient) descriptors are computed on the four facial component regions to obtain the spatial distribution of edges. Multi-class SVM classifiers are applied to classify the six basic facial expressions using the facial-component-based bag of words and PHOG descriptors respectively. Then the appearance and shape information is fused at decision level to further improve the recognition rate. Our framework provides holistic characteristics for the local texture and shape features by enhancing the structure-based spatial information, and makes it possible to use the bag of words method and the local descriptors in facial expression recognition for the first time. The recognition rate achieved by the fusion of appearance and shape features at decision level using the Cohn-Kanade database is 96.33%, which outperforms the state-of-the-art research works.
研究速報
feedback
Top