映像情報メディア学会誌

ふぉーかす

情報の量から質への転換

曽根原登

2012 年 66 巻 11 号 p. k19
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.k19

ジャーナルフリー

PDF形式でダウンロード (364K)

特　集

大規模データを活用した映像メディア処理

1. 大規模データに基づく画像•映像意味解析

佐藤真一

2012 年 66 巻 11 号 p. 887-890
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.887

ジャーナルフリー

PDF形式でダウンロード (3156K)
2. 情報メディアとその信憑性

田中克己, 山本祐輔

2012 年 66 巻 11 号 p. 891-895
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.891

ジャーナルフリー

PDF形式でダウンロード (679K)
3. 画像メディアと異種メディアの融合

角谷和俊, 北山大輔, 若宮翔子

2012 年 66 巻 11 号 p. 896-902
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.896

ジャーナルフリー

PDF形式でダウンロード (1143K)
4. 画像メディア技術の実世界データへの応用

柳井啓司

2012 年 66 巻 11 号 p. 903-906
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.903

ジャーナルフリー

PDF形式でダウンロード (1199K)
5. 大規模マルチメディアデータの統合と検索による気象イベントのモニタリング

北本朝展

2012 年 66 巻 11 号 p. 907-912
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.907

ジャーナルフリー

PDF形式でダウンロード (755K)

技術解説

ラウドネス測定法を用いたテレビ番組の音声レベル管理

岡本幹彦, 松永英一

2012 年 66 巻 11 号 p. 913-919
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.913

ジャーナルフリー

PDF形式でダウンロード (651K)

話　題

SIGGRAPH2012 見聞記

中嶋正之, 白井暁彦

2012 年 66 巻 11 号 p. 920-927
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.920

ジャーナルフリー

PDF形式でダウンロード (1656K)

講　座

拡張現実感技術の最前線（第11回）

消防防災分野における拡張現実の活用

細川直史

2012 年 66 巻 11 号 p. 928-933
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.928

ジャーナルフリー

PDF形式でダウンロード (1947K)

番組制作ファイル（第8回）

地球イチバン『コロワイ族•樹上の家』

図書博文

2012 年 66 巻 11 号 p. 934-938
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.934

ジャーナルフリー

PDF形式でダウンロード (2766K)

メディアウォッチ（第11回）

東京都現代美術館「館長庵野秀明特撮博物館」展見学レポート

辻井崇紘, 甲斐貴之

2012 年 66 巻 11 号 p. 939-941
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.939

ジャーナルフリー

PDF形式でダウンロード (1171K)

知っておきたいキーワード（第82回）

DisplayPort

長野英生

2012 年 66 巻 11 号 p. 942-945
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.942

ジャーナルフリー

PDF形式でダウンロード (406K)

ニュース

ニュース

岡田晋, 寺田光一

2012 年 66 巻 11 号 p. 946-947
発行日: 2012年
公開日: 2014/11/21

DOIhttps://doi.org/10.3169/itej.66.946

ジャーナルフリー

PDF形式でダウンロード (392K)

論文・研究速報

論文特集　画像処理・符号化とアプリケーション

特集論文

許容色差に基づくピクセルアート自動生成

滝本裕則, 吉森聖貴, 満倉靖恵

2012 年 66 巻 11 号 p. J399-J406
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J399

ジャーナルフリー

抄録を表示する抄録を非表示にする

In this paper, we propose a technique for automatic generation of pixel art from character images based on color-difference tolerance. Pixel art is one of image expressions for digital images and is based on the pixel level. Character images that are expressed by pixel art are composed of edge lines and a few colors. The proposed technique is for automatically generating pixel art from a photograph where there was only a single object. However, in the conventional method, it is difficult to determine several parameters of edge detection and decrease color for automatic generation of pixel art. In this paper, to create the optimal decreased color image for each target image, we propose a clustering method using a maximum distance algorithm (MDA) based on the human color-difference tolerance. Moreover, the complete automation of making pixel art is achieved by using the result obtained by clustering using the MDA for the threshold decision of the Canny edge detector. As a result, it is shown that several optimum parameters for pixel art are obtained by using the proposed method.

抄録全体を表示

PDF形式でダウンロード (1103K)
自然画像の埋め込みに適した色シフトによる高周波非可視化コード

北村一真, 福水洋平, 道関隆国

2012 年 66 巻 11 号 p. J407-J412
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J407

ジャーナルフリー

抄録を表示する抄録を非表示にする

We developed invisible code that is suitable for embedding into a natural image. Our code consists of a background layer, color adjustment layer and code layer. The code layer is composed of edgeless high-frequency blocks and edgeless low-frequency blocks. Two key technologies were devised to implement this idea. One is the color adjustment layer that adjusts the color of the background image for high-frequency blocks using an original blur filter. The other is the edgeless high-frequency block that increases the transparency of the periphery of the high-frequency block. To verify the effectiveness of our invisible code, we estimated the invisibility and reading accuracy for code embedded in four common types of images. The results demonstrate that the invisibility has been improved about 1.34 times and the reading accuracy has been improved about 1.38 times compared with the conventional invisible code.

抄録全体を表示

PDF形式でダウンロード (1952K)
円形カメラ配置EPIからの補間視点画像生成のための補間対象領域に注目した対応点探索法

鬼頭卓也, 阿知葉征彦, 辻拓実

2012 年 66 巻 11 号 p. J413-J419
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J413

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper proposes a new method of correspondence point search for intermediate viewpoint image generation using Lagrangian interpolation from multi-view images obtained from the circular camera arrangement. Correspondence point search is not performed on the basis of the real camera image but is performed on the basis of interpolation domain. In the conventional method, if the camera interval exceeds 4°, this causes disturbance in the image. However, the proposed method was able to produce an image with fewer disturbances to the intermediate viewpoint with a camera interval up to 6°. In addition, it was able to suppress a significant increase in processing costs by varying the block size to be used for correspondence point search by focusing on the amount of edge that is included in the search range.

抄録全体を表示

PDF形式でダウンロード (1991K)
DCT符号を高周波成分のインデックスとして用いた超解像

江田孝治

2012 年 66 巻 11 号 p. J420-J425
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J420

ジャーナルフリー

抄録を表示する抄録を非表示にする

The purpose of this research is to propose a new super-resolution method using DCT. The proposed method enlarges a small block in an input image to a large block through DCT. A high-definition image is created by applying this method to the entire low-resolution input image. The enlargement of small blocks is carried out by expanding the DCT coefficient's frequency domain to the same size as that of the large blocks. First, the high-frequency component that is lacking in a small block is searched for from a database. Then, it is added to a small block and restores the high-frequency component. The high frequency component is searched for using a DCT sign index. The database is trained using multiple natural images beforehand. The proposed method made a high-speed search possible by using DCT sign index. From many simulation results, it was found that the proposed method was more effective than the traditional one.

抄録全体を表示

PDF形式でダウンロード (687K)

論文

“デジタルエキストラ”を用いた映像制作手法

～動的3次元モデルのテレビ番組への応用～

久富健介, 冨山仁博, 片山美和, 岩舘祐一, 松永孝治, 井藤良幸, 石原渉

2012 年 66 巻 11 号 p. J426-J433
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J426

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper describes a method of producing scenes for TV program, using “dynamic 3D models”. Dynamic 3D models are 3D models generated from images shot by multiple cameras surrounding an actor for each frame using 3D-reconstruction and texture-mapping techniques. They can provide highly realistic images as well as natural motion without using a motion-capture system. In addition, they allow flexible production, so the positions of the models can be changed and the models can be duplicated. Dynamic 3D models are therefore suitable for producing crowd scenes with numerous people. Although techniques of 3D reconstruction and texture mapping have been discussed and several applications have been proposed, they have not been widely used into TV or movie production because of the size of data and insufficient textural quality. By establishing an efficient production flow with recent powerful processors, we succeeded in producing scenes with sufficient quality for a TV program, which was aired in December, 2009. The scenes involving one thousand soldiers could be produced by shooting several sequences with only two actors. We describe the production flow that we established and present some scenes that we produced using dynamic 3D models for the drama.

抄録全体を表示

PDF形式でダウンロード (1967K)
重み付きクロスバイラテラルフィルタによる奥行き推定精度の向上

松尾琢也, 福嶋慶繁, 石橋豊

2012 年 66 巻 11 号 p. J434-J443
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J434

ジャーナルフリー

抄録を表示する抄録を非表示にする

In this paper, we propose a refinement filter for depth maps. The filter convolutes an image and a depth map with a cross-computed kernel. We call the filter a weighted cross bilateral filter. The main advantages of the proposed method are the filter fits outlines of objects in the depth map to silhouettes in the image, while the filter reduces Gaussian noise in other areas. Additionally, its computational cost is independent of depth ranges. Thus, we can obtain accurate depth maps at the lower cost than the conventional approaches, which require Markov random field-based optimization methods. Experimental results show that the accuracy of the depth map in edge areas goes up and its running time is low.

抄録全体を表示

PDF形式でダウンロード (1590K)
フレーム補間を利用したフラッシュによる映像変動の補正手法

河合吉彦, 住吉英樹, 藤井真人, 八木伸行

2012 年 66 巻 11 号 p. J444-J452
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J444

ジャーナルフリー

抄録を表示する抄録を非表示にする

News programs often have sudden brightness changes caused by electronic flashes from still cameras. This paper proposes a correction method for the video deterioration caused by flashes using the frame interpolation technique. The proposed method creates an interpolated frame from neighboring frames to replace the frame area altered by the flash luminescence. To estimate accurate motion vectors used for frame interpolation, the proposed method adopts a novel block-based cost function that takes into account not only the frames before and after the flash but the flash frame itself. In addition, the hierarchical motion estimation and a new vector refinement filter are used to improve the interpolation quality. Experimental results for broadcasting video show that the proposed method is superior to the conventional interpolation method.

抄録全体を表示

PDF形式でダウンロード (1370K)
基本眼球運動統合システムの構築

宋洋, 張暁林

2012 年 66 巻 11 号 p. J453-J460
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J453

ジャーナルフリー

抄録を表示する抄録を非表示にする

A human-like active binocular vision system, inspired by binocular eye movements in animals, would help robots with automatic fast target switching, smooth target pursuing and efficient visual stabilization. In this paper, a control model that integrates saccadic eye movement, smooth pursuit eye movement, vestibulo-ocular reflex and optokinetic response is proposed. The control interface of the model has been simplified to one external saccadic command input. By Using this target selection command, target switching, target pursuing and visual stabilization of camera would run automatically. To implement the system with parallel processing, like the one used in neural network, the control model and multi-motor control are implemented in a FPGA chip. Finally, the proposed model was tested Using an image processing PC and a binocular robot head and the results show high efficiency of this control model.

抄録全体を表示

PDF形式でダウンロード (1799K)
映像と論文へのアノテーションに基づく論文読解支援システム

石戸谷顕太朗, 山本圭介, 大平茂輝, 長尾確

2012 年 66 巻 11 号 p. J461-J470
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J461

ジャーナルフリー

抄録を表示する抄録を非表示にする

Reading as many technical documents as possible is important to improve our research. When we read these documents, we can understand their contents more easily by referring to related resources such as images, audio clips, and videos. Videos contain a variety of helpful information, and facilitate our understanding of technical documents. We propose a method to define video scenes and document elements, and to annotate them with additional information such as relationships. Based on these annotations and relationships, we developed a support system that uses videos and helps readers understand technical documents. We performed some experiments to confirm whether the system was usable.

抄録全体を表示

PDF形式でダウンロード (879K)

研究速報

地上波デジタルCATV信号用のFDM/WDM光ADMにおいて空きチャネルによりクロストークが統計的に軽減される効果

中川慧, 菊島浩二

2012 年 66 巻 11 号 p. J471-J475
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J471

ジャーナルフリー

抄録を表示する抄録を非表示にする

We report the statistical simulation results of the success probability of optical FDM/WDM Add-Drop-Mux (ADM) for CATV systems transferring terrestrial broadcast digital TV signals. Statistical simulation results show that the required signal crosstalk attenuation value of the optical notch filter can be reduced by inserting blank TV signal channels. The success probability value of the optical FDM/WDM Add-Drop-Mux can consequently be increased.

抄録全体を表示

PDF形式でダウンロード (756K)
コンピュータによる写真からの斜め似顔絵生成法の検討

米山彩美, 高橋桂太, 金子正秀

2012 年 66 巻 11 号 p. J476-J480
発行日: 2012年
公開日: 2012/10/25

DOIhttps://doi.org/10.3169/itej.66.J476

ジャーナルフリー

抄録を表示する抄録を非表示にする

A method for generating oblique caricatures by computers is presented. The inputs are facial images captured from both frontal and several oblique directions. The proposed method can generate caricatures not only for the directions where the input images were provided but also for arbitrary directions by using the results of the principle component analysis of the data on facial features.

抄録全体を表示

PDF形式でダウンロード (1138K)

J-STAGEへの登録はこちら（無料）