詳細検索結果
以下の条件での結果を表示する: 検索条件を変更
クエリ検索: "1080p"
414件中 1-20の結果を表示しています
  • 大竹 桂一, 川原 功, 村井 隆一
    映像情報メディア学会年次大会講演予稿集
    2006年 2006 巻 3-1
    発行日: 2006/08/01
    公開日: 2017/05/24
    会議録・要旨集 フリー
    Panasonic has developed the world's largest 103-inch
    1080
    p
    PDF prototype. The goal is to create the greatest viewing experience offered by a consumer product for home usage. According to our study, the best viewing distance is not 3H (height) but rather 2H. In the case of the 103-inch
    1080
    p
    PDF, a viewing distance of 2.8 m, which is approximately 2H, corresponds with a 45 degree viewing angle. If one views it at a 2.8 m distance, one senses the same perception of reality at home as in a theater. Panasonic intends for this product to serve as the ideal TV for home theater.
  • Min ZHU, Leibo LIU, Shouyi YIN, Chongyong YIN, Shaojun WEI
    IEICE Transactions on Information and Systems
    2010年 E93.D 巻 12 号 3202-3210
    発行日: 2010/12/01
    公開日: 2010/12/01
    ジャーナル フリー
    This paper introduces a cycle-accurate Simulator for a dynamically REconfigurable MUlti-media System, called SimREMUS. SimREMUS can either be used at transaction-level, which allows the modeling and simulation of higher-level hardware and embedded software, or at register transfer level, if the dynamic system behavior is desired to be observed at signal level. Trade-offs among a set of criteria that are frequently used to characterize the design of a reconfigurable computing system, such as granularity, programmability, configurability as well as architecture of processing elements and route modules etc., can be quickly evaluated. Moreover, a complete tool chain for SimREMUS, including compiler and debugger, is developed. SimREMUS could simulate 270k cycles per second for million gates SoC (System-on-a-Chip) and produced one H.264
    1080
    p
    frame in 15 minutes, which might cost days on VCS (platform: CPU: E5200@ 2.5Ghz, RAM: 2.0GB). Simulation showed that
    1080
    p
    @30fps of H.264 High Profile@ Level 4 can be achieved when exploiting a 200MHz working frequency on the VLSI architecture of REMUS.
  • *村井 隆一
    画像電子学会研究会講演予稿
    2008年 08-03 巻 08-03-06
    発行日: 2008年
    公開日: 2009/01/15
    会議録・要旨集 認証あり
    Panasonic has developed the world's largest 150-inch 4k2k PDP prototype and developed 103-inch
    1080
    p
    PDP. Their goals are to create the greatest viewing experience offered by a consumer product for home usage and other applications for the public usage.
  • 松本 洋平
    日本航海学会論文集
    2012年 127 巻 205-214
    発行日: 2012/09/25
    公開日: 2017/01/15
    ジャーナル フリー
    The stability of the images is an important basis of the computer vision algorithms. To develop the vision based automated watch system for ship navigation, in this paper, I explored the optimal implementation of image stabilizer using inverse compositional algorithm for navigational images. Experimental results show the best size and position of the template regions for our
    1080
    p
    navigational images. Then my stabilizer achieved almost 100% convergence ratio in a real-time manner.
  • 村井 隆一, 安藤 亨, 村田 充弘
    映像情報メディア学会誌
    2007年 61 巻 9 号 1277-1280
    発行日: 2007/09/01
    公開日: 2009/12/25
    ジャーナル フリー
  • Bo LIU, Peng CAO, Min ZHU, Jun YANG, Leibo LIU, Shaojun WEI, Longxing SHI
    IEICE Transactions on Information and Systems
    2012年 E95.D 巻 7 号 1858-1871
    発行日: 2012/07/01
    公開日: 2012/07/01
    ジャーナル フリー
    This paper presents a novel architecture design to optimize the reconfiguration process of a coarse-grained reconfigurable architecture (CGRA) called Reconfigurable Multimedia System II (REMUS-II). In REMUS-II, the tasks in multi-media applications are divided into two parts: computing-intensive tasks and control-intensive tasks. Two Reconfigurable Processor Units (RPUs) for accelerating computing-intensive tasks and a Micro-Processor Unit (µPU) for accelerating control-intensive tasks are contained in REMUS-II. As a large-scale CGRA, REMUS-II can provide satisfying solutions in terms of both efficiency and flexibility. This feature makes REMUS-II well-suited for video processing, where higher flexibility requirements are posed and a lot of computation tasks are involved. To meet the high requirement of the dynamic reconfiguration performance for multimedia applications, the reconfiguration architecture of REMUS-II should be well designed. To optimize the reconfiguration architecture of REMUS-II, a hierarchical configuration storage structure and a 3-stage reconfiguration processing structure are proposed. Furthermore, several optimization methods for configuration reusing are also introduced, to further improve the performance of reconfiguration process. The optimization methods include two aspects: the multi-target reconfiguration method and the configuration caching strategies. Experimental results showed that, with the reconfiguration architecture proposed, the performance of reconfiguration process will be improved by 4 times. Based on RTL simulation, REMUS-II can support the
    1080
    p
    @32fps of H.264 HiP@Level4 and
    1080
    p
    @40fps High-level MPEG-2 stream decoding at the clock frequency of 200MHz. The proposed REMUS-II system has been implemented on a TSMC 65nm process. The die size is 23.7mm2 and the estimated on-chip dynamic power is 620mW.
  • 村井 隆一
    映像情報メディア学会技術報告
    2008年 32.49 巻 IDY2008-108/AIT2008-
    発行日: 2008/11/13
    公開日: 2017/09/20
    会議録・要旨集 フリー
    Panasonic has developed the world's largest 150-inch 4k2k PDP prototype and developed 103-inch
    1080
    p
    PDP. Their goals are to create the greatest viewing experience offered by a consumer product for home usage and other applications for the public usage.
  • Jia SU, Yiqing HUANG, Lei SUN, Shinichi SAKAIDA, Takeshi IKENAGA
    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
    2011年 E94.A 巻 10 号 2013-2021
    発行日: 2011/10/01
    公開日: 2011/10/01
    ジャーナル 認証あり
    With the increasing demand of high video quality and large image size, adaptive interpolation filter (AIF) addresses these issues and conquers the time varying effects resulting in increased coding efficiency, comparing with recent H.264 standard. However, currently most AIF algorithms are based on either frame level or macroblock (MB) level, which are not flexible enough for different video contents in a real codec system, and most of them are facing a severe time consuming problem. This paper proposes a content based coarse to fine AIF algorithm, which can adapt to video contents by adding different filters and conditions from coarse to fine. The overall algorithm has been mainly made up by 3 schemes: frequency analysis based frame level skip interpolation, motion vector modeling based region level interpolation, and edge detection based macroblock level interpolation. According to the experiments, AIF are discovered to be more effective in the high frequency frames, therefore, the condition to skip low frequency frames for generating AIF coefficients has been set. Moreover, by utilizing the motion vector information of previous frames the region level based interpolation has been designed, and Laplacian of Gaussian based macroblock level interpolation has been proposed to drive the interpolation process from coarse to fine. Six 720p and six
    1080
    p
    video sequences which cover most typical video types have been tested for evaluating the proposed algorithm. The experimental results show that the proposed algorithm reduce total encoding time about 41% for 720p and 25% for
    1080
    p
    sequences averagely, comparing with Key Technology Areas (KTA) Enhanced AIF algorithm, while obtains a BDPSNR gain up to 0.004 and 3.122 BDBR reduction.
  • Tian Song, Naoyuki Ishikura, Kenji Watanabe, Takashi Shimamoto
    Journal of Signal Processing
    2012年 16 巻 6 号 611-616
    発行日: 2012/11/30
    公開日: 2013/03/15
    ジャーナル フリー
    H.264/AVC has been successful in the past 10 years owing to its wide use in many fields from high-definition digital TV (HDTV) to low-resolution one-segment mobile TV. In recent years, with the high demand for high-resolution applications, 4K resolution (UHDTV) which has four times the resolution of
    1080
    p
    HDTV, has become highly required for video streaming and broadcasting. However, the computational complexity for 4K-resolution applications is still the bottleneck preventing the realization of real-time decoders. In this work, a motion vector management architecture that is designed for total motion vector management and calculation is proposed. The proposed architecture can calculate predicted motion vectors, select reference motion vectors for the direct mode, and calculate the motion vectors for interlace decoding. The proposed architecture can perform 176 cycles/MB decoding in the worst case. The simulation result shows that when working at 200 MHz, the proposed architecture can be implemented byabout 99k gates.
  • Tianruo ZHANG, Guifen TIAN, Takeshi IKENAGA, Satoshi GOTO
    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
    2008年 E91.A 巻 12 号 3630-3637
    発行日: 2008/12/01
    公開日: 2008/12/25
    ジャーナル 認証あり
    Intra coding in H.264/AVC has significantly enhanced video compression efficiency. However, computation complexity increases by the rate-distortion (RD) based mode decision. This paper proposes a novel fast mode decision algorithm in H.264/AVC intra prediction and its VLSI architecture. A novel edge-detection pattern is proposed and both edge-detection technique and spatial mode prediction technique are combined together to reduce the number of intra 4×4 candidate modes from 9 to an average of 2.50. VLSI architecture of intra mode decision module is designed with TSMC 0.18µm CMOS technology. The maximum frequency of 285MHz is achieved and 13.1k NAND gates are required. High frequency, efficient processing cycle reduction and small area make this design to be an excellent accelerator for HDTV
    1080
    p
    @30fps real time encoder.
  • Koichi MITSUNARI, Jaehoon YU, Takao ONOYE, Masanori HASHIMOTO
    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
    2018年 E101.A 巻 9 号 1298-1307
    発行日: 2018/09/01
    公開日: 2018/09/01
    ジャーナル 認証あり

    Visual object detection on embedded systems involves a multi-objective optimization problem in the presence of trade-offs between power consumption, processing performance, and detection accuracy. For a new Pareto solution with high processing performance and low power consumption, this paper proposes a hardware architecture for decision tree ensemble using multiple channels of features. For efficient detection, the proposed architecture utilizes the dimensionality of feature channels in addition to parallelism in image space and adopts task scheduling to attain random memory access without conflict. Evaluation results show that an FPGA implementation of the proposed architecture with an aggregated channel features pedestrian detector can process 229 million samples per second at 100MHz operation frequency while it requires a relatively small amount of resources. Consequently, the proposed architecture achieves 350fps processing performance for

    1080
    P
    Full HD images and outperforms conventional object detection hardware architectures developed for embedded systems.

  • Yuteng Zhou, Xinming Huang
    IEICE Electronics Express
    2018年 15 巻 12 号 20180396
    発行日: 2018年
    公開日: 2018/06/25
    [早期公開] 公開日: 2018/05/28
    ジャーナル フリー

    This paper presents the hardware architecture and VLSI implementations of a PCANet-based object detector. The proposed PCANet model, cascaded with a linear support vector machine, can achieve better classification performance than traditional handcrafted computer vision methods, yet it is significantly more power efficient than multi-layer convolutional neural networks. The proposed pipeline hardware architecture, when implemented using Synopsys 32 nm process technology, results in 27.4 fps while processing

    1080
    P
    , with only 0.5 watt power consumption. Targeted for the application of advanced driver assistance system, the proposed design is evaluated on road marking and traffic light dataset with an accuracy result of 96.8% and 93.1% respectively. Therefore, the proposed VLSI implementation of PCANet algorithm provides a high-throughput and power-efficient solution for object detection applications.

  • Tongsheng GENG, Leibo LIU, Shouyi YIN, Min ZHU, Shaojun WEI
    IEICE Transactions on Information and Systems
    2010年 E93.D 巻 12 号 3223-3231
    発行日: 2010/12/01
    公開日: 2010/12/01
    ジャーナル フリー
    This paper proposes approaches to perform HW/SW (Hardware/Software) partition and parallelization of computing-intensive tasks of the H.264 HiP (High Profile) decoding algorithm on an embedded coarse-grained reconfigurable multimedia system, called REMUS (REconfigurable MUltimedia System). Several techniques, such as MB (Macro-Block) based parallelization, unfixed sub-block operation etc., are utilized to speed up the decoding process, satisfying the requirements of real-time and high quality H.264 applications. Tests show that the execution performance of MC (Motion Compensation), deblocking, and IDCT-IQ(Inverse Discrete Cosine Transform-Inverse Quantization) on REMUS is improved by 60%, 73%, 88.5% in the typical case and 60%, 69%, 88.5% in the worst case, respectively compared with that on XPP PACT (a commercial reconfigurable processor). Compared with ASIC solutions, the performance of MC is improved by 70%, 74% in the typical and in the worst case, respectively, while those of Deblocking remain the same. As for IDCT_IQ, the performance is improved by 17% no matter in the typical or worst case. Relying on the proposed techniques,
    1080
    p
    @30fps of H.264 HiP@ Level 4 decoding could be achieved on REMUSwhen utilizing a 200MHz working frequency.
  • Xiaofeng Huang, Kaijin Wei, Guoqing Xiang, Huizhu Jia, Don Xie
    IEICE Electronics Express
    2017年 14 巻 12 号 20170501
    発行日: 2017年
    公開日: 2017/06/25
    [早期公開] 公開日: 2017/06/12
    ジャーナル フリー

    In video encoder chip, memory interface design is a must to transfer various data between the encoder pipeline and the off-chip memory. Reducing the required off-chip memory bandwidth and improving the memory access efficiency are the two key targets for optimized memory interface design. To achieve these two targets, three novel technologies are proposed in Level C+ coding order based AVS HD video encoder. Firstly, an improved Level C+ coding order with necessary NOP insertions are proposed to achieve 61% bandwidth reduction and make MB pipeline scheduling regular. Secondly, MB-level synchronous memory interface design is proposed by trading off external bandwidth, MB pipeline structure, and internal buffer size. Finally, address mapping and arbitration are proposed to improve the access efficiency by 12%. The optimized memory interface design is successfully implemented on our

    1080
    P
    @45fps AVS encoder with Xilinx Virtex-6 FPGA at an operating frequency of 200 MHz.

  • Takashi MIYAMORI, Hui XU, Hiroyuki USUI, Soichiro HOSODA, Toru SANO, Kazumasa YAMAMOTO, Takeshi KODAKA, Nobuhiro NONOGAKI, Nau OZAKI, Jun TANABE
    IEICE Transactions on Electronics
    2014年 E97.C 巻 4 号 360-368
    発行日: 2014/04/01
    公開日: 2014/04/01
    ジャーナル 認証あり
    New media processing applications such as image recognition and AR (Augment Reality) have become into practical on embedded systems for automotive, digital-consumer and mobile products. Many-core processors have been proposed to realize much higher performance than multi-core processors. We have developed a low-power many-core SoC for multimedia applications in 40nm CMOS technology. Within a 210mm2 die, two 32-core clusters are integrated with dynamically reconfigurable processors, hardware accelerators, 2-channel DDR3 I/Fs, and other peripherals. Processor cores in the cluster share a 2MB L2 cache connected through a tree-based Network-on-Chip (NoC). Its total peak performance exceeds 1.5TOPS (Tera Operations Per Second). The high scalability and low power consumption are accomplished by parallelized software for multimedia applications. In case of face detection, the performance scales up to 64 cores and the SoC consumes only 2.21W. Moreover, it can execute the
    1080
    p
    48fps H.264 decoding about 520mW by 28 cores and the 4K2K 15fps super resolution about 770mW by 32 cores in one cluster. Exploiting parallelism by low power processor cores, the many-core SoC provides several tens of times better energy efficiency than that of a high performance desk-top quad-core processor.
  • 橘 弘之, 小川 兼司, 中島 徹, 仲川 整, 藤谷 守男, 住田 圭介
    映像情報メディア学会誌
    2008年 62 巻 10 号 1593-1597
    発行日: 2008/10/01
    公開日: 2010/05/01
    ジャーナル フリー
  • Xinning LIU, Chen MEI, Peng CAO, Min ZHU, Longxing SHI
    IEICE Transactions on Information and Systems
    2012年 E95.D 巻 2 号 374-382
    発行日: 2012/02/01
    公開日: 2012/02/01
    ジャーナル フリー
    This paper proposes a novel sub-architecture to optimize the data flow of REMUS-II (REconfigurable MUltimedia System 2), a dynamically coarse grain reconfigurable architecture. REMUS-II consists of a µPU (Micro-Processor Unit) and two RPUs (Reconfigurable Processor Unit), which are used to speeds up control-intensive tasks and data-intensive tasks respectively. The parallel computing capability and flexibility of REMUS-II makes itself an excellent candidate to process multimedia applications, which require a large amount of memory accesses. In this paper, we specifically optimize the data flow to deal with those performance-hazard and energy-hungry memory accessing in order to meet the bandwidth requirement of parallel computing. The RPU internal memory could work in multiple modes, like 2D-access mode and transformation mode, according to different multimedia access patterns. This novel design can improve the performance up to 26% compared to traditional on-chip memory. Meanwhile, the block buffer is implemented to optimize the off-chip data flow through reducing off-chip memory accesses, which reducing up to 43% compared to direct DDR access. Based on RTL simulation, REMUS-II can achieve
    1080
    p
    @30fps of H.264 High Profile@ Level 4 and High Level MPEG2 at 200MHz clock frequency. The REMUS-II is implemented into 23.7mm2 silicon on TSMC 65nm logic process with a 400MHz maximum working frequency.
  • Huang-Chih Kuo, Youn-Long Lin
    IPSJ Transactions on System and LSI Design Methodology
    2013年 6 巻 76-93
    発行日: 2013年
    公開日: 2013/08/05
    ジャーナル フリー
    Intra-frame encoding is useful for many video applications such as security surveillance, digital cinema, and video conferencing because it supports random access to every video frame for easy editing and has low computational complexity that results in low hardware cost. H.264/AVC, which is the most popular video coding standard today, also defines novel intra-coding tools to achieve high compression performance at the expense of significantly increased computational complexity. We present a VLSI design for H.264/AVC intra-frame encoder. The paper summaries several novel approaches to alleviate the performance bottleneck caused by the long data dependency loop among 4 × 4 luma blocks, integrate a high-performance hardwired CABAC entropy encoder, and apply a clock-gating technique to reduce power consumption. Synthesized with a TSMC 130nm CMOS cell library, our design requires 194.1K gates at 108MHz and consumes 19.8mW to encode
    1080
    p
    (1920 × 1088) video sequences at 30 frames per second (fps). It also delivers the same video quality as the H.264/AVC reference software. We suggest a figure of merit called Design Efficiency for fair comparison of different works. Experimental results show that the proposed design is more efficient than prior arts.
  • Dabwitso KASAUKA, Kenta SUGIYAMA, Hiroshi TSUTSUI, Hiroyuki OKUHATA, Yoshikazu MIYANAGA
    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
    2019年 E102.A 巻 6 号 775-782
    発行日: 2019/06/01
    公開日: 2019/06/01
    ジャーナル フリー

    In recent years, much research interest has developed in image enhancement and haze removal techniques. With increasing demand for real time enhancement and haze removal, the need for efficient architecture incorporating both haze removal and enhancement is necessary. In this paper, we propose an architecture supporting both real-time Retinex-based image enhancement and haze removal, using a single module. Efficiently leveraging the similarity between Retinex-based image enhancement and haze removal algorithms, we have successfully proposed an architecture supporting both using a single module. The implementation results reveal that just 1% logic circuits overhead is required to support Retinex-based image enhancement in single mode and haze removal based on Retinex model. This reduction in computation complexity by using a single module reduces the processing and memory implications especially in mobile consumer electronics, as opposed to implementing them individually using different modules. Furthermore, we utilize image enhancement for transmission map estimation instead of soft matting, thereby avoiding further computation complexity which would affect our goal of realizing high frame-rate real time processing. Our FPGA implementation, operating at an optimum frequency of 125MHz with 5.67M total block memory bit size, supports WUXGA (1,920×1,200) 60fps as well as

    1080
    p
    60 color input. Our proposed design is competitive with existing state-of-the-art designs. Our proposal is tailored to enhance consumer electronic such as on-board cameras, active surveillance intrusion detection systems, autonomous cars, mobile streaming systems and robotics with low processing and memory requirements.

  • Gugang GAO, Peng CAO, Jun YANG, Longxing SHI
    IEICE Transactions on Information and Systems
    2013年 E96.D 巻 8 号 1654-1666
    発行日: 2013/08/01
    公開日: 2013/08/01
    ジャーナル フリー
    One of the largest challenges for coarse-grained reconfigurable arrays (CGRAs) is how to efficiently map applications. The key issues for mapping are (1) how to reduce the memory bandwidth, (2) how to exploit parallelism in algorithms and (3) how to achieve load balancing and take full advantage of the hardware potential. In this paper, we propose a novel parallelism scheme, called ‘Hybrid partitioning’, for mapping a H.264 high definition (HD) decoder onto REMUS-II, a CGRA system-on-chip (SoC). Combining good features of data partitioning and task partitioning, our methodology mainly consists of three levels from top to bottom: (1) hybrid task pipeline based on slice and macroblock (MB) level; (2) MB row-level data parallelism; (3) sub-MB level parallelism method. Further, on the sub-MB level, we propose a few mapping strategies such as hybrid variable block size motion compensation (Hybrid VBSMC) for MC, 2D-wave for intra 4×4, parallel processing order for deblocking. With our mapping strategies, we improved the algorithm's performance on REMUS-II. For example, with a luma 16×16MB, the Hybrid VBSMC achieves 4 times greater performance than VBSMC and 2.2 times greater performance than fixed 4×4 partition approach. Finally, we achieve
    1080
    p
    @33fps H.264 high-profile (HiP)@level 4.1 decoding when the working frequency of REMUS-II is 200MHz. Compared with typical hardware platforms, we can achieve better performance, area, and flexibility. For example, our performance achieves approximately 175% improvement than that of a commercial CGRA processor XPP-III while only using 70% of its area.
feedback
Top