詳細検索結果

Context-Sensitive Grammar Transform: Compression and Pattern Matching

Shirou MARUYAMA, Youhei TANAKA, Hiroshi SAKAMOTO, Masayuki TAKEDA

IEICE Transactions on Information and Systems
2010年 E93.D 巻 2 号 219-226
発行日: 2010/02/01
公開日: 2010/02/01

DOI https://doi.org/10.1587/transinf.E93.D.219

ジャーナルフリー

抄録を表示する抄録を非表示にする

A framework of context-sensitive grammar transform for speeding-up compressed pattern matching (CPM) is proposed. A greedy compression algorithm with the transform model is presented as well as a Knuth-Morris-Pratt (KMP)-type compressed pattern matching algorithm. The compression ratio is a match for gzip and Re-Pair, and the search speed of our CPM algorithm is almost twice faster than the KMP-type CPM algorithm on Byte-Pair-Encoding by Shibata et al.[18], and in the case of short patterns, faster than the Boyer-Moore-Horspool algorithm with the stopper encoding by Rautio et al.[14], which is regarded as one of the best combinations that allows a practically fast search.
抄録全体を表示

PDF形式でダウンロード (575K)
ページ記述言語としての PostScript

石田晴久

計測と制御
1989年 28 巻 3 号 208-212
発行日: 1989/03/10
公開日: 2009/11/26

DOI https://doi.org/10.11499/sicejl1962.28.208

ジャーナルフリー

PDF形式でダウンロード (696K)
株式会社正興電機製作所 ((九州地区中堅企業における情報化の進展 : 現状と今後の課題) (<特集II> 九州地区の傾向と問題点))

山川典宏, 林勝裕

オフィス・オートメーション
1997年 18 巻 2 号 57-61
発行日: 1997/09/10
公開日: 2019/01/15

DOI https://doi.org/10.20627/officeautomation.18.2_57

ジャーナルフリー

PDF形式でダウンロード (539K)
書評

理論と方法
1997年 12 巻 1 号 115-119
発行日: 1997/07/31
公開日: 2016/08/26

DOI https://doi.org/10.11218/ojjams.12.115

ジャーナルフリー

PDF形式でダウンロード (325K)
辞書配列利用による非モード方式のシフトJIS文書圧縮

伊藤雅, 佐藤泰司

電気学会論文誌Ｃ（電子・情報・システム部門誌）
2000年 120 巻 1 号 14-19
発行日: 2000年
公開日: 2008/12/19

DOI https://doi.org/10.1541/ieejeiss1987.120.1_14

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper proposes a new data compression method for a Japanese-text file, where the text is written in shift-JIS (JIS X 0208) codes. In the first pass, a dictionary array is built up by the higher frequency of both single and double byte characters. In the second pass, all the registered characters are replaced with the dictionary items: the code OxFF is put into a compressed file in front of non-registered ASCII character so as to distinguish non-registered characters from registered ones. It takes O (1) time on a hashing basis to confirm whether each input character belongs to the dictionary, and to transfer its code to a dictionary item. Furthermore, the run-length encoding is applied to a sequence of consecutive identical characters for the purpose of accomplishment of the much higher compression ratio. The code OxFE is a indicator to start this encoding. A feature of the method is to be a non-modal type of compression.
抄録全体を表示

PDF形式でダウンロード (1647K)
ノン・ラテン・タイポグラフィの史的展望 : アジア圏の多言語組版環境の現状と課題(<特集>タイポグラフィの史的研究)

劉賢国

デザイン学研究特集号
2012年 19 巻 3 号 2-11
発行日: 2012/06/30
公開日: 2017/11/27

DOI https://doi.org/10.11247/jssds.19.3_2

研究報告書・技術報告書フリー

PDF形式でダウンロード (19812K)

J-STAGEへの登録はこちら（無料）