行動計量学

原著

記述式テストにおける自動採点システムの最新動向

石岡恒憲

2004 年31 巻2 号 p. 67-87
発行日: 2004年
公開日: 2005/11/22

DOIhttps://doi.org/10.2333/jbhmk.31.67

ジャーナルフリー

抄録を表示する抄録を非表示にする

With the aim of removing human errors and providing critical feedback and suggestions for improvement, considerable research has be done on computer-based automated essay-scoring systems. Examples of these include e-rater, PEG, IEA, IntelliMetric, and BETSY. This paper summarizes how these systems work in an attempt to comprehend their features. They are also compared. An automated Japanese essay-scoring system named Jess is introduced, including our analysis of its performance. Lastly, difficulties caused by its treatment of Japanese passages and related problems are discussed.

抄録全体を表示

PDF形式でダウンロード (1023K)
連続反応モデルの等化係数のEMサイクル内非反復推定

荘島宏二郎, 大津起夫

2004 年31 巻2 号 p. 89-106
発行日: 2004年
公開日: 2005/11/22

DOIhttps://doi.org/10.2333/jbhmk.31.89

ジャーナルフリー

抄録を表示する抄録を非表示にする

In this study, we proposed an estimation method for equating coefficients of the continuous response model based on the common examinees design. This method applies the EM algorithm according to the Shojima (2003) method, but does not require numerical approximation in E-step or numerical iteration in M-step. We also presented a general framework for when data were missing at random and confirmed the unbiasedness of the proposed estimator using simulation studies. Finally, we illustrated an example of numerical analysis on a real data.

抄録全体を表示

PDF形式でダウンロード (435K)
科学史と科学者

—— 林知己夫氏公開インタビュー ——

高橋正樹編

2004 年31 巻2 号 p. 107-124
発行日: 2004年
公開日: 2005/11/22

DOIhttps://doi.org/10.2333/jbhmk.31.107

ジャーナルフリー

PDF形式でダウンロード (605K)

研究ノート

国際比較調査データの安定性についての検証

—— 2003年度韓国と台湾における「健康と文化調査」および「東アジア価値観国際比較調査」データの比較 ——

山岡和枝, 李相侖

2004 年31 巻2 号 p. 125-135
発行日: 2004年
公開日: 2005/11/22

DOIhttps://doi.org/10.2333/jbhmk.31.125

ジャーナルフリー

抄録を表示する抄録を非表示にする

In this research note, we examined the degree of difference/concordance among the response rates and the differences between the response rates due to the effect of weighting on the response data acquired from two surveys, the Health and Culture Survey and the East Asian Value Survey. The differences between the response rates, as well as the scale values between the two surveys, were relatively small and the structures of the response patterns were relatively similar. These findings indicate that the reliability of the survey results is relatively high. The difference between the original and the weighted Korean survey proved to be relatively minor (maximum difference of 3.5%), and the effect of weighting on the response data proved negligible. It has been recognized that comparison of pattern structures in groups of multiple questions, rather than comparison of response rates for each question, is important when seeking cross-national comparability.

抄録全体を表示

PDF形式でダウンロード (387K)

J-STAGEへの登録はこちら（無料）