主催: The Japanese Society for Artificial Intelligence
会議名: 2012年度人工知能学会全国大会(第26回)
回次: 26
開催地: 山口県山口市 山口県教育会館等
開催日: 2012/06/12 - 2012/06/15
This paper explores the use of MathML Pallel Markup Corpora for mathematical expression understanding, the task of which is formulated as a translation from Presentation to Content MathML Markups in our approach. In contrast to existing researches that mainly relied on manually encoded transformation rules, we adopt a Statistical-Machine-Translation-based method to automatically extract translation rules from parallel markup corpora. Our study shows that the structural features embedded in the MathML tree can be effectively exploited in the sub-tree alignment and the translation rules extracted from the alignment give boost to the translation system. Experimental results on the Wolfram Function Site show that our approach achieves an improvement against a rule-based system.