Authors
Srinivas Bangalore, Owen Rambow, Steve Whittaker
Publication date
2000/6
Conference
INLG’2000 Proceedings of the First International Conference on Natural Language Generation
Pages
1-8
Description
Certain generation applications may profit from the use of stochastic methods. In developing stochastic methods, it is crucial to be able to quickly assess the relative merits of different approaches or models. In this paper, we present several types of intrinsic (system internal) metrics which we have used for baseline quantitative assessment. This quantitative assessment should then be augmented to a fuller evaluation that examines qualitative aspects. To this end, we describe an experiment that tests correlation between the quantitative metrics and human qualitative judgment. The experiment confirms that intrinsic metrics cannot replace human evaluation, but some correlate significantly with human judgments of quality and understandability and can be used for evaluation during development.
Total citations
20002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320244128881081689664386666494742
Scholar articles
S Bangalore, O Rambow, S Whittaker - INLG'2000 Proceedings of the First International …, 2000