Please use this identifier to cite or link to this item:
|Title:||Intrinsic vs. extrinsic evaluation measures for referring expression generation|
|Keywords:||Performance -- Evaluation|
|Publisher:||Association for Computational Linguistics|
|Citation:||Belz, A., & Gatt, A. (2008). Intrinsic vs. extrinsic evaluation measures for referring expression generation. 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, Columbus. 197-200.|
|Abstract:||In this paper we present research in which we apply (i) the kind of intrinsic evaluation metrics that are characteristic of current comparative HLT evaluation, and (ii) extrinsic, human task-performance evaluations more in keeping with NLG traditions, to 15 systems implementing a language generation task. We analyse the evaluation results and find that there are no significant correlations between intrinsic and extrinsic evaluation measures for this task.|
|Appears in Collections:||Scholarly Works - InsLin|
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.