m2 scorer

最新推荐文章于 2026-06-22 20:57:52 发布

原创最新推荐文章于 2026-06-22 20:57:52 发布 · 844 阅读

2 ·

本内容遵循CC 4.0 BY-SA版权协议

标签

#自然语言处理

NLP 专栏收录该内容

7 篇文章

订阅专栏

论文: Better Evaluation for Grammatical Error Correction（2012NAACL）

Github h ttps://github.com/nusnlp/m2scorer

m2格式：

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-k6tNR5uP-1599300978720)(GEC.assets/image-20200828203148949.png)]

S开头的行表示原始句子，A开头的行表示注释。

每个注释行均包含编辑的开始和结束标记偏移量、错误类型、标记化的更正字符串。

出于历史原因，包含了下两个字段，可以将其忽略（请参阅CoNLL-2013共享任务）.

最后一个字段即数字（0、1、2）是注释者ID

目的：提出了一种计算system edit的方法，寻找最贴近gold的edit，phrase-level edits
难点：

（1）the set of edits that transforms one string into another is not necessarily unique(编辑的不唯一性)

（2）edits can consist of longer phrases which introduce additional ambiguity.（编辑的长度不定性）

符号	含义
$S = \{s_1, . . . , s_n\}$	source sentences
$H = \{h_1, . . . , h_n\}$	hypotheses
$G = \{g_1, . . . , g_n\}$	gold standard annotations
$g_i = \{g_i^1, . . . , g_i^r\}$	set of edits
(a,b,C)	start and end offsets a and b； correction C【gold可能含多个，系统的只有1个】

步骤：

（1）construct an edit lattice from a source-hypothesis pair.【finding the optimal sequence of edits is equivalent to solving a shortest path search through the lattice】

（2）evaluate the edits using F1 measure.