俄语多义词的标注语境
rl_wsd_labeled的Python项目详细描述
Contexts sampled from RutenTen and RNC.Sense definitions from Active Dictionary. 有些话有两个annotators。数量的上下文是100个字 and 500 for 7 words.
Annotators)>words:
-
阿纳斯塔西娅·洛普希娜)(47)
- (康斯坦丁·洛普申)
- (亚历山德拉乌达里佐夫)
- >阿纳斯塔西娅K.(( “李”安娜猫)
- (安娜塔塔塔连科)
- (鲍里斯约姆丁) 伊万·萨莫伊连科)(1)
Contexts are stored in ^{tt1}美元:
^{pr 1}美元A python interface is provided.Intall the package first:
^{pr 2}
然后在order to get labered contexts:
^{pr 3}Apart from senses,there are two special annotations:” “我不知道上下文是unclear/the contexts is invalid”,and“max sense+1” “mean”other sense,not listed among given senses.”contexts marked as“”0“or other” 没有返回,没有^{tt2}美元是通过。 如果有更多的人,然后一个annotator,上下文在哪里annotators did not agree are also not included.有一个函数^{tt3}美元,返回 ratio of senses where both annotators gave either the same concrete sense,or both skipped the senses)“…so”