使用(V1)python SDK API培训Watson Discovery不起作用

2024-05-14 10:34:02 发布

您现在位置:Python中文网/ 问答频道 /正文

我想使用Watson discovery V1API进行相关性培训。我尝试了以下方法,但仍然没有得到想要的结果。下面详细描述问题:

我有一组文档,其中一些包含单词'cloud''big data'。我想用query()api搜索单词'hadoop',然后返回那些文档,但是发现查询什么也不返回

现在,我想为discovery提供以下培训示例,以更新相关性得分,从而获得这些结果(我对相同的任务使用了查询扩展,并且它起了作用,现在我对相关性培训感兴趣)

我已经使用api add_training_data()将查询'hadoop'与相关文档关联(例如,由ID指定,包含'cloud'的文档)

现在,培训数据如下所示:

{
  "natural_language_query": "hadoop",
  "filter": "",
  "examples": [
    {
      "document_id": "1ad6f551-e092-4ce9-b08c-eb4f4cbc9458",
      "cross_reference": "",
      "relevance": 1,
      "created": "2020-01-30T23:16:19.674Z",
      "updated": "2020-01-30T23:16:19.716Z"
    },
    {
      "document_id": "f1d11f51-31b2-414f-b359-d5336b019575",
      "cross_reference": "",
      "relevance": 1,
      "created": "2020-01-30T23:16:19.674Z",
      "updated": "2020-01-30T23:16:19.722Z"
    },
    {
      "document_id": "5bfcea6a-c925-4db5-a490-89a9d1de8d4c",
      "cross_reference": "",
      "relevance": 1,
      "created": "2020-01-30T23:16:19.674Z",
      "updated": "2020-01-30T23:16:19.729Z"
    },
    {
      "document_id": "bf07e701-6893-428c-ab16-c5446e821291",
      "cross_reference": "",
      "relevance": 1,
      "created": "2020-01-30T23:16:19.674Z",
      "updated": "2020-01-30T23:16:19.735Z"
    },
    {
      "document_id": "75082812-5c96-4d2e-b388-821a0434ad4c",
      "cross_reference": "",
      "relevance": 1,
      "created": "2020-01-30T23:16:19.674Z",
      "updated": "2020-01-30T23:16:19.742Z"
    }
  ],
  "query_id": "cc1d3677eeafe70929aeccfb462860439f61b051",
  "created": "2020-01-30T23:16:19.677Z",
  "updated": "2020-01-30T23:16:19.677Z"
}

如果文档ID对应于集合中的文档,则包含单词'cloud'的文档ID。e、 g

创建了培训数据后,现在我想用查询文本'hadoop'再次运行前面的查询,并假设discovery将自动进行培训以获取相关结果(因为我找不到我所期望的任何类似'train()'的api)。但是,即使提供了培训示例,发现查询仍然不会返回任何结果

我不知道出了什么问题。我会非常感激你的帮助


Tags: 文档hadoopapiidcloudquery单词document
1条回答
网友
1楼 · 发布于 2024-05-14 10:34:02

桑迪潘

正如这里提到的:Improving result relevance with the API

When you provide a Discovery instance with training data, the service uses machine-learning Watson techniques to find signals in your content and questions. The service then reorders query results to display the most relevant results at the top. As you add more training data, the service instance becomes more accurate and sophisticated in the ordering of results it returns.

我相信相关性培训不会按你需要的方式进行。它只是重新排序

此外,“集合的训练数据集必须包含至少49个唯一的训练查询(即查询集和示例集)。”。您需要添加更多查询才能开始培训

相关问题 更多 >

    热门问题