如何使用mwapi库获取wikipedia页面？

1条回答

网友

1楼 · 发布于 2024-04-23 21:36:57

也许这段代码可以帮助您理解API：

import json  # Used only to pretty-print dictionaries.
import mwapi

USER_AGENT = 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-GB; rv:1.9.0.6) Gecko/2009011913  Firefox'

session = mwapi.Session('https://en.wikipedia.org', user_agent=USER_AGENT)

query = session.get(action='query', titles='Earth Wind and Fire')
print('query returned:')
print(json.dumps(query, indent=4))

pages = query['query']['pages']
if pages:
    print('\npages:')
    for pageid in pages:
        data = session.get(action='parse', pageid=pageid, prop='text')
        print(json.dumps(data, indent=4))

输出：

query returned:
{
    "batchcomplete": "",
    "query": {
        "pages": {
            "313370": {
                "pageid": 313370,
                "ns": 0,
                "title": "Earth Wind and Fire"
            }
        }
    }
}

pages:
{
    "parse": {
        "title": "Earth Wind and Fire",
        "pageid": 313370,
        "text": {
            "*": "<div class=\"redirectMsg\"><p>Redirect to:</p><ul class=\"redirectText\"><li><a href=\"/wiki/Earth,_Wind_%26_Fire\" title=\"Earth, Wind &amp; Fire\">Earth, Wind &amp; Fire</a></li></ul></div><div class=\"mw-parser-output\">\n\n<!  \nNewPP limit report\nParsed by mw1279\nCached time: 20171121014700\nCache expiry: 1900800\nDynamic content: false\nCPU time usage: 0.000 seconds\nReal time usage: 0.001 seconds\nPreprocessor visited node count: 0/1000000\nPreprocessor generated node count: 0/1500000\nPost\u2010expand include size: 0/2097152 bytes\nTemplate argument size: 0/2097152 bytes\nHighest expansion depth: 0/40\nExpensive parser function count: 0/500\n >\n<! \nTransclusion expansion time report (%,ms,calls,template)\n100.00%    0.000      1 -total\n >\n</div>\n<!  Saved in parser cache with key enwiki:pcache:idhash:313370-0!canonical and timestamp 20171121014700 and revision id 16182229\n  >\n"
        }
    }
}

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何使用mwapi库获取wikipedia页面？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >