用Python列出在线目录中的所有文件？

import urllib2 url = "http://cdn.primarygames.com/taxi.swf" file_name = url.split('/')[-1] u = urllib2.urlopen(url) f = open(file_name, 'wb') meta = u.info() file_size = int(meta.getheaders("Content-Length")[0]) print "Downloading: %s Bytes: %s" % (file_name, file_size) file_size_dl = 0 block_sz = 8192 while True: buffer = u.read(block_sz) if not buffer: break file_size_dl += len(buffer) f.write(buffer) status = r"%10d [%3.2f%%]" % (file_size_dl, file_size_dl * 100. / file_size) status = status + chr(8)*(len(status)+1) print status, f.close()

1条回答

网友

1楼 · 发布于 2024-05-23 17:38:22

既然你想一次下载一大堆东西，那就从寻找一个网站索引或一个网页开始，它能清楚地列出你想下载的所有东西。该网站的移动版本通常比桌面更轻，更容易擦伤。

这个网站正是你要找的：All Games。

现在，做起来真的很简单。只是，提取所有的游戏页面链接。我使用BeautifulSoup和requests来执行此操作：

import requests
from bs4 import BeautifulSoup

games_url = 'http://www.primarygames.com/mobile/category/all/'

def get_all_games():
    soup = BeautifulSoup(requests.get(games_url).text)

    for a in soup.find('div', {'class': 'catlist'}).find_all('a'):
        yield 'http://www.primarygames.com' + a['href']

def download_game(url):
    # You have to do this stuff. I'm lazy and won't do it.

if __name__ == '__main__':
    for game in get_all_games():
        download_game(url)

剩下的就看你了。download_game()根据游戏的URL下载游戏，因此您必须找出<object>标记在DOM中的位置。

相关问题更多 >

编程相关推荐

热门问题

热门文章