用python通过FTP下载大文件

import os from time import strftime from ftplib import FTP import smtplib from email.MIMEMultipart import MIMEMultipart from email.MIMEBase import MIMEBase from email.MIMEText import MIMEText from email import Encoders day = strftime("%d") today = strftime("%d-%m-%Y") link = FTP(ftphost) link.login(passwd = ftp_pass, user = ftp_user) link.cwd(file_path) link.retrbinary('RETR ' + file_name, open('/var/backups/backup-%s.tgz' % today, 'wb').write) link.delete(file_name) #delete the file from online server link.close() mail(user_mail, "Download database %s" % today, "Database sucessfully downloaded: %s" % file_name) exit()

3条回答

网友

1楼 · 编辑于 2024-06-01 00:15:10

您可以尝试设置超时。从docs：

# timeout in seconds
link = FTP(host=ftp_host, user=ftp_user, passwd=ftp_pass, acct='', timeout=3600)

网友

2楼 · 编辑于 2024-06-01 00:15:10

我用ftplib实现了代码，它可以监视连接，重新连接，并在失败时重新下载文件。详情如下：How to download big file in python via ftp (with monitoring & reconnect)?

网友

3楼 · 编辑于 2024-06-01 00:15:10

抱歉，如果我回答自己的问题，但我找到了解决办法。

我尝试了很多方法，但没有成功，所以我尝试了很多方法，最后，这个方法奏效了：

def ftp_connect(path):
    link = FTP(host = 'example.com', timeout = 5) #Keep low timeout
    link.login(passwd = 'ftppass', user = 'ftpuser')
    debug("%s - Connected to FTP" % strftime("%d-%m-%Y %H.%M"))
    link.cwd(path)
    return link

downloaded = open('/local/path/to/file.tgz', 'wb')

def debug(txt):
    print txt

link = ftp_connect(path)
file_size = link.size(filename)

max_attempts = 5 #I dont want death loops.

while file_size != downloaded.tell():
    try:
        debug("%s while > try, run retrbinary\n" % strftime("%d-%m-%Y %H.%M"))
        if downloaded.tell() != 0:
            link.retrbinary('RETR ' + filename, downloaded.write, downloaded.tell())
        else:
            link.retrbinary('RETR ' + filename, downloaded.write)
    except Exception as myerror:
        if max_attempts != 0:
            debug("%s while > except, something going wrong: %s\n \tfile lenght is: %i > %i\n" %
                (strftime("%d-%m-%Y %H.%M"), myerror, file_size, downloaded.tell())
            )
            link = ftp_connect(path)
            max_attempts -= 1
        else:
            break
debug("Done with file, attempt to download m5dsum")
[...]

在我的日志文件中，我发现：

01-12-2011 23.30 - Connected to FTP
01-12-2011 23.30 while > try, run retrbinary
02-12-2011 00.31 while > except, something going wrong: timed out
    file lenght is: 1754695793 > 1754695793
02-12-2011 00.31 - Connected to FTP
Done with file, attempt to download m5dsum

遗憾的是，即使文件已经完全下载，我也必须重新连接到FTP，这在我的cas中不是问题，因为我还必须下载md5sum。

如您所见，我无法检测到超时并重试连接，但当我获得超时时，我只是重新连接；如果有人知道如何在不创建新ftplib.FTP实例的情况下重新连接，请告诉我；）

相关问题更多 >

编程相关推荐

热门问题

热门文章