使用python模块的死锁检查实用程序
deadcheck的Python项目详细描述
说明
<;正在工作>;
这是一个工具,可以用来分析和识别网站中的死链接。这可以在定期的网站维护过程中派上用场。
这里上传的内容只是alpha内容。我把它上传到这里是为了得到一些反馈和建议,我可以如何使这更有效。
欢迎提出任何建议/意见/反馈/要求。请使用以下链接删除这些邮件。
https://www.assembla.com/spaces/deadcheck/tickets
安装
pip install lxml pip install deadcheck or pip install lxml git clone git@github.com:harshanarayana/deadcheck.git python setup.py install
用法
prompt>python run.py -url <baseURLToAnalyze> [-proxy <proxyURL>:<prompt> -username <userName for Protected Page> -password <Password to access Protected Page> -auth_base <Super URL for Authentication> -log <logFile> -exempt <ExceptionURL File> -depth <int, default = 1> -v <default = True> -out <OutputDir>]
使用deadcheckapi
from deadcheck.deadcheck import DeadcheckAPI checker = DeadcheckAPI([proxyURL, userName, password, auth_base_url]) urlObj = checker.amIDead(urlToCheck) print urlObj.info() children = urlObj.getChildren()
待定实施
- Report Generation. ( A custom Package / Using liches ) : DONE
- Handling the Processing of Links using Multiple Threads. ( To reduce processing time )
- Regex support to Exception links. DONE
- Additional Log / Debug Option Support. DONE
- GUI (Work In Progress)
- Custom API to facilitate the validation of Minor Links or Single Links. DONE
更改日志
v0.0.1:
- Initial Draft
- Analysis method and URLLinks class were combined together.
- Report Information Extraction was manual.
v0.0.2:
- Second Draft
- Code Re-structured. Independent classes created for URLLinks, handling Custom Error and Other necessary informaiton.
- URLLinks
- Custom Class for Storing URL information.
- info() method for accessing the URLLink information in the form of a string to display / Report.
- Additional parameters included for Reporting.
- File Size
- Dowload Time
- Check Time
- Last Modified.
- Status Information ( With Error and / or other information as applicable )
- ErrorCodes
- Custom class created for Storing and retrieving HTTPError codes with suitable messages.
- ErrorHandler
- Custom Error handler class for Handling Custom errors. Additional items will be added to this during the course of development.
- deadcheck
- Main Package now containins a class Deadcheck which handles all the operations.
- User the Deadcheck class to perform any operations.
v0.0.3:
- Third Draft
- Report Generation Modules Included.
- Class documentations updated. (2013-12-17)
v0.0.3.1:
- Low-level URL extraction from ‘javascript’ type hrefs.
- Fix for UnicodeEncodeError.
- Minor Cleanup / change for the reports.
v0.0.3.2:
- Fix for AttributeError in __cleanupJavaScript Function ( Bug #3 )
- Minor Modification for RegExp used in Exceptions file.
v0.0.3.3:
- Addition of new Class ‘DeadcheckAPI’ that provides the user with a small scare function that can be used to analyze a single URL in the CLI
- The Return value of this function call will be URLLinks Object and hence you can iterate through the Child Links and call the function amIDead manually to analyze ‘n’ number of links.
- Custom Test Suite has been included to support the running of tests during Installation process.
v0.0.3.3.1:
- Minor changes made into run.py