Scrapy 导入 Django 模型错误

2 投票
2 回答
841 浏览
提问于 2025-04-18 18:56

我一直在尝试使用 Django 和 DjangoItem 来设置一个 Scrapy 爬虫。但是当我运行我的爬虫时,item.py 总是无法导入 Django 应用的模型。

我新建了一个 Django 网站,并按照 这个链接 的说明进行操作,但我还是遇到了导入模型的问题。我觉得我的设置和那个帖子里说的一样(只是我把名字叫做 testsite,而不是 mysite 等等)。不过如果有人觉得我贴代码会有帮助,我可以发出来。

运行 scrapy crawl example 时出现的错误信息:

Traceback (most recent call last):
  File "/usr/bin/scrapy", line 9, in <module>
    load_entry_point('Scrapy==0.24.4', 'console_scripts', 'scrapy')()
  File "/usr/lib/pymodules/python2.7/scrapy/cmdline.py", line 143, in execute
    _run_print_help(parser, _run_command, cmd, args, opts)
  File "/usr/lib/pymodules/python2.7/scrapy/cmdline.py", line 89, in _run_print_help
    func(*a, **kw)
  File "/usr/lib/pymodules/python2.7/scrapy/cmdline.py", line 150, in _run_command
    cmd.run(args, opts)
  File "/usr/lib/pymodules/python2.7/scrapy/commands/crawl.py", line 57, in run
    crawler = self.crawler_process.create_crawler()
  File "/usr/lib/pymodules/python2.7/scrapy/crawler.py", line 87, in create_crawler
    self.crawlers[name] = Crawler(self.settings)
  File "/usr/lib/pymodules/python2.7/scrapy/crawler.py", line 25, in __init__
    self.spiders = spman_cls.from_crawler(self)
  File "/usr/lib/pymodules/python2.7/scrapy/spidermanager.py", line 35, in from_crawler
    sm = cls.from_settings(crawler.settings)
  File "/usr/lib/pymodules/python2.7/scrapy/spidermanager.py", line 31, in from_settings
    return cls(settings.getlist('SPIDER_MODULES'))
  File "/usr/lib/pymodules/python2.7/scrapy/spidermanager.py", line 22, in __init__
    for module in walk_modules(name):
  File "/usr/lib/pymodules/python2.7/scrapy/utils/misc.py", line 68, in walk_modules
    submod = import_module(fullpath)
  File "/usr/lib/python2.7/importlib/__init__.py", line 37, in import_module
    __import__(name)
  File "/home/myUserName/test-env/testbot/testbot/spiders/example.py", line 3, in <module>
    from testbot.items import UsefulModelItem
  File "/home/myUserName/test-env/testbot/testbot/items.py", line 11, in <module>
    from testapp.models import UsefulModel
ImportError: No module named testapp.models

文件结构:

├── testbot
│   ├── scrapy.cfg
│   └── testbot
│       ├── __init__.py
│       ├── __init__.pyc
│       ├── items.py
│       ├── items.pyc
│       ├── pipelines.py
│       ├── settings.py
│       ├── settings.pyc
│       └── spiders
│           ├── example.py
│           ├── example.pyc
│           ├── __init__.py
│           └── __init__.pyc
└── testsite
    ├── manage.py
    ├── testapp
    │   ├── admin.py
    │   ├── __init__.py
    │   ├── __init__.pyc
    │   ├── models.py
    │   ├── models.pyc
    │   ├── tests.py
    │   └── views.py
    └── testsite
        ├── __init__.py
        ├── __init__.pyc
        ├── settings.py
        ├── settings.pyc
        ├── urls.py
        └── wsgi.py

2 个回答

0
import sys
import os
sys.path.append('/home/testsite')

os.environ['DJANGO_SETTINGS_MODULE'] = 'testsite.settings'

# 设置 Django 项目的完整路径。

2

我觉得你可能忘了把你的 Django 项目的路径写上了。你现在添加的是应用,而不是整个项目。

在 testbot/settings.py 文件中添加:

# Setting up django's project full path.
import sys
sys.path.insert(0, '/home/TristanTrim/PycharmProjects/testsite')

# Setting up django's settings module name.
# This module is located at /home/rolando/projects/myweb/myweb/settings.py.
import os
os.environ['DJANGO_SETTINGS_MODULE'] = 'testsite.settings'

撰写回答