回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>非常新的和没有经验的程序员在这里!在</p>
<p>我正在建立一个垃圾项目,可以刮这个网站的公司名称和地点,并输出一个JSON文件。在</p>
<p><a href="https://www.f6s.com/programs?type[]=accelerator&page=93&page_alt=1&sort=open" rel="nofollow noreferrer">https://www.f6s.com/programs?type[]=accelerator&page=93&page_alt=1&sort=open</a></p>
<p>目前,我的铲运机正在拉公司名称,但也在拉日期。此外,JSON输出分为几个部分,首先是公司名称列表,然后是位置列表(包含我不需要的附加信息)。在</p>
<p>如何将公司名称/位置拉出来并格式化,以便可以将每个公司名称与特定位置关联起来?在</p>
<p>我认为我的问题是位置没有被定义为一个特定的类。在</p>
<p>另外,对于如何设置JSON输出格式的建议,我们将不胜感激!!在</p>
<hr/>
<p><strong>我的项目目录:</strong></p>
<pre><code>`myproject`/
scrapy.cfg
__init__.py
items.py
pipelines.py
settings.py
spiders/
__init__.py
byub.py
F6sSpider.py
</code></pre>
<hr/>
<p><strong>我的蜘蛛文件:</strong></p>
^{pr2}$
<hr/>
<p><strong>我的终端</strong></p>
^{3}$
<hr/>
<p><strong>我的JSON输出:</strong></p>
<pre><code>[
{"program": ["K - LAUNCHPAD 2018"]},
{"program": ["Z Nation Lab Real Estate Cohort"]},
{"program": ["C-mint-International"]},
{"program": ["StartOut Growth Lab - 2018 Fall Cohort"]},
{"program": ["IBA Application"]},
{"program": ["WATT Factory Accelerator Programme 2018"]},
{"program": ["AdvantEdge Founder's Adda"]},
{"program": ["SpinLab - The HHL Accelerator"]},
{"program": ["Shell LiveWIRE Accelerator"]},
{"program": ["Shell France Accelerator "]},
{"program": ["ELEVATE by TheVentury"]},
{"program": ["F6S R&D Money Back"]},
{"location": ["\n Jun 1-Jul 20 \u2022\n Berlin, Germany \n "]},
{"location": ["\n Mumbai, India \n "]},
{"location": ["\n Atlanta, United States \n "]},
{"location": ["\n Jul 8-Dec 31 \u2022\n San Francisco, United States \n "]},
{"location": ["\n Mar 19-May 16 \u2022\n Los Angeles, United States \n "]},
{"location": ["\n Jun 3-Nov 30 \u2022\n Gent, Belgium \n "]},
{"location": ["\n Delhi, India \n "]},
{"location": ["\n Leipzig, Germany \n "]},
{"location": ["\n Jun 20 '18-Jun 21 '19 \u2022\n Paris, France \n "]},
{"location": ["\n Jun 20 '18-Jun 1 '19 \u2022\n Paris, France \n "]},
{"location": ["\n Sep 5 '18-Feb 14 '19 \u2022\n Vienna, Austria \n "]},
{"location": ["\n London, United Kingdom \n "]}
]
</code></pre>
<p>谢谢!在</p>