回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我有一个JSON文件,它有多个对象,例如:</p>
<pre><code> {"reviewerID": "bc19970fff3383b2fe947cf9a3a5d7b13b6e57ef2cd53abc52bb2dfedf5fb1cd", "asin": "a6ed402934e3c1138111dce09256538afb04c566edf37c16b9ba099d23afb764", "overall": 2.0, "helpful": {"nHelpful": 1, "outOf": 1}, "reviewText": "This remote, for whatever reason, was chosen by Time Warner to replace their previous silver remote, the Time Warner Synergy V RC-U62CP-1.12S. The actual function of this CLIKR-5 is OK, but the ergonomic design sets back remotes by 20 years. The buttons are all the same, there's no separation of the number buttons, the volume and channel buttons are the same shape as the other buttons on the remote, and it all adds up to a crappy user experience. Why would TWC accept this as a replacement? I'm skipping this and paying double for a refurbished Synergy V.", "summary": "Ergonomic nightmare", "unixReviewTime": 1397433600}
{"reviewerID": "3689286c8658f54a2ff7aa68ce589c81f6cae4c4d9de76fa0f66d5c114f79837", "asin": "8939d791e9dd035aa58da024ace69b20d651cea4adf6159d984872b44f663301", "overall": 4.0, "helpful": {"nHelpful": 21, "outOf": 22}, "reviewText": "This is a great truck GPS. I've tried others and nothing seems to come close to the Rand McNally TND-700.Excellent screen size and resolution. The audio is loud enough to be heard over road noise and the purr of my Kenworth/Cat engine. I've used it for the last 8,000 miles or so and it has only glitched once. Just restarted it and it picked up on my route right where it should have.Clean up the minor issues and this unit rates a solid 5.Rand McNally 528881469 7-inch Intelliroute TND 700 Truck GPS", "summary": "Great Unit!", "unixReviewTime": 1280016000}
</code></pre>
<p>我正在尝试使用以下代码将其转换为熊猫数据帧:</p>
<pre><code>train_df = pd.DataFrame()
count = 0;
for l in open('train.json'):
try:
count +=1
if(count==20001):
break
obj1 = json.loads(l)
df1=pd.DataFrame(obj1, index=[0])
train_df = train_df.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(df1, ignore_index=True)
except ValueError:
line = line.replace('\\','')
obj = json.loads(line)
df1=pd.DataFrame(obj, index=[0])
train_df = train_df.append(df1, ignore_index=True)
</code></pre>
<p>但是,它为嵌套值提供了“NaN”,即“helping”属性。我希望输出使嵌套属性的两个键都是单独的列。</p>
<p>编辑:</p>
<p>备注:我使用try/except是因为我在一些对象中有'\'字符,这会导致JSON解码错误。</p>
<p>有人能帮忙吗?我还有别的办法吗?</p>
<p>谢谢你。</p>