我正在尝试将JSON文件解析为表,并且遇到了一些嵌套列表的问题。
该文件是以下内容的重复列表:
[{
"uuid": "a2d89c9b-6e2e-4e3a-8d60-bf3ce2fe3fda",
"timestamp": "2023-11-23 00:26:31.851000 UTC",
"process_timestamp": "2023-11-23 00:26:32.326000 UTC",
"visitor_id": "oeu1700282566730r0.9025758502018271",
"session_id": "AUTO",
"account_id": "25408250069",
"experiments": {
"list": [{
"element": {
"campaign_id": "26314710187",
"experiment_id": "26322360336",
"variation_id": "26314800349",
"is_holdback": "false"
}
}]
},
"entity_id": "25754820685",
"attributes": {
"list": [{
"element": {
"id": null,
"name": "",
"type": "browserId",
"value": "gc"
}
}, {
"element": {
"id": null,
"name": "",
"type": "campaign",
"value": "blablabla"
}
}, {
"element": {
"id": null,
"name": "",
"type": "device",
"value": "desktop"
}
}, {
"element": {
"id": null,
"name": "",
"type": "device_type",
"value": "desktop_laptop"
}
}, {
"element": {
"id": null,
"name": "",
"type": "referrer",
"value": "https://bookings.perrito.com/21df6542"
}
}, {
"element": {
"id": null,
"name": "",
"type": "source_type",
"value": "campaign"
}
}, {
"element": {
"id": null,
"name": "",
"type": "currentTimestamp",
"value": "1700699073915"
}
}, {
"element": {
"id": null,
"name": "",
"type": "offset",
"value": "300"
}
}]
},
"user_ip": "72.38.10.0",
"user_agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Safari/537.36",
"referer": "https://bookings.perrito.com/",
"event_type": "other",
"event_name": "transaction",
"revenue": "240939",
"value": null,
"quantity": null,
"tags": {
"key_value": [{
"key": "tour_id",
"value": "386"
}, {
"key": "booking_id",
"value": "123456"
}, {
"key": "payment_type",
"value": "creditcard"
}, {
"key": "revenue",
"value": "240939"
}, {
"key": "pax",
"value": "1"
}, {
"key": "tour_name",
"value": "Best Viaje ever"
}, {
"key": "extras",
"value": "245.00"
}]
},
"revision": "859",
"client_engine": "js",
"client_version": "0.188.1",
"element": {
"campaign_id": "26314710187",
"experiment_id": "26322360336",
"variation_id": "26314800349",
"is_holdback": "false"
}
}]
字符串
它的某些部分很容易提取:
import json
# load data using Python JSON module
with open(url,'r') as f:
data = json.loads(f.read())
data = pd.json_normalize(data)
型
然而,在JSON的属性和标记字典中有一个列表,我无法解析。
1条答案
按热度按时间nwnhqdif1#
我在a previous answer的笔记中写道:
1b1b1b1x 1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b1b
您可以将
merge
中的所有数据放在meta
列上。的最小值
由于每个嵌套列表都必须与其他嵌套列表连接,因此可能存在重复项。
型
这些行的乘积将是
56
,即out
Dataframe 的长度。