我的数据来自 CSV,应该在 Tableau 中可视化。
但是,数据包含列category_list,该列由由竖线 ( |) 分隔的值组成。
由于 Tableau 无法处理属性内的数组,我使用 Python (Pandas) 加载 CSV 并操作数据:
import pandas as pd
companies = pd.read_csv("companies.csv")
我假设该category_list列需要分解并存储到另一个 CSV(包含permalink(唯一 ID)和category对)中。
像这样的东西:
permalink,category
/organization/-qounter,Application Platforms
/organization/-qounter,Real Time
/organization/-qounter,Social Network Media
/organization/-the-one-of-them-inc-,Apps
/organization/-the-one-of-them-inc-,Games
/organization/-the-one-of-them-inc-,Mobile
/organization/1-4-all,Entertainment
/organization/1-4-all,Games
/organization/1-4-all,Software
/organization/1-800-publicrelations-inc-,Internet
/organization/1-800-publicrelations-inc-,Marketing
/organization/1-800-publicrelations-inc-,Media
/organization/1-800-publicrelations-inc-,Public Relations
/organization/1-mainstream,Apps
/organization/1-mainstream,Cable
/organization/1-mainstream,Distribution
/organization/1-mainstream,Software
...
如何实现?
原始 CSV 的摘录:
permalink,category_list,...
/organization/-qounter,Application Platforms|Real Time|Social Network Media,...
/organization/-the-one-of-them-inc-,Apps|Games|Mobile,...
/organization/1-4-all,Entertainment|Games|Software,...
/organization/1-800-publicrelations-inc-,Internet|Marketing|Media|Public Relations,...
/organization/1-mainstream,Apps|Cable|Distribution|Software,...
...