如何在我的数据框中看到一个长字符串?

数据挖掘 Python 数据挖掘 数据清理
2022-02-13 11:48:41

我的数据框中有一列,其中的句子太长。我想把它们看成一个整体,但每次我执行一个简单的 iloc 操作时,我都会得到像“我要去......”这样的输出。如何删除 ... 并查看整个句子?

2个回答

首先,这里是数据。

data = {
    'Capacity' : [
        "qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze",
        "qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze",
        "qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze",
        "qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze"
    ]
}
df = pd.DataFrame(data)
print(df['Capacity'])
# 0  qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvg...
# 1  qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvg...
# 2  qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvg...
# 3  qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvg...

您可以设置pd.options.display.max_colwidth增加要显示的字符数。

pd.options.display.max_colwidth = 20
print(df['Capacity'])
# 0    qsfjvanzicgbaefz...
# 1    qsfjvanzicgbaefz...
# 2    qsfjvanzicgbaefz...
# 3    qsfjvanzicgbaefz...

pd.options.display.max_colwidth = 100
print(df['Capacity'])
# 0    qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzb...
# 1    qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzb...
# 2    qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzb...
# 3    qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzb...

# This seeks the maximum length of the current column 'Capacity'.
pd.options.display.max_colwidth = int(df['Capacity'].str.len().max()) + 1
print(df['Capacity'])

或者您可以改用该.values方法,以便您观看“原始”数据。

#   Reset the option.
pd.options.display.max_colwidth = 50
print(df['Capacity'].values)
# ['qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze'
#  'qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze'
#  'qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze'
#  'qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze']

或者直接从您感兴趣的列中选择一行。

print(df['Capacity'][2])
# qsfjvanzicgbaefzrkrsdvjcfhnefzraezsufchnezsuvgcfnezefzrdfchnzefzreicfjnefzrzekcrghzebefzrkcfjfzbefzrfkjhnvezkqcfhaze

试试这样:

import pandas as pd

pd.set_option('display.max_rows', 500)
pd.set_option('display.max_columns', 500)
print(df)