使用固定宽度拆分 Pandas 字符串类型的列（类似于 Excel 具有固定宽度的文本到列功能）

Question

I have a dataframe of CCYPair and corresponding spot values similar to the below:我有一个 CCYPair 的 dataframe 和对应的点值，如下所示：

Current Dateframe:当前日期范围：

d = {'CCYPair': ['EURUSD', 'USDJPY'], 'Spot': [1.2, 109]}
df = pd.DataFrame(data=d)

I am looking to split the CCYPair column into CCY1 and CCY2.我希望将 CCYPair 列拆分为 CCY1 和 CCY2。 This would be easily achieved in Excel using Text-to-columns or through Left and Right functions.这可以在 Excel 中使用 Text-to-columns 或通过 Left 和 Right 函数轻松实现。 However, even after searching for a while, I am finding it quite tricky to achieve the same result in a pandas dataframe.但是，即使搜索了一段时间，我发现在 pandas dataframe 中实现相同的结果非常棘手。

I could only find pandas.read_fwf but that is for reading from a file.我只能找到 pandas.read_fwf 但这是从文件中读取的。 I already have a dataframe and am looking to split one of the columns based on fixed width.我已经有一个 dataframe 并且希望根据固定宽度拆分其中一列。

I am sure I am missing something basic here - just can't figure out what.我确定我在这里遗漏了一些基本的东西——只是不知道是什么。

I have tried df['CCY1'] = df['CCYPair'][0:3] But that applies the [0:3] on the column and not each entry within the column.我试过df['CCY1'] = df['CCYPair'][0:3]但这将 [0:3] 应用于列而不是列中的每个条目。 So I end up getting the first three CCYPair values and then NaNs.所以我最终得到了前三个 CCYPair 值，然后是 NaN。

Expected outcome:预期结果：

d = {'CCY1': ['EUR', 'USD'], 'CCY2': ['USD', 'JPY'], 'Spot': [1.2, 109]}
df = pd.DataFrame(data=d)

Answer 1

You can try extract :您可以尝试extract ：

df[['CCY1','CCY2']] = df.CCYPair.str.extract('(.{3})(.*)')

Output: Output：

  CCYPair   Spot CCY1 CCY2
0  EURUSD    1.2  EUR  USD
1  USDJPY  109.0  USD  JPY

Answer 2

You can also use str.slice method:您还可以使用 str.slice 方法：

df['CCY1'] = df['CCYPair'].str.slice(stop=3)
df['CCY2'] = df['CCYPair'].str.slice(start=3)

Output: Output：

    CCYPair   Spot  CCY1  CCY2
0    EURUSD    1.2   EUR   USD
1    USDJPY  109.0   USD   JPY

使用固定宽度拆分 Pandas 字符串类型的列（类似于 Excel 具有固定宽度的文本到列功能）

问题描述

2 个解决方案

解决方案1
2 2021-03-15 17:58:41

解决方案2
0 2021-03-15 19:01:29

使用固定宽度拆分 Pandas 字符串类型的列（类似于 Excel 具有固定宽度的文本到列功能）

问题描述

2 个解决方案

解决方案1 2 2021-03-15 17:58:41

解决方案2 0 2021-03-15 19:01:29

解决方案1
2 2021-03-15 17:58:41

解决方案2
0 2021-03-15 19:01:29