简体   繁体   English

Window 在 Python dataframe 列中滑动 substring

[英]Window sliding a substring in Python dataframe columns

I have a sequence 'DEFGHI JKLM NOPQRS' as a row in a column which is also a substring of another column in a data frame.我有一个序列'DEFGHI JKLM NOPQRS'作为列中的一行,它也是数据框中另一列的 substring。

Start Position of J = 7 and M = 10.启动 J = 7 和 M = 10 的 Position。

I need to create a new column such that it takes 5 letters before and after the start letter from J till M ie having a total length of 11:我需要创建一个新列,以便在从 J 到 M 的起始字母之前和之后需要 5 个字母,即总长度为 11:

EFGHI J KLMNO EFGHI J KLMNO
FGHIJ K LMNOP FGHIJ K LMNOP
GHIJK L MNOPQ GHIJK L MNOPQ
HIJKL M NOPQR HIJKL M NOPQR

I have tried this using R.我已经使用 R 尝试过这个。 I need suggestions to do the same in python for practice我需要建议在 python 中做同样的练习

Example:例子:

s = 'ABCDEFGHIJKLMNOPQRSTUVWXYZ'
i = s.index('J')
j = s.index('O')
context = 7
for mid in range(i, 1+j):
    print(s[mid-context:mid+context+1])

Output: Output:

CDEFGHIJKLMNOPQ
DEFGHIJKLMNOPQR
EFGHIJKLMNOPQRS
FGHIJKLMNOPQRST
GHIJKLMNOPQRSTU
HIJKLMNOPQRSTUV

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM