[英]Window sliding a substring in Python dataframe columns
I have a sequence 'DEFGHI JKLM NOPQRS' as a row in a column which is also a substring of another column in a data frame.我有一个序列'DEFGHI JKLM NOPQRS'作为列中的一行,它也是数据框中另一列的 substring。
Start Position of J = 7 and M = 10.启动 J = 7 和 M = 10 的 Position。
I need to create a new column such that it takes 5 letters before and after the start letter from J till M ie having a total length of 11:我需要创建一个新列,以便在从 J 到 M 的起始字母之前和之后需要 5 个字母,即总长度为 11:
EFGHI J KLMNO EFGHI J KLMNO
FGHIJ K LMNOP FGHIJ K LMNOP
GHIJK L MNOPQ GHIJK L MNOPQ
HIJKL M NOPQR HIJKL M NOPQR
I have tried this using R.我已经使用 R 尝试过这个。 I need suggestions to do the same in python for practice
我需要建议在 python 中做同样的练习
Example:例子:
s = 'ABCDEFGHIJKLMNOPQRSTUVWXYZ'
i = s.index('J')
j = s.index('O')
context = 7
for mid in range(i, 1+j):
print(s[mid-context:mid+context+1])
Output: Output:
CDEFGHIJKLMNOPQ
DEFGHIJKLMNOPQR
EFGHIJKLMNOPQRS
FGHIJKLMNOPQRST
GHIJKLMNOPQRSTU
HIJKLMNOPQRSTUV
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.