Python - 數據框列在“.”后重命名為大寫字母

Question

我有一些列遵循模式“abc.def”，我正在嘗試使用函數將其更改為“abcDef”。 我可以用df.rename(columns={'abc.def': 'abcDef'}, inplace = True)但尋找一種可以應用於不同數據框的更通用的方法。 我是為簡單的字符串做的，我不知道如何將它應用於列名。 我試圖將列名添加到列表中並將函數附加到列表中，但這也不起作用。

我的 df 是：

import pandas as pd
import re
            
            
data = {'end.date': ['01/10/2020 15:23', '01/10/2020 16:31', '01/10/2020 16:20', '01/10/2020 11:00'],
                  'start.date': ['01/10/2020 13:38', '01/10/2020 14:49', '01/10/2020 14:30','01/10/2020 14:30']
                  }
            
df = pd.DataFrame(data, columns = ['end.Date','start.date'])

# below is my go at the text.             
text = 'abs.d'
splitFilter = re.compile('([.!?]\s*)')
splitColumnName = splitFilter.split(text)
print(splitColumnName)
        
final = ''.join([i.capitalize() for i in splitColumnName])
final = final.replace('.', '')
print(final)

Answer 1

我想你想要那樣的東西？

import pandas as pd
import re
            
            
data = {'end.date': ['01/10/2020 15:23', '01/10/2020 16:31', '01/10/2020 16:20', '01/10/2020 11:00'],
                  'start.date': ['01/10/2020 13:38', '01/10/2020 14:49', '01/10/2020 14:30','01/10/2020 14:30']
                  }
            
df = pd.DataFrame(data, columns = ['end.Date','start.date'])

# below is my go at the text.   
def formatColumn(column) :
  splitFilter = re.compile('([.!?]\s*)')
  splitColumnName = splitFilter.split(column)
          
  final = ''.join([i.capitalize() for i in splitColumnName])
  final = final.replace('.', '')
  return final[0].lower() + final[1:] 

df.rename(columns=dict(zip(df.columns, [formatColumn(c) for c in df.columns])))

Answer 2

我使用了@Arne 和@LeMorse 的答案並編譯了我需要的內容。 再次感謝！

import pandas as pd
import re
            
            
data = {'end.date': ['01/10/2020 15:23', '01/10/2020 16:31', '01/10/2020 16:20', '01/10/2020 11:00'],
                  'start.date': ['01/10/2020 13:38', '01/10/2020 14:49', '01/10/2020 14:30','01/10/2020 14:30']
                  }
            
df = pd.DataFrame(data, columns = ['end.Date','start.date'])

# below is my go at the text.   
def formatColumn(column) :
  splitFilter = re.compile('([.!?]\s*)')
  splitColumnName = splitFilter.split(column)
          
  final = ''.join([i.capitalize() for i in splitColumnName])
  final = final.replace('.', '')
  return final[0].lower() + final[1:] 

df.columns = [formatColumn(col) for col in df.columns]

Answer 3

您可以將代碼將單個字符串轉換為函數，然后將此函數應用於每個列名，例如使用列表理解：

def camelCase(text):
    splitFilter = re.compile('([.!?]\s*)')
    splitColumnName = splitFilter.split(text)
    final = ''.join([i.capitalize() for i in splitColumnName])
    final = final.replace('.', '')
    return final

df.columns = [camelCase(col) for col in df.columns]

請注意，目前您的代碼也將第一個字母大寫。

Answer 4

def splitAndRenameColumns(df, splitSignal):
 # get all the columns in a list
 columnNameList = df.columns.values.tolist()
 # create a map to rename columns
 # mapping is old.columnname : newColumnname
 newColNames = {}
 # loop pver all column names
 for clm in columnNameList :
    # split the column names on "provided split signal i.e dot in this case"
    tempStore = clm.split(splitSignal)
    # store the first word before dot in temparory string  
    newString = tempStore[0]
    # loop over all other string values we got after splitting  
    for index in range(1,len(tempStore)):
        # capitalise first character to upper case and concatenate all the strings 
        newString += tempStore[index][0].upper()+tempStore[index][1:]
    # create the mapping 
    # i.e {'end.Date.gate': 'endDateGate', 'start.date.bate': 'startDateBate'}
    newColNames[clm] = newString
 return newColNames


df = df.rename(columns=splitAndRenameColumns(df, "."))
print(df)

它幾乎與其他答案相似，但它在拆分信號方面更通用，並用注釋清楚地解釋了該過程。 如果您還需要對代碼進行更多評論，請告訴我

Python - 數據框列在“.”后重命名為大寫字母

問題描述

4 個解決方案

解決方案1
1 2020-10-05 10:39:52

解決方案2
1 2020-10-05 11:13:20

解決方案3
0 2020-10-05 10:37:12

解決方案4
0 2020-10-05 12:58:46

Python - 數據框列在“.”后重命名為大寫字母

問題描述

4 個解決方案

解決方案1 1 2020-10-05 10:39:52

解決方案2 1 2020-10-05 11:13:20

解決方案3 0 2020-10-05 10:37:12

解決方案4 0 2020-10-05 12:58:46

解決方案1
1 2020-10-05 10:39:52

解決方案2
1 2020-10-05 11:13:20

解決方案3
0 2020-10-05 10:37:12

解決方案4
0 2020-10-05 12:58:46