簡體 English 中英

根據重復次數選擇列

[英]Selecting columns based on how many times it repeats

原文 2021-11-02 14:10:00 1 2 python/ pandas

考慮我在 python pandas 中有一個列並且有 1000 個字符串值，我如何根據重復次數從中選擇前 10 個值

data['country_state'] = data['place'].str.rsplit(',').str[-1] #column

country_state有 1000 個值我必須根據相同字符串重復的次數從 1000 個中選擇前 10 個country_state

2 個解決方案

我認為 value_counts ( https://pandas.pydata.org/docs/reference/api/pandas.Series.value_counts.html ) 和 nlargest ( https://pandas.pydata.org/pandas-docs/stable/ reference/api/pandas.Series.nlargest.html ) 應該在這里工作：

data['country_state'].value_counts().nlargest(10)

嗨，您可以使用一些 Pandas 函數來解決這個問題，首先value_counts將通過重復對您的數據進行排序並對其進行計數，然后您可以拆分前 10 個並獲取它們的索引。 這里有一個例子：

import numpy as np
import pandas as pd

#create the dataframe I used numbers for simplicity it's the same for other var
n = np.random.randint(0,50,1000)
df_n = pd.DataFrame(n,columns= ['num'])

#get values by frequency 
nreps = df_n['num'].value_counts()

#get the top ten and print it's index
top10_values = nreps.iloc[:10].index
top10_counts    = nreps.iloc[:10].values

獲取列表中重復次數最多的元素重復的次數

[英]Getting how many times the most repetitive element repeats in a list

調試遞歸 function 以查看它重復某個計算的次數

[英]Debugging recursive function to se how many times it repeats a certain calculation

檢查元組的鍵在dict中重復多少次？

[英]Check how many times a key of a tuple repeats itself in a dict?

如何計算 python 上的 CSV 文件中某個值重復的次數？

[英]How do I count how many times a value repeats in a CSV file on python?

如何從文本文件中讀取值並計算值重復多少次，然后求平均值？

[英]How can I read in values from a text file and calculate how many times a value repeats and then find the average?

將某個值在一個列表中的多個列表中的重復列表中存儲多少次

[英]Store how many times a certain value repeats in multiple lists inside of a list to a dict

需要計算“AGAT”、“AATG”和“TATC”在具有 DNA 序列的 .txt 文件中重復了多少次

[英]Need to count how many times “AGAT” “AATG” and “TATC” repeats in .txt file that has a DNA sequence

Django-選擇相關集：它會打數據庫多少次？

[英]Django - Selecting related set : how many times does it hit the database?

熊貓不根據條件選擇列

[英]Pandas not selecting columns based on condition

pandas 根據日期在 dataframe 中出現的次數增加行

[英]pandas increment row based on how many times a date is in a dataframe

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 獲取列表中重復次數最多的元素重復的次數調試遞歸 function 以查看它重復某個計算的次數檢查元組的鍵在dict中重復多少次？如何計算 python 上的 CSV 文件中某個值重復的次數？如何從文本文件中讀取值並計算值重復多少次，然后求平均值？將某個值在一個列表中的多個列表中的重復列表中存儲多少次需要計算“AGAT”、“AATG”和“TATC”在具有 DNA 序列的 .txt 文件中重復了多少次 Django-選擇相關集：它會打數據庫多少次？熊貓不根據條件選擇列 pandas 根據日期在 dataframe 中出現的次數增加行

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM