Python How to calculate percentage of a text found in every row

Question

I have a CVS with one column and 4000 rows i want to make a script that can print each unique word and its percentage that is on that CSV

Example:

Trojan
Trojan
redirects
Exploits
Trojan

Trojan: 60% Redirects: 20% Exploits 20%

What is the easy/simple way to do this?

here is a image with the data i have

import csv
myDict = {}

with open('export.csv', 'rb') as csvfile:
    for word in csvfile:
        if word in myDict:
            myDict[word] += 1
        else:
            myDict[word] = 1

for word in myDict:
    print word, float(myDict[word])/len(csvfile)

Answer 1

You can use set to get all unique values and count to get the number of occurrences. Dividing by the length of the list with text yields the percentage:

text = ['a', 'a', 'b', 'c']
[(i, text.count(i) * 100. / len(text)) for i in set(text)]

resulting in:

[('a', 50.0), ('b', 25.0), ('c', 25.0)]

Answer 2

You can use dictionary as below:

import csv

myDict = {}
row_number = 0

with open('some.csv', 'rb') as f:
    reader = csv.reader(f, delimiter=' ')
    for row in reader:
        row_number +=1
        if row[0] in myDict:
            myDict[row[0]] += 1
        else:
            myDict[row[0]] = 1

for word in myDict:
    print word, float(myDict[word])/row_number

Works as below:

>>> ================================ RESTART ================================
>>> 
Trojan 0.6
Exploits 0.2
redirects 0.2
>>>

Python How to calculate percentage of a text found in every row

Question

2 answers

solution1
1 2016-04-20 07:05:50

solution2
0 ACCPTED 2016-04-20 07:21:41

Python How to calculate percentage of a text found in every row

Question

2 answers

solution1 1 2016-04-20 07:05:50

solution2 0 ACCPTED 2016-04-20 07:21:41

solution1
1 2016-04-20 07:05:50

solution2
0 ACCPTED 2016-04-20 07:21:41