計算字符串中某個單詞出現的次數

Question

我是 Java 字符串的新手，問題是我想計算字符串中特定單詞的出現次數。 假設我的字符串是：

i have a male cat. the color of male cat is Black

現在我也不想拆分它，所以我想搜索“公貓”這個詞。 它在我的字符串中出現兩次！

我正在嘗試的是：

int c = 0;
for (int j = 0; j < text.length(); j++) {
    if (text.contains("male cat")) {
        c += 1;
    }
}

System.out.println("counter=" + c);

它給了我 46 個計數器值！ 那么解決方案是什么？

Answer 1

您可以使用以下代碼：

String in = "i have a male cat. the color of male cat is Black";
int i = 0;
Pattern p = Pattern.compile("male cat");
Matcher m = p.matcher( in );
while (m.find()) {
    i++;
}
System.out.println(i); // Prints 2

演示

它能做什么？

它匹配"male cat" 。

while(m.find())

指示，在m找到匹配項時執行循環內給出的任何操作。 i通過i++增加i的值，很明顯，這給出了一個字符串所擁有的male cat的數量。

Answer 2

如果你只想要"male cat"的數量，那么我會這樣做：

String str = "i have a male cat. the color of male cat is Black";
int c = str.split("male cat").length - 1;
System.out.println(c);

如果您想確保"female cat"不匹配，則在拆分正則表達式中使用\\\\b單詞邊界：

int c = str.split("\\bmale cat\\b").length - 1;

Answer 3

apache commons-lang 中的StringUtils有CountMatches方法來計算一個字符串在另一個字符串中的出現次數。

   String input = "i have a male cat. the color of male cat is Black";
   int occurance = StringUtils.countMatches(input, "male cat");
   System.out.println(occurance);

Answer 4

這個static方法確實返回一個字符串在另一個字符串上出現的次數。

/**
 * Returns the number of appearances that a string have on another string.
 * 
 * @param source    a string to use as source of the match
 * @param sentence  a string that is a substring of source
 * @return the number of occurrences of sentence on source 
 */
public static int numberOfOccurrences(String source, String sentence) {
    int occurrences = 0;

    if (source.contains(sentence)) {
        int withSentenceLength    = source.length();
        int withoutSentenceLength = source.replace(sentence, "").length();
        occurrences = (withSentenceLength - withoutSentenceLength) / sentence.length();
    }

    return occurrences;
}

測試：

String source = "Hello World!";
numberOfOccurrences(source, "Hello World!");   // 1
numberOfOccurrences(source, "ello W");         // 1
numberOfOccurrences(source, "l");              // 3
numberOfOccurrences(source, "fun");            // 0
numberOfOccurrences(source, "Hello");          // 1

順便說一句，該方法可以寫在一行中，很糟糕，但它也有效:)

public static int numberOfOccurrences(String source, String sentence) {
    return (source.contains(sentence)) ? (source.length() - source.replace(sentence, "").length()) / sentence.length() : 0;
}

Answer 5

Java 8 版本：

    public static long countNumberOfOccurrencesOfWordInString(String msg, String target) {
    return Arrays.stream(msg.split("[ ,\\.]")).filter(s -> s.equals(target)).count();
}

Answer 6

Java 8 版本。

System.out.println(Pattern.compile("\\bmale cat")
            .splitAsStream("i have a male cat. the color of male cat is Black")
            .count()-1);

Answer 7

為什么不遞歸？

public class CatchTheMaleCat  {
    private static final String MALE_CAT = "male cat";
    static int count = 0;
    public static void main(String[] arg){
        wordCount("i have a male cat. the color of male cat is Black");
        System.out.println(count);
    }

    private static boolean wordCount(String str){
        if(str.contains(MALE_CAT)){
            count++;
            return wordCount(str.substring(str.indexOf(MALE_CAT)+MALE_CAT.length()));
        }
        else{
            return false;
        }
    }
}

Answer 8

使用 indexOf...

public static int count(String string, String substr) {
    int i;
    int last = 0;
    int count = 0;
    do {
        i = string.indexOf(substr, last);
        if (i != -1) count++;
        last = i+substr.length();
    } while(i != -1);
    return count;
}

public static void main (String[] args ){
    System.out.println(count("i have a male cat. the color of male cat is Black", "male cat"));
}

這將顯示：2

count() 的另一個實現，僅在 1 行中：

public static int count(String string, String substr) {
    return (string.length() - string.replaceAll(substr, "").length()) / substr.length() ;
}

Answer 9

公共類 TestWordCount {

public static void main(String[] args) {

    int count = numberOfOccurences("Alice", "Alice in wonderland. Alice & chinki are classmates. Chinki is better than Alice.occ");
    System.out.println("count : "+count);

}

public static int numberOfOccurences(String findWord, String sentence) {

    int length = sentence.length();
    int lengthWithoutFindWord = sentence.replace(findWord, "").length();
    return (length - lengthWithoutFindWord)/findWord.length();

}

}

Answer 10

這將工作

int word_count(String text,String key){
   int count=0;
   while(text.contains(key)){
      count++;
      text=text.substring(text.indexOf(key)+key.length());
   }
   return count;
}

Answer 11

將需要統計的String替換為空字符串，然后使用不帶字符串的長度計算出現次數。

public int occurrencesOf(String word)
    {
    int length = text.length();
    int lenghtofWord = word.length();
    int lengthWithoutWord = text.replace(word, "").length();
    return (length - lengthWithoutWord) / lenghtofWord ;
    }

Answer 12

一旦你找到這個詞，你需要將它從正在處理的 String 中刪除，這樣它就不會再次解析，使用indexOf()和substring() ，你不需要做包含檢查長度時間

Answer 13

該字符串在循環時始終包含該字符串。 你不想 ++ 因為它現在正在做的只是獲取字符串的長度，如果它包含“”公貓”

你需要 indexOf() / substring()

有點明白我在說什么？

Answer 14

如果您找到您正在搜索的字符串，您可以繼續查找該字符串的長度（如果您在 aaaa 中搜索 aa，您會考慮它 2 次）。

int c=0;
String found="male cat";
 for(int j=0; j<text.length();j++){
     if(text.contains(found)){
         c+=1;
         j+=found.length()-1;
     }
 }
 System.out.println("counter="+c);

Answer 15

這應該是一個更快的非正則表達式解決方案。
（注意 - 不是 Java 程序員）

 String str = "i have a male cat. the color of male cat is Black";
 int found  = 0;
 int oldndx = 0;
 int newndx = 0;

 while ( (newndx=str.indexOf("male cat", oldndx)) > -1 )
 {
     found++;
     oldndx = newndx+8;
 }

Answer 16

出現子串的方式有很多種，其中兩種主題是：-

public class Test1 {
public static void main(String args[]) {
    String st = "abcdsfgh yfhf hghj gjgjhbn hgkhmn abc hadslfahsd abcioh abc  a ";
    count(st, 0, "a".length());

}

public static void count(String trim, int i, int length) {
    if (trim.contains("a")) {
        trim = trim.substring(trim.indexOf("a") + length);
        count(trim, i + 1, length);
    } else {
        System.out.println(i);
    }
}

public static void countMethod2() {
    int index = 0, count = 0;
    String inputString = "mynameiskhanMYlaptopnameishclMYsirnameisjasaiwalmyfrontnameisvishal".toLowerCase();
    String subString = "my".toLowerCase();

    while (index != -1) {
        index = inputString.indexOf(subString, index);
        if (index != -1) {
            count++;
            index += subString.length();
        }
    }
    System.out.print(count);
}}

Answer 17

我們可以從多種方式計算子串的出現：-

public class Test1 {
public static void main(String args[]) {
    String st = "abcdsfgh yfhf hghj gjgjhbn hgkhmn abc hadslfahsd abcioh abc  a ";
    count(st, 0, "a".length());

}

public static void count(String trim, int i, int length) {
    if (trim.contains("a")) {
        trim = trim.substring(trim.indexOf("a") + length);
        count(trim, i + 1, length);
    } else {
        System.out.println(i);
    }
}

public static void countMethod2() {
    int index = 0, count = 0;
    String inputString = "mynameiskhanMYlaptopnameishclMYsirnameisjasaiwalmyfrontnameisvishal".toLowerCase();
    String subString = "my".toLowerCase();

    while (index != -1) {
        index = inputString.indexOf(subString, index);
        if (index != -1) {
            count++;
            index += subString.length();
        }
    }
    System.out.print(count);
}}

Answer 18

我這里有另一種方法：

String description = "hello india hello india hello hello india hello";
String textToBeCounted = "hello";

// Split description using "hello", which will return 
//string array of words other than hello
String[] words = description.split("hello");

// Get number of characters words other than "hello"
int lengthOfNonMatchingWords = 0;
for (String word : words) {
    lengthOfNonMatchingWords += word.length();
}

// Following code gets length of `description` - length of all non-matching
// words and divide it by length of word to be counted
System.out.println("Number of matching words are " + 
(description.length() - lengthOfNonMatchingWords) / textToBeCounted.length());

Answer 19

完整的例子在這里，

package com.test;

import java.util.HashMap;
import java.util.Iterator;
import java.util.Map;

public class WordsOccurances {

      public static void main(String[] args) {

            String sentence = "Java can run on many different operating "
                + "systems. This makes Java platform independent.";

            String[] words = sentence.split(" ");
            Map<String, Integer> wordsMap = new HashMap<String, Integer>();

            for (int i = 0; i<words.length; i++ ) {
                if (wordsMap.containsKey(words[i])) {
                    Integer value = wordsMap.get(words[i]);
                    wordsMap.put(words[i], value + 1);
                } else {
                    wordsMap.put(words[i], 1);
                }
            }

            /*Now iterate the HashMap to display the word with number 
           of time occurance            */

           Iterator it = wordsMap.entrySet().iterator();
           while (it.hasNext()) {
                Map.Entry<String, Integer> entryKeyValue = (Map.Entry<String, Integer>) it.next();
                System.out.println("Word : "+entryKeyValue.getKey()+", Occurance : "
                                +entryKeyValue.getValue()+" times");
           }
     }
}

Answer 20

公共類字數{

public static void main(String[] args) {
    // TODO Auto-generated method stub
    String scentence = "This is a treeis isis is is is";
    String word = "is";
    int wordCount = 0;
    for(int i =0;i<scentence.length();i++){
        if(word.charAt(0) == scentence.charAt(i)){
            if(i>0){
                if(scentence.charAt(i-1) == ' '){
                    if(i+word.length()<scentence.length()){
                        if(scentence.charAt(i+word.length()) != ' '){
                            continue;}
                        }
                    }
                else{
                    continue;
                }
            }
            int count = 1;
            for(int j=1 ; j<word.length();j++){
                i++;
                if(word.charAt(j) != scentence.charAt(i)){
                    break;
                }
                else{
                    count++;
                }
            }
            if(count == word.length()){
                wordCount++;
            }

        }
    }
    System.out.println("The word "+ word + " was repeated :" + wordCount);
}

}

Answer 21

簡單的解決方案在這里 -

下面的代碼使用 HashMap，因為它將維護鍵和值。 所以這里的鍵是單詞，值是計數（給定字符串中單詞的出現次數）。

public class WordOccurance 
{

 public static void main(String[] args) 
 {
    HashMap<String, Integer> hm = new HashMap<>();
    String str = "avinash pande avinash pande avinash";

    //split the word with white space       
    String words[] = str.split(" ");
    for (String word : words) 
    {   
        //If already added/present in hashmap then increment the count by 1
        if(hm.containsKey(word))    
        {           
            hm.put(word, hm.get(word)+1);
        }
        else //if not added earlier then add with count 1
        {
            hm.put(word, 1);
        }

    }
    //Iterate over the hashmap
    Set<Entry<String, Integer>> entry =  hm.entrySet();
    for (Entry<String, Integer> entry2 : entry) 
    {
        System.out.println(entry2.getKey() + "      "+entry2.getValue());
    }
}

}

Answer 22

public int occurrencesOf(String word) {
    int length = text.length();
    int lenghtofWord = word.length();
    int lengthWithoutWord = text.replaceAll(word, "").length();
    return (length - lengthWithoutWord) / lenghtofWord ;
}

Answer 23

對於 Scala，它只有 1 行

def numTimesOccurrenced(text:String, word:String) =text.split(word).size-1

計算字符串中某個單詞出現的次數

問題描述

23 個解決方案

解決方案1
42 已采納 2014-03-21 18:22:03

它能做什么？

解決方案2
14 2014-03-21 18:23:51

解決方案3
13 2015-07-13 13:00:12

解決方案4
4 2014-03-21 18:32:32

解決方案5
4 2015-04-16 15:01:45

解決方案6
4 2017-12-23 06:05:38

解決方案7
3 2015-04-17 06:43:38

解決方案8
2 2014-03-21 18:27:59

解決方案9
2 2017-04-15 19:45:38

解決方案10
1 2016-12-29 10:36:55

解決方案11
1 2017-08-11 02:53:11

解決方案12
0 2014-03-21 18:16:55

解決方案13
0 2014-03-21 18:21:17

解決方案14
0 2014-03-21 18:29:18

解決方案15
0 2014-03-21 18:50:35

解決方案16
0 2017-01-23 17:36:11

解決方案17
0 2017-01-23 17:43:57

解決方案18
0 2017-11-15 12:34:51

解決方案19
0 2017-12-11 17:25:22

解決方案20
0 2018-02-15 18:14:42

解決方案21
0 2018-04-19 14:12:17

解決方案22
0 2019-01-18 12:52:03

解決方案23
0 2020-09-28 04:20:03

計算字符串中某個單詞出現的次數

問題描述

23 個解決方案

解決方案1 42 已采納 2014-03-21 18:22:03

它能做什么？

解決方案2 14 2014-03-21 18:23:51

解決方案3 13 2015-07-13 13:00:12

解決方案4 4 2014-03-21 18:32:32

解決方案5 4 2015-04-16 15:01:45

解決方案6 4 2017-12-23 06:05:38

解決方案7 3 2015-04-17 06:43:38

解決方案8 2 2014-03-21 18:27:59

解決方案9 2 2017-04-15 19:45:38

解決方案10 1 2016-12-29 10:36:55

解決方案11 1 2017-08-11 02:53:11

解決方案12 0 2014-03-21 18:16:55

解決方案13 0 2014-03-21 18:21:17

解決方案14 0 2014-03-21 18:29:18

解決方案15 0 2014-03-21 18:50:35

解決方案16 0 2017-01-23 17:36:11

解決方案17 0 2017-01-23 17:43:57

解決方案18 0 2017-11-15 12:34:51

解決方案19 0 2017-12-11 17:25:22

解決方案20 0 2018-02-15 18:14:42

解決方案21 0 2018-04-19 14:12:17

解決方案22 0 2019-01-18 12:52:03

解決方案23 0 2020-09-28 04:20:03

解決方案1
42 已采納 2014-03-21 18:22:03

解決方案2
14 2014-03-21 18:23:51

解決方案3
13 2015-07-13 13:00:12

解決方案4
4 2014-03-21 18:32:32

解決方案5
4 2015-04-16 15:01:45

解決方案6
4 2017-12-23 06:05:38

解決方案7
3 2015-04-17 06:43:38

解決方案8
2 2014-03-21 18:27:59

解決方案9
2 2017-04-15 19:45:38

解決方案10
1 2016-12-29 10:36:55

解決方案11
1 2017-08-11 02:53:11

解決方案12
0 2014-03-21 18:16:55

解決方案13
0 2014-03-21 18:21:17

解決方案14
0 2014-03-21 18:29:18

解決方案15
0 2014-03-21 18:50:35

解決方案16
0 2017-01-23 17:36:11

解決方案17
0 2017-01-23 17:43:57

解決方案18
0 2017-11-15 12:34:51

解決方案19
0 2017-12-11 17:25:22

解決方案20
0 2018-02-15 18:14:42

解決方案21
0 2018-04-19 14:12:17

解決方案22
0 2019-01-18 12:52:03

解決方案23
0 2020-09-28 04:20:03