我如何计算一行中单词的出现

Question

我是java的新手。 我想计算特定行中单词的出现次数。 到目前为止，我只能统计单词，却不知道如何统计出现次数。

有没有简单的方法可以做到这一点？

Scanner file = new Scanner(new FileInputStream("/../output.txt"));
int count = 0;
  while (file.hasNextLine()) {
    String s = file.nextLine();
    count++;    
      if(s.contains("#AVFC")){
       System.out.printf("There are %d words on this line ", s.split("\\s").length-1);
       System.out.println(count);   
      }

  }
file.close();

输出：

    There are 4 words on this line 1

    There are 8 words on this line 13

    There are 3 words on this line 16

Answer 1

我能想到的最简单的方法是使用String.split("\\\\s") ，它将基于空格进行拆分。

然后使用一个HashMap其中包含一个单词作为键，其值是使用该单词的次数。

   HashMap<String, Integer> mapOfWords = new HashMap<String, Integer>();

      while (file.hasNextLine()) {
        String s = file.nextLine(); 
        String[] words = s.split("\\s");
        int count;
        for (String word : words) {
           if (mapOfWords.get(word) == null) {
              mapOfWords.put(word, 1);
           }
           else {
              count = mapOfWord.get(word);
              mapOfWords.put(word, count + 1);
           }
        }
      }

您请求跳过包含某些单词的字符串的实现

   HashMap<String, Integer> mapOfWords = new HashMap<String, Integer>();

   while (file.hasNextLine()) {
        String s = file.nextLine(); 
        String[] words = s.split("\\s");
        int count;

        if (isStringWanted(s) == false) {
           continue;  
        } 

        for (String word : words) {
           if (mapOfWords.get(word) == null) {
              mapOfWords.put(word, 1);
           }
           else {
              count = mapOfWord.get(word);
              mapOfWords.put(word, count + 1);
           }
        }
      }

private boolean isStringWanted(String s) {
    String[] checkStrings = new String[] {"chelsea", "Liverpool", "#LFC"};

    for (String check : checkString) {
        if (s.contains(check)) {
           return false;
        }
    }
    return true;
}

Answer 2

尝试下面的代码，它可能会解决您的问题，此外，您可以在将其放入哈希图中之前调用String.toLowerCase（）

String line ="a a b b b b a q c c";
...
Map<String,Integer> map = new HashMap<String,Integer>();
Scanner scanner = new Scanner(line); 
while (scanner.hasNext()) {
    String s = scanner.next();
    Integer count = map.put(s,1); 
    if(count!=null) map.put(s,count + 1);
}
...
System.out.println(map);

结果：

{b=4, c=2, q=1, a=3}

Answer 3

检查番石榴的Multiset 。 他们的描述始于'The traditional Java idiom for eg counting how many times a word occurs in a document is something like:' 。 您会找到一些代码片段，而不使用MultiSet怎么做。

顺便说一句：如果您只想计算字符串中的单词数，为什么不只计算空格呢？ 您可以使用来自Apache Commons的StringUtils 。 这比创建拆分部分的数组要好得多。 也看看它们的实现。

int count = StringUtils.countMatches(string, " ");

Answer 4

最快的方法是将拆分后的数据存储在ArrayList中，然后在ArrayList上进行迭代并使用[Collections.frequency]（ http://www.tutorialspoint.com/java/util/collections_frequency.htm ）

Answer 5

在给定的String ，一个给定的出现String可以使用计数String#indexOf(String, int)和通过一个环路

String haystack = "This is a string";
String needle = "i";
int index = 0;

while (index != -1) {
    index = haystack.indexOf(needle, index + 1);

    if (index != -1) {
        System.out.println(String.format("Found %s in %s at index %s.", needle, haystack, index));
    }
}

我如何计算一行中单词的出现

问题描述

5 个解决方案

解决方案1
4 已采纳 2014-03-10 16:09:37

解决方案2
4 2014-03-10 16:15:53

解决方案3
0 2014-03-10 16:12:41

解决方案4
0 2014-03-10 16:15:12

解决方案5
-2 2014-03-10 16:11:10

我如何计算一行中单词的出现

问题描述

5 个解决方案

解决方案1 4 已采纳 2014-03-10 16:09:37

解决方案2 4 2014-03-10 16:15:53

解决方案3 0 2014-03-10 16:12:41

解决方案4 0 2014-03-10 16:15:12

解决方案5 -2 2014-03-10 16:11:10

解决方案1
4 已采纳 2014-03-10 16:09:37

解决方案2
4 2014-03-10 16:15:53

解决方案3
0 2014-03-10 16:12:41

解决方案4
0 2014-03-10 16:15:12

解决方案5
-2 2014-03-10 16:11:10