不在字符串数组中的字符串中的单词数

Question

I want to create a method that returns the number of words in a string that have no occurrences of words in the array of strings. 我想创建一个方法，该方法返回字符串中没有出现字符串数组中单词的单词数。 I want to implement this logic only using anything in the java.lang package. 我只想使用java.lang包中的任何东西来实现此逻辑。

public int count(String a, String[] b) {

}

Eg 例如

count("  hey   are  you there    ", new String[]{ "are", "i", "am"})

would return 3 as there is the word "are" in the string. 将返回3，因为字符串中有单词“ are”。

First off, I think I have to use the string.split function to convert the string to an array of strings. 首先，我认为我必须使用string.split函数将字符串转换为字符串数组。 Any ideas? 有任何想法吗？

Answer 1

You could simply do something like: 您可以简单地执行以下操作：

public int count(String a, String[] b) {
    int count = b.length;
    for(String s : b) if(a.contains(s)) count--;
    return count;
}

EDIT: I might have been confused, I thought you wanted the # of strings in b not in a (in your example it would still be 3). 编辑：我可能已经很困惑，我以为您想要b不包含在a的字符串数（在您的示例中，它仍然是3）。 In that case, from your example, split seems inconvenient unless you use regex , so you could create a String[] using Scanner : 在这种情况下，从您的示例来看，除非使用regex ，否则split似乎不方便，因此可以使用Scanner创建String[] ：

public int count(String a, String[] b) {
    ArrayList<String> words = new ArrayList<String>();
    Scanner scan = new Scanner(a);
    while(scan.hasNext()) words.add(scan.next());

    int count = words.size();
    for(String s : words) if(/*b contains s*/) count--;
    return count;
}

Answer 2

You logic should go somewhat like this: 您的逻辑应该像这样：

Split a , right. 拆分a ，对。 Now you have a list of words. 现在您有了单词列表。 In a real life, you should probably also try to clarify the requirement—what exactly is a “word”? 在现实生活中，您可能还应该尝试阐明要求-“单词”到底是什么？ A reasonable assumption is that it's a sequence of non-whitespace characters, but could be something different (for example, a sequence of letters). 一个合理的假设是它是一个非空白字符序列，但是可能有所不同（例如，一个字母序列）。
Iterate over a and check whether each word is in b . 遍历a并检查每个单词是否在b 。 If it isn't, increment your counter. 如果不是，请增加您的计数器。 But every check is a linear search in b , leading to the total complexity of O(nm), so... 但是每次检查都是在b进行线性搜索，从而导致O（nm）的总复杂度，因此...
Before iterating, convert b into a HashSet . 迭代之前，将b转换为HashSet 。 This is a linear operation, but then your main loop will also become a linear operation, therefore the total complexity will be O(m + n). 这是线性运算，但是您的主循环也将变为线性运算，因此总复杂度为O（m + n）。
If you have to do this thing repeatedly for different strings, but the same word list, consider creating a WordCounter class so you only have to create the HashSet once in the constructor. 如果必须对不同的字符串但在相同的单词列表中重复执行此操作，请考虑创建WordCounter类，这样您只需在构造函数中创建一次HashSet 。

Answer 3

Follow the steps to complete the task. 请按照以下步骤完成任务。

Use StringTokenizer to tokenize the String a . 使用StringTokenizer标记字符串a 。
Convert String Array b to Collection , so that you can check if it contains the given token. 将String Array b转换为Collection ，以便您可以检查它是否包含给定标记。
Use loop to get next token from StringTokenizer and check if it contains in List . 使用循环从StringTokenizer获取下一个token ，并检查它是否包含在List 。

- --

Try below code, it'll work. 试试下面的代码，它将起作用。

EDIT : Using java.util package. 编辑：使用java.util包。

public int count(String a, String[] b) {
    java.util.StringTokenizer tokenizer = new java.util.StringTokenizer(a);
    java.util.List bList = java.util.Arrays.asList(b);
    int tokens = tokenizer.countTokens();
    int counter = tokens;
    for(int i=0;i<tokens;i++) {
        String token = tokenizer.nextToken().trim();
        if(bList.contains(token)) {
            counter--;
        }
    }
    return counter;
}

By using this, you can get the counter in just one for loop. 通过使用此功能，您可以在一个for循环中获得计数器。

EDIT :: Using java.lang package only. 编辑::仅使用java.lang包。

public int count(String a, String[] b) {
    String[] words = a.split(" ");
    int tokens = words.length;
    int wordCount = 0;
    int counter = 0;
    for(int i=0;i<tokens;i++) {
        String token = words[i].trim();
        if(token.length() <= 0) {
            continue;
        }
        wordCount++;
        for(String bItem : b) {
            if(bItem.equals(token)) {
                counter++;
                break;
            }
        }
    }
    return wordCount - counter;
}

不在字符串数组中的字符串中的单词数

问题描述

3 个解决方案

解决方案1
1 2016-03-13 05:51:26

解决方案2
0 2016-03-13 06:01:31

解决方案3
0 已采纳 2016-03-13 07:56:50

不在字符串数组中的字符串中的单词数

问题描述

3 个解决方案

解决方案1 1 2016-03-13 05:51:26

解决方案2 0 2016-03-13 06:01:31

解决方案3 0 已采纳 2016-03-13 07:56:50

解决方案1
1 2016-03-13 05:51:26

解决方案2
0 2016-03-13 06:01:31

解决方案3
0 已采纳 2016-03-13 07:56:50