簡體   English   中英

找到兩個特定單詞之間的字符串

[英]Find a string between two specific word

我有一個文本,我需要在兩個特定的單詞之間提取數據,例如在Activity:Sub-Activity:之間 這是我的文字:

Activity: S1. Outline Design
Sub-Activity: S3.3 Walk through Release Backlog
Question Tag: tag
Questioner (role or team): Solution Architect
Which response should the user read first?: Response 8
Responder 1 (role or team): Developer
Response 1: 
Responder 2 (role or team): Scrum Master
Response 2: response2
Responder 3 (role or team): Please select:
Response 3: 
Responder 4 (role or team): Please select:
Response 4: 
Responder 5 (role or team): Please select:
Response 5: 

我提出了這個代碼,但問題是這種模式習慣於說話,但是一旦我查了它就不再有用了,任何人都有任何想法:

private static String extractActivity(String text) {
    Pattern pattern = Pattern.compile("(?:\\W|\\w)*Activity:(?:\\W)*(.*)(?:\\W)*Sub-Activity:(?:\\W|\\w)*",
            Pattern.DOTALL);
    Matcher matcher = pattern.matcher(text);
    matcher.matches();
    String activities = matcher.group(1);
    return activities;
}

它向我顯示以下錯誤:

Feb 19, 2014 5:06:58 PM org.apache.catalina.core.StandardWrapperValve invoke
SEVERE: Servlet.service() for servlet [default-dispatcher] in context with path [/webmi] threw exception [Request processing failed; nested exception is java.lang.IllegalStateException: No match found] with root cause
java.lang.IllegalStateException: No match found
    at java.util.regex.Matcher.group(Matcher.java:485)
    at com.lloydsbanking.webmi.service.RSSReaderService.extractSubActivity(RSSReaderService.java:107)
    at com.lloydsbanking.webmi.service.RSSReaderService.read(RSSReaderService.java:61)
    at com.lloydsbanking.webmi.web.RssController.getFeed(RssController.java:34)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:606)
    at org.springframework.web.method.support.InvocableHandlerMethod.invoke(InvocableHandlerMethod.java:219)
    at org.springframework.web.method.support.InvocableHandlerMethod.invokeForRequest(InvocableHandlerMethod.java:132)
    at org.springframework.web.servlet.mvc.method.annotation.ServletInvocableHandlerMethod.invokeAndHandle(ServletInvocableHandlerMethod.java:104)
    at org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.invokeHandleMethod(RequestMappingHandlerAdapter.java:745)
    at org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.handleInternal(RequestMappingHandlerAdapter.java:686)
    at org.springframework.web.servlet.mvc.method.AbstractHandlerMethodAdapter.handle(AbstractHandlerMethodAdapter.java:80)
    at org.springframework.web.servlet.DispatcherServlet.doDispatch(DispatcherServlet.java:925)
    at org.springframework.web.servlet.DispatcherServlet.doService(DispatcherServlet.java:856)
    at org.springframework.web.servlet.FrameworkServlet.processRequest(FrameworkServlet.java:920)
    at org.springframework.web.servlet.FrameworkServlet.doPost(FrameworkServlet.java:827)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:647)
    at org.springframework.web.servlet.FrameworkServlet.service(FrameworkServlet.java:801)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:728)
    at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:305)
    at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
    at org.tuckey.web.filters.urlrewrite.RuleChain.handleRewrite(RuleChain.java:176)
    at org.tuckey.web.filters.urlrewrite.RuleChain.doRules(RuleChain.java:145)
    at org.tuckey.web.filters.urlrewrite.UrlRewriter.processRequest(UrlRewriter.java:92)
    at org.tuckey.web.filters.urlrewrite.UrlRewriteFilter.doFilter(UrlRewriteFilter.java:394)
    at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
    at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
    at com.lloydsbanking.webmi.web.VersionNumberFilter.doFilter(VersionNumberFilter.java:50)
    at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
    at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
    at com.opensymphony.sitemesh.webapp.SiteMeshFilter.obtainContent(SiteMeshFilter.java:129)
    at com.opensymphony.sitemesh.webapp.SiteMeshFilter.doFilter(SiteMeshFilter.java:77)

表達有點過頭了。 而不是整個字符串的matches() ,可以為部件執行find() \\\\w匹配單詞字符, \\\\W表示非單詞字符。 因此\\\\W|\\\\w可能.

Pattern pattern = Pattern.compile("\\bActivity\\:(.*)\\bSub-Activity\\:",
        Pattern.DOTALL);
Matcher matcher = pattern.matcher(text);
if (matcher.find()) {
    String activities = matcher.group(1);
    return activities;
}
throw new IllegalStateException("No activity in: " + text);

我使用\\\\b作為單詞邊界,匹配/消耗0個字符,並檢測單詞邊界。 這也適用於文本開頭的Activity 我躲過了冒號( : )雖然我不能確定,但x:{2,3}將匹配xxxxx


正如@Pshemo評論的那樣,您的代碼原則上是正確的。 可能數據沒有按照參加者的方式到達。

更改

matcher.matches();

if (!matcher.matches)) {
    throw new IllegalStateException("No activity in: " + text);
}

試試這可能對你有幫助

 public static void main(String[] args){
    String str = "Activity: S1. Outline Design Sub-Activity: S3.3 Walk through " +
    "Release Backlog Question Tag: tag Questioner (role or team): Solution " +
    "Architect Which response should the user read first?: Response 8 Responder" +
    " 1 (role or team): Developer Response 1: Responder 2 (role or team): Scrum " +
    "Master Response 2: response2 Responder 3 (role or team):"+
    "Please select: Response 3: Responder 4 (role or team): Please select: Response 4:" +
    " Responder 5 (role or team): Please select: Response 5:";

    String regex = "(?<=Activity:).*?(?=Sub-Activity:)";

    Pattern pattern = Pattern.compile(regex);
    Matcher matcher = pattern.matcher(str);
    matcher.find();
    System.out.println(matcher.group());
}

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM