LinuxQuestions.org
Visit the LQ Articles and Editorials section
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > Programming
User Name
Password
Programming This forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.

Notices



Reply
 
Search this Thread
Old 11-30-2011, 10:27 AM   #1
dwhitney67
Senior Member
 
Registered: Jun 2006
Location: Maryland
Distribution: Kubuntu, Fedora, RHEL
Posts: 1,523

Rep: Reputation: 332Reputation: 332Reputation: 332Reputation: 332
Help needed with understanding (Java) Regular Expression


I'm maintaining a Java application that queries for email replies from an POP server. I have very limited knowledge interpreting regular expressions, and was wondering if someone could help me understand the following statement:
Code:
Pattern subjStart = Pattern.compile("^\\s*(?:[Rr][Ee]:\\s*" + SUBJECT_PREFIX + " )?([0-9A-Z]+)\\s*(?:.*)?");
The SUBJECT_PREFIX is a string that can be anything, including an empty string. Pattern is from the java.util.regex package.
 
Old 11-30-2011, 12:13 PM   #2
firstfire
Member
 
Registered: Mar 2006
Location: Ekaterinburg, Russia
Distribution: Debian, Ubuntu
Posts: 640

Rep: Reputation: 375Reputation: 375Reputation: 375Reputation: 375
Hi.

First, inspect these two links: 1, 2(scroll to "Regular Expressions, Literal Strings and Backslashes").

According to second link:
Quote:
In literal Java strings the backslash is an escape character. The literal string "\\" is a single backslash. In regular expressions, the backslash is also an escape character. The regular expression \\ matches a single backslash. This regular expression as a Java string, becomes "\\\\". That's right: 4 backslashes to match a single one.
So, "\\s" is the character class \s equal to [ \t\n\x0B\f\r] -- a whitespace character.

From the first link:
Quote:
Greedy quantifiers
X? X, once or not at all
.....
Special constructs (non-capturing)
(?:X) X, as a non-capturing group
Therefore (?:.*)? means optional (note second ?) block of zero or more arbitrary characters. Such non-capturing groups are used to improve performance.

The same with "(?:[Rr][Ee]:\\s*" + SUBJECT_PREFIX + " )?" -- optional block, for example "Re: <SUBJECT_PREFIX>" or "RE: <SUBJECT_PREFIX>" etc.

The only capturing group here is '([0-9A-Z]+)' -- one or more capital alphanumeric characters.

Hope, I am correct and this will help.

Last edited by firstfire; 11-30-2011 at 01:25 PM.
 
1 members found this post helpful.
Old 11-30-2011, 02:07 PM   #3
dwhitney67
Senior Member
 
Registered: Jun 2006
Location: Maryland
Distribution: Kubuntu, Fedora, RHEL
Posts: 1,523

Original Poster
Rep: Reputation: 332Reputation: 332Reputation: 332Reputation: 332
Quote:
Originally Posted by firstfire View Post
Hope, I am correct and this will help.
I will read over the information from the links you provided. Thanks a lot for dissecting the regex string I provided earlier. With the information you supplied, and that found within the Java API site, it shouldn't be too hard to grasp the layout of the regex.

Again, thanks for your help.
 
  


Reply

Tags
java, regex


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] Help with regular expression xavitxus Programming 3 08-28-2011 05:44 PM
Regular Expression 0.o Programming 3 06-09-2009 03:28 AM
regular expression in java sajith Programming 4 12-10-2008 06:40 AM
Regular Expression harkonen Programming 6 07-12-2008 01:06 PM
sed regular expression help needed Dew Linux - Newbie 1 03-30-2005 03:59 PM


All times are GMT -5. The time now is 04:58 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration