Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question?
If it is not in the man pages or the how-to's this is the place! |
Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
|
|
|
03-03-2017, 04:01 PM
|
#1
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Rep:
|
print only changed line with sed after double substition
Hi,
As the title says, I'm trying to print only the lines that have been changed with sed after a double substitution.
This is the text file:
Code:
The grand old Duke of York
He had ten thousand men
He marched them up to the top of the hill
And he marched them down again
And when they were up they were up
And when they were down they were down
And when they were only half-way up
They were neither up nor down
Normally I know that I need to use the -n option with the p flag, but in this case I'm trying to print only the lines that were altered by both substitions. The two substitions are 's/down/DOWN/' and 's/up/UP/'.
So for instance, sed -e '/s/up/UP' -e 's/down/DOWN' file.txt will display the whole file, including the lines which haven't been altered.
sed -n -e 's/up/UP/p' -e 's/down/DOWN/p' file.txt won't work either, because lines containing both 'up' and 'down' are going to be displayed twice (once for each substition).
So how should I go about this problem?
|
|
|
03-03-2017, 04:15 PM
|
#2
|
LQ Veteran
Registered: Aug 2003
Location: Australia
Distribution: Lots ...
Posts: 21,238
|
Use regex to do the substitution for both in one stanza.
|
|
1 members found this post helpful.
|
03-04-2017, 01:43 AM
|
#3
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Original Poster
Rep:
|
I can only think of the\U option, which turns what it matches into uppercase. I don't want the solution, but I'd like to know in principle how I could actually use regex for two different strings and apply the same action to both. I was thinking of something like 's/"up|down"/\U/', which doesn't work because it interprets them literally. And neither would \| work between "up" and "down"
Last edited by vincix; 03-04-2017 at 02:07 AM.
|
|
|
03-04-2017, 02:16 AM
|
#4
|
LQ Guru
Registered: Apr 2005
Distribution: Linux Mint, Devuan, OpenBSD
Posts: 7,506
|
Quote:
Originally Posted by vincix
So how should I go about this problem?
|
You probably need to clarify the problem a little more. You can do a lot with t, b, and : alone. The t will branch if s/up/&/; succeeds, though in effect it is just a check.
If you are only ever going to be using GNU sed then you can do it more concisely with T instead and skip the jumping.
|
|
1 members found this post helpful.
|
03-04-2017, 02:19 AM
|
#5
|
LQ Veteran
Registered: Aug 2003
Location: Australia
Distribution: Lots ...
Posts: 21,238
|
Perhaps you shouldn't be so keen to reject options. You might be pleasantly surprised.
Some tips if I might:
- look at "-r"
- check the (GNU) doco for the first few sentences describing the "s" command. Particularly re the matched portion of the pattern space.
(no need for branching in this case)
Last edited by syg00; 03-04-2017 at 02:20 AM.
Reason: last comment
|
|
1 members found this post helpful.
|
03-04-2017, 02:58 AM
|
#6
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Original Poster
Rep:
|
I'm not rejecting options. After all -n and -e are options. It's just that you suggested using regex and still don't know exactly how I could solve the problem through regex alone. Yes, actually, I've already looked at -r, and sed does interpret | as or, but that doesn't seem to be the right solution. Anyway, I haven't heard of t, b, or :, so I'll have to read a little bit more.
|
|
|
03-04-2017, 03:00 AM
|
#7
|
Moderator
Registered: Oct 2008
Distribution: Slackware [64]-X.{0|1|2|37|-current} ::12<=X<=15, FreeBSD_12{.0|.1}
Posts: 6,295
|
Think like this...
Code:
$ sed -rn /ADDRESS/s/MATCH/REPLACE/gp
...using the hints provided by syg00. It works.
I had to use the ADDRESS to get only the desired line(s), then using the not so subtle 's' hint provided by syg00 and a previously mentioned operator provided the right result.
Not to give everything away, here is an obfuscated example with the text in ud.txt:
Code:
$ sed -rn '/.../s/up|down/.../gp' ud.txt
They were neither UP nor DOWN
Perhaps syg00 has seen a way to get it without the address...?
Last edited by astrogeek; 03-04-2017 at 03:17 AM.
Reason: Added example
|
|
|
03-04-2017, 03:12 AM
|
#8
|
Senior Member
Registered: Dec 2011
Location: Simplicity
Posts: 2,918
|
With the t and d commands
Code:
sed '
s/up/UP/
t s2
d
:s2
s/down/DOWN/
t
d
' file.txt
With awk
Code:
awk 'sub(/up/,"UP") && sub(/down/,"DOWN")' file.txt
If you want to replace multiple ups and downs per line then you need the g modifier in sed or gsub in awk.
Last edited by MadeInGermany; 03-04-2017 at 03:16 AM.
|
|
1 members found this post helpful.
|
03-04-2017, 04:19 AM
|
#9
|
LQ Veteran
Registered: Aug 2003
Location: Australia
Distribution: Lots ...
Posts: 21,238
|
Quote:
Originally Posted by vincix
I don't want the solution, but I'd like to know in principle how I could actually use regex
|
This is the approach that will encourage contributions. You are making the effort, I am happy to help. If you eventually feel lost, ask and I will supply my solution. It may not be correct, or sufficient, but hopefully we may all learn something by the exercise.
|
|
|
03-04-2017, 12:42 PM
|
#10
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Original Poster
Rep:
|
This is what I came up with:
Code:
sed -nE '/up|down/s/up|down/\U/gp' duke.txt
The problem is that "U" is interpreted as literal "U", and it doesn't convert to uppercase letters. How do I make sed interpret it correctly?
By the way, I think the correct option was -E, not -r, in order to make sed interpret extended regex. I was referring to -E when I said that sed was eventually interpreting | as "or".
|
|
|
03-04-2017, 01:22 PM
|
#11
|
LQ Guru
Registered: Apr 2005
Distribution: Linux Mint, Devuan, OpenBSD
Posts: 7,506
|
You'll probably use an ampersand & instead of \U
Here's another alternative:
Code:
sed -e '/down/s/up/&/; t; d;' duke.txt
Though neither example do much with regex, more with sed programming.
The t is a conditional jump. When used without a destination it defaults to a jump to the end of the sed script.
Thus if the // pattern matches AND the s/// substitution succeeds, hop over the command to delete the line.
|
|
|
03-04-2017, 01:29 PM
|
#12
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Original Poster
Rep:
|
I don't insist doing it with regex (only). syg00 had suggested it at the beginning of the thread and that's why I was curious. I'm fine with using sed options. The question is, why doesn't \U work? I've seen several examples on the internet.
P.S. Only now did I see that on mac it works only with -E (for extended regex), but on Centos it seems to be working with -r (only?).
Last edited by vincix; 03-04-2017 at 01:32 PM.
|
|
|
03-04-2017, 01:33 PM
|
#13
|
LQ Guru
Registered: Apr 2005
Distribution: Linux Mint, Devuan, OpenBSD
Posts: 7,506
|
In which context have you seen \U mentioned? I don't see it in the regex manual or in the manual for sed itself.
Code:
man 7 regex
man sed
Though \U does have a meaning in perl's pattern matching
|
|
|
03-04-2017, 01:34 PM
|
#14
|
Senior Member
Registered: Feb 2011
Distribution: Ubuntu, Centos
Posts: 1,240
Original Poster
Rep:
|
https://www.gnu.org/software/sed/man...s_0022-Command
And yes, I was working with sed on mac, and now I see it's behaving slightly differently on Centos 7 when using \U. It doesn't interpret it as a literal \U, but it still doesn't work. It simply deletes both matches ("up" and "down").
Last edited by vincix; 03-04-2017 at 01:37 PM.
|
|
|
03-04-2017, 01:44 PM
|
#15
|
LQ Guru
Registered: Apr 2005
Distribution: Linux Mint, Devuan, OpenBSD
Posts: 7,506
|
If you want portability you'll need to give up on \U in sed scripting and stay closer to POSIX.
Code:
sed 's/up/UP/g; t up; d; b; :up { s/down/DOWN/g; t; d; }' duke.txt
|
|
|
All times are GMT -5. The time now is 06:44 AM.
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|