LinuxQuestions.org
Did you know LQ has a Linux Hardware Compatibility List?
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > Programming
User Name
Password
Programming This forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.

Notices

Reply
 
Search this Thread
Old 03-12-2008, 03:42 PM   #1
exscape
Member
 
Registered: Aug 2007
Location: Sweden
Distribution: OS X, Gentoo, FreeBSD
Posts: 82

Rep: Reputation: 15
Perl: running a user-specified s/// regex?


The idea is simple, but I'm not sure the implementation is.
Anyway, as part of my program, I want the user to be able to specify a regex, like:

Enter a string: ... blah blah ...
Enter a regex: s/blah/yawn/g

I've come that far, in fact. I can also use regex stuff on the left side, i.e. s/\b\w{5}\b/yawn/g.
However, I can't get backreferences to work. What'd be the ultimate thing is if the user could enter something like
s/blah (\w+) blah/$1/g

Is this possible? I've tried a bit with qr// and eval, but no luck. I'm pretty new to Perl, btw. So far, all it does is print the literal '$1' instead of the value I want.
 
Old 03-12-2008, 04:01 PM   #2
osor
HCL Maintainer
 
Registered: Jan 2006
Distribution: (H)LFS, Gentoo
Posts: 2,450

Rep: Reputation: 70
In general, this may be a security risk. To bypass security checking, put this at the beginning of your source file:
Code:
use re 'eval';
 
Old 03-12-2008, 04:24 PM   #3
exscape
Member
 
Registered: Aug 2007
Location: Sweden
Distribution: OS X, Gentoo, FreeBSD
Posts: 82

Original Poster
Rep: Reputation: 15
Quote:
Originally Posted by osor View Post
In general, this may be a security risk. To bypass security checking, put this at the beginning of your source file:
Code:
use re 'eval';
Hmm, thanks, but unless I'm missing something (I hope so), it doesn't help.

Here's the (testing) code, and a sample run:
Code:
use strict;
use warnings;
use re 'eval';

print "Enter a regex: ";
chomp(my $regex = <STDIN>);
print "Enter a replacement: ";
chomp(my $repl = <STDIN>);
print "Regex: $regex\n";
print "Repl : $repl\n";

my $str = "A bunch of words to search and replace on";

print "Old: $str\n";

$str =~ s/$regex/$repl/g;

print "New: $str\n";
Quote:
Enter a regex: \b(\w{5})\b
Enter a replacement: >>$1<<
Regex: \b(\w{5})\b
Repl : >>$1<<
Old: A bunch of words to search and replace on
New: A >>$1<< of >>$1<< to search and replace on
It seems I had to split the regex for it to work. (To minimize bugs, I removed my split code and made it two lines for the test.)
 
Old 03-12-2008, 05:53 PM   #4
osor
HCL Maintainer
 
Registered: Jan 2006
Distribution: (H)LFS, Gentoo
Posts: 2,450

Rep: Reputation: 70
Quote:
Originally Posted by exscape View Post
Hmm, thanks, but unless I'm missing something (I hope so), it doesn't help.
You’re right it won’t help. I misunderstood your question.

Does this work for you (btw, it still might be insecure in certain situations)?
Code:
#!/usr/bin/perl

use strict;
use warnings;

print "Enter a regex: ";
chomp(my $regex = <STDIN>);
print "Enter a replacement: ";
chomp(my $repl = <STDIN>);
print "Regex: $regex\n";
print "Repl : $repl\n";

my $str = "A bunch of words to search and replace on";

print "Old: $str\n";

eval "\$str =~ s/$regex/$repl/g;";

print "New: $str\n";
 
Old 03-12-2008, 06:12 PM   #5
osor
HCL Maintainer
 
Registered: Jan 2006
Distribution: (H)LFS, Gentoo
Posts: 2,450

Rep: Reputation: 70
Btw, this will also work (notice the peculiar forced quoting of $repl):
Code:
#!/usr/bin/perl

use strict;
use warnings;

print "Enter a regex: ";
chomp(my $regex = <STDIN>);
print "Enter a replacement: ";
chomp(my $repl = <STDIN>);
$repl = '"' . $repl . '"';
print "Regex: $regex\n";
print "Repl : $repl\n";

my $str = "A bunch of words to search and replace on";

print "Old: $str\n";

$str =~ s/$regex/$repl/gee;

print "New: $str\n";
 
Old 03-12-2008, 06:13 PM   #6
osor
HCL Maintainer
 
Registered: Jan 2006
Distribution: (H)LFS, Gentoo
Posts: 2,450

Rep: Reputation: 70
Ignore this.

Last edited by osor; 03-12-2008 at 06:22 PM.
 
Old 03-12-2008, 06:14 PM   #7
exscape
Member
 
Registered: Aug 2007
Location: Sweden
Distribution: OS X, Gentoo, FreeBSD
Posts: 82

Original Poster
Rep: Reputation: 15
I solved it!
Relevant parts:
Code:
my $regex = pop @ARGV;

my ($search, $repl, $flags) = ($1, $2, $3) if $regex =~ m{^s/(.*?)/(.*?)/(\w+)?};
$repl = "" unless defined $repl;
$flags = "" unless defined $flags;
die "Unable to read regex! ...blah...\n" unless (defined $search && defined $repl);

eval '$newname =~ s/$search/' . "$repl/$flags";
        die "Unable to execute substitution! Invalid regular expression?\nError message was: $@\n" if $@;
Works great!
I'm writing a mass-rename util. Fully functional by now!

Quote:
# Start with a bunch of kernel source files.

serenity@exscape ~/programming/perl/testing $ ls
aperture.c crash_dump.c head64.c ioport.c mce_amd.c pci-calgary.c process.c smpboot.c tce.c vsyscall.c
apic.c e820.c hpet.c irq.c mce.c pci-dma.c ptrace.c smp.c time.c x8664_ksyms.c
asm-offsets.c early_printk.c i387.c k8.c mce_intel.c pci-gart.c reboot.c stacktrace.c traps.c
audit.c early-quirks.c i8259.c kprobes.c module.c pci-nommu.c setup64.c suspend.c tsc.c
bugs.c genapic.c init_task.c ldt.c mpparse.c pci-swiotlb.c setup.c syscall.c tsc_sync.c
crash.c genapic_flat.c io_apic.c machine_kexec.c nmi.c pmtimer.c signal.c sys_x86_64.c vsmp.c

# Rename all to .c.old
serenity@exscape ~/programming/perl/testing $ perl ../massrename.pl * 's/.c$/.c.old/g'
serenity@exscape ~/programming/perl/testing $ ls
aperture.c.old e820.c.old i387.c.old kprobes.c.old mpparse.c.old pmtimer.c.old smpboot.c.old time.c.old
apic.c.old early_printk.c.old i8259.c.old ldt.c.old nmi.c.old process.c.old smp.c.old traps.c.old
asm-offsets.c.old early-quirks.c.old init_task.c.old machine_kexec.c.old pci-calgary.c.old ptrace.c.old stacktrace.c.old tsc.c.old
audit.c.old genapic.c.old io_apic.c.old mce_amd.c.old pci-dma.c.old reboot.c.old suspend.c.old tsc_sync.c.old
bugs.c.old genapic_flat.c.old ioport.c.old mce.c.old pci-gart.c.old setup64.c.old syscall.c.old vsmp.c.old
crash.c.old head64.c.old irq.c.old mce_intel.c.old pci-nommu.c.old setup.c.old sys_x86_64.c.old vsyscall.c.old
crash_dump.c.old hpet.c.old k8.c.old module.c.old pci-swiotlb.c.old signal.c.old tce.c.old x8664_ksyms.c.old

# Rename back to .c with underscore words swapped (genapic_flat -> flat_genapic)

serenity@exscape ~/programming/perl/testing $ perl ../massrename.pl * 's/(.*?)_(.*?).c.old/$2_$1.c/g'
serenity@exscape ~/programming/perl/testing $ ls
amd_mce.c crash.c.old hpet.c.old kexec_machine.c nmi.c.old printk_early.c smpboot.c.old tce.c.old
aperture.c.old dump_crash.c i387.c.old kprobes.c.old pci-calgary.c.old process.c.old smp.c.old time.c.old
apic.c.old e820.c.old i8259.c.old ksyms_x8664.c pci-dma.c.old ptrace.c.old stacktrace.c.old traps.c.old
apic_io.c early-quirks.c.old intel_mce.c ldt.c.old pci-gart.c.old reboot.c.old suspend.c.old tsc.c.old
asm-offsets.c.old flat_genapic.c ioport.c.old mce.c.old pci-nommu.c.old setup64.c.old sync_tsc.c vsmp.c.old
audit.c.old genapic.c.old irq.c.old module.c.old pci-swiotlb.c.old setup.c.old syscall.c.old vsyscall.c.old
bugs.c.old head64.c.old k8.c.old mpparse.c.old pmtimer.c.old signal.c.old task_init.c x86_64_sys.c
Is there an app like this out there? In any case I feel good at the moment, this will be used a lot by myself!

BTW, with the regex checker in the code above, is security still a concern?
I don't see how it'd matter, though, re 'eval' is off, and even if it was exploitable it wouldn't really matter; it's not like it's going to be setuid root or anything. If you can run perl -e 'eval "unlink $_ for <*>"' you can run rm -f * too...?
 
Old 03-12-2008, 06:15 PM   #8
osor
HCL Maintainer
 
Registered: Jan 2006
Distribution: (H)LFS, Gentoo
Posts: 2,450

Rep: Reputation: 70
For your mass rename tool, are you duplicating the functionality of rename?

Last edited by osor; 03-12-2008 at 06:27 PM.
 
Old 03-12-2008, 06:16 PM   #9
exscape
Member
 
Registered: Aug 2007
Location: Sweden
Distribution: OS X, Gentoo, FreeBSD
Posts: 82

Original Poster
Rep: Reputation: 15
Double post.

Anyway, yes, I'm duplicating that. Damnit.
Well, still time well spent learning!

Last edited by exscape; 03-12-2008 at 06:25 PM.
 
Old 03-12-2008, 06:48 PM   #10
chrism01
Guru
 
Registered: Aug 2004
Location: Sydney
Distribution: Centos 6.6, Centos 5.10
Posts: 16,324

Rep: Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041Reputation: 2041
These are 2 good tutorials

http://perldoc.perl.org/perlrequick.html - quick tutorial
http://perldoc.perl.org/perlretut.html - full tutorial

In re the OP etc, look at the \Q ...\E construct in the 2nd tutorial
 
  


Reply

Tags
perl, regex


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
perl + regex issues. kurrupt Programming 2 10-11-2006 10:33 AM
Perl regex $ ShaqDiesel Programming 6 08-18-2006 03:40 PM
regex Perl help igotlongestname Programming 2 09-14-2005 08:51 PM
perl: using 'substr' in regex ananthbv Programming 5 11-03-2004 02:58 AM
Regex help needed in perl Dr Twox Programming 2 08-07-2004 06:58 AM


All times are GMT -5. The time now is 05:48 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration