How to Match Multiple Lines using Regex in Perl One-liners

ByEric Ma Apr 10, 2020Apr 10, 2020

Perl one-liners with perl’s regular expression statement can be a very powerful text processing tools used as commands in a terminal or a script. By default, the input to the perl one-liner with -p or -n options is passed line by line. However, when we want to match multiple lines, it gets us some trouble. In this post we take a look at a technique to match multiple lines using Perl one-liner.

As an example, let’s try to find and remove content between <PRE> and </PRE> (both tags included too) if the content contains only new lines (\n), spaces, and <BR>/<HR> tags. A simple regex like <PRE>[\s{<BR>}{<HR>}]*</PRE> matches such criteria. But it does not match across multiple line (that is \s does not match \n). The trick here is to add option -0777 so that the record separator is the char of octal number 777 instead of \n.

perl -0777 -pe 's|<PRE>[\s{<BR>}{<HR>}]*</PRE>||g'

You can find the meanings of the options to perl used here from perlrun manual.

Here is one example of usages of the above one-liner.

$ echo -e "text\n<PRE>\n<BR>\n<HR><HR>\n \n</PRE>more text"
text
<PRE>
<BR>
<HR><HR>

</PRE>more text
$ echo -e "text\n<PRE>\n<BR>\n<HR><HR>\n \n</PRE>more text" |
perl -0777 -pe 's|<PRE>[\s{<BR>}{<HR>}]*</PRE>||g'
text
more text

The same technique can be used for grep too: How to Grep 2 Lines using grep in Linux.

How to print a line to STDERR and STDOUT in Bash?

ByQ A Mar 24, 2018

In Bash, how to print a string as a line to STDOUT? That is, the string and the newline character, nicely? And similarly, how to print the line to STDERR? In Bash, you can simply use the echo command: echo “your message here” or echo your message here Examples: $ echo the message here the…

Force Linux to reboot

ByQ A Mar 24, 2018

How to force Linux to reboot when the reboot command does not work. Enable the use of the magic SysRq option: # echo 1 > /proc/sys/kernel/sysrq Reboot the machine: # echo b > /proc/sysrq-trigger Even if you could not log on the system but sshd is working, you can force the Linux to reboot by:…

How to install node.js on Fedora?

ByEric Ma Mar 24, 2018Mar 24, 2018

How to install node.js on Fedora? You may install it by: # yum install nodejs npm Read more: How to install node.js on Ubuntu/Linux Mint? How to parse POST data in node.js? How to exit the program in Node.js? How to convert an object to json in Node.js? In Node.js, how to import functions from…

Xterm color codes for Vim on Linux

ByEric Ma Mar 24, 2018Mar 24, 2018

Vim uses Xterm color codes like 9, 1 and 114 on Linux. What’s the overall mapping from color to codes or codes to color? Xterm color table: You can also find the Xterm color codes here. Read more: How to convert tiff images from RGB color to CMYK color on Linux? Convention of error codes…

How to make Linux automatically reboot after a kernel panic?

ByEric Ma Mar 24, 2018Mar 24, 2018

After a kernel panic, it is impossible to remotely connect to the Linux server to reboot it by SSH. How to make the panic kernel automatically reboot itself? Linux kernel has a nice feature that reboots itself after a timeout when a kernel panic happened. Usually, it is disabled by default. To turn it on,…

How to import a source file to OCaml’s toplevel?

ByQ A Mar 24, 2018

How to import a source file to OCaml’s toplevel? Say, I want to use a function implemented in a source file in the toplevel. #use “file-name”;; Read, compile and execute source phrases from the given file. This is textual inclusion: phrases are processed just as if they were typed on standard input. The reading of…

Similar Posts

Leave a Reply Cancel reply