Sed¶

Materials to download¶

Grep (an acronym for “Global Regular Expression Print”), finds a string in a given file or input.

Grep format:

grep [options] [regexp] [filename]

Grep usecases:

grep -i 'mary' mary-lamb.txt

grep -w 'as' mary-lamb.txt

grep -r '456' /<your_working_directory>/

grep -v ‘the’ mary-lamb.txt

grep -A1 'School'  mary-lamb.txt

grep -B2 'School'  mary-lamb.txt

Print additional (leading and trailing) context lines before and after the match (grep -C <NUM>):

grep -C3 'School' mary-lamb.txt

grep -H 'School' mary-lamb.txt

Regexp or regular expression:

Regexp is how we specify that we find to see a particular pattern (it could be words or characters).

grep 'M.a' mary-lamb.txt
grep 'M*y' Mary_Lamb_lyrics.txt

awk [options] [filename]

Named after the authors: Aho, Weinberger, Kernighan

awk '{print}' BRITE_students.txt

awk '{print $1}' BRITE_students.txt

awk '{print $1" "$3}' BRITE_students.txt

awk '$1=="Ali"' BRITE_students.txt

awk '/Kat/ {print $0}' BRITE_students.txt

Question for you: How do you print the name and favorite sport of students whose names contain the letter “u”?

<insert code here>

awk '/Kat/{++cnt} END {print "Count = ", cnt}' BRITE_students.txt

awk 'BEGIN {
   sum = 0; for (i = 0; i < 20; ++i) {
       sum += i; if (sum > 50) exit(10); else print "Sum =", sum
   }
}'

sed [options] [filename]

SED stands for “Stream EDitor”. It is a widely used text processing Linux tool.

cat BRITE_students.txt | sed -n 3p

cat mary-lamb.txt | sed 's/Mary/Maria/g'

sed -e '1d' -e '2d' -e '5d' BRITE_students.txt

echo -e "1d\n2d\n5d" > my_lines.txt
cat my_lines.txt
sed -f my_lines.txt BRITE_students.txt

cat BRITE_students.txt | sed -n 2,'$p'