matching two consecutive identical characters in awk

Question

Bhoot 0 Junior Poster in Training

15 Years Ago

i have come across a question which asks to write an awk command that displays all records having 2nd and 3rd characters same.

i could write the solution in sed, using tagged regular expression, as follows :

sed -n '/^.$.$\1.*$/' emp.list

However, as far as i know, tagged expression are not applicable in awk. I wrote the solution in awk using substr() :

awk '{ if (substr($0,2,1) == substr($0,3,1)) print $0; }' emp.list

What i want to know is is there any other way in awk to solve this question? I just want to use a regular expression as i did in sed.
Also, is there any other way to write this in sed as well?
Thank you :)

shell-scripting

4 Contributors
4 Replies
294 Views
5 Days Discussion Span
Latest Post 15 Years Ago Latest Post by ghostdog74

All 4 Replies

Fest3er 39 Posting Whiz in Training

15 Years Ago

You seem to be right; awk/gawk do seem to have limited REs.

But I do have a wiseacre solution. :)

alias awk=egrep
awk "^.(.)\1" emp.list

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

Bhoot 0 Junior Poster in Training · Answer 1 · 2009-03-25T07:13:15+00:00

You seem to be right; awk/gawk do seem to have limited REs.
But I do have a wiseacre solution. :)
alias awk=egrep
awk "^.(.)\1" emp.list

lol..i couldnt help myself from laughing when i saw your solution :D
a good one; but certainly not applicable for me :)
anyways thanks for that too :)

Aia 1,977 Nearly a Posting Maven · Answer 2 · 2009-03-25T20:01:13+00:00

>I wrote the solution in awk using substr() :

awk '{ if (substr($0,2,1) == substr($0,3,1)) print $0; }' emp.list

A slightly different approach. Since the targets to compare are found always in the beginning, use only the first field. Separate individual characters using split and then compare.

awk '{ split($1, a, ""); if(a[2] == a[3]) print }' emp.list

ghostdog74 57 Junior Poster · Answer 3 · 2009-03-28T08:42:54+00:00

regular expressions are usually not needed for string manipulations.

# echo "abcdd" | awk 'BEGIN{FS=""}$2==$3{print "ok"}'
# echo "abbdd" | awk 'BEGIN{FS=""}$2==$3{print "ok"}'
ok

matching two consecutive identical characters in awk

Recommended Answers Collapse Answers

All 4 Replies

Recommended Answers