The funny thing is, by and large my only use case for awk is to print out whites...

tyingq · on Sept 30, 2021

The syntax isn't nearly as nice, but Perl can be handy if you're doing something more after splitting into columns. And it's usually already there / installed, like awk. For just columns:

  $ printf "a b  c d   e\n1 2  3 4 5" | perl -lanE 'say "$F[2] $F[4]"'
  c e
  3 5

adamgordonbell · on Sept 30, 2021

It surprized me that AWK had dictionaries and no declaration of vars that make it feel like a modern scripting langauge even though it was written in the 70s.

It turns out though that this is because Perl and later Ruby were inspired by AWK and even support these line by line processing idioms with BEGIN and END sa well.

    ruby -n -a -e 'puts "#{$F[0] $F[1]}"'

    ruby -ne '
    BEGIN { $words = Hash.new(0) }

    $_.split(/[^a-zA-Z]+/).each { |word| 
    $words[word.downcase] += 1 }

    END {
        ...

tannhaeuser · on Oct 1, 2021

I think it's pretty obvious that awk syntax is ultimately the main inspiration for JavaScript syntax, with optional semicolon as stmt terminator, regexp literals, for (x in y), the function keyword, a[x] associative array accessors, etc.

popcube · on Oct 2, 2021

they spend a lot of time to make one line perl can handle most function of awk, sed et al.

flandish · on Sept 30, 2021

A long while ago I wrote up a little processor to determine field lengths in a given file - I forgot the original reason. ( https://github.com/sullivant/csvinfo )

However, I feel I really should have taken the time to learn Awk better as it could probably be done there, and simply! (It was a good excuse to tinker with rust, but that's an aside.)

tyingq · on Sept 30, 2021

For some idea, a one liner to find the (last) longest username and length in /etc/passwd:

  $ awk -F: '{len=length($1);if(len>max){max=len;user=$1}}END{print user,max}' /etc/passwd

flandish · on Sept 30, 2021

Thanks for that reply! It's good to work with an example.

genewitch · on Sept 30, 2021

I'll mark this on my GitHub when I get back on a computer, I take public datasets and make graphs and transforms and reports. The big survey companies have weird data records and having to write a parser is my least favorite part. I think other people who ingest my content don't appreciate the effort, but that's a near universal feeling I think, heh.

twic · on Sept 30, 2021

If i don't use awk, i throw tr -s ' ' into the pipeline, and then the delimiter is a single space, so you can just cut.

axiolite · on Oct 1, 2021

That will collapse multiple spaces, but won't handle a mix of spaces and tabs, which awk will handle.

adamgordonbell · on Sept 30, 2021

choose from your link does look nice for simple column selection.

   echo -e "foo   bar   baz" | choose -1 -2

vs awks

   echo -e "foo   bar   baz" | awk '{ print $2, $3}'

I love the effort people are putting into reinventing the core unix tools.

I think I'll stick with Awk for now though.

foobarian · on Sept 30, 2021

The problem with new tools is

$ choose

bash: choose: command not found...

goohle · on Sept 30, 2021

  ls -l | tr -s ' ' | cut -d ' ' -f 5

foobarian · on Sept 30, 2021

Exactly! Exactly! And now fix it to work with tabs :-)

tyingq · on Sept 30, 2021

And leading whitespace. Compare:

  $ printf " one two  three"  | tr -s ' ' | cut -d ' ' -f 1

  $ printf " one two  three"  | awk '{print $1}'
  one

goohle · on Sept 30, 2021

  ps ax | sed 's/^\s\+//; s/\s\+/ /g;' | cut -d ' ' -f 4

goohle · on Sept 30, 2021

  echo -e '1\t2\t3\t4\t5' | expand -t 1 | cut -d ' ' -f 3