Taking dumps of analytics logs and pulling out relevant info for our customers o... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		bijanv on Nov 10, 2013 \| parent \| context \| favorite \| on: Ask HN: To everybody who uses MapReduce: what prob... Taking dumps of analytics logs and pulling out relevant info for our customers on app usage

sitkack on Nov 10, 2013 [–]

This is the `grep/awk` use case. The nice thing about streaming mr interface to hadoop (calling external programs) is that you can literally take your grep/awk workflow and move it to the cluster. Retaining line oriented records is a huge step in having a portable data processing workflow.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact