Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is simply nonsense. Big data has never been just about MapReduce.

It has always revolved around the concept of a data lake, with data stored as objects, a series of data engineering pipelines moving data around and a query engine on top. And in almost every enterprise company this is the high level architecture you see today.

And this model only continues to grow in popularity as the use of siloed SaaS products drives data sprawl and the need for tools like Spark, Fivetran etc to move it all back to a centralised data lake for analysis.



Big data just means big data.

A data lake is one way to deal with it. It's convenient but not always the best.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: