by Jesse Anderson | Sep 7, 2016 | Blog, Business, Data Engineering, Data Engineering is hard, Magnum Opus |
After years of teaching Big Data, I’ve come up with the best explanation of why it isn’t easy, cheap, or quick. I wrote the in-depth piece published on... by Jesse Anderson | Aug 31, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Today’s blog post comes from a question from a subscriber on my mailing list. The question come from Alpesh D.: I have been getting your emails and they all seem to make sense. However, did I understand it correct that you believe all big data engineers need to... by Jesse Anderson | Aug 24, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Crawl, Walk, Run with Big Data Attacking a Big Data project with an all-or-nothing mindset leads to an absolute failure. I highly suggest breaking the overall project into more manageable phases. These phases are called crawl, walk, and run. Crawling In this phase,... by Jesse Anderson | Aug 17, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Today’s blog post comes from a question from a subscriber to my mailing list. The question come from André M.: How good a Big Data strategy can be defined by someone that doesn’t know the technology behind it? (By knowing I don’t mean being able to... by Jesse Anderson | Aug 10, 2016 | Blog, Business, Data Engineering |
Companies and individuals often come into Big Data thinking everything is cheap. After all, the entire stack is open source, right? Well, some things are cheap and some things are more expensive. Software One of the important distinctions with Hadoop is that it... by Jesse Anderson | Jul 27, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Some of the contenders for Big Data messaging systems are Apache Kafka, Google Cloud Pub/Sub, and Amazon Kinesis (not discussed in this post). While similar in many ways, there are enough subtle differences that a Data Engineer needs to know. These can range from nice...