by Jesse Anderson | Apr 21, 2016 | Blog, Business, Data Engineering, Magnum Opus |
I am proud to announce that my latest screencast on Apache Impala called Creating Big Data Solutions with Impala was released on O’Reilly. This caps off a long relationship with Impala that started well before it was released publicly. My relationship with... by Jesse Anderson | Apr 19, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Unit testing, Apache Spark, and Java are three things you’ll rarely see together. And yes, all three are possible and work well together. Update: updated to Spark Testing Base 0.6.0. Why Unit Test With Spark? I’m not an advocate of TDD (Test-Driven... by Jesse Anderson | Apr 11, 2016 | Blog, Business, Data Engineering |
I spoke at Strata+Hadoop World two weeks on Kafka. There were three main themes from the conference that I came away with: real-time Big Data is the (present) future, we should be using intermediary libraries instead of programming directly to an API, and applied AI... by Jesse Anderson | Apr 6, 2016 | Blog, Business, Data Engineering |
Kafka 0.9.0 brings with it a bevy of goodness. Many of those features are aimed at operations, like security. Some are developer features, like a new Consumer API. This post will focus on the new features of 0.9.0 for developers. New Consumer API The most notable... by Jesse Anderson | Mar 28, 2016 | Blog, Business, Data Engineering |
An investment in knowledge pays the best interest. Benjamin Franklin Benjamin Franklin is saying that investing in yourself or your team’s knowledge will give you the highest ROI (return on investment). By investing in knowledge, you’re going to be far...