by Jesse Anderson | Apr 19, 2016 | Blog, Business, Data Engineering, Data Engineering is hard |
Unit testing, Apache Spark, and Java are three things you’ll rarely see together. And yes, all three are possible and work well together. Update: updated to Spark Testing Base 0.6.0. Why Unit Test With Spark? I’m not an advocate of TDD (Test-Driven... by Jesse Anderson | Apr 11, 2016 | Blog, Business, Data Engineering |
I spoke at Strata+Hadoop World two weeks on Kafka. There were three main themes from the conference that I came away with: real-time Big Data is the (present) future, we should be using intermediary libraries instead of programming directly to an API, and applied AI... by Jesse Anderson | Apr 6, 2016 | Blog, Business, Data Engineering |
Kafka 0.9.0 brings with it a bevy of goodness. Many of those features are aimed at operations, like security. Some are developer features, like a new Consumer API. This post will focus on the new features of 0.9.0 for developers. New Consumer API The most notable... by Jesse Anderson | Mar 28, 2016 | Blog, Business, Data Engineering |
An investment in knowledge pays the best interest. Benjamin Franklin Benjamin Franklin is saying that investing in yourself or your team’s knowledge will give you the highest ROI (return on investment). By investing in knowledge, you’re going to be far... by Jesse Anderson | Mar 24, 2016 | Blog, Business, Data Engineering |
Hadoop has a vast and vibrant developer community. Following the lead of Hadoop’s name, the projects in the Hadoop ecosystem all have names that don’t correlate to their function. This makes it really hard to figure out what each piece does or is used for.... by Jesse Anderson | Mar 23, 2016 | Business, Data Engineering |
It’s easy for someone in the training industry to identify great training. They live, eat, and breath it. What about everyone else? How can you identify great training even if you aren’t in the training industry? In a previous post, I raised the...