James Dixon’s Blog

James Dixon’s thoughts on commercial open source and open source business intelligence

Archive for May 20th, 2010

EMC’s Dan Hushon on Pentaho and Hadoop

leave a comment »

Dan Hushon, a Senior Director at EMC’s CTO office, has blogged about our Hadoop announcement: ETL & Hadoop/Map-Reduce… a match made in Orlando!

Dan has been at EMC for a number of years and know a lot about data. He is dead on when he talks about metadata and dimensionality of Map/Reduce and NoSQL data stores. These environments are rich in data but the metadata can be very sparse or non-existent. This makes reporting and analysis of the data harder.

Written by James

May 20, 2010 at 4:04 am