The World of Hadoop and Beyond
Hortonworks went through an IPO last Friday, December 12, 2014. It’s
initial price of $16 soared by 60% immediately after. Today the stock price
is $24.70 with a market cap of $1.02B. Another billion dollar club member.
They compete with Cloudera and MapR in packaging the open source Hadoop
platform for customers. How do they make money when the basic Apache Hadoop
is free? – by offering added services like training and consulting. They
can also add auxiliary products (not open source) that customers must pay
for. The interesting fact is that Hortonworks’s CEO had claimed a $100m
revenue this year, but looks like he is way short of that – $33m during
first nine months. The future is quite uncertain!
In the mean time, several new start-ups have come up in the Hadoop-sphere:
Databricks with venture funding of $47m so far (includes Andrees... (more)
Data Lake and Data Refinery – Gartner Controversy!
Much discussion has been going on the new phrase called Data Lake. Gartner
wrote a report on the ‘Data Lake’ fallacy, saying to be careful about
‘data lake’ or ‘data swamp’. Then Andrew Oliver wrote in the
InfoWorld these beginning words, “For $200, Gartner tells you ‘data
lakes’ are bad and advises you to try real hard, plan far in advance, and
get governance correct”. Wow, what an insight!
During my days at IBM and Oracle, Gartner wanted to get time on my calendar
to talk about database futures. Then afterwards, I realized that... (more)
This year’s CES expects to have 160,000 attendees and tonight’s keynote
by the CEO of Samsung Mr. BK Yoon was “unlocking infinite possibilities of
IoT”. The Internet of Things seems to be the overall theme this year.
Today I listened to an interesting panel on disruptive technologies over next
five years. Here is a brief summary.
3D Printing: This year expects to see 300,000 desktop 3D printers in the US.
Mainstream consumer adoption is doubtful. Someone jokingly said that you can
build a statue of yourself and install it in your yard. Another term for 3D
printing is additive ma... (more)
Fast Data vs. Big Data – How to Combine?
Today, all the discussion on Big Data centers around “static data” in a
data lake (old Data Warehouse) accessed by BI tools or SQL on Hadoop (Hawk,
Impala) or Map/Reduce algorithms (MapR) for analysis. This is looking at
historical data and finding trends. Some new tools are trying to provide
predictive analysis based on past trends. This area deals with mostly the
volume and variety aspect of Big Data, but not the velocity or for “data in
The term “Fast Data” is applied to data that is in motion. This component
is getting more ... (more)
On September 9, 2014, Apple made significant announcements at the historic
Flint Center in Cupertino, where exactly 30 years ago, the brand new Apple
McIntosh was introduced by Steve Jobs. It is worthwhile to see the entire
event (almost 1.5 hours, culminated by the band U2 playing on stage and
releasing a new album on iTune for free download).
Three key announcements were made: the new iPhone6 and iPhone6Plus, Apple
Pay, and Apple Watch.
The new iPhone6 and 6Plus offer bigger screen sizes, as expected, with a new
chip plus new iOS8 software. Screen size went from 4 inch (iPhone5... (more)