What do you get when you combine Big Data technologies….like Pig and
Hive? A flying pig?
No, you get a “Logical Data Warehouse”.
My general prediction is that Cloudera and Hortonworks are both aggressively
moving to fulfilling a vision which looks a lot like Gartner’s “Logical
Data Warehouse”….namely, “the next-generation data warehouse that
improves agility, enables innovation and responds more efficiently to
changing business requirements.”
In 2012, Infochimps (now CSC) leveraged its early use of stream processing,
NoSQLs, and Hadoop to create a design pattern which combined real-time,
ad-hoc, and batch analytics. This concept of combining the best-in-breed Big
Data technologies will continue to advance across the industry until the
entire legacy (and proprietary) data infrastructure stack will be replaced
with a new (and open) one.
As this is happening, I predi... (more)
The security community has a growing number of influential and important
people, especially as the industry rises to meet the need to address more
advanced security threats, such as targeted attacks. But how does a company
in the security industry truly identify the influential people? And then once
identified, how does one use influential voices to help promote their brand?
In this study, we answer the first question - how to identify the most
powerful voices in your industry, focusing on the security space, and as part
of this we provide you a list of people to follow for the b... (more)
Have you heard of products like IBM’s InfoSphere Streams, Tibco’s Event
Processing product, or Oracle’s CEP product? All good examples of
commercially available stream processing technologies which help you process
events in real-time.
I’ve been asked what I consider as “Big Data” versus “Small Data”
in this domain. Here’s my view.
Real-Time Analytics Small Data Big Data Data Volume None None Data Velocity
100K events / day (<<1K events / second) Billion+ events / day (>>1K events /
second) Data Variety 1-6 unstructured on sources AND 1 single destination (an
output file, a SQ... (more)
Do you think they truly understood just how fast the data infrastructure
marketplace was going to change? That is the question that comes to mind when
I think about Donald Feinberg and Mark Beyer at Gartner who, last year,
wrote about how the data warehouse market is undergoing a transformation.
Did they, or anyone for that matter, understand the significant change
underway in the data center?
I describe it as Big Data 1.0 versus Big Data 2.0.
Big Data 1.0
I was recently talking to friends at one of our largest banks about their Big
Data projects under way. In less than one yea... (more)
SAP customers are confused about the positioning between SAP Sybase IQ and
SAP Hana as it applies to data warehousing. Go figure, so is SAP. You want to
learn about their data warehousing offering, and all you hear is “Hana
this” and “Hana that”.
It reminds me of the time after I left Teradata when the BI appliances came
on the scene. First Netezza, then Greenplum, then Vertica and Aster Data,
then ParAccel. Everyone was confused about what the BI appliance was in
relation to the EDW. Do I need an EDW, a BI appliance, an EDW + BI appliance?
With SAP, Sybase IQ is supposed to be ... (more)