Sqoop Graduation Meetup
This blog was originally posted on the Apache Blog: https://blogs.apache.org/sqoop/entry/sqoop_graduation_meetup Cloudera hosted the Apache Sqoop Meetup last week at Cloudera HQ in Palo Alto. About 20...
View ArticleMeet the Presenters: Aaron Myers from Cloudera and Suresh Srinivas from...
This was originally posted on the Hadoop Summit 2012 blog. Today’s “Meet the Presenters” interview features two speakers: Aaron Myers from Cloudera and Suresh Srinivas from Hortonworks. Aaron and...
View ArticleWhy we build our platform on HDFS
It’s not often the case that I have a chance to concur with my colleague E14 over at Hortonworks but his recent blog post gave the perfect opportunity. I wanted to build on a few of E14’s points and...
View ArticleColumn Statistics in Apache Hive
Over the last couple of months the Hive team at Cloudera has been working hard to bring a bunch of exciting new features to Apache Hive. In this blog post, I’m going to talk about one such feature –...
View ArticleSeeking nominations for the 2012 Government Big Data Solutions Award
This post was contributed by Bob Gourley, editor, CTOvision.com. You are no doubt aware of the interesting situation we face with data today: The amount of data being created is growing faster than...
View ArticleMeet the Engineer: Aaron T. Myers
As I mentioned in my inaugural post last week, it’s important to shine a spotlight on the Cloudera engineers who have a hand in making the Hadoop projects run. It’s an obvious point, and yet an...
View ArticleCloudera Manager 4.0: Customer Feedback and Adoption
It’s been roughly three months since we announced GA of Cloudera Manager 4.0 (CM4) and I wanted to provide an update on its adoption and feedback from customers. For those new to it, Cloudera Manager...
View ArticleHow-to: Enable User Authentication and Authorization in Apache HBase
With the default Apache HBase configuration, everyone is allowed to read from and write to all tables available in the system. For many enterprise setups, this kind of policy is unacceptable....
View ArticleCDH4.1 Now Released!
Update time! As a reminder, Cloudera releases major versions of CDH, our 100% open source distribution of Apache Hadoop and related projects, annually and then updates to CDH every three months....
View ArticleVideos: Get Started with Hadoop Using Cloudera Enterprise
Our video animation factory has been busy lately. The embedded player below contains our two latest ones stitched together: Get Started with Hadoop Using Cloudera Enterprise, Part 1 To be a proactive...
View ArticleHBase at ApacheCon Europe 2012
Apache HBase will have a notable profile at ApacheCon Europenext month. Clouderan and HBase committer Lars George has two sessions on the schedule: HBase Sizing and Schema DesignAbstract: This talk...
View ArticleQuorum-based Journaling in CDH4.1
A few weeks back, Cloudera announced CDH 4.1, the latest update release to Cloudera’s Distribution including Apache Hadoop. This is the first release to introduce truly standalone High Availability for...
View ArticleTop Five Nominees for the 2012 Government Big Data Solutions Award
The following is a re-post from Bob Gourley of CTOVision.com. The amount of data being created in governments is growing faster than humans can analyze. But analysis can solve tough challenges. Those...
View ArticleThe Winner of the 2012 Government Big Data Solutions Award is the National...
The following is a re-post from CTOVision.com. The Government Big Data Solutions Award was established to highlight innovative solutions and facilitate the exchange of best practices, lessons learned...
View ArticleHow-to: Use Apache ZooKeeper to Build Distributed Apps (and Why)
It’s widely accepted that you should never design or implement your own cryptographic algorithms but rather use well-tested, peer-reviewed libraries instead. The same can be said of distributed...
View ArticleNew Products and Releases: Cloudera Navigator, Cloudera Enterprise BDR, and More
Today is an exciting day for Cloudera customers and users. With an update to our 100% open source platform and a number of new add-on products, every software component we ship is getting either a...
View ArticleHow-to: Set Up a Hadoop Cluster with Network Encryption
Hadoop network encryption is a feature introduced in Apache Hadoop 2.0.2-alpha and in CDH4.1. In this blog post, we’ll first cover Hadoop’s pre-existing security capabilities. Then, we’ll explain why...
View ArticleHow-to: Set Up Cloudera Manager 4.5 for Apache Hive
Last week Cloudera released the 4.5 release of Cloudera Manager, the leading framework for end-to-end management of Apache Hadoop clusters. (Download Cloudera Manager here, and see install instructions...
View ArticleHow-to: Use the Apache HBase REST Interface, Part 1
There are various ways to access and interact with Apache HBase. The Java API provides the most functionality, but many people want to use HBase without Java. There are two main approaches for doing...
View ArticleHow Apache Hadoop Helps Scan the Internet for Security Risks
The following guest post comes from Alejandro Caceres, president and CTO of Hyperion Gray LLC – a small research and development shop focusing on open-source software for cyber security. Imagine this:...
View Article