HBase 0.96.0 Released!
The following post, by Apache HBase 0.96 Release Manager/Cloudera Software Engineer Michael Stack, was published originally at blogs.apache.org and is provided below for your convenience. Our thanks to...
View ArticleWhat are HBase znodes?
Apache ZooKeeper is a client/server system for distributed coordination that exposes an interface similar to a filesystem, where each node (called a znode) may contain data and a set of children. Each...
View ArticleMigrating to MapReduce 2 on YARN (For Operators)
Cloudera Manager lets you add a YARN service in the same way you would add any other Cloudera Manager-managed service. In Apache Hadoop 2, YARN and MapReduce 2 (MR2) are long-needed upgrades for...
View ArticleBinaryPig: Scalable Static Binary Analysis Over Hadoop
Our thanks to Telvis Calhoun, Zach Hanif, and Jason Trost of Endgame for the guest post below about their BinaryPig application for large-scale malware analysis on Apache Hadoop. Endgame uses data...
View ArticleHow-to: Get Started with Sentry in Hive
A quick on-ramp (and demo) for using the new Sentry module for RBAC in conjunction with Hive One attribute of the Enterprise Data Hub is fine-grained access to data by users and apps. This post about...
View ArticleWhat’s New in Cloudera Manager 5?
Learn the new features and enhancements in Cloudera Manager 5, including support for YARN, management of third-party apps and frameworks, and more. The response to the Oct. 2013 release of Cloudera...
View ArticleAccumulo Comes to CDH
Apache Accumulo is now generally available on CDH 4. Cloudera is pleased to announce the immediate availability of its first release of Accumulo packaged to run under CDH, our open source distribution...
View ArticleWhere to Find Cloudera Tech Talks (Through March 2014)
Find Cloudera tech talks in Berlin, Budapest, London, Stockholm, Tokyo, and across the US during this calendar quarter. Below please find our regularly scheduled quarterly update about where to find...
View ArticlePro Tips for Pitching an HBaseCon Talk
These suggestions from the Program Committee offer an inside track to getting your talk accepted! With HBaseCon 2014 (in San Francisco on May 5) Call for Papers closing in just over three weeks (on...
View ArticleBest Practices for Deploying Cloudera Enterprise on Amazon Web Services
This FAQ contains answers to the most frequently asked questions about the architecture and configuration choices involved. In December 2013, Cloudera and Amazon Web Services (AWS) announced a...
View ArticleHow-to: Make Hadoop Accessible via LDAP
Integrating Hue with LDAP can help make your secure Hadoop apps as widely consumed as possible. Hue, the open source Web UI that makes Apache Hadoop easier to use, easily integrates with your...
View ArticleNew Hue Demos: Spark UI, Job Browser, Oozie Scheduling, and YARN Support
Hue users can learn a lot about new features by following a steady stream of new demos. Hue, the open source Web UI that makes Apache Hadoop easier to use, is now a standard across the ecosystem —...
View ArticleThis Month in the Ecosystem (February 2014)
Welcome to our sixth edition of “This Month in the Ecosystem,” a digest of highlights from February 2014 (never intended to be comprehensive; for completeness, see the excellent Hadoop Weekly)....
View ArticleInside Apache Oozie HA
Oozie’s new HA qualities help cluster operators sleep well at night. Here’s how it works. One of the big new features in CDH 5 for Apache Oozie is High Availability (HA). In designing this feature, the...
View ArticleHow-to: Implement Role-based Security in Impala using Apache Sentry
This quick demo illustrates how easy it is to implement role-based access and control in Impala using Sentry. Apache Sentry (incubating) is the Apache Hadoop ecosystem tool for role-based access...
View ArticleIndex-Level Security Comes to Cloudera Search
The integration of Apache Sentry with Apache Solr helps Cloudera Search meet important security requirements. As you have learned in previous blog posts, Cloudera Search brings the power of Apache...
View ArticleSneak Preview: "Features & Internals" Track at HBaseCon 2014
The HBaseCon 2014 “Features & Internals” track covers the newest developments in Apache HBase functionality. The HBaseCon 2014 (May 5, 2014 in San Francisco) agenda has something for everyone –...
View ArticleHow-to: Configure JDBC Connections in Secure Apache Hadoop Environments
Learn how HiveServer, Apache Sentry, and Impala help make Hadoop play nicely with BI tools when Kerberos is involved. In 2010, I wrote a simple pair of blog entries outlining the general considerations...
View ArticleApache Spark Resource Management and YARN App Models
A concise look at the differences between how Spark and MapReduce manage cluster resources under YARN The most popular Apache YARN application after MapReduce itself is Apache Spark. At Cloudera, we...
View ArticleThis Month in the Ecosystem (May 2014)
Welcome to our ninth edition of “This Month in the Ecosystem,” a digest of highlights from May/early June 2014 (never intended to be comprehensive; for that, see the excellent Hadoop Weekly). More good...
View Article